feat:change thread sheduling method in ThreadPool class #2648

QlQlqiqi · 2024-05-12T04:56:03Z

The logic is based on function `WriteThread::AwaitState` in rocksdb. link

Before:

All workers and main thread which pushs task in queue both are waiting the same lock. It can cause very intense competition.
When a worker has finished one task, it will try to get lock again for a new task through function await. It can make the worker sleep with high probability due to intense competition. And it can cost much time to sleep and wake up.

After:

This is a standard producer-consumer model. So we can use lock-free list to deal with this problem about intense competition.
When a worker wake up, it will try to get tasks. And when it find there is no tasks, it will try to loop for a while to wait for new tasks. Because with high throughput the time for waiting new tasks is very short, so this loop will NOT cause serious block. In order to reduce the block time, the loop has 3 level.
2.1. 1-level. Using spin-loop to wait.
2.2. 2-level. Using long-time-loop to wait. The worker maybe yield the cpu when some condition is reached. And using a data to store probability of entering 2-level loop.
2.3. 3-level. Using function await to wait for new tasks.

params

the count of 1-level loop:
default: 200. Too much number maybe cause high cpu load. Too few number maybe cause vain opration.
queue_slow_size_:
default: std::min(worker_num, 100). When the number of tasks in queue exceeds it, the main thread which call function Schedule call std::this_thread::yield().
max_queue_size_:
default: max_queue_size. When the number of tasks in queue exceeds it, the main thread which call function Schedule call std::this_thread::yield() till the number of tasks in queue is less than threshold.
max_yield_usec_:
default: 100. The max time of loop in 2-level loop.
slow_yield_usec_:
default: 3. If the time the function std::this_thread::yield() spends exceeds the threshold, the data sorce may be updated.
kMaxSlowYieldsWhileSpinning:
default: 3. If the times of reaching condition above(5), the data sorce will be updated.
sampling_base:
default: 256. It represent the provability of enter 2-level loop is not lower than 1/sampling_base.

src/net/include/random.h

src/net/include/likely.h

…rting pika in centos

cheniujh · 2024-05-23T14:01:48Z

src/net/src/thread_pool.cc

+      // 1. loop for short time
+      for (uint32_t tries = 0; tries < 200; ++tries) {
+        if (newest_node_.load(std::memory_order_acquire) != nullptr) {
+          last = newest_node_.exchange(nullptr);


这里先到的线程直接摘了整个链表，据为己有，在去线性消费，可能会导致延迟波动大，建议尽量将任务均匀分给线程池里的worker。毕竟Pika读写链路上都是自己的线程，和rocksdb的线程模型差异比较大（Rocksdb这块都是application线程在对每个writer并发），这一块可能得多一些考量。

我想了一下，一次只取一定数量的 task 大概有两种办法：
1、一个 worker 对应一个无锁链表，然后新的 task 就随机或者遍历地往这些链表中加；
2、依旧使用一个无锁链表，但是无锁链表的容量较低，比如为 10 个这样的，这样保证一个 worker 一次最多取 10 个。

第二个方法直接测试就行，第一个方法见我新的分支：https://github.com/QlQlqiqi/pika/tree/change-thread-shedule-with-mutil-list

我这里测的结果是这两个方法速度不相上下，当然如果调参合适的话应该会有较大的差距

change thread sheduling method and the logic is based on rocksdb

8342c96

github-actions bot added the Invalid PR Title label May 12, 2024

QlQlqiqi changed the title ~~change thread sheduling method in ThreadPool class~~ feat:change thread sheduling method in ThreadPool class May 12, 2024

github-actions bot removed the Invalid PR Title label May 12, 2024

AlexStocks reviewed May 12, 2024

View reviewed changes

src/net/include/random.h Outdated Show resolved Hide resolved

AlexStocks reviewed May 12, 2024

View reviewed changes

src/net/include/likely.h Outdated Show resolved Hide resolved

add Copyright and replace "#param once" with "#ifdef"

79ca6c6

github-actions bot added the ✏️ Feature New feature or request label May 12, 2024

QlQlqiqi requested a review from AlexStocks May 12, 2024 06:43

QlQlqiqi and others added 8 commits May 13, 2024 14:42

change the lisence Copyright start date

1bb167c

add comment for the order between unlock and consumption

f9a15cd

Merge branch 'OpenAtomFoundation:unstable' into unstable

ce80eee

fix bug

d89cd2e

add some tips for failing to start codis

3778461

Merge branch 'OpenAtomFoundation:unstable' into unstable

a583036

fix bug: addtional introduced packages maybe cause core dump when sta…

43bd1ab

…rting pika in centos

fix bug: failed to start redis server

14f59b3

QlQlqiqi force-pushed the unstable branch from 0b331ec to 14f59b3 Compare May 22, 2024 07:37

Merge branch 'OpenAtomFoundation:unstable' into unstable

a107816

cheniujh reviewed May 23, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat:change thread sheduling method in ThreadPool class #2648

feat:change thread sheduling method in ThreadPool class #2648

QlQlqiqi commented May 12, 2024 •

edited

cheniujh May 23, 2024

QlQlqiqi May 23, 2024

QlQlqiqi May 23, 2024

QlQlqiqi May 23, 2024

feat:change thread sheduling method in ThreadPool class #2648

Are you sure you want to change the base?

feat:change thread sheduling method in ThreadPool class #2648

Conversation

QlQlqiqi commented May 12, 2024 • edited

The logic is based on function WriteThread::AwaitState in rocksdb. link

Before:

After:

params

cheniujh May 23, 2024

Choose a reason for hiding this comment

QlQlqiqi May 23, 2024

Choose a reason for hiding this comment

QlQlqiqi May 23, 2024

Choose a reason for hiding this comment

QlQlqiqi May 23, 2024

Choose a reason for hiding this comment

QlQlqiqi commented May 12, 2024 •

edited

The logic is based on function `WriteThread::AwaitState` in rocksdb. link