Multithreaded rendering / rasterisation #8

tamara-schmitz · 2017-08-15T21:22:15Z

Essentially two sections can be multithreaded: vertex shading and rasterisation (including pixel shading).
Listed below there a few multithreading concept proposals:

Vertex shader:

put every matrix manipulation per triangle in a task queue and let thread pool process queue

Rasterisation:

split frame and z buffer into slices along the y axis (ideally twice / thrice as many slices as CPU processing cores). give each slice a task queue for storing index of to be processed triangles received from vertex stage (only one worker thread per slice). use bound checks to determine which triangles affect which lines (probably useless if just a few slices). copy vertices from triangle queue into edges and texcoordforedge. do rasterisation. copy each slice into main framebuffer.

tamara-schmitz · 2017-09-26T19:43:50Z

The following is already implemented since at least: 32085e5

### Queue fetches
Use SafeQueue to reduce complications. Main thread should notify every frame about how many triangles have been sent out for rendering. Threads can then decide whether they should use pop() in blocking mode or notifiy the main thread that they have finished doing their work.

tamara-schmitz · 2017-11-03T18:37:23Z

Memory fencing

SafeQueues are in place but we use locks to prevent race conditions.
Read about memory fencing instead: https://www.linuxjournal.com/content/lock-free-multi-producer-multi-consumer-queue-ring-buffer?page=0,1

Circular buffers

Switching from a Queue to a circular buffer seems like a good idea as it guarantees that there are no reallocations during pop and push. Memory allocations are also unnecessary during runtime. However buffer size is pretty static. Buffer stalls if write pointer just in front of read pointer (=> buffer is full).
Check out Wikipedia for more information: https://en.wikipedia.org/wiki/Circular_buffer
Also this may be useful: https://www.codeproject.com/Articles/153898/Yet-another-implementation-of-a-lock-free-circul

Other ideas

Use of a stack which also eliminates reallocations but constant allocs and deallocs may degrade performance.

tamara-schmitz · 2017-11-06T19:03:55Z

Current status

Threading works pretty much (suspect race condition in VP if VP count > 1 though). See 016ed89

Performance results are bad as expected as currently every triangle fetch from the rasteriser requires a lock.

tamara-schmitz · 2017-11-07T15:27:17Z

Other possible improvements

Profiling is required but VertexProcessorObjs may slow things down as they all have shared contains pointing at one texture. Concurrent reference counting could have a significant influence on performance.

tamara-schmitz · 2022-01-29T21:10:10Z

SafeQueue was rewritten to be the only queue type required. Only issues left are in copying rasteriser textures back to the main thread and rendering them.

tamara-schmitz added the feature label Aug 15, 2017

tamara-schmitz added this to the Pseudo 3D engine milestone Aug 15, 2017

tamara-schmitz pushed a commit that referenced this issue Sep 27, 2017

Implemented #15. Prepared for #8.

32085e5

tamara-schmitz pushed a commit that referenced this issue Jan 26, 2022

Implemented #15. Prepared for #8.

6bff4ff

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multithreaded rendering / rasterisation #8

Multithreaded rendering / rasterisation #8

tamara-schmitz commented Aug 15, 2017

tamara-schmitz commented Sep 26, 2017 •

edited

tamara-schmitz commented Nov 3, 2017 •

edited

tamara-schmitz commented Nov 6, 2017

tamara-schmitz commented Nov 7, 2017

tamara-schmitz commented Jan 29, 2022

Multithreaded rendering / rasterisation #8

Multithreaded rendering / rasterisation #8

Comments

tamara-schmitz commented Aug 15, 2017

Vertex shader:

Rasterisation:

tamara-schmitz commented Sep 26, 2017 • edited

tamara-schmitz commented Nov 3, 2017 • edited

Memory fencing

Circular buffers

Other ideas

tamara-schmitz commented Nov 6, 2017

Current status

tamara-schmitz commented Nov 7, 2017

Other possible improvements

tamara-schmitz commented Jan 29, 2022

tamara-schmitz commented Sep 26, 2017 •

edited

tamara-schmitz commented Nov 3, 2017 •

edited