avx: AVX1 support for matrix inverse #64

recp · 2018-10-30T07:07:18Z

cglm already supports AVX version for mat4_mul, but mat4_inv was missing. I implemented AVX1 version of matrix inverse.

After upgraded my Macbook Pro I'll try to implement AVX2 + FMA too, but since my current CPU does not support that, I can't do that for now.

I tested mat4_inv on Ivy Bridge CPU, I got similar performance with SSE (not better), but on new CPUs the result may be different. I'll try to reduce some shuffles later to increase performance.

New functions:

glm_mat4_scale_avx(mat4 m, float s)
glm_mat4_inv_avx(mat4 mat, mat4 dest)

These are selected automatically if -mavx is set.

I'll try to optimize SIMD-ed functions with SSE3 and SSE4 later.

coveralls · 2018-10-30T07:10:05Z

Coverage remained the same at 11.487% when pulling 01b93b0 on simd into 07e60bd on master.

recp · 2021-04-30T22:50:40Z

glm_mat4_scale_avx() is added to master and I'll try to re-implement the AVX vesion

recp added 3 commits October 30, 2018 09:27

avx: optimize (re-use) mat4_mul registers

abfa355

avx: implement mat4_inv for AVX1

e9b51fc

avx: implement scale matrix using AVX

9aebdc7

Merge branch 'master' into simd

01b93b0

recp self-assigned this Mar 6, 2020

recp added enhancement feature in progress labels Mar 6, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

avx: AVX1 support for matrix inverse #64

avx: AVX1 support for matrix inverse #64

recp commented Oct 30, 2018 •

edited

coveralls commented Oct 30, 2018 •

edited

recp commented Apr 30, 2021

avx: AVX1 support for matrix inverse #64

Are you sure you want to change the base?

avx: AVX1 support for matrix inverse #64

Conversation

recp commented Oct 30, 2018 • edited

coveralls commented Oct 30, 2018 • edited

recp commented Apr 30, 2021

recp commented Oct 30, 2018 •

edited

coveralls commented Oct 30, 2018 •

edited