Add SIMD #30

UserAB1236872 · 2014-09-13T09:11:09Z

This is a miscellaneous issue to adding SIMD. I've been doing a lot of work, and what's become clear is that adding SIMD is something that requires a lot of profiling, this may take a while to come to fruition.

Miscelaneous things I've found:

Using explicit assembly (SIMD or not) for anything on a Vec3 or smaller is NOT WORTH IT because of compiler inlining. This includes the cross product, but I haven't checked vector Len yet. If you disable all compiler optimization (-N) it's usually an improvement. I suppose theoretically, if you could convince the compiler to magically inline your SIMD it would work fine, but you can't so...

(Yes, I know SIMD loads 4 values at a time, you can interleave them and use junk slots for Vec3 and Vec2. I figured it was worth experimenting with)

However, the improvements gained on a Vec4 are big enough to be worth it. Combined with pointers it's a massive improvement (10 ns/op to just over 1 ns/op for some simple operations like addition).

Matrices are still a work in progress, but I'm fairly confident I can do some magic with 4x4 matrix inversion and possibly determinants. We'll see if it matters for 3x3. 4x4 Matrix multiplcation can probably be improved simply by adding a SIMD dot product and using the dot on Row/Col instead of writing out the operation like we're doing now.

dmitshur · 2014-09-19T04:50:40Z

Would this break GopherJS compatibility?

I'm starting to rely on mathgl as my vector library when developing using WebGL, so it'd be really nice to be able to continue to do that.

Currently:

mgl32 $ go test
PASS
ok      github.com/go-gl/mathgl/mgl32   0.009s
mgl32 $ gopherjs test
ok      _/Users/Dmitri/Dropbox/Work/2013/GoLand/src/github.com/go-gl/mathgl/mgl32   0.222s

UserAB1236872 · 2014-09-19T07:27:22Z

I'll definitely add either a buildtag or an extra make command that does
everything without the SIMD stuff. I'm not sure which would be more
convenient.

On Thu, Sep 18, 2014 at 9:50 PM, Dmitri Shuralyov [email protected]
wrote:

Would this break GopherJS compatibility?

I'm starting to rely
shurcooL/play@cd45204
on mathgl as my vector library when developing using WebGL
https://github.com/gopherjs/webgl, so it'd be really nice to be able to
continue to do that.

Currently:

mgl32 $ go test
PASS
ok github.com/go-gl/mathgl/mgl32 0.009s
mgl32 $ gopherjs test
ok _/Users/Dmitri/Dropbox/Work/2013/GoLand/src/github.com/go-gl/mathgl/mgl32 0.222s

—
Reply to this email directly or view it on GitHub
#30 (comment).

pwaller · 2014-09-19T08:05:37Z

Off topic: Woah @ GopherJS

UserAB1236872 · 2014-09-23T01:19:34Z

Other note: apparently SIMD for a Mat3 determinant is also not worth it, though this is because of the necessary passthrough to another function (you can't define assembly on a pointer receiver).

james4k · 2014-10-15T02:01:10Z

Auto-vectorization in the compiler would be the ultimate ideal, but one could also make a code generation tool to vectorize your math heavy functions into assembly routines.

Edit: This would be a tool for the mathgl user, to vectorize entire algorithms. Not for vectorizing each individual Mat4 routine, etc.

ghost · 2015-10-21T20:56:32Z

@Jragonmiris can you post your SIMD code ?

UserAB1236872 · 2015-10-23T20:13:14Z

Let me see if I can dig it up

On Wed, Oct 21, 2015 at 1:56 PM Olivier Gagnon [email protected]
wrote:

@Jragonmiris https://github.com/Jragonmiris can you post your SIMD code
?

—
Reply to this email directly or view it on GitHub
#30 (comment).

UserAB1236872 mentioned this issue Nov 4, 2014

Improve code generation with text/template #32

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SIMD #30

Add SIMD #30

UserAB1236872 commented Sep 13, 2014

dmitshur commented Sep 19, 2014

UserAB1236872 commented Sep 19, 2014

pwaller commented Sep 19, 2014

UserAB1236872 commented Sep 23, 2014

james4k commented Oct 15, 2014

ghost commented Oct 21, 2015

UserAB1236872 commented Oct 23, 2015

Add SIMD #30

Add SIMD #30

Comments

UserAB1236872 commented Sep 13, 2014

dmitshur commented Sep 19, 2014

UserAB1236872 commented Sep 19, 2014

pwaller commented Sep 19, 2014

UserAB1236872 commented Sep 23, 2014

james4k commented Oct 15, 2014

ghost commented Oct 21, 2015

UserAB1236872 commented Oct 23, 2015