Update casts for NEON #27

QuLogic · 2018-01-16T11:04:15Z

The problem is that the comparisons return uint32x4_t and MASK is int32x4_t; converting comparison results to float just causes an error trying to convert back to int. The second commit adds some other casts to make unsigned/signed explicit (and now works without -flax-vector-conversions).

This fixes the other problem in #23. Still not able to build on armv7 because it needs android_getCpuFeatures and cpu-features.c doesn't yet build.

QuLogic · 2018-01-16T11:04:59Z

FastNoiseSIMD/FastNoiseSIMD_internal.cpp

@@ -190,7 +191,7 @@ static SIMDf VECTORCALL FUNC(FLOOR)(SIMDf a)
 {
 	SIMDf fval = SIMDf_CONVERT_TO_FLOAT(SIMDi_CONVERT_TO_INT(a));

-	return vsubq_f32(fval, SIMDf_AND(SIMDf_LESS_THAN(a, fval), SIMDf_NUM(1)));
+	return vsubq_f32(fval, SIMDf_CAST_TO_FLOAT(vandq_s32(SIMDf_LESS_THAN(a, fval), SIMDi_NUM(1))));


I'm only unsure whether this is right; the inputs should be int, but I'm not sure if SIMDi_NUM does the right thing.

This won't work, it is meant to conditionally subtract 1 if a < fval. You are subtracting an int 1 instead of a float 1 these are bit casted not converted

OK, that's what I thought; I think it should be correct now.

You shouldn't need to change the floor function. See my latest commit

You haven't pushed any new commits.

My bad, looks like our changes are the same anyway, apart from the floor function

The change to floor is still necessary; on master:

In file included from FastNoiseSIMD/FastNoiseSIMD_neon.cpp:37:0: FastNoiseSIMD/FastNoiseSIMD_internal.cpp: In function ‘SIMDf L5_FUNC_FLOOR(SIMDf)’: FastNoiseSIMD/FastNoiseSIMD_internal.cpp:184:77: error: cannot convert ‘int32x4_t {aka __vector(4) int}’ to ‘float32x4_t {aka __vector(4) float}’ for argument ‘1’ to ‘int32x4_t vreinterpretq_s32_f32(float32x4_t)’ #define SIMDf_AND(a,b) SIMDf_CAST_TO_FLOAT(vandq_s32(vreinterpretq_s32_f32(a),vreinterpretq_s32_f32(b))) ^ FastNoiseSIMD/FastNoiseSIMD_internal.cpp:154:54: note: in definition of macro ‘SIMDf_CAST_TO_FLOAT’ #define SIMDf_CAST_TO_FLOAT(a) vreinterpretq_f32_s32(a) ^ FastNoiseSIMD/FastNoiseSIMD_internal.cpp:193:25: note: in expansion of macro ‘SIMDf_AND’ return vsubq_f32(fval, SIMDf_AND(SIMDf_LESS_THAN(a, fval), SIMDf_NUM(1))); ^~~~~~~~~

and it works on this branch.

QuLogic commented Jan 16, 2018

View reviewed changes

QuLogic force-pushed the arm-casts branch 2 times, most recently from 86bae84 to 691eaf8 Compare January 16, 2018 21:27

Update casts in NEON floor code.

4ad0855

QuLogic force-pushed the arm-casts branch from 691eaf8 to 4ad0855 Compare January 16, 2018 23:38

Auburn merged commit cd10773 into Auburn:master Jan 16, 2018

QuLogic deleted the arm-casts branch January 16, 2018 23:51

QuLogic mentioned this pull request Jul 21, 2018

Add platform-specific flags for NEON. robbmcleod/pyfastnoisesimd#15

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update casts for NEON #27

Update casts for NEON #27

QuLogic commented Jan 16, 2018

QuLogic Jan 16, 2018

Auburn Jan 16, 2018

QuLogic Jan 16, 2018

Auburn Jan 16, 2018

QuLogic Jan 16, 2018

Auburn Jan 16, 2018

QuLogic Jan 16, 2018 •

edited

Loading

Update casts for NEON #27

Update casts for NEON #27

Conversation

QuLogic commented Jan 16, 2018

QuLogic Jan 16, 2018

Choose a reason for hiding this comment

Auburn Jan 16, 2018

Choose a reason for hiding this comment

QuLogic Jan 16, 2018

Choose a reason for hiding this comment

Auburn Jan 16, 2018

Choose a reason for hiding this comment

QuLogic Jan 16, 2018

Choose a reason for hiding this comment

Auburn Jan 16, 2018

Choose a reason for hiding this comment

QuLogic Jan 16, 2018 • edited Loading

Choose a reason for hiding this comment

QuLogic Jan 16, 2018 •

edited

Loading