Replace perturbNormal implementation with a more robust version #21299

zeux · 2021-02-18T05:59:48Z

This change switches to a slightly different formulation using a
cotangent frame described by Christian Schüler in "Normal Mapping
Without Precomputed Tangents" follow-up blog post.

This implementation is nicer as it has fewer opportunities to produce a
NaN output given a degenerate input; it contains one division and one
normalize at the end, and only division needs to be guarded against.

As a result, when the UV mapping is degenerate within a given triangle,
the resulting determinant is 0 and scale is set to 0 as well.

I believe this also handles non-uniform mapping better than the method
we used to use, as it preserves the relative length of T & B. It's worth noting
that this method is also more closely aligned with bump perturbation
(see perturbNormalArb in bumppar_pars_fragment.glsl.js).

Using Mali Offline Shader Compiler on a simple shader that samples a
normal map and converts it to object space using this function, the
resulting code is also slightly faster than before - 20.5 cycles vs 22.3
cycles. The resulting performance on a complete three.js shader is
likely to be unaffected.

Replaces #18871 (which was not merged)
Fixes #19727

This change switches to a slightly different formulation using a cotangent frame described by Christian Schüler in "Normal Mapping Without Precomputed Tangents" follow-up blog post. This implementation is nicer as it has fewer opportunities to produce a NaN output given a degenerate input; it contains one division and one normalize at the end, and only division needs to be guarded against. As a result, when the UV mapping is degenerate within a given triangle, the resulting determinant is 0 and scale is set to 0 as well. Using Mali Offline Shader Compiler on a simple shader that samples a normal map and converts it to object space using this function, the resulting code is also slightly faster than before - 20.5 cycles vs 22.3 cycles. The resulting performance on a complete three.js shader is likely to be unaffected.

zeux · 2021-02-18T06:01:10Z

I've tested this on some artificial examples like the one in the linked issue as well as on three.js normal map example, but any other pointers for scenes / situations to test would be appreciated as this is a rather fundamental function in the shader code :)

donmccurdy · 2021-02-18T06:04:47Z

The NormalTangentTest glTF sample would be a good one to test; unlike NormalTangentMirrorTest it does not have precomputed tangents.

zeux · 2021-02-18T06:07:55Z

FWIW some validation for why this change can be important - when the input model has triangles with degenerate mapping -

Before this change:

After this change:

zeux · 2021-02-18T06:18:31Z

@donmccurdy Thanks! This is a truly fantastic test model. I've confirmed that the model looks correct after this change from both sides on all orientations:

zeux · 2021-02-18T06:36:58Z

Profiling results: http://shader-playground.timjones.io/d229083f2340912fbf707e6f23d8bf63

highp:
Mali: old 23.3 cycles, new 21.3 cycles
PowerVR: old 50 cycles, new 50 cycles

mediump:
Mali: old 21.3 cycles, new 18.5 cycles
PowerVR: old 45 cycles, new 42 cycles

Radeon Graphics Analyzer results on shader playground sadly don't show estimated cycle counts, but the new variant has ~10% fewer instructions so that looks like a general rule of thumb wrt new variant across multiple architectures.

Mugen87

Awesome! We should definitely give this a try!

mrdoob · 2021-02-18T12:27:00Z

Excellent stuff!

mrdoob · 2021-02-18T12:27:15Z

Thanks!

mrdoob · 2021-02-18T12:29:18Z

On a side note... (I didn't know about http://shader-playground.timjones.io/)

Seems like this:

float faceDirection = gl_FrontFacing ? 1.0 : - 1.0;

produces less instructions than this:

float faceDirection = float( gl_FrontFacing ) * 2.0 - 1.0;

Do you have any recommendations?

zeux · 2021-02-18T16:23:00Z

@mrdoob Nice catch - I didn't look at this part since it was generic to all shaders. It does seem that on Mali, conditional select is a bit faster. On PowerVR and AMD it looks like the compiler compiles both to more or less identical code. My usual rule of thumb is that conditional selects are preferable to attempts to emulate them using multiplications since all GPUs have dedicated instructions for this that don't require branching, and multiplications can result in more instructions / cycles in some cases - but in this case it mostly appears to be a wash. Still, since there does appear to be a tiny difference on one vendor it may be worth changing this to ?:.

WestLangley · 2021-02-18T16:31:08Z

@zeux Thanks for doing this!

I've tested this on some artificial examples like the one in the linked issue as well as on three.js normal map example, but any other pointers for scenes / situations to test would be appreciated as this is a rather fundamental function in the shader code :)

Did you try a non-uniform scale test case?

Also, models having mirrored UVs must be verified (they have backwards winding order.). I think 'DamagedHelmet.gltf' has mirrored UVs.

And back-sided faces -- as opposed to double-sided ones.

And, of course, the Adreno hardware should be tested, which has been a problem for us.

zeux · 2021-02-18T16:34:33Z

@WestLangley I've tried a case when UV mapping was degenerate along only one axis and that also worked fine. If by "back-sided faces" you mean models with disabled culling then yeah, the NormalTangentTest covers that. I'll double check wrt mirrored UVs

zeux · 2021-02-18T16:39:12Z

Mirrored UVs look correct as well - I've tested using NormalTangentMirrorTest.gltf with manually patched gltf file to remove TANGENT attribute.

mrdoob · 2021-02-18T16:41:25Z

@WestLangley

Also, models having mirrored UVs must be verified (they have backwards winding order.). I think 'DamagedHelmet.gltf' has mirrored UVs.

If this PR had broken DamagedHelmet.gltf the e2e tests would have caught it.

And, of course, the Adreno hardware should be tested, which has been a problem for us.

The problem with Adreno was gl_FrontFacing inside functions: #21205

zeux · 2021-02-18T16:42:48Z

And, of course, the Adreno hardware should be tested, which has been a problem for us.

I believe the Adreno workarounds here previously included dFdx(vec3) which I've kept as is to not hit this :) Other than that this is mostly just vector math, inversesqrt is also used in rect area light code so hopefully that works. Of course one can never be sure... I don't have a device to test this, but the bugs would be device / driver specific anyhow so we might need to just rely on the community for this.

mrdoob · 2021-02-18T16:43:04Z

@zeux

Still, since there does appear to be a tiny difference on one vendor it may be worth changing this to ?:.

Will do, thanks!

Also, I think Babylon.js does the same.

WestLangley · 2021-02-18T16:43:28Z

@zeux @mrdoob OK, thanks.

I would be most interested, however, in the results of a non-uniform scale test case: before vs after.

zeux · 2021-02-18T16:47:44Z

@mrdoob Indeed - Babylon.js uses the exact same function which I didn't know before. This is great since it also equalizes behavior between the two engines :) The only difference there is that they handle back-facing triangles by negating the UV coordinate instead of negating T & B which is mathematically (and numerically) equivalent.

mrdoob · 2021-02-18T16:57:15Z

This is great since it also equalizes behavior between the two engines :)

Yep!

The only difference there is that they handle back-facing triangles by negating the UV coordinate instead of negating T & B which is mathematically (and numerically) equivalent.

What's faster? 🤓

zeux · 2021-02-18T17:01:29Z

@mrdoob Our approach looks like it's 1 cycle faster on Mali :) Same on PowerVR. Also I guess less prone to bugs with gl_FrontFacing...

WestLangley · 2021-02-19T16:31:01Z

@zeux @mrdoob Under non-uniform scale and rendering with MeshNormalMaterial, it does appear that this PR computes different normals than before.

This is what I expected, but I am not sure if it was wrong before and now less wrong, or correct before and now wrong -- or what.

WestLangley · 2021-02-19T16:37:13Z

@zeux @mrdoob It looks like our shader math is wrong. The normal map was designed to be applied to the normal -- not to the normal after the normal matrix has been applied.

We apply the normal matrix first, in the vertex shader. In theory, we should apply the normal map first.

Under uniform scale -- the base case -- it does not matter.

It might make sense to see how other engines handle this. Maybe we just live with it.

zeux · 2021-02-20T03:33:23Z

We apply the normal matrix first, in the vertex shader. In theory, we should apply the normal map first.

Yeah that's an interesting problem. In particular, if you have a non-uniform scale along axes that are orthogonal to the vertex normal, the vertex normal stays the same! So any approach that just computes TBN from the interpolated normal in the fragment shader is doomed.

I'm not sure if there's a great way to deal with this. On some level I want to say that if an application requires perfect tangent space normal mapping it really needs to store tangents in the vertex data - indeed, canonically when one bakes tangent space normal maps from a high/low-res geometry, the bake results depend on the specific tangent space calculation (for which there's no single standard, although MikkTS is becoming a de-facto one which glTF also recommends).

We could of course deal with this by correcting the TBN matrix using the normal matrix in the fragment shader, but that's rather expensive and has additional issues with instanced geometry (where it's even more expensive since we compute the correctly scaled normal in the shader atm).

WestLangley · 2021-02-20T19:57:10Z

@zeux We are in agreement.

@WestLangley wrote:

I would be most interested, however, in the results of a non-uniform scale test case: before vs after.

It makes no sense to invest time doing that, since (as I noted above) the math in previous steps is incorrect.

But at least the issue is documented here.

Fix code style in the shader

ab7458e

Mugen87 approved these changes Feb 18, 2021

View reviewed changes

mrdoob added this to the r126 milestone Feb 18, 2021

mrdoob merged commit dee3528 into mrdoob:dev Feb 18, 2021

mrdoob mentioned this pull request Feb 18, 2021

ShaderChunk: Replaced gl_FrontFacing multiplication with conditional. #21307

Merged

chubei-oppen mentioned this pull request Apr 29, 2022

Add a r115 compatible version perturbNormal2Arb oppenfuture/three.js#118

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Replace perturbNormal implementation with a more robust version #21299

Replace perturbNormal implementation with a more robust version #21299

zeux commented Feb 18, 2021

zeux commented Feb 18, 2021

donmccurdy commented Feb 18, 2021

zeux commented Feb 18, 2021

zeux commented Feb 18, 2021 •

edited

Loading

zeux commented Feb 18, 2021

Mugen87 left a comment •

edited

Loading

mrdoob commented Feb 18, 2021

mrdoob commented Feb 18, 2021

mrdoob commented Feb 18, 2021

zeux commented Feb 18, 2021

WestLangley commented Feb 18, 2021

zeux commented Feb 18, 2021

zeux commented Feb 18, 2021

mrdoob commented Feb 18, 2021

zeux commented Feb 18, 2021

mrdoob commented Feb 18, 2021

WestLangley commented Feb 18, 2021

zeux commented Feb 18, 2021

mrdoob commented Feb 18, 2021

zeux commented Feb 18, 2021

WestLangley commented Feb 19, 2021

WestLangley commented Feb 19, 2021

zeux commented Feb 20, 2021

WestLangley commented Feb 20, 2021

Replace perturbNormal implementation with a more robust version #21299

Replace perturbNormal implementation with a more robust version #21299

Conversation

zeux commented Feb 18, 2021

zeux commented Feb 18, 2021

donmccurdy commented Feb 18, 2021

zeux commented Feb 18, 2021

zeux commented Feb 18, 2021 • edited Loading

zeux commented Feb 18, 2021

Mugen87 left a comment • edited Loading

Choose a reason for hiding this comment

mrdoob commented Feb 18, 2021

mrdoob commented Feb 18, 2021

mrdoob commented Feb 18, 2021

zeux commented Feb 18, 2021

WestLangley commented Feb 18, 2021

zeux commented Feb 18, 2021

zeux commented Feb 18, 2021

mrdoob commented Feb 18, 2021

zeux commented Feb 18, 2021

mrdoob commented Feb 18, 2021

WestLangley commented Feb 18, 2021

zeux commented Feb 18, 2021

mrdoob commented Feb 18, 2021

zeux commented Feb 18, 2021

WestLangley commented Feb 19, 2021

WestLangley commented Feb 19, 2021

zeux commented Feb 20, 2021

WestLangley commented Feb 20, 2021

zeux commented Feb 18, 2021 •

edited

Loading

Mugen87 left a comment •

edited

Loading