Add SUBRK and DIVRK bytecode instructions to bytecode v5 #1115

zeux · 2023-11-27T18:06:58Z

Right now, we can compile R*K for all arithmetic instructions, but K*R gets compiled into two instructions (LOADN/LOADK + arithmetic opcode).

This is problematic since it leads to reduced performance for some code. However, we'd like to avoid adding reverse variants of ADDK et al for all opcodes to avoid the increase in I$ footprint for interpreter.

Looking at the arithmetic instructions, % and // don't have interesting use cases for K*V; ^ is sometimes used with constant on the left hand side but this would need to call pow() by necessity in all cases so it would be slow regardless of the dispatch overhead. This leaves the four basic arithmetic operations.

For + and *, we can implement a compiler-side optimization in the future that transforms K*R to R*K automatically. This could either be done unconditionally at -O2, or conditionally based on the type of the value (driven by type annotations / inference) -- this technically changes behavior in presence of metamethods, although it might be sensible to just always do this because non-commutative +/* are evil.

However, for - and / it is impossible for the compiler to optimize this in the future, so we need dedicated opcodes. This only increases the interpreter size by ~300 bytes (~1.5%) on X64.

This makes spectral-norm and math-partial-sums 6% faster; maybe more importantly, voxelgen gets 1.5% faster (so this change does have real-world impact).

To avoid the proliferation of bytecode versions this change piggybacks on the bytecode version bump that was just made in 604 for vector constants; we would still be able to enable these independently but we'll consider v5 complete when both are enabled.

Related: #626

Right now, we can compile R*K for all arithmetic instructions, but K*R gets compiled into two instructions (LOADN/LOADK + arithmetic opcode). This is problematic since it leads to reduced performance for some code. However, we'd like to avoid adding reverse variants of ADDK et al for all opcodes to avoid the increase in I$ footprint for interpreter. Looking at the arithmetic instructions, % and // don't have interesting use cases for K*V; ^ is sometimes used with constant on the left hand side but this would need to call pow() by necessity in all cases so it would be slow regardless of the dispatch overhead. This leaves the four basic arithmetic operations. For + and *, we can implement a compiler-side optimization in the future that transforms K*R to R*K automatically. This could either be done unconditionally at -O2, or conditionally based on the type of the value (driven by type annotations / inference) -- this technically changes behavior in presence of metamethods, although it might be sensible to just always do this because non-commutative +/* are evil. However, for - and / it is impossible for the compiler to optimize this in the future, so we need dedicated opcodes. This only increases the interpreter size by ~300 bytes (~1.5%) on X64. This makes spectral-norm and math-partial-sums 6% faster. To avoid the proliferation of bytecode versions this change piggybacks on the bytecode version bump that was just made in 604 for vector constants; we would still be able to enable these independently but we'll consider v5 complete when both are enabled.

…onsistency

CodeGen/src/IrBuilder.cpp

CodeGen/src/IrLoweringX64.cpp

VM/src/lvmexecute.cpp

Co-authored-by: vegorov-rbx <[email protected]>

zeux added 3 commits November 27, 2023 09:43

Fix tests and add more tests.

776f2f7

Add DIVRK test for vectors and also change variable names in VM for c…

dcae3d4

…onsistency

zeux requested a review from vegorov-rbx November 27, 2023 19:00

... oh, this is already tested above actually.

eaa1a36

vegorov-rbx reviewed Nov 28, 2023

View reviewed changes

CodeGen/src/IrBuilder.cpp Show resolved Hide resolved

vegorov-rbx reviewed Nov 28, 2023

View reviewed changes

CodeGen/src/IrLoweringX64.cpp Outdated Show resolved Hide resolved

Add variables per review

a66adcf

vegorov-rbx approved these changes Nov 28, 2023

View reviewed changes

vegorov-rbx reviewed Nov 28, 2023

View reviewed changes

VM/src/lvmexecute.cpp Outdated Show resolved Hide resolved

vegorov-rbx reviewed Nov 28, 2023

View reviewed changes

VM/src/lvmexecute.cpp Outdated Show resolved Hide resolved

Apply suggestions from code review

6874b37

Co-authored-by: vegorov-rbx <[email protected]>

vegorov-rbx merged commit 89b437b into luau-lang:master Nov 28, 2023
7 checks passed

zeux deleted the subdivrk branch November 28, 2023 15:44

BrewTestBot mentioned this pull request Dec 6, 2023

luau 0.605 Homebrew/homebrew-core#156620

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SUBRK and DIVRK bytecode instructions to bytecode v5 #1115

Add SUBRK and DIVRK bytecode instructions to bytecode v5 #1115

zeux commented Nov 27, 2023 •

edited

Loading

Add SUBRK and DIVRK bytecode instructions to bytecode v5 #1115

Add SUBRK and DIVRK bytecode instructions to bytecode v5 #1115

Conversation

zeux commented Nov 27, 2023 • edited Loading

zeux commented Nov 27, 2023 •

edited

Loading