Content.Sink.write(sink, last, utf8Content, callback) could become faster #12469

gnikolaidis · 2024-11-03T09:53:14Z

Jetty version(s)
Jetty 12.0.x

Enhancement Description
Looking at the source code, I see that the method body of Content.Sink.write(sink, last, utf8Content, callback) is using StandardCharsets.UTF_8.encode(utf8Content) to create the UTF-8 ByteBuffer.

I believe that ByteBuffer.wrap(utf8Content.getBytes(StandardCharsets.UTF_8)) would be quite faster, as in recent Java versions (which use UTF8 internally) it is implemented as a simple System.arrayCopy().

The text was updated successfully, but these errors were encountered:

lorban · 2024-11-04T15:58:21Z

You are right, a quick JMH benchmark clearly shows that wrap/getBytes is consistenly faster than encoding:

Fully ASCII string
 JDK 22
  Utf8Benchmark.testEncode        thrpt   10   8642774.694 ± 194220.398  ops/s
  Utf8Benchmark.testWrapGetBytes  thrpt   10  52008364.510 ± 271773.779  ops/s
 JDK 17
  Utf8Benchmark.testEncode        thrpt   10   7355221.908 ± 396420.981  ops/s
  Utf8Benchmark.testWrapGetBytes  thrpt   10  52114741.560 ± 124515.129  ops/s

French string
 JDK 22
  Utf8Benchmark.testEncode        thrpt   10   4858458.628 ± 269877.435  ops/s
  Utf8Benchmark.testWrapGetBytes  thrpt   10  17264867.586 ± 227175.842  ops/s
 JDK 17
  Utf8Benchmark.testEncode        thrpt   10   4346969.813 ± 213333.374  ops/s
  Utf8Benchmark.testWrapGetBytes  thrpt   10  16451271.214 ± 125170.260  ops/s

Japanese string
 JDK 22
  Utf8Benchmark.testEncode        thrpt   10  2819575.012 ± 142874.446  ops/s
  Utf8Benchmark.testWrapGetBytes  thrpt   10  7802310.700 ±  74366.173  ops/s
 JDK 17
  Utf8Benchmark.testEncode        thrpt   10  2670866.437 ± 155794.533  ops/s
  Utf8Benchmark.testWrapGetBytes  thrpt   10  8240865.962 ±  70066.445  ops/s

so I'm going to create a pull request to submit this enhancement.

Thanks for the suggestion!

joakime · 2024-11-04T15:59:43Z

@lorban what's the GC difference like?

Signed-off-by: Ludovic Orban <[email protected]>

lorban · 2024-11-04T16:14:03Z

@joakime Interesting question. JMH's GCProfiler reports that the encode benchmark generates about 2 to 10 GB/s of garbage while wrap/getBytes generates about 25 GB/s... which is probably about the max memory bandwidth of my machine.

Still, GC'ing seems to be faster than encoding.

sbordet · 2024-11-04T16:22:05Z

How is this not a JDK bug?

gnikolaidis · 2024-11-04T16:51:36Z

My guess as to why it not a JDK bug is that Charset.encode(String) is very explicit that it must produce the same result as Charset.encode(CharBuffer), which in turn - quoting the Javadoc - always replaces malformed-input and unmappable-character sequences with charset's default replacement string - which may or may not be the case with JDK's UTF-8 encoding.

gnikolaidis · 2024-11-04T16:53:05Z

@joakime Interesting question. JMH's GCProfiler reports that the encode benchmark generates about 2 to 10 GB/s of garbage while wrap/getBytes generates about 25 GB/s... which is probably about the max memory bandwidth of my machine.

Still, GC'ing seems to be faster than encoding.

These results are probably just an artifact of the higher overall throughput.

joakime · 2024-11-04T17:12:27Z

My guess as to why it not a JDK bug is that Charset.encode(String) is very explicit that it must produce the same result as Charset.encode(CharBuffer), which in turn - quoting the Javadoc - always replaces malformed-input and unmappable-character sequences with charset's default replacement string - which may or may not be the case with JDK's UTF-8 encoding.

Error handling / Bad input handling is an important aspect of this.

@lorban what happens if the new technique encounters bad/malformed input?
Will it throw an exception? or will it replace characters with the UTF-8 replacement character?

Signed-off-by: Ludovic Orban <[email protected]>

lorban · 2024-11-05T09:58:02Z

@gnikolaidis I've modified my benchmark to make it more apparent that when the given string is purely ASCII, the wrap/getBytes throughput bottlenecks on memory bandwidth, so it's purely a memcopy/GC benchmark in that case. In all other cases, some encoding logic has to run and it's more complex to compare.

@joakime since the source is a String object, wouldn't the invalid bytes be returned as-is in the returned byte array?

Signed-off-by: Ludovic Orban <[email protected]>

gnikolaidis added the Enhancement label Nov 3, 2024

joakime added this to Jetty 12.0.16 FROZEN Nov 4, 2024

lorban self-assigned this Nov 4, 2024

lorban moved this to 🏗 In progress in Jetty 12.0.16 FROZEN Nov 4, 2024

lorban added a commit that referenced this issue Nov 4, 2024

#12469 - use faster UTF8 encoding

7948afe

Signed-off-by: Ludovic Orban <[email protected]>

lorban mentioned this issue Nov 4, 2024

Use faster UTF8 encoding in Content.write() #12475

Merged

lorban added a commit that referenced this issue Nov 5, 2024

#12469 - make benchmark single-threaded and its report more readable

bfd384a

Signed-off-by: Ludovic Orban <[email protected]>

lorban added a commit that referenced this issue Nov 5, 2024

#12469 - use faster UTF8 encoding

316065a

Signed-off-by: Ludovic Orban <[email protected]>

lorban closed this as completed in #12475 Nov 6, 2024

lorban added a commit that referenced this issue Nov 6, 2024

#12469 - use faster UTF8 encoding

f325da4

Signed-off-by: Ludovic Orban <[email protected]>

lorban added a commit that referenced this issue Nov 6, 2024

#12469 - make benchmark single-threaded and its report more readable

47a60b6

Signed-off-by: Ludovic Orban <[email protected]>

lorban added a commit that referenced this issue Nov 6, 2024

#12469 - use faster UTF8 encoding

66b494d

Signed-off-by: Ludovic Orban <[email protected]>

github-project-automation bot moved this from 🏗 In progress to ✅ Done in Jetty 12.0.16 FROZEN Nov 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Content.Sink.write(sink, last, utf8Content, callback) could become faster #12469

Content.Sink.write(sink, last, utf8Content, callback) could become faster #12469

gnikolaidis commented Nov 3, 2024

lorban commented Nov 4, 2024

joakime commented Nov 4, 2024

lorban commented Nov 4, 2024

sbordet commented Nov 4, 2024

gnikolaidis commented Nov 4, 2024

gnikolaidis commented Nov 4, 2024

joakime commented Nov 4, 2024

lorban commented Nov 5, 2024

Content.Sink.write(sink, last, utf8Content, callback) could become faster #12469

Content.Sink.write(sink, last, utf8Content, callback) could become faster #12469

Comments

gnikolaidis commented Nov 3, 2024

lorban commented Nov 4, 2024

joakime commented Nov 4, 2024

lorban commented Nov 4, 2024

sbordet commented Nov 4, 2024

gnikolaidis commented Nov 4, 2024

gnikolaidis commented Nov 4, 2024

joakime commented Nov 4, 2024

lorban commented Nov 5, 2024