Various cache read and write performance optimizations. #5948

benjamn · 2020-02-14T21:33:03Z

Although we've been building Apollo Client 3.0 with performance in mind, making sure (for example) that you only pay for the features you use, we haven't focused on optimizing raw performance per se, until now.

Using an especially large query and response object provided by a customer/contributor, I was able to reduce initial (cold) execution times for the following write/read round-trip from ~150ms (measured using the latest published beta, @apollo/[email protected]) to to ~95ms, a 37% improvement, and average (warm) execution times from ~105ms to ~25ms, a 76% improvement:

const start = Date.now();

cache.writeQuery({
  query,
  data: result.data,
  variables,
});

cache.watch({
  query,
  variables,
  optimistic: true,
  immediate: true,
  callback(data) {
    console.log(Date.now() - start, "ms");
  },
});

For this benchmark, I created a new InMemoryCache object for each run, so the results are not skewed by the benefits of result caching (see #3394) though result caching can have huge benefits for actual applications.

As always with performance, exact numbers will vary from query to query and machine to machine, but I used a 2014 dual-core 3GHz MacBook Pro, a 36KB query with lots of fragments, and a 500KB JSON-encoded result.

It's worth reviewing each of these commits separately, as they do not really follow a common theme (except for speeeed ).

hwillson

Really incredible work @benjamn!

src/cache/inmemory/entityStore.ts

When I wrote this code, I thought this array would usually be so short that indexOf would be faster than Set.prototype.has, but of course the pathological cases are what end up mattering, and I've recently seen some result objects that cause thousands of shallow copies to be made over a series of many merges using one DeepMerger instance.

Since shouldInclude gets called for every single field in any read or write operation, it's important that it takes any available shortcuts to handle the common case (no directives) as cheaply as possible.

Since any of the provided variables could be consumed by any of the fields in a selection set that we're reading, all variables are potentially relevant as part of the result object cache key, so we don't make any attempt to stringify just a subset of the variables. However, since we use the same stringified variables in every cache key, there's no need to perform that encoding repeatedly. JSON.stringify may be fast, but the variables object can be arbitrarily large.

Believe it or not, iterating over the values of policies.rootTypenamesById was noticeably expensive according to Chrome devtools profiling. Since this information almost never changes, we might as well maintain it in the format that's most convenient.

Creating a throwaway array just to call JSON.stringify was much more expensive than string concatenation. The exact format of these cache keys is an invisible implementation detail, so I picked something that seemed unlikely ever to be ambiguous, though we can easily change it later.

Since policies.applyMerges doesn't change anything unless there are custom merge functions to process, we can skip calling it if no merge functions were found while processing the current entity.

Instead of recursively calling processSelectionSet to handle fragments, we can simply treat their fields as fields of the current selection set.

This change means fragment results will no longer be cached separately from normal selection set results, which is potentially a loss of caching granularity, but there's also a reduction in caching overhead because we're caching fewer result objects, and we don't have to merge them all together, and (most importantly) the result caching system still tracks dependencies the same way as before. It's as if we transformed the query by inlining fragment selections, except without doing any work!

Although this may seem like a reversion to forEach instead of a for loop, the for loop had an unexpectedly negative impact on minification, and a Set has the ability to deduplicate selection objects, so we never re-process the same field multiple times through different fragments.

#5948 (comment)

benjamn · 2020-02-17T19:38:31Z

These changes have been released in @apollo/[email protected], fyi.

benjamn added 🐎 performance 💡 idea 🧞‍♂️ enhancement labels Feb 14, 2020

benjamn added this to the Release 3.0 milestone Feb 14, 2020

benjamn requested review from hwillson and jbaxleyiii February 14, 2020 21:33

benjamn self-assigned this Feb 14, 2020

hwillson approved these changes Feb 15, 2020

View reviewed changes

src/cache/inmemory/entityStore.ts Outdated Show resolved Hide resolved

benjamn added 9 commits February 15, 2020 15:16

Optimize shouldInclude for common case of no directives.

db1a73d

Since shouldInclude gets called for every single field in any read or write operation, it's important that it takes any available shortcuts to handle the common case (no directives) as cheaply as possible.

Avoid calling policies.applyMerges unless mergeable fields found.

e3dd0b9

Since policies.applyMerges doesn't change anything unless there are custom merge functions to process, we can skip calling it if no merge functions were found while processing the current entity.

Avoid forEach and fragment recursion in processSelectionSet.

852681d

Instead of recursively calling processSelectionSet to handle fragments, we can simply treat their fields as fields of the current selection set.

benjamn force-pushed the read-and-write-performance-optimizations branch from 54d6ed2 to dc61865 Compare February 16, 2020 17:22

benjamn added 2 commits February 16, 2020 12:25

Use '#' rather than '\n' to join field name and ID in makeDepKey.

7945544

#5948 (comment)

Mention PR #5948 in CHANGELOG.md.

0a4ae81

benjamn merged commit b094e30 into master Feb 16, 2020

benjamn deleted the read-and-write-performance-optimizations branch February 16, 2020 19:58

markhalonen mentioned this pull request Feb 26, 2020

Performance is 20x worse than basic fetch -- Should I try V3? #5993

Closed

github-actions bot locked as resolved and limited conversation to collaborators Feb 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Various cache read and write performance optimizations. #5948

Various cache read and write performance optimizations. #5948

benjamn commented Feb 14, 2020 •

edited

Loading

hwillson left a comment

benjamn commented Feb 17, 2020

Various cache read and write performance optimizations. #5948

Various cache read and write performance optimizations. #5948

Conversation

benjamn commented Feb 14, 2020 • edited Loading

hwillson left a comment

Choose a reason for hiding this comment

benjamn commented Feb 17, 2020

benjamn commented Feb 14, 2020 •

edited

Loading