Rework traversal strategy in Policies#applyMerges. #5880

benjamn · 2020-01-29T23:41:00Z

Previously, I thought it was important to process any child object fields with custom merge functions before processing their parents, which required a post-order traversal of the result tree (in other words, calling policies.applyMerges recursively on incoming.__value before invoking the custom merge function). I have come to realize this was a mistake, for a few different and somewhat subtle reasons.

First, a merge function should be able to return merged data in any format (specified by the TExisting type), but it should also be able to assume the incoming data has not been modified by merge functions. Reasoning about data types is much simpler if the incoming data is always just the type of data returned by the GraphQL server/schema (StoreValue, most generically), and the existing type is always some other type chosen by the developer (TExisting), possibly but not necessarily the same as the schema type (for example, an indexed lookup table, or an ordered history of results). Calling policies.applyMerges recursively before invoking the merge function violated this assumption, because nested incoming data could be turned into TExisting data before the merge function saw it.

Note: this is not just a case of trying to make the type system happy, because TypeScript's flexible type inference isn't powerful enough to provide any reliable warnings or errors here. A stronger static type system would make enforcement easier, but even in a completely dynamic language this kind of type-level reasoning would still be important, because it has real semantic implications.

Second, there's no universal way for the cache to merge arrays automatically (should it replace, concatenate, and/or deduplicate? what if the other value is not an array?), so blindly calling policies.applyMerges on array values before the merge function had a chance to process them was a risky plunge. This commit severs the false linkage between items in different arrays that happen to have the same index.

Third, I have been hoping to provide an options.merge helper function to facilitate forced merges of unidentified data, like { ...existing, ...incoming } but with appropriate regard for custom merge functions. Unfortunately, the risk of repeatedly merging already merged data would have required that all merge functions be idempotent (a useful quality for most merge functions to have, but not always possible). I feared that implementing options.merge was going to require a new field function in addition to read and merge (like merge but taking two TExisting values and merging them idempotently), but thankfully the changes in this commit allow us to implement that functionality with just read and merge.

Previously, I thought it was important to process any child object fields with custom merge functions before processing their parents, which required a post-order traversal of the result tree (in other words, calling policies.applyMerges recursively on incoming.__value *before* invoking the custom merge function). I have come to realize this was a mistake, for a few different and somewhat subtle reasons. First, a merge function should be able to return merged data in any format (specified by the TExisting type), but it should also be able to assume the incoming data has not been modified by merge functions. Reasoning about data types is much simpler if the incoming data is always just the type of data returned by the GraphQL server/schema (StoreValue, most generally), and the existing type is always some other type chosen by the developer (TExisting), possibly but not necessarily the same as the schema type (for example, an indexed lookup table, or an ordered history of results). Calling policies.applyMerges recursively *before* invoking the merge function violated this assumption, because nested incoming data could be turned into TExisting data before the merge function saw it. Note: this is not just a case of trying to make the type system happy, because TypeScript's flexible type inference isn't powerful enough to provide any reliable warnings or errors. A stronger static type system would make enforcement easier, but even in a completely dynamic language this kind of type-level reasoning would still be important, because it has actual semantic implications. Second, there's no universal way for the cache to merge arrays automatically (should it replace, concatenate, and/or deduplicate? what if the other value is not an array?), so blindly calling policies.applyMerges on array values before the merge function had a chance to process them was a risky plunge. This commit severs the false linkage between items in different arrays that happen to have the same index. Third, I have been hoping to provide an options.merge helper function to facilitate forced merges of unidentified data, like { ...existing, ...incoming } but with appropriate regard for custom merge functions. Unfortunately, the risk of repeatedly merging already merged data would have required that all merge functions be idempotent (a useful quality for most merge functions to have, but not always possible). I feared that implementing options.merge was going to require a new field function in addition to read and merge (like merge but taking two TExisting values and merging them idempotently), but thankfully the changes in this commit allow us to implement that functionality with just read and merge.

hwillson

Looks great @benjamn! 🧙‍♂️

Instead of recursively searching for FieldValueToBeMerged wrapper objects anywhere in the incoming data, processSelectionSet and processFieldValue can build a sparse tree specifying just the paths of fields that need to be merged, and then applyMerges can use that tree to traverse only the parts of the data where merge functions need to be called. These changes effectively revert #5880, since the idea of giving merge functions a chance to transform their child data before calling nested merge functions no longer makes as much sense. Instead, applyMerges will be recursively called on the child data before parent merge functions run, the way it used to be (before #5880).

benjamn added 👩‍🏭 refactor 🧞‍♂️ enhancement 🧩 implementation-detail labels Jan 29, 2020

benjamn added this to the Release 3.0 milestone Jan 29, 2020

benjamn requested a review from hwillson January 29, 2020 23:41

benjamn self-assigned this Jan 29, 2020

hwillson approved these changes Jan 29, 2020

View reviewed changes

benjamn added the 👩‍🔬 needs-more-tests label Jan 30, 2020

benjamn merged commit 87b9fd2 into master Jan 30, 2020

benjamn deleted the rework-applyMerges-traversal-strategy branch January 30, 2020 17:58

benjamn mentioned this pull request Jan 31, 2020

Provide options.mergeObjects for easy and correct recursive merging. #5885

Merged

github-actions bot locked as resolved and limited conversation to collaborators Feb 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rework traversal strategy in Policies#applyMerges. #5880

Rework traversal strategy in Policies#applyMerges. #5880

benjamn commented Jan 29, 2020 •

edited

Loading

hwillson left a comment

Rework traversal strategy in Policies#applyMerges. #5880

Rework traversal strategy in Policies#applyMerges. #5880

Conversation

benjamn commented Jan 29, 2020 • edited Loading

hwillson left a comment

Choose a reason for hiding this comment

benjamn commented Jan 29, 2020 •

edited

Loading