Speed up state res in rare case we don't have all events #16116

erikjohnston · 2023-08-15T14:52:27Z

If we don't have all the auth events in a room then not all state events will have a chain cover index. Even so, we can still use the chain cover index on the events that do have it, rather than bailing and using the slower functions.

This situation should not arise for newly persisted rooms, as we check we have the full auth chain for each event, but can happen for existing rooms.

c.f. #15245

If we don't have all the auth events in a room then not all state events will have a chain cover index. Even so, we can still use the chain cover index on the events that do have it, rather than bailing and using the slower functions. This situation should not arise for newly persisted rooms, as we check we have the full auth chain for each event, but can happen for existing rooms. c.f. #15245

erikjohnston · 2023-08-15T15:15:56Z

synapse/storage/databases/main/event_federation.py

-            for event_id in state_set:
-                chain_id, seq_no = chain_info[event_id]
+            for state_id in state_set:
+                chain_id, seq_no = chain_info[state_id]


This was just to make mypy happy, as the changes above caused the type of event_id to change :(

grr at how we don't get Rust's shadowing rules in Python

reivilibre

quite intricate and tricky but I think I get it

synapse/storage/databases/main/event_federation.py

reivilibre · 2023-08-17T17:20:29Z

synapse/storage/databases/main/event_federation.py

+        # We pull out those events with their auth events, which gives us enough
+        # information to construct the auth chain of an event up to auth events
+        # that have the chain cover index.
+        sql = """


if I'm following along properly, this pulls out all (event_id, authing_event_id) pairs with event_id being in the set of chain-cover-uncalculated events for this room,

then annotates these (event_id, authing_event_id) pairs with whether the auth event has a chain cover...

then annotates these (event_id, authing_event_id) pairs with whether the auth event has a chain cover...

I'm not sure what you mean by "annotate" here? We look at all the events we've pulled out to see if they are indexed and mark them as such?

'annotate' as in labels that keypair with a bool value in a map ;-)

reivilibre · 2023-08-17T17:26:07Z

synapse/storage/databases/main/event_federation.py

+            processing = set(auth_ids)
+            to_add = set()
+            while processing:
+                auth_id = processing.pop()
+                to_add.add(auth_id)
+
+                sub_auth_ids = event_to_auth_ids.get(auth_id)
+                if sub_auth_ids is None:
+                    continue
+
+                processing.update(sub_auth_ids - to_add)
+
+            event_id_to_partial_auth_chain[event_id] = to_add


so event_id_to_partial_auth_chain[event_id] is the transitive auth-event closure of event_id?

We say 'partial auth chain' here... what's the partial in respect to?

We don't pull out the full auth chain, we stop when the auth events are indexed.

synapse/storage/databases/main/event_federation.py

reivilibre · 2023-08-17T17:34:23Z

synapse/storage/databases/main/event_federation.py

+        This modifies `state_sets` so that they only include events that have a
+        chain cover index, and returns a set of event IDs that are part of the
+        auth difference.


ah, here's the hidden mut keyword :p

Took me a moment to notice that modifying state_sets is part of the result but not sure I have a better suggestion

It's icky, but here we are. I mostly wanted to avoid needlessly copying the state sets when its easier to just mutate tehm

yeah, reasonable. Half tempted to suggest a _mut suffix convention for this type of thing but probably gets overbearing soon enough. Meh, it will do.

reivilibre · 2023-08-17T17:39:09Z

tests/storage/test_event_federation.py

+    # degree, we "fork" execution and run the algorithm for each node in the
+    # zero degree.


i.e. for each node that has no remaining incoming connections, we give it a chance to be the next in the list?

Yup, indeed.

reivilibre · 2023-08-17T17:40:18Z

tests/storage/test_event_federation.py

+T = TypeVar("T")
+
+
+def get_all_topologically_sorted_orders(


at what point do you start writing tests for your test helpers haha, but looks good to me — if two humans agree on it, it must be right

reivilibre · 2023-08-17T17:41:29Z

tests/storage/test_event_federation.py

+    for ordering in all_topological_orderings:
+        ordering.reverse()
+
+        for idx in range(len(ordering)):
+            graph_subsets.add(frozenset(ordering[:idx]))


so in each topological ordering then we just choose an arbitrary cut-off point, 'forking' as you put it.

reivilibre · 2023-08-17T17:41:38Z

tests/storage/test_event_federation.py

+    return new_paths
+
+
+def get_all_topologically_consistent_subsets(


this test helper also LGTM

reivilibre · 2023-08-17T17:43:39Z

synapse/storage/databases/main/event_federation.py

-            for event_id in state_set:
-                chain_id, seq_no = chain_info[event_id]
+            for state_id in state_set:
+                chain_id, seq_no = chain_info[state_id]


grr at how we don't get Rust's shadowing rules in Python

Co-authored-by: reivilibre <[email protected]>

erikjohnston · 2023-08-18T11:51:34Z

Thanks for the review @reivilibre! Do the answers make sense?

reivilibre

answers make sense, thanks!

erikjohnston added 2 commits August 15, 2023 16:15

Newsfile

cbaadc0

erikjohnston force-pushed the erikj/fix_state branch from 6db50ed to cbaadc0 Compare August 15, 2023 15:15

erikjohnston commented Aug 15, 2023

View reviewed changes

erikjohnston mentioned this pull request Aug 15, 2023

Chain cover edge case degrades to slower state resolution #15245

Closed

erikjohnston marked this pull request as ready for review August 15, 2023 15:35

erikjohnston requested a review from a team as a code owner August 15, 2023 15:35

reivilibre self-assigned this Aug 17, 2023

reivilibre approved these changes Aug 17, 2023

View reviewed changes

erikjohnston and others added 2 commits August 18, 2023 12:47

Apply suggestions from code review

a557df6

Co-authored-by: reivilibre <[email protected]>

Fixup

cf42d8d

reivilibre approved these changes Aug 18, 2023

View reviewed changes

erikjohnston merged commit bd558a6 into develop Aug 18, 2023

erikjohnston deleted the erikj/fix_state branch August 18, 2023 14:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up state res in rare case we don't have all events #16116

Speed up state res in rare case we don't have all events #16116

erikjohnston commented Aug 15, 2023

erikjohnston Aug 15, 2023

reivilibre Aug 17, 2023

reivilibre left a comment

reivilibre Aug 17, 2023

erikjohnston Aug 18, 2023

reivilibre Aug 18, 2023

reivilibre Aug 17, 2023

erikjohnston Aug 18, 2023

reivilibre Aug 17, 2023

erikjohnston Aug 18, 2023

reivilibre Aug 18, 2023 •

edited

Loading

reivilibre Aug 17, 2023

erikjohnston Aug 18, 2023

reivilibre Aug 17, 2023

erikjohnston Aug 18, 2023

reivilibre Aug 17, 2023

erikjohnston Aug 18, 2023

reivilibre Aug 17, 2023

reivilibre Aug 17, 2023

erikjohnston commented Aug 18, 2023

reivilibre left a comment

		# degree, we "fork" execution and run the algorithm for each node in the
		# zero degree.

		return new_paths


		def get_all_topologically_consistent_subsets(

Speed up state res in rare case we don't have all events #16116

Speed up state res in rare case we don't have all events #16116

Conversation

erikjohnston commented Aug 15, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

reivilibre left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

reivilibre Aug 18, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

erikjohnston commented Aug 18, 2023

reivilibre left a comment

Choose a reason for hiding this comment

reivilibre Aug 18, 2023 •

edited

Loading