feat(load_testing): Enable some automatic reconnection handling #159

KaylaBrady · 2024-06-25T19:51:30Z

Summary

Ticket: Repeat load testing at increased scale

What is this PR for?

This adds a couple of changes to the load testing script to support running it at scale.

The main change is to use a long-lived websocket connection with some baked-in retry logic. With this change, I was able to run a load test with 200 users without hitting the ssl.SSLEOFError exception I was seeing before. I did see some logs of reconnect, indicating that automatic reconnection was preformed as expected.

There is some weirdness around the change to use run_forever with threading that I'll comment on specifically.

KaylaBrady · 2024-06-25T19:52:20Z

load_testing/locustfile.py

@@ -15,7 +15,7 @@ class MobileAppUser(HttpUser, PhoenixChannelUser):
    wait_time = between(1, 5)
    socket_path = "/socket"

-    prob_reset_map_data = 0.3
+    prob_reset_map_data = 0.02


Decreasing this since reseting map data will be rare. This was causing memory usage to spike.

github-actions · 2024-06-25T19:57:45Z

Coverage of commit `870c94f`

Summary coverage rate:
  lines......: 75.9% (953 of 1255 lines)
  functions..: 72.7% (436 of 600 functions)
  branches...: no data found

Files changed coverage rate: n/a

Download coverage report

github-actions · 2024-06-25T20:00:59Z

Coverage of commit `f29a4eb`

Summary coverage rate:
  lines......: 75.9% (953 of 1255 lines)
  functions..: 72.7% (436 of 600 functions)
  branches...: no data found

Files changed coverage rate: n/a

Download coverage report

KaylaBrady · 2024-06-25T20:29:34Z

load_testing/phoenix_channel.py

        )
        leave_push.send()
        return leave_push.get_reply()

+    def sleep_with_heartbeat(self, seconds):


pulled from dotcom

KaylaBrady · 2024-06-25T20:30:00Z

load_testing/phoenix_channel.py

+        # run_forever is blocking
+        # https://github.com/websocket-client/websocket-client/issues/980#issuecomment-2065628852
+        daemon = threading.Thread(target=self.run_forever)
+        daemon.daemon = True
+        daemon.start()


I saw this comment suggesting that threading can be avoided by using rel b/c it is async, but I found run_forever was blocking even when using rel as the dispatcher.

I don't love this threading, and I'm pretty sure it is the reason why keyboard interrupt doesn't work when running locust with a single worker.

Actually I'm going to try getting rid of rel entirely. Seems like it isn't strictly necessary websocket-client/websocket-client#969

boringcactus

If there's a way to get this all working without rel, that seems like it'd be nice, but if that doesn't pan out, this seems like it's fine.

The previous library was experiencing flaky SSL errors when sending messages and had high CPU usage while running locally, even in distributed mode. Switching this library seems to have resolved those issues - ran tests for 200 users joining & leaving the same sets of stops and encountered client errors only caused by backend failures

github-actions · 2024-06-26T15:59:44Z

Coverage of commit `96df63e`

Summary coverage rate:
  lines......: 75.9% (953 of 1255 lines)
  functions..: 72.7% (436 of 600 functions)
  branches...: no data found

Files changed coverage rate: n/a

Download coverage report

KaylaBrady · 2024-06-26T16:02:45Z

@boringcactus could you please re-review? I ended up changing the websocket library to websockets. This seems to have helped the reliability of leaving channels - no more mysterious SSLEof Errors. This also uses less local CPU - I was getting 90% CPU usage warnings with the old version even in distributed mode, but the new version can spawn 200 users not in distributed mode without hitting a CPU warning.

KaylaBrady · 2024-06-26T16:04:25Z

load_testing/phoenix_channel.py

@@ -7,7 +7,7 @@
 from typing import Any

 import gevent
-import websocket
+import websockets.sync.client as websockets


The regular async version uses asyncio, which is not supported in locust.

KaylaBrady requested a review from a team as a code owner June 25, 2024 19:51

KaylaBrady requested review from boringcactus and removed request for a team June 25, 2024 19:51

KaylaBrady marked this pull request as draft June 25, 2024 19:51

KaylaBrady commented Jun 25, 2024

View reviewed changes

feat(load_testing): Enable some automatic reconnection handling

f29a4eb

KaylaBrady force-pushed the kb-load-test-scaling branch from 870c94f to f29a4eb Compare June 25, 2024 19:58

KaylaBrady commented Jun 25, 2024

View reviewed changes

KaylaBrady marked this pull request as ready for review June 25, 2024 20:31

boringcactus approved these changes Jun 25, 2024

View reviewed changes

KaylaBrady added 4 commits June 26, 2024 11:53

doc(load_testing): Remove note about keyboard interrupt

49e56bc

cleanup: remove unused dependencies

0f544e3

cleanup(load_testing): import order, unused var

96df63e

KaylaBrady commented Jun 26, 2024

View reviewed changes

boringcactus approved these changes Jun 26, 2024

View reviewed changes

KaylaBrady merged commit 6429ae9 into main Jun 26, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(load_testing): Enable some automatic reconnection handling #159

feat(load_testing): Enable some automatic reconnection handling #159

KaylaBrady commented Jun 25, 2024

KaylaBrady Jun 25, 2024

github-actions bot commented Jun 25, 2024

github-actions bot commented Jun 25, 2024

KaylaBrady Jun 25, 2024

KaylaBrady Jun 25, 2024

KaylaBrady Jun 25, 2024

boringcactus left a comment

github-actions bot commented Jun 26, 2024

KaylaBrady commented Jun 26, 2024

KaylaBrady Jun 26, 2024

feat(load_testing): Enable some automatic reconnection handling #159

feat(load_testing): Enable some automatic reconnection handling #159

Conversation

KaylaBrady commented Jun 25, 2024

Summary

KaylaBrady Jun 25, 2024

Choose a reason for hiding this comment

github-actions bot commented Jun 25, 2024

Coverage of commit 870c94f

github-actions bot commented Jun 25, 2024

Coverage of commit f29a4eb

KaylaBrady Jun 25, 2024

Choose a reason for hiding this comment

KaylaBrady Jun 25, 2024

Choose a reason for hiding this comment

KaylaBrady Jun 25, 2024

Choose a reason for hiding this comment

boringcactus left a comment

Choose a reason for hiding this comment

github-actions bot commented Jun 26, 2024

Coverage of commit 96df63e

KaylaBrady commented Jun 26, 2024

KaylaBrady Jun 26, 2024

Choose a reason for hiding this comment

Coverage of commit `870c94f`

Coverage of commit `f29a4eb`

Coverage of commit `96df63e`