feat(gossipsub): feature gate metrics related code #5711

drHuangMHT · 2024-12-04T02:35:34Z

Description

May close #2923.

Notes & open questions

Breaking change: Contains public API changes.
I'm trying to keep the change as small as possible, which introduces a lot of duplicate code. Is there a better way?

Change checklist

I have performed a self-review of my own code
I have made corresponding changes to the documentation
I have added tests that prove my fix is effective or that my feature works
A changelog entry has been made in the appropriate crates

mergify · 2024-12-04T02:36:12Z

This pull request has merge conflicts. Could you please resolve them @drHuangMHT? 🙏

…ossipsub-metric-feature

drHuangMHT · 2024-12-04T03:18:22Z

Oops that would probably take a while to find where the side effect is.

elenaf9

It would be great if we can avoid the duplicated API for each Behavior::new_* method.

Wdyt think of instead:

by default don't include metrics when creating a new behavior
havea single, feature-gated method
fn with_metrics(&mut self, metrics_registry: &mut Registry, metrics_config: MetricsConfig,)
to enable them? The method could optionally also take& return ownership of self, so that one could call Behavior::new(..).with_metrics(...).

drHuangMHT · 2024-12-05T15:30:04Z

* havea single, feature-gated method
  `fn with_metrics(&mut self, metrics_registry: &mut Registry, metrics_config: MetricsConfig,)`
  to enable them? The method could optionally also take& return  ownership of self, so that one could call `Behavior::new(..).with_metrics(...)`.

A mutable borrow would allow users to change the metrics inside at runtime, I'll make it taking the ownership then.

drHuangMHT · 2024-12-05T15:42:16Z

The scoring part can be tricky though. The behaviour need to provide a reference to PeerScore in order to log metrics.
I have an idea though:

Use a dummy Metrics struct to bypass the type check, it will never be constructed.
Keep the scoring code as-is(with gates on metrics related code).
Drop the metrics variable immediately in metric_score when the feature is not enabled to silence unused variable warning.

elenaf9 · 2024-12-05T15:53:17Z

The scoring part can be tricky though. The behaviour need to provide a reference to PeerScore in order to log metrics. I have an idea though:

Use a dummy Metrics struct to bypass the type check, it will never be constructed.

Keep the scoring code as-is(with gates on metrics related code).

Drop the metrics variable immediately in metric_score when the feature is not enabled to silence unused variable warning.

Wdyt of:

score doesn't do anything with metrics, but instead just returns a tuple (usize, usize) with the number of MessageDeficit and IPColocations that happened.
metrics_score calls score, and based on the returned counters updates the metrics?

I usually try to avoid such tuple return types, but in this specific case it may be cleaner than constructing a dummy structure?
We could also wrap them in newtypes (MessageDeficit(usize), IpColocation(usize)).

drHuangMHT · 2024-12-05T16:03:30Z

Wdyt of:

score doesn't do anything with metrics, but instead just returns a tuple (usize, usize) with the number of MessageDeficit and IPColocations that happened.

There are 98 places where score is called(93 in tests though), it is not very easy to make changes to its signature directly. We can define a new function instead and let score call it.

metrics_score calls score, and based on the returned counters updates the metrics?

No problem.
EDIT: Now I think about it, if the feature is not enabled, the counters are basically useless and you still pay the price to keep them. We can live with it no problem, but it is not necessary, that's the point.

I usually try to avoid such tuple return types, but in this specific case it may be cleaner than constructing a dummy structure? We could also wrap them in newtypes (MessageDeficit(usize), IpColocation(usize)).

Just out of curiosity: do alias work in this case?

elenaf9 · 2024-12-05T16:31:35Z

Hmm, I just noticed that score / metric_score already return an f64... then it's even less ideal to return two additional integers...
Given that in both cases where we log a metric we register a Penalty, the method could also return a Vec<Penalty>, that the score_metric function then iterates on.

There are 98 places where score is called(93 in tests though), it is not very easy to make changes to its signature directly. We can define a new function instead and let score call it.

Good idea 👍.

Now I think about it, if the feature is not enabled, the counters are basically useless and you still pay the price to keep them. We can live with it no problem, but it is not necessary, that's the point.

If we do the vec, we could only push something into the vec if metrics are enabled. I would assume that the compiler then just optimizes it away when the feature is not enabled.
But doing it like this is also a bit hacky and would require proper documentation.

drHuangMHT · 2024-12-05T16:41:56Z

If we do the vec, we could only push something into the vec if metrics are enabled. I would assume that the compiler then just optimizes it away when the feature is not enabled. But doing it like this is also a bit hacky and would require proper documentation.

The penalties are fungible, and I don't think the ordering is of any importance, so a number will suffice. And yeah we don't have to increment the counter when the feature is not enable, the impact will be minimal.

drHuangMHT · 2024-12-05T17:18:52Z

Well this probably also improves performance now that the increment by 1 won't be called repeatedly.

elenaf9

One comment, rest LGTM.
I'd still like to wait for @jxs's review since he's more familiar with gossipsub than I am.

protocols/gossipsub/src/peer_score.rs

jxs

Hi and sorry for the delay! Thanks for the inititative @drHuangMHT, and @elenaf9 for the review.

I am torn on this one, I know #2923 was opened and that gossipsub uses metrics differently than any other behaviour, but I wonder if there there is any use case for usage of gossipsub without metrics where the the dependency on prometheus-client creates significant problems? Is that your case @drHuangMHT?

Cause I think this PR makes it even more confusing to understand the scoring system, with variables declared in places for metrics like here where it's not clear why it's outside the metrics scope just below. and duplicated functions used only for metrics (remove_peer_from_mesh) where it's also not clear why the whole code needs to be duplicated.
If we really wanna make prometheus-client a optional dependency can we try to first decouple the metrics logic from the scoring system? So that when we make metrics feature gated it's only

#[cfg(feature = "metrics")]
if let Some(metrics) = self.metrics.as_mut() {
    // code
}

protocols/gossipsub/src/rpc.rs

drHuangMHT · 2024-12-09T14:10:52Z

I am torn on this one, I know #2923 was opened and that gossipsub uses metrics differently than any other behaviour, but I wonder if there there is any use case for usage of gossipsub without metrics where the the dependency on prometheus-client creates significant problems? Is that your case @drHuangMHT?

No, I just picked the issue up. But just as thomas said it wouldn't be a bad thing to do.

Cause I think this PR makes it even more confusing to understand the scoring system, with variables declared in places for metrics like here where it's not clear why it's outside the metrics scope just below.

Ugh because another block of gated code used the same variable. If I moved it in there will be a scoping problem. I'll look into that to see what I can do. I'm keeping them separate because I don't want to change the original code structure too much.

and duplicated functions used only for metrics (remove_peer_from_mesh) where it's also not clear why the whole code needs to be duplicated.

Oops that's probably an oversight. I'll look into it, the duplication probably isn't necessary.

If we really wanna make prometheus-client a optional dependency can we try to first decouple the metrics logic from the scoring system? So that when we make metrics feature gated it's only

I'll try.

drHuangMHT added 2 commits December 4, 2024 10:25

feature gate metrics related code

21a7ffd

reduce diff and fix test

906e5a0

drHuangMHT added 5 commits December 4, 2024 10:49

Merge branch 'master' of https://github.com/libp2p/rust-libp2p into g…

cc58400

…ossipsub-metric-feature

fix wrongly gated variable

252bbdf

reduce diff

a0970e3

fix borrowing rule violation

b79e15d

formatting

871e36a

sync new changes to scoring

c6ac3f4

elenaf9 reviewed Dec 5, 2024

View reviewed changes

refactor behaviour constructor

7d71e9c

refactor scoring

efdf89a

drHuangMHT requested a review from elenaf9 December 6, 2024 03:22

elenaf9 reviewed Dec 7, 2024

View reviewed changes

protocols/gossipsub/src/peer_score.rs Outdated Show resolved Hide resolved

protocols/gossipsub/src/peer_score.rs Outdated Show resolved Hide resolved

remove unnecessary feature gates

0764628

jxs reviewed Dec 9, 2024

View reviewed changes

protocols/gossipsub/src/rpc.rs Outdated Show resolved Hide resolved

protocols/gossipsub/src/rpc.rs Outdated Show resolved Hide resolved

drHuangMHT added 5 commits December 9, 2024 22:14

remove unnecessary return

ee4b77f

remove duplication of remove_peer_from_mesh

343c6ae

reorder cfg flags

7add7b7

remove unnecessary gates by reordering code

51d3559

changelog

6aef7a5

drHuangMHT requested a review from jxs December 10, 2024 09:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(gossipsub): feature gate metrics related code #5711

feat(gossipsub): feature gate metrics related code #5711

drHuangMHT commented Dec 4, 2024 •

edited

Loading

mergify bot commented Dec 4, 2024

drHuangMHT commented Dec 4, 2024

elenaf9 left a comment •

edited

Loading

drHuangMHT commented Dec 5, 2024 •

edited

Loading

drHuangMHT commented Dec 5, 2024 •

edited

Loading

elenaf9 commented Dec 5, 2024 •

edited

Loading

drHuangMHT commented Dec 5, 2024 •

edited

Loading

elenaf9 commented Dec 5, 2024

drHuangMHT commented Dec 5, 2024

drHuangMHT commented Dec 5, 2024

elenaf9 left a comment

jxs left a comment

drHuangMHT commented Dec 9, 2024

feat(gossipsub): feature gate metrics related code #5711

Are you sure you want to change the base?

feat(gossipsub): feature gate metrics related code #5711

Conversation

drHuangMHT commented Dec 4, 2024 • edited Loading

Description

Notes & open questions

Change checklist

mergify bot commented Dec 4, 2024

drHuangMHT commented Dec 4, 2024

elenaf9 left a comment • edited Loading

Choose a reason for hiding this comment

drHuangMHT commented Dec 5, 2024 • edited Loading

drHuangMHT commented Dec 5, 2024 • edited Loading

elenaf9 commented Dec 5, 2024 • edited Loading

drHuangMHT commented Dec 5, 2024 • edited Loading

elenaf9 commented Dec 5, 2024

drHuangMHT commented Dec 5, 2024

drHuangMHT commented Dec 5, 2024

elenaf9 left a comment

Choose a reason for hiding this comment

jxs left a comment

Choose a reason for hiding this comment

drHuangMHT commented Dec 9, 2024

drHuangMHT commented Dec 4, 2024 •

edited

Loading

elenaf9 left a comment •

edited

Loading

drHuangMHT commented Dec 5, 2024 •

edited

Loading

drHuangMHT commented Dec 5, 2024 •

edited

Loading

elenaf9 commented Dec 5, 2024 •

edited

Loading

drHuangMHT commented Dec 5, 2024 •

edited

Loading