Multi-Chain live edge traversal with random walks #2458

synctext · 2016-07-12T08:51:25Z

Goal: every peer conducts a crawl of it's neighbors and stores a duplicate of their complete chain.
This walk is random and resilient against peer failure.

Approach: each peer conducts a keep-alive to a peer which is Multichain record is created with. Incoming introduction-responses are given to any peer to which we have a Multichain record. The result is a graph traversal across Multichain edges, live edges as only online peers are eligible.

Each random walk starts at yourself. A random walk is conducted of, for instance, 4 steps deep. After these steps a teleport home is conducted. Each visit of a peer results in the crawling of one or more Multichain records. Shown above is the effect of repeated random walks and the resulting graph sampling.

Open questions: consider any Multichain record, or only incoming/outgoing edges as a walk candidate.

pimveldhuisen · 2016-08-17T14:41:46Z

Design for a dispersy modification to use trust based peer selection:

Currently dispersy keeps a list of around 11 peers that are selected by random walks. Every 5 seconds a random peer is replaced. To make this trust based:

For every new peer request it's last x = 100 blocks
Calculate the netflow multichain score for each peer.
Order all peers from lowest to highest.
With probability alpha = 0.2 select the peer on the top of the list peer, otherwise go to the next peer, until a peer has been selected.
Replace this peer with a new one.

Since low scoring peers have a higher chance to be replaced, this should converge the list to a list of peers with high scores. However, since there is still a chance that the best peer is replaced, ( (1-alpha)^10 * alpha) ) to avoid clogging top contributors.

The crawling incorporates all 2 hop flows from the last x blocks. Based on previous crawls the score will also incorporate multi hop flows. To incorporate more 3 hop flows, after step 1:
1.2: Get the list of active peers from the new peer.
1.3: Crawl x blocks for each of those peers.
This of course can be extended ad infinitum, but is very costly.

pimotte · 2016-09-01T09:37:40Z

Johan: Specs for this component: Basic functioning hearsay mechanism. Low-risk conservative design.

synctext · 2016-09-03T10:20:47Z

Fault resilience and a design without any probability of cascading failure

Random graph sampling is safe, but not resilient against resource exhaustion attack.
Random fault resilience would be the main argument against making a multichain crawler directly dependent on the contents of the multichain. As proposed, start with a simple scoring function: 0=random peer, 1=connected through multichain. Deploy and learn.

synctext · 2016-09-07T11:03:52Z

The key challenge and danger for 2016 and 2017 is that we DDoS ourself. In scientific turns this is the problem of load balancing. The sybil attack is more medium to long-term. Hence my expressed desire for a conservative design, that just works.

synctext · 2016-09-08T10:13:31Z

Key piece of theory for msc thesis "Problem Description": https://www.semanticscholar.org/paper/Estimating-and-Sampling-Graphs-with-Ribeiro-Towsley/2337ba01e237a47c5f965474c1f2d4f4ee4f2643
Not that we want "Frontier Sampling", but the description, theory, and resource-constrained sampling idea is exactly the problem we have in Tribler ecosystem.

synctext · 2017-01-09T15:13:49Z

Simulation results show that a "Pim-Rank" biased walk on the network gives network overloads at certain nodes. Simulation with 720 nodes and 45000 edges. One or more nodes get 14000 requests to process, while the average outgoing load is making 720 requests (1 req / 5 sec for 1h)

pimveldhuisen · 2017-01-23T16:19:39Z

WIP:

pimveldhuisen@4c07a82

pimveldhuisen · 2017-09-30T19:24:40Z

I think this issue should be closed or reassigned to @qstokkink

qstokkink · 2017-10-01T06:38:03Z

As both Tribler and IPv8 have a live edge implementation, I will close this.

synctext added the type: MSc Thesis Work label Jul 12, 2016

synctext added this to the Backlog milestone Jul 12, 2016

synctext assigned pimveldhuisen Jul 12, 2016

synctext mentioned this issue Oct 20, 2016

Thesis: Using distributed blockchains to cultivate trust in a peer group #2533

Closed

synctext mentioned this issue Feb 15, 2017

Trusted peer discovery and improved NAT puncturing #2623

Closed

synctext mentioned this issue Apr 4, 2017

transitive trust based on blockchain constructs #2894

Closed

7 tasks

qstokkink mentioned this issue Jun 9, 2017

Live edges #2979

Merged

devos50 unassigned pimveldhuisen Sep 30, 2017

qstokkink closed this as completed Oct 1, 2017

synctext mentioned this issue Dec 5, 2017

IPv8: testnet with 1000 nodes #3272

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-Chain live edge traversal with random walks #2458

Multi-Chain live edge traversal with random walks #2458

synctext commented Jul 12, 2016 •

edited

Loading

pimveldhuisen commented Aug 17, 2016

pimotte commented Sep 1, 2016

synctext commented Sep 3, 2016 •

edited

Loading

synctext commented Sep 7, 2016

synctext commented Sep 8, 2016 •

edited

Loading

synctext commented Jan 9, 2017

pimveldhuisen commented Jan 23, 2017 •

edited

Loading

pimveldhuisen commented Sep 30, 2017

qstokkink commented Oct 1, 2017

Multi-Chain live edge traversal with random walks #2458

Multi-Chain live edge traversal with random walks #2458

Comments

synctext commented Jul 12, 2016 • edited Loading

pimveldhuisen commented Aug 17, 2016

pimotte commented Sep 1, 2016

synctext commented Sep 3, 2016 • edited Loading

synctext commented Sep 7, 2016

synctext commented Sep 8, 2016 • edited Loading

synctext commented Jan 9, 2017

pimveldhuisen commented Jan 23, 2017 • edited Loading

pimveldhuisen commented Sep 30, 2017

qstokkink commented Oct 1, 2017

synctext commented Jul 12, 2016 •

edited

Loading

synctext commented Sep 3, 2016 •

edited

Loading

synctext commented Sep 8, 2016 •

edited

Loading

pimveldhuisen commented Jan 23, 2017 •

edited

Loading