DHT Optimistic Provide

dennis-tra · November 15, 2021, 4:53pm

Hi everyone,

Over the last weeks, some ResNetLab collaborators (including me) conducted IPFS uptime and content routing measurements with great support from @yiannisbot. Some results can be found here, here and here. In parallel, I went ahead and built another DHT performance measurement tool for the particular use case of investigating the provide performance as it lags far behind the discovery performance.

While investigating how the whole machinery works I had an idea of how to speed things up that I would like to share with you and also would love to get feedback on.

I would call it an “Optimistic Provide” and the TL;DR would be:

For each peer, we come across during content publication, calculate the probability of an even closer one. If the likelihood is low, just store the provider record at that peer optimistically.

For the long version, I’m copying relevant parts from this repository

which includes more (but partially irrelevant) plots than I’m sharing here in this post.

Motivation

When IPFS attempts to store a provider record in the DHT it tries to find the 20 closest peers to the corresponding CID using the XOR distance.
To find these peers, IPFS sends FIND_NODES RPCs to the closest peers in its routing table and then repeats the process for the set of returned peers.
There are two termination conditions for this process:

Termination: The 20 closest peers to the CID were queried for even closer peers but didn’t yield closer ones.
Starvation: All peers in the network were queried (if I interpret this condition correctly)

This can lead to huge delays if some of the 20 closest peers don’t respond timely or are straight out not reachable.
The following graph shows the latency distribution of the whole provide-process for 1,269 distinct provide operations.

In other words, it shows the distribution of how long it takes for the kaddht.Provide(ctx, CID, true) call to return.
At the top of the graph, you can find the percentiles and total sample size. There is a huge spike at around 10s which is probably related to an exceeded context deadline - not sure though.

If we on the other hand look at how long it took to find the peers that we eventually attempted stored the provider records at, we see that it takes less than 1.6s in the vast majority of cases.

Again, sample size and percentiles are given in the figure title. The sample size corresponds to 1269 * 20 as in every Provide-run we attempt to save the provider record at 20 peers.

The same point can be made if we take a look at how many hops it took to find a peer that we eventually attempted to store the provider records at:

Note the log scale of the y-axis. Over 98 % of the time, an appropriate peer to store the provider record at was found in 3 hops or less.

Optimistic Provide

The discrepancy between the time the provide operations take and the time it could have taken led to the idea of just storing provider records optimistically at peers.
This would trade storing these records on potentially more than 20 peers for decreasing the time content becomes available in the network.
Further, it requires a priori information about the current network size.

Procedure

Let’s imagine we want to provide content with the CID C and start querying our closest peers.
When finding a new peer with Peer ID P we calculate the distance to the CID C and derive the expected amount of peers μ that are even closer to the CID (than the peer with peer ID P).

If we norm P and C to the range from 0 to 1 this can be calculated as:

μ = || P - C || * N

Where N is the current network size and || . || corresponds to the normed XOR distance metric.

The logic would be that if the expected value μ is less than 20 peers we store the provider record at the peer P.

This threshold could also consider standard deviation etc. and could generally be tuned to minimize falsely selected peers (peers that are not in the set of the 20 closest peers).

Example

The following graph shows the distribution of normed XOR distances of the peers that were selected to store the provider record to the CID that was provided.

The center of mass of this distribution is roughly at 0.1 %. So if we find a peer that has a distance of || P - C || = 0.1 % while the network has a size of N = 7000 peers we would expect to find 7 peers that are closer than the one we just found.
Therefore, we would store a provider record at that peer right away.

N = 7000 is a realistic assumption based on our crawls.

Network Size

Since the calculation above needs information about the current network size there is the question of how to get to that information locally on every node. I could come up with three strategies:

Hard Coding
Periodic full DHT crawls (was implemented in the new experimental DHT client)
Estimation based on peer-CID proximity of previous provide operations

As said above, measurement methodology and more information are given in this GitHub - dennis-tra/optimistic-provide: A DHT measurement tool and proposal for an optimistic way to store provider records.
repository.

Generally, I would love to hear what you think about it, if I made wrong assumptions, if I got my maths wrong, if anything else seems fishy, etc

Best,
Dennis

PS: @yiannisbot pointed me to this IPFS Content Providing Proposal. If the proposal outlined in this post proofs to be viable, I would consider it a fifth option of techniques to speed things up.

Jorropo · November 15, 2021, 5:10pm

That sounds like an idea.

I will not comment on it yet, however you should know that we rarely work on discuss.libp2p.io (it’s more used for more basic questions / random discussions).

Pls post this in one of thoses repos (create an issue) to avoid this awesome work going unoticed:

GitHub - libp2p/go-libp2p-kad-dht: A Kademlia DHT implementation on go-libp2p
GitHub - libp2p/notes: libp2p Collaborative Notebook for Research
GitHub - libp2p/research-dht: Moved discussion notes to https://github.com/libp2p/notes (archive moved to notes)

I’m not sure which one would be the best to post this to, I belive notes.

dennis-tra · November 15, 2021, 5:21pm

@Jorropo That’s good to hear

I copied the post to the libp2p/notes repository and created this issue:

github.com/libp2p/notes

DHT Optimistic Provide

opened 05:17PM - 15 Nov 21 UTC

dennis-tra

Copying [the forum post](https://discuss.libp2p.io/t/dht-optimistic-provide/1143…) to this repository. --- Hi everyone, Over the last weeks, some ResNetLab collaborators (including me) conducted IPFS uptime and content routing measurements with great support from @yiannisbot. Some results can be found [here](https://github.com/dennis-tra/nebula-crawler-reports/blob/main/calendar-week-44/ipfs/README.md), [here](https://github.com/wcgcyx/nebula-crawler) and [here](https://github.com/ConsenSys/ipfs-lookup-measurement). In parallel, I went ahead and built another [DHT performance measurement tool](https://github.com/dennis-tra/optimistic-provide) for the particular use case of investigating the provide performance as it lags far behind the discovery performance. While investigating how the whole machinery works I had an idea of how to speed things up that I would like to share with you and also would love to get feedback on. I would call it an "Optimistic Provide" and the **TL;DR** would be: > For each peer, we come across during content publication, calculate the probability of an even closer one. If the likelihood is low, just store the provider record at that peer optimistically. For the long version, I'm copying relevant parts from this repository https://github.com/dennis-tra/optimistic-provide which includes more (but partially irrelevant) plots than I'm sharing here in this post. --- **Motivation** When IPFS attempts to store a provider record in the DHT it tries to find the 20 closest peers to the corresponding `CID` using the XOR distance. To find these peers, IPFS sends `FIND_NODES` RPCs to the closest peers in its routing table and then repeats the process for the set of returned peers. There are two termination conditions for this process: 1. **Termination**: The 20 closest peers to the CID were queried for even closer peers but didn't yield closer ones. 2. **Starvation**: All peers in the network were queried (if I interpret [this condition](https://github.com/libp2p/go-libp2p-kad-dht/blob/cd05807c54f3168f01a5a363b37aee5e38fee63d/query.go#L368) correctly) This can lead to huge delays if some of the 20 closest peers don't respond timely or are straight out not reachable. The following graph shows the latency distribution of the whole provide-process for 1,269 distinct provide operations. ![Provide Latencies](https://user-images.githubusercontent.com/11836793/141825213-82559096-b6dc-430c-ab60-ddd3203bdc4e.png) In other words, it shows the distribution of how long it takes for the [`kaddht.Provide(ctx, CID, true)`](https://github.com/libp2p/go-libp2p-kad-dht/blob/0b7ac010657443bc0675b3bd61133fe04d61d25b/fullrt/dht.go#L752) call to return. At the top of the graph, you can find the percentiles and total sample size. There is a huge spike at around 10s which is probably related to an exceeded context deadline - not sure though. If we on the other hand look at how long it took to find the peers that we eventually **attempted** stored the provider records at, we see that it takes less than 1.6s in the vast majority of cases. ![Discover Latencies](https://user-images.githubusercontent.com/11836793/141825243-0fa3bbcb-7644-4b91-9321-524a8ae405d6.png) Again, sample size and percentiles are given in the figure title. The sample size corresponds to `1269 * 20` as in every `Provide`-run we attempt to save the provider record at 20 peers. The same point can be made if we take a look at how many hops it took to find a peer that we eventually **attempted** to store the provider records at: ![Hop Distribution](https://user-images.githubusercontent.com/11836793/141825287-5e1f98ae-d191-4a4c-bea3-f790b7f74586.png) Note the log scale of the `y`-axis. Over 98 % of the time, an appropriate peer to store the provider record at was found in 3 hops or less. **Optimistic Provide** The discrepancy between the time the provide operations take and the time it could have taken led to the idea of just storing provider records optimistically at peers. This would trade storing these records on potentially more than 20 peers for decreasing the time content becomes available in the network. Further, it requires a priori information about the current network size. **Procedure** Let's imagine we want to provide content with the CID `C` and start querying our closest peers. When finding a new peer with Peer ID `P` we calculate the distance to the CID `C` and derive the expected amount of peers `μ` that are even closer to the CID (than the peer with peer ID `P`). If we norm `P` and `C` to the range from `0` to `1` this can be calculated as: ```text μ = || P - C || * N ``` Where `N` is the current network size and `|| . ||` corresponds to the [normed XOR distance metric](https://github.com/dennis-tra/optimistic-provide#normed-xor-distance). The logic would be that if the expected value `μ` is less than 20 peers we store the provider record at the peer `P`. This threshold could also consider standard deviation etc. and could generally be tuned to minimize falsely selected peers (peers that are not in the set of the 20 closest peers). **Example** The following graph shows the distribution of [normed XOR distances](https://github.com/dennis-tra/optimistic-provide#normed-xor-distance) of the peers that were selected to store the provider record to the CID that was provided. ![Selected Peer Distances](https://user-images.githubusercontent.com/11836793/141825337-ec0caca4-4aee-494a-a16d-e883814e7375.png) The center of mass of this distribution is roughly at `0.1 %`. So if we find a peer that has a distance of `|| P - C || = 0.1 %` while the network has a size of `N = 7000` peers we would expect to find `7` peers that are closer than the one we just found. Therefore, we would store a provider record at that peer right away. > `N = 7000` is a realistic assumption based on [our crawls](https://github.com/dennis-tra/nebula-crawler). **Network Size** Since the calculation above needs information about the current network size there is the question of how to get to that information locally on every node. I could come up with three strategies: 1. Hard Coding 2. Periodic full DHT crawls (was implemented in the [new experimental DHT client](https://github.com/libp2p/go-libp2p-kad-dht/releases/tag/v0.12.0)) 3. Estimation based on peer-CID proximity of previous provide operations --- As said above, measurement methodology and more information are given in [this repository](https://github.com/dennis-tra/optimistic-provide). Generally, I would love to hear what you think about it, if I made wrong assumptions, if I got my maths wrong, if anything else seems fishy, etc 👍 Best, Dennis PS: @yiannisbot pointed me to this [IPFS Content Providing Proposal](https://github.com/protocol/web3-dev-team/blob/main/proposals/ipfs-content-providing.md). If the proposal outlined in this post proofs to be viable, I would consider it a fifth option [of techniques to speed things up](https://github.com/protocol/web3-dev-team/blob/main/proposals/ipfs-content-providing.md#brief-plan-of-attack).

Let’s continue the discussion there, then

Topic		Replies	Views
Kad-DHT Protobuf Messages, Peer Routing and Content Providers Implementers and Contributors	10	1579	October 15, 2024
List of noob questsions Users and Developers	3	461	September 1, 2021
Nebula libp2p DHT crawler Implementers and Contributors	7	1609	July 21, 2021
Js-libp2p "peer and content routing" / kad-dht example	33	6655	December 31, 2021
What is "providers of a value to the given key"? Users and Developers	6	385	April 18, 2022

DHT Optimistic Provide

Related topics