More intelligent blockstore garbage collection #3092

jakobvarmose · 2016-08-17T01:46:05Z

Type: Feature
Area: Blockstore, Pin

Description:

The current garbage collector deletes all un-pinned blocks. This makes the system more fragile, as people don't usually pin a lot of objects. And if I have configured my node to keep up to 10GB of data I would also expect it to keep close to that limit at all times.

One solution is to delete blocks at random until the disk usage falls below the threshold limit. But blocks could also be deleted based on supply and demand, or when they were last accessed.

The current garbage collector uses a mark-and-sweep algorithm, but another option would be to use reference-counting, and I think that is better.

JesseWeinstein · 2016-08-17T01:49:51Z

It might also be good to have both a min and max threshold, and have a garbage collection that only removed things until the min threshold was reached. It could be automatically triggered at the max threashold, as is done now.

whyrusleeping · 2016-08-17T02:10:37Z

@jakobvarmose how would your reference counting idea work? would it count existing references to blocks? or would it simply increment some counter each time a block is referenced by another object?

I agree 100% that the current GC implementation is a bit... blunt. The issue is that the marking phase is relatively expensive, so doing smaller runs is not cheap. There has been some previous discussion towards this topic here: ipfs/notes#130

One related topic i was talking about earlier (for ipfs block rm) was keeping a bloom filter of pinned objects stored on disk to expedite pin checking. This could also help out here for a partial GC.

jakobvarmose · 2016-08-17T02:51:09Z

@jakobvarmose how would your reference counting idea work? would it count existing references to blocks? or would it simply increment some counter each time a block is referenced by another object?

I would add two new variables associated with each block:

directPin bool
refCounter uint

When an object is pinned directly just set b.directPin = true. When recursively pinning an object increment b.refCounter on that block and on each of its descendants.

To check if a block is pinned simply do b.directPin || b.refCounter > 0.

whyrusleeping · 2016-08-17T03:08:11Z

@jakobvarmose yeah, we used to do that. It was really slow, and expensive. We have thousands to millions (or more) of blocks. Blocks may also be very small, and the overhead of that reference information would be significant. Every pin operation would need to iterate over every child block and update every entry for that child block.

jakobvarmose · 2016-08-17T12:22:47Z

@whyrusleeping Oh, I didn't know. Yeah, it would use quite a bit of memory. But I can't see how it would be much slower, as the current implementation also reads each child block from disk, and this is probably the slowest part. Unpinning will of course be slower, but pinning should be about the same speed. Or what am I missing?

Kubuxu · 2016-08-17T12:40:24Z

@jakobvarmose no, currently when you pin and unpin we only update list of root hashes, reference counting would require to iterate whole hash tree and update the reference count.

kevina · 2016-08-18T01:31:05Z

@whyrusleeping I was wondering about why we won't use reference counting for indirect pins. How is that worse than what we have to do now. How is it worse to have to iterate over every child of a single recursive pin to update it worse than having to always iterate over every child of every recursive pin just to check if a block is pinned. The checking operation would seam to me to be more frequent than the updating operation.

Have we considered storing the indirect pinned blocks individually in the datastore. That is for example for each indirectly pinned block have an entry under the name "/local/pins/indirect/". This will have disk space overhead, but since it is no longer in memory will scale well. The underlying leveldb is designed to be fast so at this point I am having a hard time understating that a mass update could be really slow.

The cost of checking pins will become more of an issue once "block rm" lands (#2962) and also in my filestore code (#2634).

A bloom filter will help in the case that a block is not pinned, but to make sure it is we will still have to iterate over every child of every single recursive pin.

whyrusleeping · 2016-08-18T01:44:47Z

@kevina we've been down this road before. The disk overhead is significant, the cost to pin large objects becomes relatively obscene. If you want to dig into it more, go check out the old code and PRs

kevina · 2016-08-18T02:15:47Z

Some references for why we no longer store information on indirectly pinned blocks in the datastore: #1192 #1225 #1381 #1420

jakobvarmose · 2016-08-18T02:37:54Z

@Kubuxu When unpinning you are right that the current implementation runs very quickly. But when pinning all child blocks are read from the datastore (see https:/ipfs/go-ipfs/blob/master/pin/pin.go#L140-L144).

jakobvarmose · 2016-08-18T02:43:46Z

Another related idea: To make GC faster instead of simply marking, count the number of references (you could call it count-and-sweep), and store this result to disk. Additionally store incremental updates of recursive pins. The next time the garbage collector is run it will read the results from last time, and update incrementally.

jbenet · 2016-08-18T03:51:45Z

Maybe we should review proper gc algorithms and find something that matches
our perf needs (re algorithmic complexity)
On Wed, Aug 17, 2016 at 22:43 Jakob Varmose Bentzen <
[email protected]> wrote:

Another related idea: To make GC faster instead of simply marking, count
the number of references (you could call it count-and-sweep), and store
this result to disk. Additionally store incremental updates of recursive
pins. The next time the garbage collector is run it will read the results
from last time, and update incrementally.

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
#3092 (comment), or mute
the thread
https:/notifications/unsubscribe-auth/AAIcoRj8rOz4hFyEKj8BwSKXFXGndzWJks5qg8bjgaJpZM4JmA_C
.

whyrusleeping · 2016-08-19T16:57:10Z

reference: #2030

kevina · 2016-08-20T06:31:40Z

A rather longest IRC discussion: https://botbot.me/freenode/ipfs/msg/71626736/ (Starting around 9 pm PDT on Aug 19). No real consensus but lots of ideas and background info.

whyrusleeping · 2017-09-03T00:37:08Z

ref: #4149

whyrusleeping added the need/community-input Needs input from the wider community label Aug 18, 2016

whyrusleeping added the status/deferred Conscious decision to pause or backlog label Sep 14, 2016

RubenKelevra mentioned this issue Mar 31, 2020

[Draft] A cache sweeper for kubo (go-ipfs) ipfs/notes#428

Open

35 tasks

Stebalien mentioned this issue Apr 25, 2020

IPFS Repo GC Is Unrealistic At Non-Trivial Scale: rm -rf + resync is faster #7213

Closed

gammazero mentioned this issue Nov 4, 2020

[META] Garbage Collection Enhancement/Rework #7752

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

More intelligent blockstore garbage collection #3092

More intelligent blockstore garbage collection #3092

jakobvarmose commented Aug 17, 2016

JesseWeinstein commented Aug 17, 2016

whyrusleeping commented Aug 17, 2016

jakobvarmose commented Aug 17, 2016 •

edited

Loading

whyrusleeping commented Aug 17, 2016

jakobvarmose commented Aug 17, 2016 •

edited

Loading

Kubuxu commented Aug 17, 2016

kevina commented Aug 18, 2016

whyrusleeping commented Aug 18, 2016

kevina commented Aug 18, 2016 •

edited

Loading

jakobvarmose commented Aug 18, 2016 •

edited

Loading

jakobvarmose commented Aug 18, 2016

jbenet commented Aug 18, 2016

whyrusleeping commented Aug 19, 2016

kevina commented Aug 20, 2016 •

edited

Loading

whyrusleeping commented Sep 3, 2017

More intelligent blockstore garbage collection #3092

More intelligent blockstore garbage collection #3092

Comments

jakobvarmose commented Aug 17, 2016

JesseWeinstein commented Aug 17, 2016

whyrusleeping commented Aug 17, 2016

jakobvarmose commented Aug 17, 2016 • edited Loading

whyrusleeping commented Aug 17, 2016

jakobvarmose commented Aug 17, 2016 • edited Loading

Kubuxu commented Aug 17, 2016

kevina commented Aug 18, 2016

whyrusleeping commented Aug 18, 2016

kevina commented Aug 18, 2016 • edited Loading

jakobvarmose commented Aug 18, 2016 • edited Loading

jakobvarmose commented Aug 18, 2016

jbenet commented Aug 18, 2016

whyrusleeping commented Aug 19, 2016

kevina commented Aug 20, 2016 • edited Loading

whyrusleeping commented Sep 3, 2017

jakobvarmose commented Aug 17, 2016 •

edited

Loading

jakobvarmose commented Aug 17, 2016 •

edited

Loading

kevina commented Aug 18, 2016 •

edited

Loading

jakobvarmose commented Aug 18, 2016 •

edited

Loading

kevina commented Aug 20, 2016 •

edited

Loading