This repository has been archived by the owner on Apr 16, 2020. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 24
Make the Rabin Chunker perform well, or document why it's not fixable #142
Labels
Comments
@flyingzumwalt a good way to try this is to create file-format specific Rabin chunking.
Keywords: Content Defined, Chunking, Deduplication |
It might be good to do some research on FastCDC and Asymmetric Extremum, which has low computational overheads. |
This was referenced Feb 26, 2018
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Based on the tests in #137 the rabin chunker isn't actually providing any real deduplication benefits. It's also really slow.
The text was updated successfully, but these errors were encountered: