Skip to content

Latest commit

 

History

History
13 lines (12 loc) · 1.1 KB

README.md

File metadata and controls

13 lines (12 loc) · 1.1 KB

byteshuf is a linux tool that allows shuffling extremely large files or directories of files, using the algorithm behind the terashuf project. btyeshuf is a single standalone .c file, and can be built via make N=..., where N is the size of each byte chunk. Instead of shuffling lines of text, we shuffle groupings of N-bytes.

Arguments

Name Flag Purpose
verbose -v Verbose reporting on the progress being made
directory -d Directory strictly containing files we intend to shuffle
input -i Location of the sole file we intend to shuffle
output -o Output path. Each fileoutput file has a .%d suffix appended
per-file -n Number of N-byte samples to output per final output file
chunk-size -s Number of N-byte samples to output per intermediate output file
read-header -r Skip fixed-size headers, by skipping x-bytes on all inputs files
write-header -w Writes a zero'ed out header of size x-bytes to each final output file