New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

New binary file format #6

Merged

hltbra merged 13 commits into master from feature/new-zset-format

Nov 12, 2018

Commits on Nov 12, 2018

Create new sorted set score structure (binary)

The new format has an 8-byte header (count) followed by the elements data.
Each element is represented by a 4-byte header and a data section.
The 4-byte header contains the size of the data section.
The data section is the raw data (no serialization).

Example of a score file with 2 elements ("hello" and "world"):

    0x0002   0x05 hello     0x05 world
     count   size data      size data
      2       5   "hello"   5    "world"

hltbra committed Nov 12, 2018

5ff46f8

Change ZCARD performance test to use multiple scores

hltbra committed Nov 12, 2018
Configuration menu
View commit details

Copy full SHA for 10cd0df

Browse repository at this point
Copy the full SHA

10cd0df View commit details

Browse the repository at this point in the history
Optimize zcount to use new zset file format

hltbra committed Nov 12, 2018
Configuration menu
View commit details

Copy full SHA for a4ec792

Browse repository at this point
Copy the full SHA

a4ec792 View commit details

Browse the repository at this point in the history
Move ZSET encoding/decoding to ZSetEncoder class

hltbra committed Nov 12, 2018
Configuration menu
View commit details

Copy full SHA for d64afd8

Browse repository at this point
Copy the full SHA

d64afd8 View commit details

Browse the repository at this point in the history
Fix performance regression after aa12d3
```
The extra `seek()` calls were a performance regression
```
hltbra committed Nov 12, 2018
Configuration menu
View commit details

Copy full SHA for 1ff832b

Browse repository at this point
Copy the full SHA

1ff832b View commit details

Browse the repository at this point in the history
Remove unecessary JSON serialization
```
After the binary format for zsets, the JSON serialization
became useless
```
hltbra committed Nov 12, 2018
Configuration menu
View commit details

Copy full SHA for c942843

Browse repository at this point
Copy the full SHA

c942843 View commit details

Browse the repository at this point in the history
Always read files in binary mode
```
This is for consistency and to help
in the future Python 3 migration
```
hltbra committed Nov 12, 2018
Configuration menu
View commit details

Copy full SHA for 4563551

Browse repository at this point
Copy the full SHA

4563551 View commit details

Browse the repository at this point in the history
Avoid unnecessary seek on rewrite_content()

hltbra committed Nov 12, 2018
Configuration menu
View commit details

Copy full SHA for ba07af7

Browse repository at this point
Copy the full SHA

ba07af7 View commit details

Browse the repository at this point in the history
Extract method skip_header()

hltbra committed Nov 12, 2018
Configuration menu
View commit details

Copy full SHA for 93e0b0d

Browse repository at this point
Copy the full SHA

93e0b0d View commit details

Browse the repository at this point in the history
Simplify read_element by relying on file header
```
Try to read the number of elements found in the file header
instead of checking if 0 bytes were returned when reading
```
hltbra committed Nov 12, 2018
Configuration menu
View commit details

Copy full SHA for bee05af

Browse repository at this point
Copy the full SHA

bee05af View commit details

Browse the repository at this point in the history
Convert class methods into instance methods
```
All these methods now share the file object (instance attribute)
```
hltbra committed Nov 12, 2018
Configuration menu
View commit details

Copy full SHA for a9ccd48

Browse repository at this point
Copy the full SHA

a9ccd48 View commit details

Browse the repository at this point in the history
Update performance tests to have zset improvement results

hltbra committed Nov 12, 2018
Configuration menu
View commit details

Copy full SHA for b0e6d58

Browse repository at this point
Copy the full SHA

b0e6d58 View commit details

Browse the repository at this point in the history
Add a string test to exercise unicode chars
```
Before the binary file manipulation, special unicode characters were not handled well
```
hltbra committed Nov 12, 2018
Configuration menu
View commit details

Copy full SHA for 8113842

Browse repository at this point
Copy the full SHA

8113842 View commit details

Browse the repository at this point in the history

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New binary file format #6

New binary file format #6

Commits on Nov 12, 2018