sstable: specialized index blockIter #2592

jbowens · 2023-06-05T14:02:40Z

Implementing a specialized blockIter for index blocks should reduce the cpu cost of seeks:

Index blocks are dense. They tend to contain smaller keys (because their keys may be shortened by Comparer.Separator), and relatively small values (an encoded BlockHandle). This density means seeking within an index block has a higher cpu cost, and optimizations here can have an outsized impact on overall seek cpu cost.
The Trailers of keys within index blocks are always ignored. There's no need to decode them.¹
The values of keys in index blocks are always inlined, so we can avoid the overhead associated with checking whether a value is inlined or out-of-band and overhead from returning base.LazyValues.
Related to ¹, we do not need to perform obsolete bit checks or obsolete key kind transformations introduced in sstable,db: introduce a sstable-internal ObsoleteBit in the key kind #2559 for index block keys.
We write index blocks with a restart interval of 1. Besides encoding the offset of every key, this forces every key to be written in whole without any prefix compression. We can avoid the overhead of considering whether a key might be prefix-compressed.²

The downside is code duplication. But there are a few factors that make this more tolerable:

We use a subset of the blockIter interface with index blocks (eg, no NextPrefix, no SeekLT), so it's not quite doubling the interface surface area.
The non-trivial code surrounding lazy-value handling and reading does not need to be duplicated.
The code around prefix sharing does not need to be duplicated.

Once we have a specialized index block iterator, it's easier to make changes to the index block format. There are two optimizations available right off the bat:

Since the key trailers are unused¹, we can omit them entirely.
Since prefix compression is unused², we can omit the shared bytes varint.

Jira issue: PEBBLE-42

The text was updated successfully, but these errors were encountered:

jbowens · 2023-12-25T02:17:36Z

Relates to #97

jbowens · 2023-12-25T02:25:14Z

We might also consider entirely different formats for index blocks, such as an adaptive radix tree. With sufficient space savings, could we remove support for two-level indexes and simplify the sstable iterator?

Somewhat relates to #2632 by achieving similar key compression within the index block.

jbowens · 2024-07-16T19:25:28Z

Moving into 'Next' category because this will fall out of the columnar block format work.

Add writer, reader and iterator types for index blocks. This commit considers and resolves most of cockroachdb#97 and cockroachdb#2592. Close cockroachdb#97. Close cockroachdb#2592.

jbowens added T-storage A-storage labels Jun 5, 2023

jbowens mentioned this issue Jun 5, 2023

sstable,db: introduce a sstable-internal ObsoleteBit in the key kind #2559

Merged

jbowens self-assigned this Jul 16, 2024

jbowens closed this as completed in 583859f Aug 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sstable: specialized index blockIter #2592

sstable: specialized index blockIter #2592

jbowens commented Jun 5, 2023 •

edited by cockroach-jira-scripts

Loading

jbowens commented Dec 25, 2023

jbowens commented Dec 25, 2023

jbowens commented Jul 16, 2024 •

edited

Loading

sstable: specialized index blockIter #2592

sstable: specialized index blockIter #2592

Comments

jbowens commented Jun 5, 2023 • edited by cockroach-jira-scripts Loading

jbowens commented Dec 25, 2023

jbowens commented Dec 25, 2023

jbowens commented Jul 16, 2024 • edited Loading

jbowens commented Jun 5, 2023 •

edited by cockroach-jira-scripts

Loading

jbowens commented Jul 16, 2024 •

edited

Loading