db: refactor Reader.Get #3507

jbowens · 2024-04-12T16:20:36Z

db: refactor Reader.Get

This commit refactors the implementation of getIter to be a little more
understandable and avoid the unnecessary use of levelIter. Current supported
format major versions guarantee that a user key is not split across sstables
within a level. This ensures that Get (which only retrieves one individual user
key) need only consult 1 sstable per level.

This is somewhat motivated by #2863. Removing getIter's dependency on levelIter
will make that refactor easier.

db: simplify levelIter skipEmptyFileForward

Apply a small simplification to levelIter.skipEmptyFileForward's condition for
when to interleave a synthetic boundary. Previously we needed to check whether
a file's largest point key was a range deletion to work around getIter's use of
the rangeDelIterPtr. Now that getIter no longer uses levelIter it's
unnecessary.

cockroach-teamcity · 2024-04-12T16:20:44Z

This change is

RaduBerinde

Awesome! Some minor comments.

Reviewable status: 0 of 6 files reviewed, 3 unresolved discussions (waiting on @itsbilal and @jbowens)

get_iter.go line 77 at r1 (raw file):

}

func (g *getIter) Next() *base.InternalKV {

Do we ever call Next() on this iterator? It seems strange, since we'd only care about the first result.

get_iter.go line 243 at r1 (raw file):

}

func (g *getIter) getSSTableIterators(

This could use a comment - we are getting iterators for the one sstable that might contain the key.

get_iter.go line 255 at r1 (raw file):

	// the file doesn't actually contain any point keys equal to `key`. We next
	// to keep searching for a file that actually contains point keys ≥ key.
	if m.LargestPointKey.IsExclusiveSentinel() && g.comparer.Equal(m.LargestPointKey.UserKey, g.key) {

Not related to this change, but I think we should make SeekGE take a base.UserKeyBoundary

This commit refactors the implementation of getIter to be a little more understandable and avoid the unnecessary use of levelIter. Current supported format major versions guarantee that a user key is not split across sstables within a level. This ensures that Get (which only retrieves one individual user key) need only consult 1 sstable per level. This is somewhat motivated by cockroachdb#2863. Removing getIter's dependency on levelIter will make that refactor easier.

Apply a small simplification to levelIter.skipEmptyFileForward's condition for when to interleave a synthetic boundary. Previously we needed to check whether a file's largest point key was a range deletion to work around getIter's use of the rangeDelIterPtr. Now that getIter no longer uses levelIter it's unnecessary.

jbowens

TFTR!

Reviewable status: 0 of 6 files reviewed, 2 unresolved discussions (waiting on @itsbilal and @RaduBerinde)

get_iter.go line 77 at r1 (raw file):

Previously, RaduBerinde wrote…

Do we ever call Next() on this iterator? It seems strange, since we'd only care about the first result.

yeah, but I think only in the case the first key is a MERGE. added a comment

get_iter.go line 243 at r1 (raw file):

Previously, RaduBerinde wrote…

This could use a comment - we are getting iterators for the one sstable that might contain the key.

Done.

get_iter.go line 255 at r1 (raw file):

Previously, RaduBerinde wrote…

Not related to this change, but I think we should make SeekGE take a base.UserKeyBoundary

yeah, makes sense

jbowens requested review from a team and itsbilal April 12, 2024 16:20

jbowens requested a review from RaduBerinde April 12, 2024 16:47

jbowens force-pushed the get-refac branch 3 times, most recently from 4742d22 to 888bd84 Compare April 12, 2024 17:33

RaduBerinde approved these changes Apr 12, 2024

View reviewed changes

jbowens added 2 commits April 12, 2024 16:40

jbowens force-pushed the get-refac branch from 888bd84 to 41f25cb Compare April 12, 2024 20:40

jbowens commented Apr 12, 2024

View reviewed changes

jbowens merged commit 1eab9d6 into cockroachdb:master Apr 12, 2024
11 checks passed

jbowens deleted the get-refac branch April 12, 2024 21:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

db: refactor Reader.Get #3507

db: refactor Reader.Get #3507

jbowens commented Apr 12, 2024

cockroach-teamcity commented Apr 12, 2024

RaduBerinde left a comment

jbowens left a comment

db: refactor Reader.Get #3507

db: refactor Reader.Get #3507

Conversation

jbowens commented Apr 12, 2024

cockroach-teamcity commented Apr 12, 2024

RaduBerinde left a comment

Choose a reason for hiding this comment

jbowens left a comment

Choose a reason for hiding this comment