A B-Tree is a self-balancing search tree, in which nodes can have more than two children.
A B-Tree of order n satisfies the following properties:
- Every node has at most 2n keys.
- Every node (except root) has at least n keys.
- Every non-leaf node with k keys has k+1 children.
- The keys in all nodes are sorted in increasing order.
- The subtree between two keys k and l of a non-leaf node contains all the keys between k and l.
- All leaves appear at the same level.
A second order B-Tree with keys from 1 to 20 looks like this:
class BTreeNode<Key: Comparable, Value> {
unowned var owner: BTree<Key, Value>
fileprivate var keys = [Key]()
var children: [BTreeNode]?
...
}
The main parts of a node are two arrays:
- An array containing the keys
- An array containing the children
Nodes also have a reference to the tree they belong to.
This is necessary because nodes have to know the order of the tree.
Note: The array containing the children is an Optional, because leaf nodes don't have children.
- Searching for a key
k
begins at the root node. - We perform a linear search on the keys of the node, until we find a key
l
that is not less thank
,
or reach the end of the array. - If
k == l
then we have found the key. - If
k < l
:- If the node we are on is not a leaf, then we go to the left child of
l
, and perform the steps 3 - 5 again. - If we are on a leaf, then
k
is not in the tree.
- If the node we are on is not a leaf, then we go to the left child of
- If we have reached the end of the array:
- If we are on a non-leaf node, then we go to the last child of the node, and perform the steps 3 - 5 again.
- If we are on a leaf, then
k
is not in the tree.
value(for:)
method searches for the given key and if it's in the tree,
it returns the value associated with it, else it returns nil
.
Keys can only be inserted to leaf nodes.
- Perform a search for the key
k
we want to insert. - If we haven't found it and we are on a leaf node, we can insert it.
- If after the search the key
l
which we are standing on is greater thank
:
We insertk
to the position beforel
. - Else:
We insertk
to the position afterl
.
After insertion we should check if the number of keys in the node is in the correct range.
If there are more keys in the node than order*2
, we need to split the node.
- Move up the middle key of the node we want to split, to its parent (if it has one).
- If it hasn't got a parent(it is the root), then create a new root and insert to it.
Also add the old root as the left child of the new root. - Split the node into two by moving the keys (and children, if it's a non-leaf) that were after the middle key to a new node.
- Add the new node as a right child for the key that we have moved up.
After splitting a node it is possible that the parent node will also contain too many keys, so we need to split it also.
In the worst case the splitting goes up to the root (in this case the height of the tree increases).
An insertion to a first order tree looks like this:
The method insert(_:for:)
does the insertion.
After it has inserted a key, as the recursion goes up every node checks the number of keys in its child.
if a node has too many keys, its parent calls the split(child:atIndex:)
method on it.
The root node is checked by the tree itself.
If the root has too many nodes after the insertion the tree calls the splitRoot()
method.
Keys can only be removed from leaf nodes.
- Perform a search for the key
k
we want to remove. - If we have found it:
- If we are on a leaf node:
We can remove the key. - Else:
We overwritek
with its inorder predecessorp
, then we removep
from the leaf node.
- If we are on a leaf node:
After a key have been removed from a node we should check that the node has enough keys.
If a node has fewer keys than the order of the tree, then we should move a key to it,
or merge it with one of its siblings.
If the problematic node has a nearest sibling that has more keys than the order of the tree,
we should perform this operation on the tree, else we should merge the node with one of its siblings.
Let's say the child we want to fix c1
is at index i
in its parent node's children array.
If the child c2
at index i-1
has more keys than the order of the tree:
- We move the key at index
i-1
from the parent node to the childc1
's keys array at index0
. - We move the last key from
c2
to the parent's keys array at indexi-1
. - (If
c1
is non-leaf) We move the last child fromc2
toc1
's children array at index 0.
Else:
- We move the key at index
i
from the parent node to the end of childc1
's keys array. - We move the first key from
c2
to the parent's keys array at indexi
. - (If
c1
isn't a leaf) We move the first child fromc2
to the end ofc1
's children array.
Let's say we want to merge the child c1
at index i
in its parent's children array.
If c1
has a left sibling c2
:
- We move the key from the parent at index
i-1
to the end ofc2
's keys array. - We move the keys and the children(if it's a non-leaf) from
c1
to the end ofc2
's keys and children array. - We remove the child at index
i-1
from the parent node.
Else if c1
only has a right sibling c2
:
- We move the key from the parent at index
i
to the beginning ofc2
's keys array. - We move the keys and the children(if it's a non-leaf) from
c1
to the beginning ofc2
's keys and children array. - We remove the child at index
i
from the parent node.
After merging it is possible that now the parent node contains too few keys,
so in the worst case merging also can go up to the root, in which case the height of the tree decreases.
-
remove(_:)
method removes the given key from the tree. After a key has been deleted,
every node checks the number of keys in its child. If a child has less nodes than the order of the tree, it calls thefix(childWithTooFewKeys:atIndex:)
method. -
fix(childWithTooFewKeys:atIndex:)
method decides which way to fix the child (by moving a key to it, or by merging it), then callsmove(keyAtIndex:to:from:at:)
ormerge(child:atIndex:to:)
method according to its choice.
Written for Swift Algorithm Club by Viktor Szilárd Simkó