Database interactions

Database object

class rocksdb.DB
__init__(db_name, Options opts, read_only=False)
Parameters:
  • db_name (unicode) – Name of the database to open

  • opts (rocksdb.Options) – Options for this specific database

  • read_only (bool) – If True the database is opened read-only. All DB calls which modify data will raise an Exception.

put(key, value, sync=False, disable_wal=False, ignore_missing_column_families=False, no_slowdown=False, low_pri=False)

Set the database entry for “key” to “value”.

Parameters:
  • key (bytes) – Name for this entry

  • value (bytes) – Data for this entry

  • sync (bool) –

    If True, the write will be flushed from the operating system buffer cache (by calling WritableFile::Sync()) before the write is considered complete. If this flag is true, writes will be slower.

    If this flag is False, and the machine crashes, some recent writes may be lost. Note that if it is just the process that crashes (i.e., the machine does not reboot), no writes will be lost even if sync == False.

    In other words, a DB write with sync == False has similar crash semantics as the “write()” system call. A DB write with sync == True has similar crash semantics to a “write()” system call followed by “fdatasync()”.

  • disable_wal (bool) – If True, writes will not first go to the write ahead log, and the write may got lost after a crash.

  • ignore_missing_column_families (bool) – If True and if user is trying to write to column families that don’t exist (they were dropped), ignore the write (don’t return an error). If there are multiple writes in a WriteBatch, other writes will succeed.

  • no_slowdown (bool) – If True and we need to wait or sleep for the write request, fails immediately with Status::Incomplete().

  • low_pri (bool) – If True, this write request is of lower priority if compaction is behind. In this case, no_slowdown = true, the request will be cancelled immediately with Status::Incomplete() returned. Otherwise, it will be slowed down. The slowdown value is determined by RocksDB to guarantee it introduces minimum impacts to high priority writes.

delete(key, sync=False, disable_wal=False, ignore_missing_column_families=False, no_slowdown=False, low_pri=False)

Remove the database entry for “key”.

Parameters:
Raises:

rocksdb.errors.NotFound – If the key did not exists

merge(key, value, sync=False, disable_wal=False, ignore_missing_column_families=False, no_slowdown=False, low_pri=False)

Merge the database entry for “key” with “value”. The semantics of this operation is determined by the user provided merge_operator when opening DB.

See rocksdb.DB.put() for the parameters

Raises:

rocksdb.errors.NotSupported if this is called and no rocksdb.Options.merge_operator was set at creation

write(batch, sync=False, disable_wal=False, ignore_missing_column_families=False, no_slowdown=False, low_pri=False)

Apply the specified updates to the database.

Parameters:
get(key, verify_checksums=False, fill_cache=True, snapshot=None, read_tier='all')
Parameters:
  • key (bytes) – Name to get

  • verify_checksums (bool) – If True, all data read from underlying storage will be verified against corresponding checksums.

  • fill_cache (bool) – Should the “data block”, “index block” or “filter block” read for this iteration be cached in memory? Callers may wish to set this field to False for bulk scans.

  • snapshot (rocksdb.Snapshot) – If not None, read as of the supplied snapshot (which must belong to the DB that is being read and which must not have been released). Is it None a implicit snapshot of the state at the beginning of this read operation is used

  • read_tier (string) –

    Specify if this read request should process data that ALREADY resides on a particular cache. If the required data is not found at the specified cache, then rocksdb.errors.Incomplete is raised.

    Use all if a fetch from disk is allowed.
    Use cache if only data from cache is allowed.

Returns:

None if not found, else the value for this key

multi_get(keys, verify_checksums=False, fill_cache=True, snapshot=None, read_tier='all')
Parameters:

keys (list of bytes) – Keys to fetch

For the other params see rocksdb.DB.get()

Returns:

A dict where the value is either bytes or None if not found

Raises:

If the fetch for a single key fails

Note

keys will not be “de-duplicated”. Duplicate keys will return duplicate values in order.

key_may_exist(key, fetch=False, verify_checksums=False, fill_cache=True, snapshot=None, read_tier='all')

If the key definitely does not exist in the database, then this method returns False, else True. If the caller wants to obtain value when the key is found in memory, fetch should be set to True. This check is potentially lighter-weight than invoking DB::get(). One way to make this lighter weight is to avoid doing any IOs.

Parameters:
  • key (bytes) – Key to check

  • fetch (bool) – Obtain also the value if found

For the other params see rocksdb.DB.get()

Returns:

  • (True, None) if key is found but value not in memory

  • (True, None) if key is found and fetch=False

  • (True, <data>) if key is found and value in memory and fetch=True

  • (False, None) if key is not found

iterkeys(verify_checksums=False, fill_cache=True, snapshot=None, read_tier='all')

Iterate over the keys

For other params see rocksdb.DB.get()

Returns:

A iterator object which is not valid yet. Call first one of the seek methods of the iterator to position it

Return type:

rocksdb.BaseIterator

itervalues(verify_checksums=False, fill_cache=True, snapshot=None, read_tier='all')

Iterate over the values

For other params see rocksdb.DB.get()

Returns:

A iterator object which is not valid yet. Call first one of the seek methods of the iterator to position it

Return type:

rocksdb.BaseIterator

iteritems(verify_checksums=False, fill_cache=True, snapshot=None, read_tier='all')

Iterate over the items

For other params see rocksdb.DB.get()

Returns:

A iterator object which is not valid yet. Call first one of the seek methods of the iterator to position it

Return type:

rocksdb.BaseIterator

snapshot()

Return a handle to the current DB state. Iterators created with this handle will all observe a stable snapshot of the current DB state.

Return type:

rocksdb.Snapshot

get_property(prop)

DB implementations can export properties about their state via this method. If “property” is a valid property understood by this DB implementation, a byte string with its value is returned. Otherwise None

Valid property names include:

  • b"rocksdb.num-files-at-level<N>": return the number of files at level <N>,

    where <N> is an ASCII representation of a level number (e.g. “0”).

  • b"rocksdb.stats": returns a multi-line byte string that describes statistics

    about the internal operation of the DB.

  • b"rocksdb.sstables": returns a multi-line byte string that describes all

    of the sstables that make up the db contents.

  • b"rocksdb.num-immutable-mem-table": Number of immutable mem tables.

  • b"rocksdb.mem-table-flush-pending": Returns 1 if mem table flush is pending, otherwise 0.

  • b"rocksdb.compaction-pending": Returns 1 if a compaction is pending, otherweise 0.

  • b"rocksdb.background-errors": Returns accumulated background errors encountered.

  • b"rocksdb.cur-size-active-mem-table": Returns current size of the active memtable.

get_live_files_metadata()

Returns a list of all table files.

It returns a list of dict’s were each dict has the following keys.

name

Name of the file

level

Level at which this file resides

size

File size in bytes

smallestkey

Smallest user defined key in the file

largestkey

Largest user defined key in the file

smallest_seqno

smallest seqno in file

largest_seqno

largest seqno in file

compact_range(begin=None, end=None, **options)

Compact the underlying storage for the key range [begin,end]. The actual compaction interval might be superset of [begin, end]. In particular, deleted and overwritten versions are discarded, and the data is rearranged to reduce the cost of operations needed to access the data.

This operation should typically only be invoked by users who understand the underlying implementation.

begin == None is treated as a key before all keys in the database. end == None is treated as a key after all keys in the database. Therefore the following call will compact the entire database: db.compact_range().

Note that after the entire database is compacted, all data are pushed down to the last level containing any data. If the total data size after compaction is reduced, that level might not be appropriate for hosting all the files. In this case, client could set change_level to True, to move the files back to the minimum level capable of holding the data set or a given level (specified by non-negative target_level).

Parameters:
  • begin (bytes) – Key where to start compaction. If None start at the beginning of the database.

  • end (bytes) – Key where to end compaction. If None end at the last key of the database.

  • change_level (bool) – If True, compacted files will be moved to the minimum level capable of holding the data or given level (specified by non-negative target_level). If False you may end with a bigger level than configured. Default is False.

  • target_level (int) – If change_level is true and target_level have non-negative value, compacted files will be moved to target_level. Default is -1.

  • bottommost_level_compaction (string) –

    For level based compaction, we can configure if we want to skip/force bottommost level compaction. By default level based compaction will only compact the bottommost level if there is a compaction filter. It can be set to the following values.

    skip

    Skip bottommost level compaction

    if_compaction_filter

    Only compact bottommost level if there is a compaction filter. This is the default.

    force

    Always compact bottommost level

options

Returns the associated rocksdb.Options instance.

Note

Changes to this object have no effect anymore. Consider this as read-only

Iterator

class rocksdb.BaseIterator

Base class for all iterators in this module. After creation a iterator is invalid. Call one of the seek methods first before starting iteration

seek_to_first()

Position at the first key in the source

seek_to_last()

Position at the last key in the source

seek(key)
Parameters:

key (bytes) – Position at the first key in the source that at or past

Methods to support the python iterator protocol

__iter__()
__next__()
__reversed__()

Snapshot

class rocksdb.Snapshot

Opaque handler for a single Snapshot. Snapshot is released if nobody holds a reference on it. Retrieved via rocksdb.DB.snapshot()

WriteBatch

class rocksdb.WriteBatch

WriteBatch holds a collection of updates to apply atomically to a DB.

The updates are applied in the order in which they are added to the WriteBatch. For example, the value of “key” will be “v3” after the following batch is written:

batch = rocksdb.WriteBatch()
batch.put(b"key", b"v1")
batch.delete(b"key")
batch.put(b"key", b"v2")
batch.put(b"key", b"v3")
__init__(data=None)

Creates a WriteBatch.

Parameters:

data (bytes) – A serialized version of a previous WriteBatch. As retrieved from a previous .data() call. If None a empty WriteBatch is generated

put(key, value)

Store the mapping “key->value” in the database.

Parameters:
  • key (bytes) – Name of the entry to store

  • value (bytes) – Data of this entry

merge(key, value)

Merge “value” with the existing value of “key” in the database.

Parameters:
  • key (bytes) – Name of the entry to merge

  • value (bytes) – Data to merge

delete(key)

If the database contains a mapping for “key”, erase it. Else do nothing.

Parameters:

key (bytes) – Key to erase

clear()

Clear all updates buffered in this batch.

Note

Don’t call this method if there is an outstanding iterator. Calling rocksdb.WriteBatch.clear() with outstanding iterator, leads to SEGFAULT.

data()

Retrieve the serialized version of this batch.

Return type:

bytes

count()

Returns the number of updates in the batch

Return type:

int

__iter__()

Returns an iterator over the current contents of the write batch.

If you add new items to the batch, they are not visible for this iterator. Create a new one if you need to see them.

Note

Calling rocksdb.WriteBatch.clear() on the write batch invalidates the iterator. Using a iterator where its corresponding write batch has been cleared, leads to SEGFAULT.

Return type:

rocksdb.WriteBatchIterator

WriteBatchIterator

class rocksdb.WriteBatchIterator
__iter__()

Returns self.

__next__()

Returns the next item inside the corresponding write batch. The return value is a tuple of always size three.

First item (Name of the operation):

  • "Put"

  • "Merge"

  • "Delete"

Second item (key):

Key for this operation.

Third item (value):

The value for this operation. Empty for "Delete".

Repair DB

repair_db(db_name, opts)
Parameters:
  • db_name (unicode) – Name of the database to open

  • opts (rocksdb.Options) – Options for this specific database

If a DB cannot be opened, you may attempt to call this method to resurrect as much of the contents of the database as possible. Some data may be lost, so be careful when calling this function on a database that contains important information.

Errors

exception rocksdb.errors.NotFound
exception rocksdb.errors.Corruption
exception rocksdb.errors.NotSupported
exception rocksdb.errors.InvalidArgument
exception rocksdb.errors.RocksIOError
exception rocksdb.errors.MergeInProgress
exception rocksdb.errors.Incomplete