Skip to content

Glossary

Shared terminology used across HyperbyteDB documentation.


Term Definition
Anti-entropy Deprecated. The cluster.anti_entropy_* config keys have no effect.
Arrow Apache Arrow — in-memory columnar format used by optional columnar MessagePack ingest (columnar-ingest feature).
Batching WAL Optional decorator over the RocksDB WAL that groups multiple append operations into a single write batch (group commit) for higher throughput.
Cardinality The number of unique values for a tag key or the number of unique measurements. High cardinality can degrade performance.
chDB Embedded ClickHouse — query engine and native storage backend via chdb-rust. Data lives in per-measurement ReplacingMergeTree tables under chdb.session_data_path.
Column Family (CF) A RocksDB concept. HyperbyteDB uses separate column families for WAL entries (wal), WAL metadata (wal_meta), general metadata (metadata), and replication state.
Composition Root bootstrap.rs — the single location where all adapters and services are wired together using dependency injection via Arc<dyn Trait>.
Continuous Query (CQ) A named query that runs automatically on a schedule, typically used for downsampling raw data into summary measurements.
Materialized View (MV) A TimeseriesQL downsampling definition backed by ClickHouse MATERIALIZED VIEW objects. Updates incrementally on each source flush.
Drain Graceful node removal procedure: stop accepting writes, flush WAL, wait for replication acks, notify peers, shut down.
Field A key-value pair in a data point that holds the actual measured value. Fields are not indexed. Field types are enforced after first write (Float, Integer, UInteger, String, Boolean).
Figment Configuration loading library used by HyperbyteDB. Merges defaults, TOML config file, and environment variables.
Flush Background process that reads WAL entries and writes them into chDB MergeTree tables via ChdbNativeAdapter. Runs every flush.interval_secs.
Hexagonal Architecture Design pattern where business logic depends only on abstract port traits, with concrete adapters plugged in at the composition root. Also called ports and adapters.
Hinted Handoff Mechanism that stores writes destined for unreachable peers in a local queue and replays them when the peer recovers.
TimeseriesQLesQL Query language compatible with InfluxDB 1.x. Supports SELECT with aggregates/transforms, SHOW commands, DDL, DELETE, and continuous queries.
Line Protocol InfluxDB's text-based wire format for writing time-series data: measurement,tag=val field=val timestamp.
Master-Master Replication Clustering model where every node independently accepts writes and replicates them to all peers asynchronously.
Measurement Analogous to a table in a relational database. Contains a set of tag keys and field keys.
Merkle Tree Not used in the current cluster sync model. Peers align via WAL replication and metadata/WAL sync.
Metadata Database definitions, measurement schemas (field types, tag keys), user accounts, tombstones, and CQ definitions. Stored in RocksDB.
OpenRaft Rust implementation of the Raft consensus protocol. Used by HyperbyteDB for schema mutation ordering in cluster mode.
Point A single data observation: measurement name, tag set, field set, and timestamp (nanoseconds since Unix epoch).
Port An abstract trait in the hexagonal architecture that defines a boundary between business logic and infrastructure (e.g., WalPort, QueryPort, PointsSinkPort).
Precision Timestamp unit for line protocol writes: ns (nanoseconds), us/u (microseconds), ms (milliseconds), s (seconds).
Raft Consensus algorithm used for schema mutations in cluster mode. Ensures all nodes apply CREATE/DROP/DELETE operations in the same order.
RecordBatch Arrow's in-memory columnar data container. Used by optional columnar MessagePack ingest.
Replication Log RocksDB-backed store tracking WAL and mutation acknowledgements from peers. Used for safe WAL truncation in cluster mode.
Retention Policy (RP) Configuration that controls how long data is kept. Each database has one or more RPs. The default RP is autogen with infinite duration.
RocksDB Embedded key-value store. Used for the WAL (durable write log), metadata store, replication log, and Raft state.
Series A unique combination of measurement name and tag set. Each series has its own time-ordered sequence of field values.
Statement Summary Ring buffer tracking recently executed TimeseriesQL statements with query digest, latency, and error status. Exposed via GET /api/v1/statements.
Tag A key-value pair in a data point used for indexing and grouping. Tags are always strings. Stored in metadata for SHOW TAG queries.
Tombstone A metadata record created by DELETE statements. Marks data for exclusion at query time.
WAL (Write-Ahead Log) Durable, ordered log where incoming writes are persisted before the client receives a response. Data in the WAL is flushed to chDB by the background flush service.