Write-ahead log

    Starting with version 2.2 ArangoDB stores all data-modification operation inits write-ahead log. The write-ahead log is sequence of append-only files containingall the write operations that were executed on the server.

    It is used to run data recovery after a server crash, and can also be used ina replication setup when slaves need to replay the same sequence of operations ason the master.

    By default, each write-ahead logfile is 32 MiB in size. This size is configurable via theoption —wal.logfile-size.When a write-ahead logfile is full, it is set to read-only, and following operations willbe written into the next write-ahead logfile. By default, ArangoDB will reserve somespare logfiles in the background so switching logfiles should be fast. How many reservelogfiles ArangoDB will try to keep available in the background can be controlled by theconfiguration option —wal.reserve-logfiles.

    Long-running transactions prevent write-ahead logfiles from being fully garbage-collectedbecause it is unclear whether a transaction will commit or abort. Long-running transactionscan thus block the garbage-collection progress and should therefore be avoided at all costs.

    On a system that acts as a replication master, it is useful to keep a few of the already collected write-ahead logfiles so replication slaves still can fetch data fromthem if required. How many collected logfiles will be kept before they get deleted isconfigurable via the option —wal.historic-logfiles.

    For all write-ahead log configuration options, please refer to the pageWrite-ahead log options.

    RocksDB WAL Details

    Note: In rocksdb the WAL options are all prefixed with .The options do have no effect.

    The individual RocksDB WAL files are per default about 64 MiB big.The size will always be proportionally sized to the value specified via. The value specifies the amount of data to buildup in memory (backed by the unsorted WAL on disk) before converting it to asorted on-disk file.

    Larger values can increase performance, especially during bulk loads.Up to write buffers may be held in memoryat the same time, so you may wish to adjust this parameter to control memoryusage. A larger write buffer will result in a longer recovery time the nexttime the database is opened.