README: split into sections, reformat for github.io

Split the rather large README into smaller sections with a pitch and a ToC at the top. Move the sections into docs/ so that Github Pages can read them. 'make doc' produces a local HTML tree. Update the kernel bugs and gotchas list. Add some information that has been accumulating in Github comments. Remove information about bugs in kernels earlier than 4.14. Signed-off-by: Zygo Blaxell <bees@furryterror.org>
2026-01-07 19:30:20 +01:00 · 2018-09-30 01:08:29 -04:00
parent 32d2739b0d
commit e8298570ed
15 changed files with 888 additions and 584 deletions
--- a/docs/config.md
+++ b/docs/config.md
@@ -0,0 +1,151 @@
+bees Configuration
+==================
+
+The only configuration parameter that *must* be provided is the hash
+table size.  Other parameters are optional or hardcoded, and the defaults
+are reasonable in most cases.
+
+Hash Table Sizing
+-----------------
+
+Hash table entries are 16 bytes per data block.  The hash table stores
+the most recently read unique hashes.  Once the hash table is full,
+each new entry in the table evicts an old entry.
+
+Here are some numbers to estimate appropriate hash table sizes:
+
+    unique data size |  hash table size |average dedupe extent size
+        1TB          |      4GB         |        4K
+        1TB          |      1GB         |       16K
+        1TB          |    256MB         |       64K
+        1TB          |    128MB         |      128K <- recommended
+        1TB          |     16MB         |     1024K
+       64TB          |      1GB         |     1024K
+
+Notes:
+
+ * If the hash table is too large, no extra dedupe efficiency is
+obtained, and the extra space just wastes RAM.  Extra space can also slow
+bees down by preventing old data from being evicted, so bees wastes time
+looking for matching data that is no longer present on the filesystem.
+
+ * If the hash table is too small, bees extrapolates from matching
+blocks to find matching adjacent blocks in the filesystem that have been
+evicted from the hash table.  In other words, bees only needs to find
+one block in common between two extents in order to be able to dedupe
+the entire extents.  This provides significantly more dedupe hit rate
+per hash table byte than other dedupe tools.
+
+ * When counting unique data in compressed data blocks to estimate
+optimum hash table size, count the *uncompressed* size of the data.
+
+ * Another way to approach the hash table size is to simply decide how much
+RAM can be spared without too much discomfort, give bees that amount of
+RAM, and accept whatever dedupe hit rate occurs as a result.  bees will
+do the best job it can with the RAM it is given.
+
+Factors affecting optimal hash table size
+-----------------------------------------
+
+It is difficult to predict the net effect of data layout and access
+patterns on dedupe effectiveness without performing deep inspection of
+both the filesystem data and its structure--a task that is as expensive
+as performing the deduplication.
+
+* **Compression** on the filesystem reduces the average extent length
+compared to uncompressed filesystems.  The maximum compressed extent
+length on btrfs is 128KB, while the maximum uncompressed extent length
+is 128MB.  Longer extents decrease the optimum hash table size while
+shorter extents increase the optimum hash table size because the
+probability of a hash table entry being present (i.e. unevicted) in
+each extent is proportional to the extent length.
+
+   As a rule of thumb, the optimal hash table size for a compressed
+filesystem is 2-4x larger than the optimal hash table size for the same
+data on an uncompressed filesystem.  Dedupe efficiency falls dramatically
+with hash tables smaller than 128MB/TB as the average dedupe extent size
+is larger than the largest possible compressed extent size (128KB).
+
+* **Short writes** also shorten the average extent length and increase
+optimum hash table size.  If a database writes to files randomly using
+4K page writes, all of these extents will be 4K in length, and the hash
+table size must be increased to retain each one (or the user must accept
+a lower dedupe hit rate).
+
+   Defragmenting files that have had many short writes increases the
+extent length and therefore reduces the optimum hash table size.
+
+* **Time between duplicate writes** also affects the optimum hash table
+size.  bees reads data blocks in logical order during its first pass,
+and after that new data blocks are read incrementally a few seconds or
+minutes after they are written.  bees finds more matching blocks if there
+is a smaller amount of data between the matching reads, i.e. there are
+fewer blocks evicted from the hash table.  If most identical writes to
+the filesystem occur near the same time, the optimum hash table size is
+smaller.  If most identical writes occur over longer intervals of time,
+the optimum hash table size must be larger to avoid evicting hashes from
+the table before matches are found.
+
+   For example, a build server normally writes out very similar source
+code files over and over, so it will need a smaller hash table than a
+backup server which has to refer to the oldest data on the filesystem
+every time a new client machine's data is added to the server.
+
+Scanning modes for multiple subvols
+-----------------------------------
+
+The `--scan-mode` option affects how bees divides resources between
+subvolumes.  This is particularly relevant when there are snapshots,
+as there are tradeoffs to be made depending on how snapshots are used
+on the filesystem.
+
+Note that if a filesystem has only one subvolume (i.e. the root,
+subvol ID 5) then the `--scan-mode` option has no effect, as there is
+only one subvolume to scan.
+
+The default mode is mode 0, "lockstep".  In this mode, each inode of each
+subvol is scanned at the same time, before moving to the next inode in
+each subvol.  This maximizes the likelihood that all of the references to
+a snapshot of a file are scanned at the same time, which takes advantage
+of VFS caching in the Linux kernel.  If snapshots are created very often,
+bees will not make very good progress as it constantly restarts the
+filesystem scan from the beginning each time a new snapshot is created.
+
+Scan mode 1, "independent", simply scans every subvol independently
+in parallel.  Each subvol's scanner shares time equally with all other
+subvol scanners.  Whenever a new subvol appears, a new scanner is
+created and the new subvol scanner doesn't affect the behavior of any
+existing subvol scanner.
+
+Scan mode 2, "sequential", processes each subvol completely before
+proceeding to the next subvol.  This is a good mode when using bees for
+the first time on a filesystem that already has many existing snapshots
+and a high rate of new snapshot creation.  Short-lived snapshots
+(e.g. those used for `btrfs send`) are effectively ignored, and bees
+directs its efforts toward older subvols that are more likely to be
+origin subvols for snapshots.  By deduping origin subvols first, bees
+ensures that future snapshots will already be deduplicated and do not
+need to be deduplicated again.
+
+If you are using bees for the first time on a filesystem with many
+existing snapshots, you should read about [snapshot gotchas](gotchas.md).
+
+Threads and load management
+---------------------------
+
+By default, bees creates one worker thread for each CPU detected.
+These threads then perform scanning and dedupe operations.  The number of
+worker threads can be set with the [`--thread-count` and `--thread-factor`
+options](options.md).
+
+If desired, bees can automatically increase or decrease the number
+of worker threads in response to system load.  This reduces impact on
+the rest of the system by pausing bees when other CPU and IO intensive
+loads are active on the system, and resumes bees when the other loads
+are inactive.  This is configured with the [`--loadavg-target` and
+`--thread-min` options](options.md).
+
+Log verbosity
+-------------
+
+bees can be made less chatty with the [`--verbose` option](options.md).