GGLinnk/bees - bees - Virtual World Git

mirror of https://github.com/Zygo/bees.git synced 2026-01-08 20:00:22 +01:00

Author	SHA1	Message	Date
Zygo Blaxell	a5d078d48b	docs: deprecate the `--workaround-btrfs-send` option Emphasize that the option is relevant to old kernels, older than the minimum supportable version threshold. De-emphasize the use case of "send-workaround" as a synonym for "exclude read-only". Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-11 23:39:56 -05:00
Zygo Blaxell	e2587cae9b	docs: expand "Threads and load management" to suggest not running bees so much One of the more obvious ways to reduce bees load is to simply not run it all the time. Explicitly state using maintenance windows as a load management option. SIGUSR1 and SIGUSR2 should have been documented somewhere else before now. Better late than never. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-11 23:39:56 -05:00
Zygo Blaxell	ac581273d3	docs: config.md updates The theories behind bees slowing down when presented with a larger has table turned out to be wrong. The real cause was a very old bug which submitted thousands of `LOGICAL_INO` requests when only a handful of requests were needed. "Compression on the filesystem" -> "Compression in files" Don't be so "dramatic". Be "rapid" instead. Remove "cannot avoid modifying read-only snapshots" as a distinction between subvol and extent scans. Both modes support send workaround and send waiting with no significant distinction. Emphasize extent scan's better handling of many snapshots. Also reflinks. Add some discussion of `--throttle-factor`. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-11 23:39:56 -05:00
Zygo Blaxell	7fcde97b70	docs: update the bug reporting and status instructions Thread names have changed. Document some of the newer ones. Don't jump immediately to blaming poor performance on qgroups or autodefrag. These do sometimes have kernel regressions but not all the time. Emphasize advantage of controlling bees deferred work requests at the source, before btrfs gets stuck committing them. Avoid asserting that it's OK for gdb to crash. Remove mention of lower-layer block device issues wrt corruption. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-11 23:39:55 -05:00
Zygo Blaxell	e457f502b7	docs: update kernel bugs page for January 2025 "Kernel" -> "Linux kernel". If you can run bees on a kernel that isn't Linux, congratulations! Emphasize the age of the data corruption warnings. Once 5.4 reaches EOL we can remove those. Simplify the discussion of old kernels and API levels. There's a new optional kernel API for `openat2` support at 5.6. The absolute minimum kernel version is still 4.2, and will not increase to 4.15 until the subvol scanners are removed. Remove discussion of bees support for kernels 4.19 (which recently reached EOL) and earlier. The `LOGICAL_INO` vs dedupe bug is actually a `LOGICAL_INO` vs clone bug. Dedupe isn't necessary to reproduce it. Remove a stray ')'. Strip out most of the discussion of slow backrefs, as they are no longer a concern on the range of supported kernel versions. Leave some description there because bees still has some vestigial workarounds. Remove `btrfs send` from the "Unfixed kernel bugs" section, which makes the section empty, so remove the section too. bees now handles send on a subvol reasonably well. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-11 23:39:55 -05:00
Zygo Blaxell	46815f1a9d	docs: update README.md Emphasize "large" is an upper bound on the size of filesystem bees can handle. New strengths: largest extent first for fixed maintenance windows, scans data only once (ish), recovers more space Removed weaknesses: less temporary space Need more caps than `CAP_SYS_ADMIN`. Emphasize DATA CORRUPTION WARNING is an old-kernel thing. Update copyright year. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-11 23:39:55 -05:00
Zygo Blaxell	0d251d30f4	docs: update feature interaction lists Tested on larger filesystems than 100T too, but let's use Fermi approximation. Next size is 1P. Removed interaction with block-level SSD caching subsystems. These are really btrfs metadata vs. a lower block layer, and have nothing to do with bees. Added mixed block groups to the tested list, as mixed block groups required explicit support in the extent scanner. Added btrfs-convert to the tested list. btrfs-convert has various problems with space allocation in general, but these can be solved by carefully ordered balances after conversion, and they have nothing to do with bees. In-kernel dedupe is dead and the stubs were removed years ago. Remove it from the list. btrfs send now plays nicely with bees on all supportable kernels, now that stable/linux-4.19.y is dead. Send workaround is only needed for kernels before v5.4 (technically v5.2, but nobody should ever mount a btrfs with kernel v5.1 to v5.3). bees will pause automatically when deduping a subvol that is currently running a send. bees will no longer gratuitously refragment data that was defragmented by autodefrag. Explicitly list all the RAID profiles tested so far, as there have been some new ones. Explicitly list other deduplicators tested. Sort the list of btrfs features alphabetically. Add scrub and balance, which have been tested with bees since the beginning. New tested btrfs features: block-group-tree, raid1c3, raid1c4. New untested btrfs features: squotas, raid-stripe-tree. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-11 23:39:55 -05:00
Zygo Blaxell	d4900cc5d5	docs: default throttle is zero Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-03 23:15:37 -05:00
Zygo Blaxell	bd9dc0229b	docs: add `--throttle-factor` option Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-03 23:15:37 -05:00
Zygo Blaxell	69e9bdfb0f	docs: post-5.7 toxic extent handling Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2024-12-01 00:17:52 -05:00
Zygo Blaxell	7b0ed6a411	docs: default scan mode is 4, "extent" Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2024-12-01 00:17:51 -05:00
Zygo Blaxell	d5a6c30623	docs: old missing features are not missing any more The extent scan mode has been implemented (partially, but close enough to win benchmarks). New features include several nuisance dedupe countermeasures. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2024-12-01 00:17:51 -05:00
Zygo Blaxell	25f7ced27b	docs: add scan mode 4, "extent" Extent is a different kind of scan mode, so introduce the concept of the two kinds of scan mode, and rearrange the description of scan modes along the new boundaries. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2024-12-01 00:17:51 -05:00
Zygo Blaxell	da32667e02	docs: add event counters for extent scan Add a section for all the new extent scan event counters. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2024-12-01 00:17:51 -05:00
Zygo Blaxell	e22653e2c6	docs: remove "matched_" prefix event counters We can no longer reliably determine the number of hash table matches, since we'll stop counting after the first one. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2024-11-30 23:30:33 -05:00
Zygo Blaxell	54ed6e1cff	docs: event counter updates after fixing counter names and scan_one_extent improvements Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2024-11-30 23:30:33 -05:00
Zygo Blaxell	5414c7344f	docs: resolve_overflow limit is only 655050 when BTRFS_MAX_EXTENT_REF_COUNT is Use the current header value in the doc. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2024-11-30 23:30:33 -05:00
Zygo Blaxell	088cbc951a	docs: event counter updates after readahead sanity improvements Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2024-11-30 23:30:33 -05:00
Zygo Blaxell	37f5b1bfa8	docs: add allocator regression in 6.0+ kernels Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2024-11-30 23:30:33 -05:00
Zygo Blaxell	30a4fb52cb	Revert "context: add experimental code for avoiding tiny extents" because this problem is better solved elsewhere. This reverts commit `11fabd66a8`.	2024-11-30 23:30:33 -05:00
Zygo Blaxell	faac895568	docs: add the 6.10..6.12 delayed refs bug Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2024-11-30 23:30:33 -05:00
Zygo Blaxell	124507232f	docs: add vmalloc bug to kernel bugs list The bug is: v6.3-rc6: f349b15e183d mm: vmalloc: avoid warn_alloc noise caused by fatal signal The fixes are: v6.4: 95a301eefa82 mm/vmalloc: do not output a spurious warning when huge vmalloc() fails v6.3.10: c189994b5dd3 mm/vmalloc: do not output a spurious warning when huge vmalloc() fails The bug has been backported to LTS, but the fix has not: v6.2.11: 61334bc29781 mm: vmalloc: avoid warn_alloc noise caused by fatal signal v6.1.24: ef6bd8f64ce0 mm: vmalloc: avoid warn_alloc noise caused by fatal signal v5.15.107: a184df0de132 mm: vmalloc: avoid warn_alloc noise caused by fatal signal Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-07-06 13:50:12 -04:00
Zygo Blaxell	3c5e13c885	context: log when LOGICAL_INO returns 0 refs There was a bug in kernel 6.3 where LOGICAL_INO with IGNORE_OFFSET sometimes fails to ignore the offset. That bug is now fixed, but LOGICAL_INO still returns 0 refs much more often than seems appropriate. This is most likely because bees frequently deletes extents while there is still work waiting for them in Task queues. In this case, LOGICAL_INO correctly returns an empty list, because every reference to some extent is deleted, but the new extent tree with that extent removed is not yet committed in btrfs. Add a DEBUG-level log message and an event counter to track these events. In the absence of a kernel bug, the debug message may indicate CPU time was wasted performing a search whose outcome could have been predicted. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-07-06 12:54:33 -04:00
Zygo Blaxell	a6ca2fa2f6	docs: add IGNORE_OFFSET regression in 6.2..6.3 to kernel bugs list This doesn't impact the current bees master, but it does break bees-next. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-07-06 12:49:36 -04:00
Zygo Blaxell	da3ef216b1	docs: working around `btrfs send` issues isn't really a feature The critical kernel bugs in send have been fixed for years. The limitations that remain aren't bugs, and bees has no sustainable workaround for them. Also update copyright year range. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-03-07 10:25:51 -05:00
Zygo Blaxell	b7665d49d9	docs: fill in missing LTS backports for "1119a72e223f btrfs: tree-checker: do not error out if extent ref hash doesn't match" Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-03-07 10:17:44 -05:00
Zygo Blaxell	9b60f2b94d	docs: add "missing" features that have been in development for some time already Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-02-25 03:42:42 -05:00
Zygo Blaxell	8978d63e75	docs: update GCC versions list and clarify markdown statement I don't know if anyone else is testing GCC versions before 8.0 any more, but I'm not. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-02-25 03:39:55 -05:00
Zygo Blaxell	82474b4ef4	docs: update front page At least one user was significantly confused by "designed for large filesystems". The btrfs send workarounds aren't new any more. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-02-25 03:38:50 -05:00
Zygo Blaxell	73834beb5a	docs: minor changes to how-it-works based on past user questions Clarify that "too large" and "too small" are some distance away from each other. The Goldilocks zone is _wide_. The interval between cache drops is now shorter. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-02-25 03:37:37 -05:00
Zygo Blaxell	c92ba117d8	docs: various gotcha updates Fixing the obviously wrong and out of date stuff. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-02-25 03:37:23 -05:00
Zygo Blaxell	c354e77634	docs: simplify the exit-with-SIGTERM description The description now matches the code again. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-02-25 03:36:44 -05:00
Zygo Blaxell	f21569e88c	docs: update the feature interactions page Fixing the obviously out-of-date and no-longer-tested things. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-02-25 03:34:22 -05:00
Zygo Blaxell	3d5ebe4d40	docs: update kernel bugs and workarounds list for 6.2.0 Remove some of the repetition to make the document easier to edit. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-02-25 03:32:52 -05:00
Zygo Blaxell	28ee2ae1a8	docs: fix broken link in options.md Links in docs/ are relative to docs/, not the top level. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-01-23 00:08:54 -05:00
Zygo Blaxell	9587c40677	docs: add crawl_again, drop crawl_restart Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-01-05 01:10:17 -05:00
Zygo Blaxell	c5889049f0	docs: remove duplicate (and wrong) default scan mode The default scan mode is found in config.md. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-01-05 01:10:17 -05:00
Adam Faiz	ecaed09128	docs: fix reference direction The Dependencies list is above the Packaging section, not below. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-29 06:25:33 -05:00
Zygo Blaxell	48dd2a45fe	docs: remove the line discussing 'max_transid' in recent scan mode This makes the doc match the code again. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:01 -05:00
Zygo Blaxell	984ceeb2a5	docs: update documentation for new 'recent' scan mode Also attempted to clarify the descriptions of the modes based on feedback and questions from users over the years. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:01 -05:00
Zygo Blaxell	84f91af503	context: don't let multiple worker Tasks get stuck on a single extent or inode When two Tasks attempt to lock the same extent, append the later Task to the earlier Task's post-exec work queue. This will guarantee that all Tasks which attempt to manipulate the same extent will execute sequentially, and free up threads to process other extents. Similarly, if two scanner threads operate on the same inode, any dedupe they perform will lock out other scanner threads in btrfs. Avoid this by serializing Task objects that reference the same file. This does theoretically use an unbounded amount of memory, but in practice a Task that encounters a contended extent or inode quickly stops spawning new Tasks that might increase the queue size, and all Tasks that might contend for the same lock(s) end up on a single FIFO queue. Note that the scope of inode locks is intentionally global, i.e. when an inode is locked, it locks every inode with the same number in every subvol. This avoids significant lock contention and task queue growth when the same inode with the same file extents appear in snapshots. Fixes: https://github.com/Zygo/bees/issues/158 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:00 -05:00
Zygo Blaxell	9cdeb608f5	bees: drop the balance/logical workaround that has been disabled for two years Kernels that needed the balance workaround frankly are too buggy to run bees at all. The workaround also makes the locking stories around logical_ino calls and process exit complicated, so get rid of it completely. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	a32cd5247f	docs: update kernel bugs list for 5.18 ptvf fix Also correct my own style for the fixed version column. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-08-17 13:04:06 -04:00
Zygo Blaxell	9c68f15474	README: update copyright year 2022 It has been some years since the copyright statement was updated. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-07-29 22:20:02 -04:00
Zygo Blaxell	5f3cb9b374	docs: update kernel bugs list for 2022-07-29 * RAID1 device count problems fixed * log tree replay parent transid verify failure in 5.18 and 5.19 added, patches available but not upstream yet * flushoncommit issues fixed, discussion section removed * LOGICAL_INO vs dedupe hang added Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-07-29 22:07:26 -04:00
Zygo Blaxell	007067b83f	docs: add missing 'adjust_offset_hit' counter Reported by York-Simon Johannsen via github issue 208. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-12-19 15:10:02 -05:00
suorcd	bb5160987e	docs: spell "snapshot" correctly https://github.com/Zygo/bees/pull/209 Edited: regenerate docs for the downstream change in index.md. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-12-19 15:08:26 -05:00
Zygo Blaxell	670fce5be5	resolve: reword the too-many-duplicates exception message For one thing, it should _say_ that there are too many duplicates. We were making the user read the manual to find that out. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-11-29 21:27:48 -05:00
Zygo Blaxell	7f67f55746	docs: remove some stray whitespace Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-11-29 21:27:48 -05:00
Zygo Blaxell	eb2630dee6	docs: document resolve_overflow In commit `d9e3c0070b` "context: stop creating new refs when there are too many already" we added a new counter, but didn't document it. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-11-29 21:27:48 -05:00

1 2

95 Commits