GGLinnk/bees - bees - Virtual World Git

mirror of https://github.com/Zygo/bees.git synced 2025-08-23 22:42:20 +02:00

Author	SHA1	Message	Date
Zygo Blaxell	ee5c971d77	fsync: fix signed comparison of stf.f_type Build fails on 32-bit Slackware because GCC 11's `-Werror=sign-compare` is stricter than necessary: cc -Wall -Wextra -Werror -O3 -I../include -D_FILE_OFFSET_BITS=64 -std=c99 -O2 -march=i586 -mtune=i686 -o bees-version.o -c bees-version.c bees.cc: In function 'void bees_fsync(int)': bees.cc:426:24: error: comparison of integer expressions of different signedness: '__fsword_t' {aka 'int'} and 'unsigned int' [-Werror=sign-compare] 426 \| if (stf.f_type != BTRFS_SUPER_MAGIC) { \| ^ To work around this, cast `stf.f_type` to the same type as `BTRFS_SUPER_MAGIC`, so it has the same number of bits that we're looking for in the magic value. Fixes: https://github.com/Zygo/bees/issues/317 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-07-03 21:48:40 -04:00
Zygo Blaxell	d37f916507	tempfile: don't need to update the inode if the flags don't change A small performance optimization, given that we are constantly clobbering the file with new content. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-06-29 23:34:10 -04:00
Zygo Blaxell	3a17a4dcdd	tempfile: make sure FS_COMPR_FL stays set btrfs will set the FS_NOCOMP_FL flag when all of the following are true: 1. The filesystem is not mounted with the `compress-force` option 2. Heuristic analysis of the data suggests the data is compressible 3. Compression fails to produce a result that is smaller than the original If the compression ratio is 40%, and the original data is 128K long, then compressed data will be about 52K long (rounded up to 4K), so item 3 is usually false; however, if the original data is 8K long, then the compressed data will be 8K long too, and btrfs will set FS_NOCOMP_FL. To work around that, keep setting FS_COMPR_FL and clearing FS_NOCOMP_FL every time a TempFile is reset. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-06-29 23:25:36 -04:00
Zygo Blaxell	4039ef229e	tempfile: clear FS_NOCOW_FL while setting FS_COMPR_FL FS_NOCOW_FL can be inherited from the subvol root directory, and it conflicts with FS_COMPR_FL. We can only dedupe when FS_NOCOW_FL is the same on src and dst, which means we can only dedupe when FS_NOCOW_FL is clear, so we should clear FS_NOCOW_FL on the temporary files we create for dedupe. Fixes: https://github.com/Zygo/bees/issues/314 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-06-29 23:24:55 -04:00
Zygo Blaxell	e9d4aa4586	roots: make the "idle" label useful Apply the "idle" label only when the crawl is finished _and_ its transid_max is up to date. This makes the keyword "idle" better reflect when bees is not only finished crawling, but also scanning the crawled extents in the queue. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-06-18 23:06:14 -04:00
Zygo Blaxell	504f4cda80	progress: move the "idle" cell to the next cycle ETA column When all extents within a size tier have been queued, and all the extents belong to the same file, the queue might take a long time to fully process. Also, any progress that is made will be obscured by the "idle" tag in the "point" column. Move "idle" to the next cycle ETA column, since the ETA duration will be zero, and no useful information is lost since we would have "-" there anyway. Since the "point" column can now display the maximum value, lower that maximum to 999999 so that we don't use an extra column. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-06-18 22:33:05 -04:00
Zygo Blaxell	6c36f4973f	extent scan: log the bfr when removing a prealloc extent With subvol scan, the crawl task name is the subvol/inode pair corresponding to the file offset in the log message. The identity of the file can be determined by looking up the subvol/inode pair in the log message. With extent scan, the crawl task name is the extent bytenr corresponding to the file offset in the log message. This extent is deleted when the log message is emitted, so a later lookup on the extent bytenr will not find any references to the extent, and the identity of the file cannot be determined. Log the bfr, which does a /proc lookup on the name of the fd, so the filename is logged. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-06-18 22:33:05 -04:00
Zygo Blaxell	337bbffac1	extent scan: drop a nonsense trace message This message appears only during exception backtraces, but it doesn't carry any useful information. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-06-18 21:17:48 -04:00
Zygo Blaxell	527396e5cb	extent scan: integrate seeker debug output stream Send both tree_search ioctl and `seek_backward` debug logs to the same output stream, but only write that stream to the debug log if there is an exception. The feature remains disabled at compile time. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-06-18 21:17:48 -04:00
Zygo Blaxell	bc7c35aa2d	extent scan: only write a detailed debug log when there's an exception Note that when enabled, the logs are still very CPU-intensive, but most of the logs will be discarded. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-06-18 21:17:48 -04:00
Zygo Blaxell	0953160584	trace: export `exception_check` We need to call this from more than one place in bees. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-06-18 21:17:48 -04:00
Zygo Blaxell	9a9644659c	trace: clean up the formatting around top-level exception log messages Fewer newlines. More consistent application of the "TRACE:" prefix. All at the same log level. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-06-18 21:17:48 -04:00
Zygo Blaxell	fd53bff959	extent scan: drop out-of-date comment The comment describes an earlier version which submitted each extent ref as a separate Task, but now all extent refs are handled by the same Task to minimize the amount of time between processing the first and last reference to an extent. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-06-18 21:17:48 -04:00
Zygo Blaxell	9439dad93a	extent scan: extra check to make sure no Tasks are started when throttled Previously `scan()` would run the extent scan loop once, and enqueue one extent, before checking for throttling. Do an extra check before that, and bail out so that zero extents are enqueued when throttled. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-06-18 21:17:48 -04:00
Zygo Blaxell	ef9b4b3a50	extent scan: shorten task name for extent map Linux kernel thread names are hardcoded at 16 characters. Every character counts, and "0x" wastes two. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-06-18 21:17:48 -04:00
Zygo Blaxell	8331f70db7	progress: fix ETA calculations The "tm_left" field was the estimated _total_ duration of the crawl, not the amount of time remaining. The ETA timestamp was then calculated based on the estimated time to run the crawl if it started _now_, not at the start timestamp. Fix the duration and ETA calculations. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-06-18 21:17:48 -04:00
Zygo Blaxell	47243aef14	hash: handle $BEESHOME on btrfs too The `_nothrow` variants of `do_ioctl` return true when they succeed, which is the opposite of what `ioctl` does. Fix the logic so bees can correctly identify its own hash table when it's on the same filesystem as the target. Fixes: `f6908420ad` ("hash: handle $BEESHOME on non-btrfs") Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-17 21:18:08 -05:00
Zygo Blaxell	a670aa5a71	extent scan: don't divide by zero if there were no loops Commit `183b6a5361` ("extent scan: refactor BeesCrawl, BeesScanMode*") moved some statistics calculations out of the loop in `find_next_extent`, but did not ensure that the statistics would not be calculated if the loop had not executed any iterations. In rare instances, the function returns without entering the loop at all, which results in divide by zero. Add a check just before doing that. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-13 23:59:42 -05:00
Zygo Blaxell	51b3bcdbe4	trace: deprecate BEESLOGTRACE, align trace logs with exception notices Exceptions were logged at level NOTICE while the stack traces were logged at level DEBUG. That produced useless noise in the output with `-v5` or `-v6`, where there were exception headings logged, but no details. Fix that by placing the exceptions and traces at level DEBUG, but prefix them with `TRACE:` for easy grepping. Most of the events associated with BEESLOGTRACE either never happen, or they are harmless (e.g. trying to open deleted files or subvols). Reassign them to ordinary BEESLOGDEBUG, with one exception for unrecognized Extent flags that should be debugged if any appear. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-13 23:59:42 -05:00
Zygo Blaxell	ae58401d53	trace: avoid one copy in every trace function While investigating https://github.com/Zygo/bees/issues/282 I noticed that we're doing at least one unnecessary extra copy of the functor in BEESTRACE. Get rid of it with a const reference. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-13 23:59:42 -05:00
Zygo Blaxell	3e7eb43b51	BeesStringFile: figure out when to call--or _not_ call--fsync Older kernel versions featured some bugs in btrfs `fsync`, which could leave behind "ghost dirents", orphan filename items that did not have a corresponding inode. These dirents were created during log replay during the first mount after a crash due to several different bugs in the log tree and its use over the years. The last known bug of this kind was fixed in kernel 5.16. As of this writing, no fixes for this bug have been backported to any earlier LTS kernel. Some filesystems, including btrfs, will flush the contents of a new file before renaming it over an old file. On paper, btrfs can do this very cheaply since the contents of the new file are not referenced, and the old file not dereferenced, until a tree commit which includes both actions atomically; however, in real life, btrfs provides `fsync`-like semantics and uses the log-tree infrastructure to implement them, which compromises performance and acts as a magnet for bugs. The benefit of this trade-off is that `rename` can be used as a synchronization point for data outside of the btrfs, which would not happen if everything `rename` does was simply deferred to the next tree commit. The cost of this trade-off is that for the first 8 years of its existence, bees would trigger the bug so often that the project recommended its users put $BEESHOME in its own subvol to make it easy to remove ghost dirents left behind by the bug. Some other filesystems, such as xfs, don't have any special semantics for `rename`, and require `fsync` to avoid garbage or missing data after a crash. Even filesystems which do have a special case for `rename` can be configured to turn it off. btrfs will silently delete data from files in the event that an unrecoverable data block write error occurs. Kernel version 6.2 adds important new and unexpected cases where this can happen on filesystems using raid56 data, but it also happens in all usable btrfs versions (the silent deletion behavior was introduced in kernel version 3.9). Unrecoverable write errors are currently reported to userspace only through `fsync`. Since the failed extents are deleted, they cannot be detected via csum failures or scrub after the fact--and it's too late by then, the data is already gone. `fsync` is the last opportunity to detect the write failure before the `rename`. If the error is not detected, the contents of the file will be silently discarded in btrfs. The impact on bees is that scans will abruptly restart from zero after a crash combined with some other reasonably common failures. Putting all of this together leads to a rather complex workaround: if the filesystem under $BEESHOME (specifically, the filesystem where BeesStringFile objects such as `beescrawl.dat` are written) is a btrfs filesystem, and the host kernel is a version prior to 5.16, then don't call `fsync` before `rename`. In all other cases, do call `fsync`, and prevent dependent writes (i.e. the following `rename`) in the event of errors. Since present kernel versions still require `fsync`, we don't need an upper bound on the kernel version check until someone fixes btrfs `rename` (or perhaps adds a flag to `renameat2` which prevents use of the log tree) in the kernel. Once that fix happens, we can drop the `fsync` call for kernels after that fixed version. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-10 21:04:20 -05:00
Zygo Blaxell	88b1e4ca6e	main: unconditionally enable workaround for the logical_ino-vs-clone kernel bug This obviously doesn't fix or prevent the kernel bug, but it does prevent bees from triggering the bug without assitance from another application. The bug can still be triggered by running bees at the same time as an application which uses clone or LOGICAL_INO. `btdu` uses LOGICAL_INO, while `cp` from coreutils (and many others) use clone (reflink copy). Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-06 23:14:16 -05:00
Zygo Blaxell	c1d7fa13a5	roots: drop unnecessary mutex unlock in stop_request In commit `31b2aa3c0d` ("context: speed up orderly process termination"), the stop request was split into two methods after the mutex unlock. Now that there's nothing after the mutex unlock in `stop_request`, there's no need for an explicit unlock to do what the destructor would have done anyway. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-06 23:14:16 -05:00
Zygo Blaxell	aa39bddb2d	extent scan: implement an experimental ordered scan mode Parallel scan runs each extent size tier in a separate thread. The threads compete to process extents within the tier's size range. Ordered scan processes each extent size tier completely before moving on to the next. In theory, this means large extents always get processed quickly, especially when new ones appear, and the queue does not fill up with small extents. In practice, the multi-threaded scanner massively outperforms the single-threaded scanner, unless the number of worker threads is very small (i.e. one). Disable most of the feature for now, but leave the code in place so it can be easily reactivated for future testing. Ordered scan introduces a parallelized extent mapper Task. Keep that in parallel scan mode, which further enhances the parallelism. The extent scan crawl threads now run at 'idle' priority while the map tasks run at normal priority, so the map tasks don't flood the task queue. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-06 23:14:16 -05:00
Zygo Blaxell	1aea2d2f96	crawl: deprecate use of BeesCrawl to search the extent tree BeesScanModeExtent can do that by itself now. Overloading the subvol crawl code resulted in an ugly, inefficient hack, and we definitely don't want to accidentally continue to use it. Remove the support for reading the extent tree and add some `assert`s to make sure it isn't still used somewhere. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-06 22:43:22 -05:00
Zygo Blaxell	183b6a5361	extent scan: refactor BeesCrawl, BeesScanMode* The main gains here are: * Move extent tree searches into BeesScanModeExtent so that they are not slowed down by the BeesCrawl code, which was designed for the much more specialized metadata in subvol trees. * Enable short extent skipping now that BeesCrawl is out of the way. * Stop enumerating btrfs subvols when in extent scan mode. All this gets rid of >99% of unnecessary extent tree searches. Incremental extent scan cycles now finish in milliseconds instead of minutes. BeesCrawl was never designed to cope with the structure and content of the extent tree. It would waste thousands of tree-search ioctl calls reading and ignoring metadata items. Performance was particularly bad when a binary search was involved, as any binary search probe that landed in a metadata block group would read and discard all the metadata items in the block group, sequentially, repeated for each level of the binary search. This was blocking implementation of short extent skipping optimization for large extent size tiers, because the skips were using thousands of tree searches to skip over only a few hundred extent items. Extent scan also had to read every extent item twice to do the transid filtering, because BeesCrawl's interface discarded the relevant information when it converted a `BtrfsTreeItem` into a `BeesFileRange`. The cost of this extra fetch was negligible, but it could have been zero. Fix this by: * Copy the equivalent of `fetch_extents` from BeesCrawl into `BeesScanModeExtent`, then give each of the extent scan crawlers its own `BtrfsDataExtentTreeFetcher` instance. This enables extent tree searches to avoid pure (non-mixed) metadata block groups. `BeesCrawl` is now used only for its interface to `BeesRoots` for saving state in `beescrawl.dat`, and never to determine the next extent tree item. * Move subvol-specific parts of `BeesRoots` into a new class `BeesScanModeSubvol` so that `BtrfsScanModeExtent` doesn't have to enable or support them. In particular, `bees -m4` no longer enumerates all of the _subvol_ crawlers. `BeesRoots` is still used to save and load crawl state. * Move several members from `BtrfsScanModeExtent` into a per-crawler state object `SizeTier` to eliminate the need for some locks and to maintain separate cache state for `BtrfsDataExtentTreeFetcher`. * Reuse the `BtrfsTreeItem` to get the generation field for the transid range filter. * Avoid a few corner cases when handling errors, where extent scan might drop an extent without scanning it, or fail to advance to the next extent. * Enable the extent-skipping algorithm for large size tiers, now that `BeesCrawl::fetch_extents` is no longer slowing it down. * Add a debug stream interface which developers can easily turn on when needed to inspect the decisions that extent scan is making. * Track metrics that are more useful, particularly searches per extent scanned, and fraction of extents that are skipped. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-06 22:43:22 -05:00
Zygo Blaxell	b6446d7316	roots: rework open_root_nocache to use btrfs-tree This gets rid of one open-coded btrfs tree search. Also reduce the log noise level for subvol open failures, and remove some ancient references to `BEESLOG`. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-06 22:42:15 -05:00
Zygo Blaxell	440740201a	main: the base directory for `--strip-paths` should be root_fd, not cwd The cwd is where core dumps and various profiling and verification libraries want to write their data, whereas root_fd is the root of the target filesystem. These are often intentionally different. When they are different, `--strip-paths` sets the wrong prefix to strip from paths. Once the root fd has been established, we can set the path prefix to the string prefix that we'll get from future calls to `name_fd`. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-06 22:42:15 -05:00
Zygo Blaxell	f6908420ad	hash: handle $BEESHOME on non-btrfs bees explicitly supports storing $BEESHOME on another filesystem, and does not require that filesystem to be btrfs; however, if $BEESHOME is on a non-btrfs filesystem, there is an exception on every startup when trying to identify the subvol root of the hash table file in order to blacklist it, because non-btrfs filesystems don't have subvol roots. Fix by checking not only whether $BEESHOME is on btrfs, but whether it is on the _same_ btrfs, as the bees root, without throwing an exception. The hash table is blacklisted only when both filesystems are btrfs and have the same fsid. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-06 22:42:15 -05:00
Zygo Blaxell	30cd375d03	readahead: clean up the code, update docs Remove dubious comments and #if 0 section. Document new event counters, and add one for read failures. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-06 22:42:15 -05:00
Zygo Blaxell	48b7fbda9c	progress: adjust minimum thresholds for ETA to 10 seconds and 1 GiB of data 1% is a lot of data on a petabyte filesystem, and a long time to wait for an ETA. After 1 GiB we should have some idea of how fast we're reading the data. Increase the time to 10 seconds to avoid a nonsense result just after a scan starts. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-06 22:42:15 -05:00
Zygo Blaxell	874832dc58	openat2: log a warning when we fall back to openat This should occur only once per run, but it's worth leaving a note that it has happened. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-19 22:19:42 -05:00
Zygo Blaxell	5fe89d85c3	extent scan: make sure we run every extent crawler once per transaction There's a pathological case where all of the extent scan crawlers except one are at the end of a crawl cycle, but the one crawler that is still running is keeping the Task queue full. The result is that bees never starts the other extent scan crawlers, because the queue is always full at the instant a new transid triggers the start of a new scan. That's bad because it will result in bees falling behind when new data from the inactive size tiers appears. To fix this, check for throttling _after_ creating at least one scan task in each crawler. That will keep the crawlers running, and possibly allow them to claw back some space in the Task queue. It slightly overcommits the Task queue, so there will be a few more Tasks than nominally allowed. Also (re)introduce some hysteresis in the queue size limit and reduce it a little, so that bees isn't continually stopping and restarting crawls every time one task is created or completed, and so that we stay under the configured Task limit despite overcommitting. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-19 22:19:42 -05:00
Zygo Blaxell	a2b3e1e0c2	log: demote a lot of BEESLOGWARN to higher verbosity levels Toxic extent workarounds are going away because the underlying kernel bugs have been fixed. They are no longer worthy of spamming non-developer logs. INO_PATHS can return no paths if an inode has been deleted. It doesn't need a log message at all, much less one at WARN level. Dedupe failure can be INFO, the same level as dedupe itself, especially since the "NO dedupe" message doesn't mention what was [not] deduped. Inspired by Kai Krakow's "context: demote "abandoned toxic match" to debug log level". Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-19 01:08:28 -05:00
Kai Krakow	aaec931081	context: demote "abandoned toxic match" to debug log level This log message creates a overwhelmingly lot of messages in the system journal, leading to write-back flushing storms under high activity. As it is a work-around message, it is probably only useful to developers, thus demote to debug level. This fixes latency spikes in desktop usage after adding a lot of new files, especially since systemd-journal starts to flush caches if it sees memory pressure. Signed-off-by: Kai Krakow <kai@kaishome.de>	2025-01-19 00:59:22 -05:00
Zygo Blaxell	d4a681c8a2	Revert "roots: use a non-idle task for next_transid" next_transid tasks don't respect queue selection very well, because they effectively end up spinning in a loop until all other worker threads become busy. Back this out, and fix the priority handling in the Task library. This reverts commit `58db4071de`.	2025-01-12 18:48:33 -05:00
Zygo Blaxell	b8dd9a2db0	progress: put a timestamp in the bottom row This records the time when the progress data was calculated, to help indicate when the data might be very old. While we're here, move "now" out of the loop so there's only one value. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-11 23:39:55 -05:00
Zygo Blaxell	2f2a68be3d	roots: use openat2 instead of openat when available This increases resistance to symlink and mount attacks. Previously, bees could follow a symlink or a mount point in a directory component of a subvol or file name. Once the file is opened, the open file descriptor would be checked to see if its subvol and inode matches the expected file in the target filesystem. Files that fail to match would be immediately closed. With openat2 resolve flags, symlinks and mount points terminate path resolution in the kernel. Paths that lead through symlinks or onto mount points cannot be opened at all. Fall back to openat() if openat2() returns ENOSYS, so bees will still run on kernels before v5.6. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-09 02:26:53 -05:00
Zygo Blaxell	82f1fd8054	process: replace crucible::gettid() with a weak symbol Since we're now using weak symbols for dodgy libc functions, we might as well do it for gettid() too. Use the ::gettid() global namespace and let libc override it. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-09 01:37:44 -05:00
Zygo Blaxell	613ddc3c71	progress: rename "ctime" -> "tm_left" "ctime", an abbreviation of "cycle time", collides with "ctime", an abbreviation of "st_ctime", a well-known filesystem term. "tm_left" fits in the column, so use that. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-06 12:50:50 -05:00
Zygo Blaxell	c3a39b7691	progress: rework the progress table after github discussion * Report position within cycle in units that cannot be mistaken for size or percentage * Put the total/maximum values in their own row * Add a start time column * Change column titles to reference "cycles" * Use "idle" instead of "finished" when a crawler is not running * Replace "transid" with "gen" because it's shorter Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-03 23:45:37 -05:00
Zygo Blaxell	58db4071de	roots: use a non-idle task for next_transid The scanners which finish early can become stuck behind scanners that are able to keep the queue full. Switch the next_transid task to the normal Task queues so that we force scanners to restart on every new transaction, possibly deferring already queued work to do so. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-03 23:36:53 -05:00
Zygo Blaxell	0d3e13cc5f	context: report time in scan_one_extent Add yet another field to the scan/skip report line: the wallclock time used to process the extent ref. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-03 23:36:53 -05:00
Zygo Blaxell	1af5fcdf34	roots: don't access a shared variable after releasing a lock Access the local copy of `m_root_crawl_map` instead. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-03 23:36:53 -05:00
Zygo Blaxell	87472b6086	extent scan: don't put non-data block groups in the data extent map The total data size should not include metadata or system block groups, and already does not; however, we still have these block groups in the map for mapping the crawl pointer to a logical offset within the filesystem. Rearrange a few lines around the `if` statement so that the map doesn't contain anything it should not. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-03 23:32:48 -05:00
Zygo Blaxell	ca351d389f	extent scan: pick the right block groups for mixed-bg filesystems The progress indicator was failing on a mixed-bg filesystem because those filesystems have block groups which have both _DATA and _METADATA bits, and the filesystem size calculation was excluding block groups that have _METADATA set. It should exclude block groups that have _DATA not set. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-03 23:15:37 -05:00
Zygo Blaxell	1f0b8c623c	options: improve message when too many--or too few--path arguments given Running bees with no arguments complains about "Only one" path argument. Replace this with "Exactly one" which uses similar terminology to other btrfs tools. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-03 23:15:37 -05:00
Zygo Blaxell	74296c644a	options: return EXIT_SUCCESS after displaying help message `getopt_long` already supplies a message when an option cannot be parsed, so there isn't a need to distinguish option parse failures from help requests. Fixes: https://github.com/Zygo/bees/pull/277 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-03 23:15:37 -05:00
Zygo Blaxell	231593bfbc	throttle: don't hold the multilock during throttle Release the lock before entering the throttle sleep, so that other threads can still run. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-03 23:15:37 -05:00
Zygo Blaxell	81bbf7e1d4	throttle: set default to 0.0 Longer latency testing runs are not showing a consistent gain from a throttle factor of 1.0. Make the default more conservative. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-01-03 23:15:37 -05:00

1 2 3 4 5 ...

369 Commits