GGLinnk/bees - bees - Virtual World Git

mirror of https://github.com/Zygo/bees.git synced 2025-11-03 19:50:34 +01:00

Author	SHA1	Message	Date
Zygo Blaxell	cb2c20ccc9	fs: get rid of base class btrfs_ioctl_same_extent_info We only use BtrfsExtentInfo when it's exactly equivalent to the base, so drop the derived class. While we're here, fix BtrfsExtentSame::add so it uses a btrfs-compatible uint64_t instead of an off_t. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-01-05 01:10:17 -05:00
Zygo Blaxell	ded5bf0148	btrfs-tree: fix whitespace and const Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-01-05 01:10:17 -05:00
Zygo Blaxell	d5de012a17	btrfs-tree: translate item types for error messages Look up the name when filling in the what() field for the exception. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-01-05 01:10:17 -05:00
Zygo Blaxell	66d1e8a89b	btrfs-tree: add chunk items: length and type Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-01-05 01:10:17 -05:00
Zygo Blaxell	c327e0bb10	readahead: report the original size in BEESTOOLONG BEESTOOLONG was always reporting a size of zero, and the offset of the end of the readahead region. Report the original size instead (and also in BEESTRACE and BEESNOTE). Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-01-05 01:10:17 -05:00
Zygo Blaxell	9587c40677	docs: add crawl_again, drop crawl_restart Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-01-05 01:10:17 -05:00
Zygo Blaxell	a115587fad	roots: fix extent lock failure handling Drop the crawl_restart counter, it doesn't happen here (or anywhere else). Add the crawl_again counter for extents that are restarted due to an extent-level lock. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-01-05 01:10:17 -05:00
Zygo Blaxell	af6ecbc69b	trace: use pthread_setname wrapper libcrucible can deal with the Linux kernel and/or libc's thread name limitations. No need to duplicate that work in bees. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-01-05 01:10:17 -05:00
Zygo Blaxell	563e584da4	task: use pthread_setname_np correctly It turns out I've been using pthread_setname_np wrong the whole time: * on Linux, the thread name length is 15 characters. TASK_COMM_LEN is 16 bytes, and the last one is always 0. This is now hardcoded in many places and cannot be changed. * pthread_setname_np doesn't return -errno, so DIE_IF_MINUS_ERRNO was the wrong macro. On the other hand, we never want to do anything differently when pthread_setname_np fails, so we never needed to check the return value. Also, libc silently ignores attempts to set the thread name when it is too long. That's almost certainly a libc bug, but libc probably suppresses the error result for the same reasons I ignore the error result. Wrap the pthread_setname function with a C++ std::string overload that truncates the argument at 15 characters, so we at least get the first part of the task name in the thread name field. Later commits can deal with making the bees thread names shorter. Also wrap pthread_getname for symmetry. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-01-05 01:10:17 -05:00
Zygo Blaxell	c5889049f0	docs: remove duplicate (and wrong) default scan mode The default scan mode is found in config.md. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-01-05 01:10:17 -05:00
Adam Faiz	ecaed09128	docs: fix reference direction The Dependencies list is above the Packaging section, not below. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-29 06:25:33 -05:00
Zygo Blaxell	64dab81e42	Merge github PR #148 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-23 00:26:33 -05:00
Zygo Blaxell	cfcdac110b	context: don't count MultiLock waiting time in dedup_ms This was inflating the dedup_ms statistic because it was counting all the resolve time too. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-22 23:46:36 -05:00
Zygo Blaxell	c3b664fea5	context: don't forget to retry locked extents The caller of scan_forward has to stop advancing the BeesFileCrawl position when an extent lock blocks a scan, so that it will resume from the same position when the Task is scheduled again; otherwise, bees simply skips over the extent and leave it incompletely deduped. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-22 23:46:36 -05:00
Hilton Chain	66b00f8a97	beesd: Honor DESTDIR on installation. Co-authored-by: Adam Faiz <adam.faiz@disroot.org> Signed-off-by: Hilton Chain <hako@ultrarare.space>	2022-12-23 11:10:17 +08:00
Zygo Blaxell	bbcfd9daa6	roots: replace BEES_TRANSID_FACTOR with BEES_TRANSID_POLL_INTERVAL Restart crawl_more (and update crawl roots and flush FD caches) every time the transid changes, and only when the transid changes, but not more often than a reasonable minimum poll interval. Clean up the log message: use the proper thread name and remove the wildly inaccurate estimate of when crawl will resume. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:01 -05:00
Zygo Blaxell	d6d3e1045e	context: keep the resolve cache smaller We don't need to cache 65536 extent maps, especially if each one can have almost 700K references. Valgrind's massif tool points to the extent map cache as a very large memory allocator, but test runs with memcg disagree. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:01 -05:00
Zygo Blaxell	d5d17cbe62	roots: run insert_new_crawl from within a Task If we have loadavg targeting enabled, there may be no worker threads available to respond to new subvols, so we should not bother updating the subvols list. Put insert_new_crawl into a Task so it only executes when a worker is available. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:01 -05:00
Zygo Blaxell	48dd2a45fe	docs: remove the line discussing 'max_transid' in recent scan mode This makes the doc match the code again. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:01 -05:00
Zygo Blaxell	7267707687	roots: disable recent sorting by max_transid On large filesystems where the min_transid of all subvols gets stuck at 0, bees may lose the ability to effectively track recent data. A secondary sort by max_transid will allow scanning newer subvols that were created after bees started running on the filesystem, but before bees completed the first scan of all subvols. On the other hand, the secondary sort does a reverse version of the sequential scan mode, and the sequential scan mode is simply awful. Disable it for now. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:01 -05:00
Zygo Blaxell	984ceeb2a5	docs: update documentation for new 'recent' scan mode Also attempted to clarify the descriptions of the modes based on feedback and questions from users over the years. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:01 -05:00
Zygo Blaxell	03f809bf22	roots: reimplement scan modes using virtual base and methods Split each scan mode into two distinct phases: 1. A heavy discovery phase, where we search the entire filesystem for something (new items in subvol trees in this case). 2. A light consuming phase, where we fetch extents to dedupe from places that we found in the discovery phase. Part 1 recomputes the subvol ordering every time there is a new transid. For some scan modes this computation is quite expensive, far too costly to pay for every extent, so we do it no more than once per transaction. Part 2 is run every time a worker thread hits the crawl_more Task. It simply pulls one extent from the first crawler off a sorted list, removing the crawler from the list when the crawler runs out of data. Part 1 creates a new structure and swaps it into place, while Part 2 continues to run using the previous strucuture. Neither of these need to block the other, so they don't. The separate class and base pointer also make it easer to add new scan modes that are not based on subvol trees or that don't use BeesCrawl. While we're here, fix up some method visibility in BeesRoots. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:01 -05:00
Zygo Blaxell	0dca6f74b0	roots: remove duplicate default scan mode setting Set the constructor's default scan mode to an invalid mode, so if we change the default, we don't have to update two places. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:01 -05:00
Zygo Blaxell	f5c4714a28	roots: add 'recent' crawl mode for a mix of new and old data Crawl mode 3 'recent' prioritizes data from new updates to previously scanned subvols over subvols that have not been completely scanned yet. If no such new data exists, falls back to a variation of 'lockstep' scan mode. This enables us to keep up with new data as it arrives, a key weakness of all the other scan modes, and worth violating our unwritten "no new scan modes until we have extent-tree dedupe working" policy for. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:00 -05:00
Zygo Blaxell	de96a38460	roots: emit "crawl finished" at the correct time The correct time is when we set the deferred bit after a tree search returns empty. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:00 -05:00
Zygo Blaxell	82c2b5bafe	roots: improve thread status tracking messages Don't dereference a shared_ptr inside a thread status function. Do trace the crawl start events. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:00 -05:00
Zygo Blaxell	d725f3c66c	context: process PREALLOC extents synchronously in extent's Task worker Inode-oriented scan workers must do all of their work sequentially, so it's counterproductive to spawn a Task to do a background dedupe. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:00 -05:00
Zygo Blaxell	84f91af503	context: don't let multiple worker Tasks get stuck on a single extent or inode When two Tasks attempt to lock the same extent, append the later Task to the earlier Task's post-exec work queue. This will guarantee that all Tasks which attempt to manipulate the same extent will execute sequentially, and free up threads to process other extents. Similarly, if two scanner threads operate on the same inode, any dedupe they perform will lock out other scanner threads in btrfs. Avoid this by serializing Task objects that reference the same file. This does theoretically use an unbounded amount of memory, but in practice a Task that encounters a contended extent or inode quickly stops spawning new Tasks that might increase the queue size, and all Tasks that might contend for the same lock(s) end up on a single FIFO queue. Note that the scope of inode locks is intentionally global, i.e. when an inode is locked, it locks every inode with the same number in every subvol. This avoids significant lock contention and task queue growth when the same inode with the same file extents appear in snapshots. Fixes: https://github.com/Zygo/bees/issues/158 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:00 -05:00
Zygo Blaxell	31d26bcfc6	roots: organize scan workers by inode instead of extent Split crawlers into two separate Tasks: 1. a Task which locates the next inode with a new data extent. 2. a Task which scans every new extent in that inode. This simplifies some lock contention and execution ordering issues. Files are read sequentially. Workers dynamically scale up or down as needed, without creating thousands of deferred Task objects. Workers obtain inode locks for different inodes in btrfs, so they can work in parallel instead of waiting for each other. This change in behavior comes with new names for the worker Tasks: "crawl_master" is now "crawl_more", the singular Task which creates inode-scanning Tasks. "crawl_<subvol>" is now "crawl_<subvol>_<inode>". Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:00 -05:00
Zygo Blaxell	e13c62084b	roots: use scan mode 'independent' by default Independent subvol scanners fairly consistently outperform either of the correlated scan modes. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:00 -05:00
Zygo Blaxell	7cef1133be	roots: use symbolic names for SCAN_MODEs This was done on the development branch three years ago, and has been creating annoying merge conflicts ever since. Sync up the branches so they have the same names for these. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:00 -05:00
Zygo Blaxell	f98599407f	roots: rework btrfs send workaround using btrfs-tree Drop the cache since we no longer have to open a file every time we check a subvol's status. Also stop counting workaround events at the root level twice. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:59 -05:00
Zygo Blaxell	4d59939b07	btrfs-tree: introduce lightweight classes for btrfs tree search operations btrfs-tree provides classes for low-level access to btrfs tree objects. An item class is provided to decode polymorphic btrfs item fields. Several tree classes provide forward and backward iteration over raw object items at different tree levels. A csum tree class provides convenient access to csums by bytenr, supporting all current btrfs csum types. Wrapper classes for inode and subvol items provide direct access to btrfs metadata fields without clumsy stat() wrappers or ioctls. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:59 -05:00
Zygo Blaxell	24b904f002	seeker: backward searching template function This template turns a forward search primitive (e.g. lower_bound, FIEMAP, TREE_SEARCH_V2) into a backward search primitive. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:59 -05:00
Zygo Blaxell	23c16aa978	BeesFileRange: coalesce is not used, subtract was never implemented Less dead code to maintain. Also more Doxygen comments. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:59 -05:00
Zygo Blaxell	152e69a6d1	bytevector: validate length in get<T>() Don't allow a pointer to T to be taken from a ByteVector that is not at least sizeof(T) bytes long. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	148cc03060	bytevector: do not deadlock in self-assignment Not that this is a particularly useful use case, but it will lock up, and it should not. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	b699325a77	bytevector: don't need _all_ of those mutexes Methods that don't even look at the pointer don't need a mutex. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	a59d89ea81	bytevector: add some fugly mutexes We are using ByteVectors from multiple threads in some cases. Mostly these are the status and progress threads which read the ByteVector object references embedded in BEESNOTE macros. Since it's not clear what the data race implications are, protect the shared_ptr in ByteVector with a mutex for now. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	d1015b683f	bytevector: add ostream output with hexdump There is a hexdump template in fs. Move hexdump to its own header, then ByteVector can use it too. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	9cdeb608f5	bees: drop the balance/logical workaround that has been disabled for two years Kernels that needed the balance workaround frankly are too buggy to run bees at all. The workaround also makes the locking stories around logical_ino calls and process exit complicated, so get rid of it completely. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	83a2b010e6	context: drop long-dead ExtentWalker code At some point BtrfsExtentWalker will be fully deprecated and removed from bees. Might as well start with code that hasn't been built in 6 years. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	31b2aa3c0d	context: speed up orderly process termination Quite often bees exceeds its service timeout for termination because it is waiting for a loop embedded in a Task to finish some long-running btrfs operation. This can cause bees to be aborted by SIGKILL before it can completely flush the hash table or save crawl state. There are only two important things SIGTERM does when bees terminates: 1. Save crawl progress 2. Flush out the hash table Everything else is automatically handled by the kernel when the process is terminated by SIGKILL, so we don't have to bother doing it ourselves. This can save considerable time at shutdown since we don't have to wait for every thread to reach a point where it becomes idle, or force loops to terminate by throwing exceptions, or check a condition every time we access a pointer. Instead, we need do only the things in the list above, and then call _exit() to clean up everything else. Hash table and crawl state writeback can happen in their background threads instead of the foreground one. Separate the "stop" method for these classes into "stop_request" and "stop_wait" so that these writebacks can run at the same time. Deprecate and remove all references to the BeesHalt exception, and remove several unnecessary checks for BeesContext::stop_requested. Pause the task queue instead of cancelling it, which preserves the crawl progress state and stops new Tasks from competing for iops and CPU during writeback. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	594ad1786d	context: dump current load tracking stats Dump the instantaneous load (last 5 seconds, extracted from load average) and the computed target worker count (before rounding and truncation) on the same status line as the task and worker thread count. This should give better visibility into Task's thread count calculation algorithm. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:57 -05:00
Zygo Blaxell	b143664747	task: use exponential backoff algorithm to set thread count Tasks are often running longer than 5 seconds (especially extents with multiple references requiring copy operations), so the load tracking algorithm needs to average several samples over a longer period of time than 5 seconds. If the sample period is 60 seconds, we end up recomputing the original load average from current_load, so skip the rounding error and use the original load average value. Arguably the real fix is to break up the more complex extent operations over several downstream Task objects, but that's a more significant design change. Tweak the attack and decay rates so that threads are started a little more slowly, but still stopped rapidly when load spikes up. Remove the hysteresis to provide support for load average targets below 1, or with fractional components, with a PWM-like effect. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:57 -05:00
Zygo Blaxell	a85ada3a49	task: export load tracking statistics Provide an interface so that programs can monitor the Task load average calculations. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:57 -05:00
Zygo Blaxell	46a38fe016	task: rescue post-exec queue on Task destruction task1.append(task2) is supposed to run task2 after task1 is executed; however, if task1 was just executed, and its last reference was owned by a TaskConsumer, then task2 will be appended to a Task that will never run again. A similar problem arises in Exclusion, which can cause blocked tasks to occasionally be dropped without executing them. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:57 -05:00
Zygo Blaxell	2aafa802a9	task: increase saved thread name length to 64 24 bytes seems a little low. 64 is a rounder (and more square) number. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:57 -05:00
Zygo Blaxell	cdef59e2f3	task: add more Doxygen comments for PairLock I need to remind myself why it's there, and not just std::lock. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:57 -05:00
Zygo Blaxell	dc2dc8d08a	task: delete the queue after deleting all of its children This was resulting in an assertion failure later on if a queue was being rescued from a deleted task with only one post-exec queue. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:57 -05:00

1 2 3 4 5 ...

623 Commits