GGLinnk/bees - bees - Virtual World Git

mirror of https://github.com/Zygo/bees.git synced 2025-12-20 02:20:20 +01:00

Author	SHA1	Message	Date
Zygo Blaxell	d725f3c66c	context: process PREALLOC extents synchronously in extent's Task worker Inode-oriented scan workers must do all of their work sequentially, so it's counterproductive to spawn a Task to do a background dedupe. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:00 -05:00
Zygo Blaxell	84f91af503	context: don't let multiple worker Tasks get stuck on a single extent or inode When two Tasks attempt to lock the same extent, append the later Task to the earlier Task's post-exec work queue. This will guarantee that all Tasks which attempt to manipulate the same extent will execute sequentially, and free up threads to process other extents. Similarly, if two scanner threads operate on the same inode, any dedupe they perform will lock out other scanner threads in btrfs. Avoid this by serializing Task objects that reference the same file. This does theoretically use an unbounded amount of memory, but in practice a Task that encounters a contended extent or inode quickly stops spawning new Tasks that might increase the queue size, and all Tasks that might contend for the same lock(s) end up on a single FIFO queue. Note that the scope of inode locks is intentionally global, i.e. when an inode is locked, it locks every inode with the same number in every subvol. This avoids significant lock contention and task queue growth when the same inode with the same file extents appear in snapshots. Fixes: https://github.com/Zygo/bees/issues/158 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:00 -05:00
Zygo Blaxell	31d26bcfc6	roots: organize scan workers by inode instead of extent Split crawlers into two separate Tasks: 1. a Task which locates the next inode with a new data extent. 2. a Task which scans every new extent in that inode. This simplifies some lock contention and execution ordering issues. Files are read sequentially. Workers dynamically scale up or down as needed, without creating thousands of deferred Task objects. Workers obtain inode locks for different inodes in btrfs, so they can work in parallel instead of waiting for each other. This change in behavior comes with new names for the worker Tasks: "crawl_master" is now "crawl_more", the singular Task which creates inode-scanning Tasks. "crawl_<subvol>" is now "crawl_<subvol>_<inode>". Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:00 -05:00
Zygo Blaxell	e13c62084b	roots: use scan mode 'independent' by default Independent subvol scanners fairly consistently outperform either of the correlated scan modes. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:00 -05:00
Zygo Blaxell	7cef1133be	roots: use symbolic names for SCAN_MODEs This was done on the development branch three years ago, and has been creating annoying merge conflicts ever since. Sync up the branches so they have the same names for these. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:51:00 -05:00
Zygo Blaxell	f98599407f	roots: rework btrfs send workaround using btrfs-tree Drop the cache since we no longer have to open a file every time we check a subvol's status. Also stop counting workaround events at the root level twice. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:59 -05:00
Zygo Blaxell	4d59939b07	btrfs-tree: introduce lightweight classes for btrfs tree search operations btrfs-tree provides classes for low-level access to btrfs tree objects. An item class is provided to decode polymorphic btrfs item fields. Several tree classes provide forward and backward iteration over raw object items at different tree levels. A csum tree class provides convenient access to csums by bytenr, supporting all current btrfs csum types. Wrapper classes for inode and subvol items provide direct access to btrfs metadata fields without clumsy stat() wrappers or ioctls. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:59 -05:00
Zygo Blaxell	24b904f002	seeker: backward searching template function This template turns a forward search primitive (e.g. lower_bound, FIEMAP, TREE_SEARCH_V2) into a backward search primitive. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:59 -05:00
Zygo Blaxell	23c16aa978	BeesFileRange: coalesce is not used, subtract was never implemented Less dead code to maintain. Also more Doxygen comments. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:59 -05:00
Zygo Blaxell	152e69a6d1	bytevector: validate length in get<T>() Don't allow a pointer to T to be taken from a ByteVector that is not at least sizeof(T) bytes long. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	148cc03060	bytevector: do not deadlock in self-assignment Not that this is a particularly useful use case, but it will lock up, and it should not. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	b699325a77	bytevector: don't need _all_ of those mutexes Methods that don't even look at the pointer don't need a mutex. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	a59d89ea81	bytevector: add some fugly mutexes We are using ByteVectors from multiple threads in some cases. Mostly these are the status and progress threads which read the ByteVector object references embedded in BEESNOTE macros. Since it's not clear what the data race implications are, protect the shared_ptr in ByteVector with a mutex for now. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	d1015b683f	bytevector: add ostream output with hexdump There is a hexdump template in fs. Move hexdump to its own header, then ByteVector can use it too. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	9cdeb608f5	bees: drop the balance/logical workaround that has been disabled for two years Kernels that needed the balance workaround frankly are too buggy to run bees at all. The workaround also makes the locking stories around logical_ino calls and process exit complicated, so get rid of it completely. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	83a2b010e6	context: drop long-dead ExtentWalker code At some point BtrfsExtentWalker will be fully deprecated and removed from bees. Might as well start with code that hasn't been built in 6 years. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	31b2aa3c0d	context: speed up orderly process termination Quite often bees exceeds its service timeout for termination because it is waiting for a loop embedded in a Task to finish some long-running btrfs operation. This can cause bees to be aborted by SIGKILL before it can completely flush the hash table or save crawl state. There are only two important things SIGTERM does when bees terminates: 1. Save crawl progress 2. Flush out the hash table Everything else is automatically handled by the kernel when the process is terminated by SIGKILL, so we don't have to bother doing it ourselves. This can save considerable time at shutdown since we don't have to wait for every thread to reach a point where it becomes idle, or force loops to terminate by throwing exceptions, or check a condition every time we access a pointer. Instead, we need do only the things in the list above, and then call _exit() to clean up everything else. Hash table and crawl state writeback can happen in their background threads instead of the foreground one. Separate the "stop" method for these classes into "stop_request" and "stop_wait" so that these writebacks can run at the same time. Deprecate and remove all references to the BeesHalt exception, and remove several unnecessary checks for BeesContext::stop_requested. Pause the task queue instead of cancelling it, which preserves the crawl progress state and stops new Tasks from competing for iops and CPU during writeback. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:58 -05:00
Zygo Blaxell	594ad1786d	context: dump current load tracking stats Dump the instantaneous load (last 5 seconds, extracted from load average) and the computed target worker count (before rounding and truncation) on the same status line as the task and worker thread count. This should give better visibility into Task's thread count calculation algorithm. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:57 -05:00
Zygo Blaxell	b143664747	task: use exponential backoff algorithm to set thread count Tasks are often running longer than 5 seconds (especially extents with multiple references requiring copy operations), so the load tracking algorithm needs to average several samples over a longer period of time than 5 seconds. If the sample period is 60 seconds, we end up recomputing the original load average from current_load, so skip the rounding error and use the original load average value. Arguably the real fix is to break up the more complex extent operations over several downstream Task objects, but that's a more significant design change. Tweak the attack and decay rates so that threads are started a little more slowly, but still stopped rapidly when load spikes up. Remove the hysteresis to provide support for load average targets below 1, or with fractional components, with a PWM-like effect. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:57 -05:00
Zygo Blaxell	a85ada3a49	task: export load tracking statistics Provide an interface so that programs can monitor the Task load average calculations. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:57 -05:00
Zygo Blaxell	46a38fe016	task: rescue post-exec queue on Task destruction task1.append(task2) is supposed to run task2 after task1 is executed; however, if task1 was just executed, and its last reference was owned by a TaskConsumer, then task2 will be appended to a Task that will never run again. A similar problem arises in Exclusion, which can cause blocked tasks to occasionally be dropped without executing them. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:57 -05:00
Zygo Blaxell	2aafa802a9	task: increase saved thread name length to 64 24 bytes seems a little low. 64 is a rounder (and more square) number. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:57 -05:00
Zygo Blaxell	cdef59e2f3	task: add more Doxygen comments for PairLock I need to remind myself why it's there, and not just std::lock. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:57 -05:00
Zygo Blaxell	dc2dc8d08a	task: delete the queue after deleting all of its children This was resulting in an assertion failure later on if a queue was being rescued from a deleted task with only one post-exec queue. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:57 -05:00
Zygo Blaxell	7873988dac	task: add a pause() method as an alternative to cancel() pause(true) stops the TaskMaster from processing any more Tasks, but does not destroy any queued Tasks. pause(false) re-enables Task processing. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:57 -05:00
Zygo Blaxell	3f740d6b2d	task: simplify clear_queue Simplify the loop in clear_queue because we can't be modifying a queue while we are clearing it. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:56 -05:00
Zygo Blaxell	c0a7533dd4	task: use const for current_consumer The const version of this code has much more testing, but any effect at run time is unlikely. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:56 -05:00
Zygo Blaxell	090fa39995	task: don't hold the mutex while disposing of pending Tasks In the event that someday Barrier allows users to force execution of its pending tasks prior to the destruction of the BarrierState object, we'll be ready to submit those Tasks for execution without waiting for the BarrierState mutex lock. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:56 -05:00
Zygo Blaxell	2f25f89067	task: get rid of separate Exclusion and ExclusionState Exclusion was generating a new Task every time a lock was contended. That results in thousands of empty Task objects which contain a single Task item. Get rid of ExclusionState. Exclusion is now a simple weak_ptr to a Task. If the weak_ptr is expired, the Exclusion is unlocked. If the weak_ptr is not expired, it points to the Task which owns the Exclusion. try_lock now appends the Task attempting to lock the Exclusion directly to the owning Task, eliminating the need for Exclusion to have one. This also removes the need to call insert_task separately, though insert_task remains for other use cases. With no ExclusionState there is no need for a string argument to Exclusion's constructor, so get rid of that too. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:56 -05:00
Zygo Blaxell	7fdb87143c	task: get rid of the separate Barrier and BarrierLock Make one class Barrier which is copiable, so we don't have to have users making shared Barrier all the time. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:55 -05:00
Zygo Blaxell	d345ea2b78	readahead: use emulation It seems that readahead() does not work on btrfs, or at least it has no discernable effect. Enable the workaround instead. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:55 -05:00
Zygo Blaxell	a2e1887c52	bees: use MultiLocker to serialize dedupe and logical_ino In current kernels there is a bug which leads to an infinite loop in add_all_parents(). The bug is triggered by one thread running dedupe while another runs logical_ino. Work around this by ensuring that bees process never runs dedupe and logical_ino ioctls at the same time. Any number of either can run at the same time, but not one of both. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:55 -05:00
Zygo Blaxell	4a4a2de89f	multilocker: serialize conflicting parallel operations For performance or workaround reasons we sometimes have to avoid doing two conflicting operations at the same time, but we can still run any number of non-conflicting operations in parallel. MultiLocker (suggestions for a better class name welcome) blocks the calling thread until there are no threads attempting to run a conflicting operation. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:54 -05:00
Zygo Blaxell	cc87125e41	bees: drop bees_sync, we will not need it bees_sync() was an exception-trapping wrapper around fsync() which is not needed in any of the contexts from which it was called: 1. dedupe operations implicitly flush the src data, so there is no need to call fsync() to do that twice. 2. crawl position is written to a temporary file and renamed over the original, which always forces a flush when the original exists. On the first write, where there is no original, a crash would result in starting over with an empty or hole-filled beescrawl file, which is the initial state of bees. There is also a long history of kernel bugs triggered by fsync() in this case. 3. we use unreadahead to trigger writeback for flushing the hash table to persistent storage. Here is a space where we might use fsync after all, as part of bees_unreadahead's emulation of POSIX_FADV_DONTNEED, but we need to get read-once behavior from the scanner before we can use this capability. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:54 -05:00
Zygo Blaxell	be9321cdb3	roots: correctly track crawl dirty state If there's an error while writing the crawl state, the state should remain dirty. If the crawl state is successfully written, the state is only clean if there were no changes to crawl state since the write was committed. We need to release the lock while writing the state but correctly set the dirty flag when the state is written successfully. Replace the bool with a version number counter. Track the last version successfully saved and the current version of the crawl state. The state is dirty if these counters disagree and clean if they agree. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:54 -05:00
Zygo Blaxell	a9c81e5531	bees: drop m_parent_ctx It has not been used since 2016. Also drop the explicit default constructor. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:54 -05:00
Zygo Blaxell	942800ad00	fd: add some doxygen Still very incomplete, but better than it was before. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:54 -05:00
Zygo Blaxell	21c08008e6	namedptr: add some doxygen, fix the #endif comment Document the overall purpose of the class and what some of the methods do, particularly the ones with terrible names like 'insert_item' (which only inserts an item after calling the Function). Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:54 -05:00
Zygo Blaxell	30ece57116	fs: export btrfs_compress_type_ntoa We already had a function that was _similar_, so add decoding for compress type NONE, give it a less specific name, and declare it in fs.h. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:54 -05:00
Zygo Blaxell	6556566f54	ntoa: fix type of mask It really needs to be uint64_t, but at least it now doesn't contradict the definition in the earlier header. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:53 -05:00
Zygo Blaxell	ece58cc910	cache: add a method to get estimated cache size Estimated because there is no lock preventing the result from changing before it is used. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:53 -05:00
Zygo Blaxell	331cb142e3	fs: make dedupe work again after a really unfortunate build fix In commit `14ce81c08` "fs: get rid of silly base class that causes build failures now" I neglected to set the dest_count field in the ioctl arg structure, so bees master hasn't been deduping anything for about three weeks. I'd put a THROW_CHECK in here to catch this kind of bug in the future, but it would be placed at exactly the point where this fix is. Fixes: `14ce81c08` Signed-off-by: Zygo Blaxell <bees@furryterror.org> v0.8	2022-11-05 13:43:21 -04:00
Zygo Blaxell	5953ea6d3c	fs: update btrfs compatibility header: add csum types, BTRFS_FS_INFO_FLAG_GENERATION and _METADATA_UUID I guess this means it's "args_v3" now? Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-10-25 12:56:16 -04:00
Zygo Blaxell	07a4c9e8c0	roots: sprinkle on some more const Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-10-25 12:56:16 -04:00
Zygo Blaxell	8f6f8e4ac2	roots: make sure we can never get a uint_max transid If we iterate over all roots to find the max transid, but the set of all roots is empty, we'll get a nonsense number. Make sure that number doesn't reach the crawling logic by killing it with an exception. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-10-25 12:56:16 -04:00
Zygo Blaxell	972721016b	fs: get rid of base class fiemap Yet another build failure of the form: error: flexible array member fiemap... not at end of struct crucible::Fiemap... bees doesn't use fiemap any more, so the fixes here are minimal changes to make it build, not shining examples of C++ class design. Signer-off-by: Zygo Blaxell <bees@furryterror.org>	2022-10-25 12:56:16 -04:00
Zygo Blaxell	5040303f50	fs: get rid of base class btrfs_data_container This fixes another build failure of the form: error: flexible array member btrfs_... not at end of struct crucible::Btrfs... Fixes: https://github.com/Zygo/bees/issues/236 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-10-23 22:42:57 -04:00
Zygo Blaxell	3654738f56	bees: fix deprecated-copy warnings for clang-14 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-10-23 22:39:59 -04:00
Zygo Blaxell	be3c54e14c	extentwalker: drop explicit default constructors They're all public because it's a struct, so there's no need to make them explicit. clang-14 deprecates these. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-10-23 22:39:59 -04:00
KhalilSantana	2751905f1d	Fixes a bad grep pattern caused by `dffd6e0` Fixes #233	2022-10-13 16:03:30 -04:00

... 3 4 5 6 7 ...

797 Commits