GGLinnk/bees - bees - Virtual World Git

mirror of https://github.com/Zygo/bees.git synced 2026-01-08 20:00:22 +01:00

Author	SHA1	Message	Date
Zygo Blaxell	6d6686eb5b	context: get rid of resolve (LOGICAL_INO) serializer There are kernel bugs in LOGICAL_INO from time to time; however, we can't avoid these bugs by serializing LOGICAL_INO calls. It hasn't been used for some time, so remove the code and less-than-completely-accurate comments. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-12-19 15:10:02 -05:00
Zygo Blaxell	670fce5be5	resolve: reword the too-many-duplicates exception message For one thing, it should _say_ that there are too many duplicates. We were making the user read the manual to find that out. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-11-29 21:27:48 -05:00
Zygo Blaxell	ff3b5a7a1b	hash: drop bees_unreadahead Forcing the entire hash table into immediate writeback causes crippling write latencies at shutdown. Even discarding pages as they are read in at startup can trigger a writeback latency spike if the pages are dirty at read time. Better to let the VM subsystem handle this on its own. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-11-29 21:27:48 -05:00
Zygo Blaxell	13ec4b5165	hash: add utsname fields to log output Putting this information in the logs saves us from having to ask for the kernel version and machine name every time. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-11-29 21:27:48 -05:00
Zygo Blaxell	85c93c10e6	bees: clean up #include list No need for atomic, and sort the Linux headers. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-11-29 21:27:48 -05:00
Zygo Blaxell	ba694b4881	hash: move the random generator out of bees-hash.cc We need random numbers in more places, so centralize the engines. Initialize with a proper random seed so every worker thread gets different behavior. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-11-29 21:27:48 -05:00
Zygo Blaxell	5e379b4c48	readahead: update comments to reflect bakeoff results It turns out that readahead() alone is fastest. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-11-29 21:27:48 -05:00
Zygo Blaxell	c698fd7211	context: stop using deprecated memset_zero template Use ordinary literal initialization instead. The ioctl doesn't need initialization of args at all. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-11-29 21:27:48 -05:00
Zygo Blaxell	ecf110f377	context: add a comment explaining why we are not adding bees_unreadahead At the end of scanning one extent, in theory we do not need that extent any more. In practice, it hurts benchmark scores if we drop the extents after reading them. Add a comment to note this where we put the bees_unreadhead call. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-11-29 21:27:48 -05:00
Zygo Blaxell	7f7f919d08	context: fix the status message that will never be seen BEESNOTE can only be seen if the status thread is running at the time, making the log of activities during shutdown incomplete. Wake up the status thread early during shutdown so the logged sequence of shutdown actions is complete. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-11-29 21:27:48 -05:00
Zygo Blaxell	11fabd66a8	context: add experimental code for avoiding tiny extents In the current architecture we can't directly measure the physical extent size, and we can't make good decisions with the extent data (reference) item alone. If the early return is enabled here, there is a small speedup and a large drop in dedupe hit rate, especially when extent splits occur. Leave the early return commented for now, but collect the event statistics. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-11-29 21:27:48 -05:00
Zygo Blaxell	7a8d98f94d	roots: use the new type argument to next_min Tree searches are all looking for specific item types. Skip over any item types we are not interested in when resetting the search key for the next search. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-10-31 19:59:09 -04:00
Zygo Blaxell	e861957632	roots: use default nr_items BtrfsIoctlSearchKeyV2's constructor now fills in nr_items = 1, so we don't need to set it explicitly any more. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-10-31 19:56:04 -04:00
Zygo Blaxell	14cd6ed033	bees: deprecate vector<uint8_t> and replace with ByteVector The vector<uint8_t> in the hash table doesn't hurt very much--only a few microseconds per 128K hash block. The vector<uint8_t> in BeesBlockData hurts a bit more--we run that constructor thousands of times per second. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-10-31 19:42:01 -04:00
Zygo Blaxell	2f14a5a9c7	roots: reduce number of objects per TREE_SEARCH_V2, drop BEES_MAX_CRAWL_ITEMS and BEES_MAX_CRAWL_BYTES This makes better use of dynamic buffer sizing, and reduces the amount of stale date lying around. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-10-31 19:42:01 -04:00
Zygo Blaxell	587870911f	roots: use const more Mark local variables that can be const const. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-10-31 19:42:01 -04:00
Zygo Blaxell	d384f3eec0	roots: ignore subvol when it is read-only and send workaround is enabled Previously, when the bees send workaround is enabled, bees would immediately advance the subvol's crawl status as if the entire subvol had been scanned. If the subvol is later made read-write, or if the workaround is disabled, bees sees that the subvol has already been marked as scanned. This is an unfortunate result if the subvol is inadvertently marked read-only or if bees is inadvertently run with the send workaround disabled. Instead, (almost) completely ignore the subvol: don't advance the crawl pointer, don't consider the subvol in the list if searchable roots, and don't consider the subvol when calculating min_transid for new subvols. The "almost" part is: if the subvol scan has not yet started, keep its start timestamp current so it won't mess up subvol traversal performance metrics. Also handle exceptions while determining whether a subvol is read-only, as those apparently do happen. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-10-31 19:42:01 -04:00
Zygo Blaxell	12e80658a8	fs: fix FIEMAP_MAX_OFFSET type silliness in fiemap.h In fiemap.h the members of struct fiemap are declared as __u64, but the FIEMAP_MAX_OFFSET macro is an unsigned long long value: $ grep FIEMAP_MAX_OFFSET -r /usr/include/ /usr/include/linux/fiemap.h:#define FIEMAP_MAX_OFFSET (~0ULL) $ grep fe_length -r /usr/include/ /usr/include/linux/fiemap.h: __u64 fe_length; /* length in bytes for this extent */ This results in a type mismatch error on architectures like ppc64le: fiemap.cc:31:35: note: deduced conflicting types for parameter 'const _Tp' ('long unsigned int' and 'long long unsigned int') 31 \| fm.fm_length = min(fm.fm_length, FIEMAP_MAX_OFFSET - fm.fm_start); \| ~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Work around this by copying the macro into a uint64_t constant, and not using the macro any more. Fixes: https://github.com/Zygo/bees/issues/194 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-10-06 15:17:02 -04:00
Zygo Blaxell	a353d8cc6e	hash: use POSIX_FADV_WILLNEED and POSIX_FADV_DONTNEED The hash table is one of the few cases in bees where a non-trivial amount of page cache memory will be used in a predictable way, so we can advise the kernel about our IO demands in advance. Use WILLNEED to prefetch hash table pages at startup. Use DONTNEED to trigger writeback on hash table pages at shutdown. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-10-04 20:41:09 -04:00
Zygo Blaxell	97d70ef4c5	bees: readahead() in the kernel is posix_fadvise(..., POSIX_FADV_WILLNEED) In theory, we don't need the pread() loop, because the kernel will do a better job with readahead(). In practice, we might still need the pread() code, as the readahead will occur at idle IO priority, which could adversely affect bees performance. More testing is required. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-10-04 20:21:01 -04:00
Zygo Blaxell	522e52618e	context: calculate TOTAL RATES correctly The denominator for TOTAL RATES is the total running time, not the delta running time. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-08-30 18:23:42 -04:00
Kai Krakow	081a6af278	bees: Avoid unused result with -Werror=unused-result Fixes: commit `20b8f8ae0b` ("bees: use helper function for readahead") Signed-off-by: Kai Krakow <kai@kaishome.de>	2021-06-19 10:35:28 +02:00
Zygo Blaxell	3d95460eb7	fiemap: don't force flush so we can see the delalloc shenanigans Like filefrag, fiemap was defaulting to FIEMAP_FLAG_SYNC, and providing no option to turn it off. This prevents observation of delayed allocations, making fiemap less useful. Override the default flag setting so fiemap gets the current (i.e. unflushed) extent map state. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 21:09:14 -04:00
Zygo Blaxell	d9e3c0070b	context: stop creating new refs when there are too many already LOGICAL_INO_V2 has a maximum limit of 655050 references per extent. Although it no longer has a crippling performance problem, at roughly two seconds to process extent, it's too slow to be useful. When an extent gains an absurd number of references, stop making any more. Returning zero extent refs will make bees believe the extent was deleted, and it will remove the block from the hash table. This helps speed processing of highly duplicated large files like VM images, and the cost of a slightly lower dedupe hit rate. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 21:05:55 -04:00
Zygo Blaxell	08899052ad	trace: current_exception() is not a replacement for uncaught_exception() In `15ab981d9e` "bees: replace uncaught_exception(), deprecated in C++17", uncaught_exception() was replaced with current_exception(); however, current_exception() is only valid after an exception has been captured by a catch block. BeesTracer wants to know about exceptions _before_ they are caught, so current_exception() is not useful here. Instead, conditionally compile using uncaught_exception() or uncaught_exceptions(), selected by C++ standard version, and make bees stack traces work again. Fixes: `15ab981d9e` "bees: replace uncaught_exception(), deprecated in C++17" Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:56:54 -04:00
Zygo Blaxell	03532effed	trace: move BeesTrace and BeesNote into their own translation unit This allows these components to be used by test executables without pulling in all of bees, and more rapidly iterate their code. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:56:54 -04:00
Zygo Blaxell	1fd26a03b2	tracer: annotate both ends of the stack trace Add a matching "--- BEGIN TRACE..." line to complement the "--- END TRACE..." line. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:56:54 -04:00
Zygo Blaxell	b2d4a07c6f	roots: add a TRACE for transid_max search and crawl_transid thread Some users are hitting an exception somewhere in crawl_transid, which forces bees to return back to the transid_max calculation over and over. Also out-of-range transids. Add some BEESTRACE so we can see what we were doing in the exception handler. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:56:54 -04:00
Zygo Blaxell	7008c74113	bees: trace and log improvements during roots and context startup Currently if crawl throws an exception, we don't have basic information about what was being crawled or even if the crawler was running at all. These traces also help identify the causes of early exception failures. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:56:54 -04:00
Zygo Blaxell	5f0f7a8319	bees: increase StringFile size limit If we are going to dedupe thousands of subvols, we are going to need a bigger beescrawl.dat. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:56:54 -04:00
Zygo Blaxell	ee86b585a5	bees: use a reserved symbol name in BEESLOG "c" could be a local variable name, which would do interesting things to some log messages. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:56:54 -04:00
Zygo Blaxell	cf4b5417c9	context: remove unnecessary copies These were added while debugging a crash that was fixed years ago. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:56:54 -04:00
Zygo Blaxell	77ef6a0638	roots: split constructor into separate start method This allows us to use the fd cache and inode resolve functions without starting crawler threads. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:56:54 -04:00
Zygo Blaxell	0f0da21198	context: track record extent reference counts This might be interesting information, though most of the motivation for this evaporated when kernel 5.7 came out. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:56:54 -04:00
Zygo Blaxell	8a70bca011	bees: misc comment updates These have been accumulating in unpublished bees commits. Squash them all into one. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:56:54 -04:00
Zygo Blaxell	20b8f8ae0b	bees: use helper function for readahead There seem to be multiple ways to do readahead in Linux, and only some of them work. Hopefully reading the actual data is one of them. This is an attempt to avoid page-by-page reads in the generic dedupe code. We load both extents into the VFS cache (read sequentially) and hope they are still there by the time we call dedupe on them. We also call readahead(2) and hopefully that either helps or does nothing. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:56:54 -04:00
Zygo Blaxell	0afd2850f4	cache: emit log messages when clearing FD cache This enables us to correlate FD cache clears with external events such as btrfs inode eviction storms. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:56:46 -04:00
Zygo Blaxell	ffac407a9b	roots: clean up crawl_master Remove some broken #if 0 code, and take advantage of new Task non-repeating execution semantics. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:49:15 -04:00
Zygo Blaxell	4f032ab85b	context: report Task instance count Report the number of Task objects that currently exist as well as the number on the global work queue. THREADS (work queue 298 of 2385 tasks, 16 workers): This helps spot leaks, since Task objects that are blocked on other Task post-exec queues are otherwise invisible. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:49:15 -04:00
Zygo Blaxell	0bbaddd54c	docs: finally concede that the consensus spelling is "dedupe" Change documentation and comments to use the word "dedupe," not "dedup" as found in circa-3.15 kernel sources. No changes in code or program output--if they used "dedup" before, they will continue to be spelled "dedup" now. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:49:15 -04:00
Zygo Blaxell	fbd1091052	options: remove default 8 CPU thread limit Higher CPU core counts became more common, and kernel bugs became less common, since the arbitrary 8-thread limit was introduced. We can remove the limit now, and treat any remaining scaling inefficiency as a bug to be removed. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:49:15 -04:00
Zygo Blaxell	5b72f35657	src: bees depends on libcrucible.a The dependency was missing, so changes to the library would not trigger a rebuild of the bees binary. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-06-11 20:49:15 -04:00
Zygo Blaxell	80c69f1ce4	context: get rid of shared_ptr<BeesContext> in every single cached Fd object Support for multiple BeesContext objects sharing a FdCache was wasting significant space and atomic inc/dec memory cycles for no good reason since the shared-FdCache feature was deprecated. open_root and open_root_ino still need a BeesContext to work. Pass the BeesContext pointer through the function object instead of the cache key arguments. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-04-28 21:54:00 -04:00
Zygo Blaxell	db65031c2b	context: get rid of all instances of pthread_cancel pthread_cancel doesn't really work properly. It was only being used in bees to bring threads to a stop if the BeesContext is destroyed early. It is frequently implicated in core dump reports because of the fragility of the C++ iostream / C stdio / library infrastructure, particularly surrounding upgrades on the host running bees. The pthread_cancel call itself often simply fails even when it doesn't call terminate(). Defer creation of the status and progress threads until after the BeesContext::start method is invoked. At that point, the existing ask-threads-nicely-to-stop code is up and running, and normal condvars can be used to bring bees to a stop, without having to resort to pthread_cancel. Since we're deleting half of the BeesContext constructor in this change, let's remove the other half too, and put an end to the deprecated support for multiple BeesContexts sharing a process. It's still possible to run multiple BeesContexts, but they will not share a FD cache. This will allow the FD cache's keys to become smaller and hopefully save some memory later on. Fixes: #171 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-04-28 21:42:03 -04:00
Zygo Blaxell	bcf3e7de3e	uuid: drop dependency on uuid.h The weird things distros do to the path where uuid.h gets installed have broken bees builds for the last time. We were only using uuid to support a legacy feature that was removed over four years ago. Hypothetical users who are upgrading directly from bees v0.1 should probably restart all the crawlers anyway--there were bugs. Also, if any such users exist, I respect their tremendous patience with the horrible performance all these years--bees got about 30x faster since v0.1. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-04-23 08:16:50 -04:00
Zygo Blaxell	7f660f50b8	lib: fs: stop using libbtrfs-dev helper functions to re-enable buffer length checks The Linux kernel's btrfs headers are better than the libbtrfs-dev headers: - the libbtrfs-dev headers have C++ language compatibility issues - upstream version in Linux kernel is more accurate and up to date - macros in libbtrfs-dev's ctree.h hide information that would enable bees to perform runtime buffer length checking - enum types whose presence cannot be detected with #ifdef When accessing members of metadata items from the filesystem, we want to verify that the member we are accessing is within the boundaries of the item that was retrieved; otherwise, a memory access violation may occur or garbage may be returned to the caller. A simple C++ template, given a pointer to a structure member and a buffer, can determine that the buffer contains enough bytes to safely access a struct member. This was implemented back in 2016, but left unused due to ctree.h issues. Some btrfs metadata structures have variable length despite using a fixed-size in-memory structure. The members that appear earliest in the structure contain information about which following members of the structure are used. The item stored in the filesystem is truncated after the last used member, and all following members must not be accessed. 'btrfs_stack_*' accessor macros obscure the memory boundaries of the members they access, which makes it impossible for a C++ template to verify the memory access. If the template checks the length of the entire structure, it will find an access violation for variable-length metadata items because the item is rarely large enough for the entire structure. Get rid of all the libbtrfs-dev accessor macros and reimplement them with the necessary buffer length checks. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-02-22 20:06:43 -05:00
Zygo Blaxell	6eb7afa65c	build: include localconf everywhere Overriding makeflags did not work from localconf in the src, lib, or test directories. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-02-22 20:06:43 -05:00
Zygo Blaxell	10af3f9763	bees: remove si_addr_lsb from siginfo debug message to fix FTBFS Apparently it is missing in newer Linux headers, making builds fail. We don't need it, so remove it. Closes: #160 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2020-12-21 19:26:22 -05:00
Zygo Blaxell	636e69267e	resolve: add bees.h constants for balance and logical_ino serialization Make these workarounds configurable in src/bees.h instead of #if 0 code blocks. Someday we'll make the constants in bees.h configurable through a file or similar. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2020-12-17 18:07:36 -05:00
Zygo Blaxell	ab5316a3da	src: use correct flags for compiling .c files, fix missing dependencies fiewalk and fiemap depend on a lot of crucible, and incremental builds fail hard without proper dependency tracking. All binaries must be rebuilt when makeflags changes. This dependency exists already in lib and test, but src was missing. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2020-12-17 17:54:52 -05:00

1 2 3 4 5

224 Commits