GGLinnk/bees - bees - Virtual World Git

mirror of https://github.com/Zygo/bees.git synced 2025-09-19 03:22:16 +02:00

Author	SHA1	Message	Date
Kai Krakow	17e1171464	Installation: Remove USR_PREFIX from Makefile This commit removes USR_PREFIX and introduces ETC_PREFIX instead. The purpose of PREFIX is the installation prefix in the system, not the installation destination. The latter one is what DESTDIR is used for. This should clear up the confusion. PREFIX was already mis-used as installation destination. But that doesn't mix well with how the make targets are designed. CC: Timofey Titovets <nefelim4ag@gmail.com> Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-09-08 02:59:52 +02:00
Kai Krakow	9069201036	Scripts: Fix systemd unit not being templated Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-09-08 02:21:08 +02:00
Kai Krakow	ace814321f	Makefile: Auto-detect systemd unit path This uses pkg-config to detect the system unit dir. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-09-08 02:21:08 +02:00
Kai Krakow	451f0ad9aa	Makefile: Allow installation of fiemap/fiewalk support tools There's now a new make target called "install_tools" which would not run by default on installation. One can add "OPTIONAL_INSTALL_TARGETS=install_tools" into localconf to install these by default. fiewalk would be installed to sbin, as only root can run it, the other goes to bin. Gentoo can use this to optionally install these tools as a package feature. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-09-08 02:20:59 +02:00
Kai Krakow	85f9265034	Makefile: make installing libs a separate target This will allow installing fiemap/fiewalk support tools as an optional install target. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-09-08 02:13:27 +02:00
Kai Krakow	5b28aad27f	Makefile: Run install tests only for default target "reallyall" Otherwise, tests would still run during "make install". Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-09-08 02:13:27 +02:00
Kai Krakow	6c47bb61c1	Makefile: remove tests from "make all" Instead, introduce "make reallyall" and make it the default target. Now, one can override the default target using localconf. Needed for preparing Gentoo ebuild test behavior. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-09-08 02:13:27 +02:00
Timofey Titovets	2d14fd90e4	Update options in sample config Signed-off-by: Timofey Titovets <nefelim4ag@gmail.com>	2018-08-29 11:44:25 +03:00
Timofey Titovets	e0f315d47a	Make beesd -h useful Signed-off-by: Timofey Titovets <nefelim4ag@gmail.com>	2018-08-29 11:44:25 +03:00
Zygo Blaxell	e564d27dda	README: update known bugs and issues list Also split "bad feature interactions" into "unknown" (which is what it really was before) and "bad" (which includes some filesystem-destroying problems). Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-05-18 00:16:09 -04:00
Zygo Blaxell	c3effe0a20	crawl: use custom order instead of (ab)using BeesFileRange::operator< This makes the code clearer and keeps changes to BeesFileRange ordering isolated. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-05-18 00:16:08 -04:00
Zygo Blaxell	f8c27f5c6a	bees: revert TOXIC_INTERVAL back to pre-4.14 levels Linux kernel 4.14, while resistant to extent toxicity, is not immune to it. Go back to the paranoid setting to avoid tying up filesystems in ridiculously long kernel loops in find_parent_nodes. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-05-18 00:16:08 -04:00
Zygo Blaxell	26039cd559	tempfile: update comments around bees_sync Deadlock reproduced on kernel 4.14.34. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-05-18 00:16:04 -04:00
Zygo Blaxell	e9aef89293	fs: fix FTBFS on GCC 8 The memset is just doing an assignment from one dereferenced pointer to another, so do an assignment to keep GCC 8 happy. Fixes: https://github.com/Zygo/bees/issues/64 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-05-18 00:15:37 -04:00
Zygo Blaxell	c21518d8ff	stats: rename "chase_wrong_data" to "chase_no_data" An empty BeesBlockData from the chasing algorithm used to mean that data was found at the expected location but it does not match; however, there are now other reasons for this and they occur much more often. The name is misleading. Change the name to report more correctly what happens: no data, without any guess about the reason. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-03-01 00:01:13 -05:00
Zygo Blaxell	082f04818f	BeesBlockData: fix data type issues Not sure if these cause any problems, but they are theoretically incorrect data types. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-02-28 23:58:28 -05:00
Zygo Blaxell	5bdad7fc93	crucible: progress: a progress tracker for worker queues The task queue can become very large with many subvols, requiring hours for the queue to clear. 'beescrawl.dat' saves in the meantime will save the work currently scheduled, not the work currently completed. Fix by tracking progress with ProgressTracker. ProgressTracker::begin() gives the last completed crawl position. ProgressTracker::end() gives the last scheduled crawl position. begin() does not advance if there is any item between begin() and end() is not yet completed. In between are crawled extents that are on the task queue but not yet processed. The file 'beescrawl.dat' saves the begin() position while the extent scanning task queue is fed from the end() position. Also remove an unused method crawl_state_get() and repurpose the operator<(BeesCrawlState) that nobody was using. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-02-28 23:49:39 -05:00
Zygo Blaxell	90c32c3f05	crucible: MAP_32BIT is not defined on ARM Also fix a stray #if that should be #ifdef. Closes: https://github.com/Zygo/bees/issues/59 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-02-25 10:08:44 -05:00
Zygo Blaxell	33d274eabd	resolve: break up long intra-extent dedup loops When both block candidates for dedup are located in the same extent, bees excludes them from deduplication because the dedup operation would not free any space (both blocks are still referenced, so neither is deleted). Candidates in other extents are still considered. Typically a few blocks are duplicated many thousands or even millions of times within a filesystem. Many of these blocks appear in the same extent as each other. In cases where an extent contains an extremely common duplicate block, it may appear multiple times in many extents. bees can get into a loop with a very bad worst-case running time: 32768 blocks per extent * 2560 bees reference limit * 256 distinct hash table entries = 21.5 billion iterations...squared, because this loop happens every time bees encounteres any of the references. Not an infinite number, but close enough. In each iteration of the loop, replace_dst detects that both src and dst block are part of the same btrfs extent data item and therefore should not be deduped; however, this occurs after the block has been allocated and read by chase_extent_ref. This dst is discarded, but the outer loop tries again with another reference to the same block and gets the same result. An easy fix for this problem is to stop the loop immediately when the same physical extent is found in both src and dst. The condition is rare enough to ignore the negligible space efficiency loss, and filesystem scan stops dead if the loop is allowed to proceed. An exception is thrown to terminate the loop at scan_one_extent from within replace_dst. It would be better to determine the extent bytenr of each candidate extent and filter them out in scan_one_extent (which reduces the number of LOGICAL_INO calls as a side-effect), but bees has no code capable of doing extent data tree lookups with backward iteration yet. Even better would be to change the hash table format so that the extent bytenr can be decoded directly from the hash table entry (this already exists for compressed extents). Both of these changes are too large for v0.6. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-02-25 10:08:42 -05:00
Zygo Blaxell	2ac94438bd	README: FD caches are now cleared every 10 transactions Also some other minor editorial changes. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-02-14 21:09:05 -05:00
Zygo Blaxell	9063c6442f	README: clarify that bees is not to be used on old kernels Also note that there is currently no released Linux kernel that is free of relevant bugs. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-02-14 20:54:48 -05:00
Zygo Blaxell	86afa69cd1	cache: release lock before clearing Clearing the FD cache could trigger a lot of inode evicts in the kernel, which will block the cache entry destructors called by map::clear(). This prevents any cache lookups or new file opens while it happens. Move the map to an auto variable and destroy it after releasing the mutex lock. This probably has the same net result (all the bees threads will be blocked in the kernel instead of on a bees mutex), but at least the problem is outside of userspace now. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-02-07 23:14:38 -05:00
Zygo Blaxell	8f0e88433e	roots: get rid of common error messages, add more error counters One very common case is losing a race to open a file that was deleted. No need to spam the logs with mere ENOENT reports. Other errors are more significant. Log those with errno, and add event counters to record them. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-02-07 23:12:01 -05:00
Zygo Blaxell	5c1b45d67c	extentwalker: remove wrong constraint check Extents that extend past EOF will have ipos = (file size rounded up to next block) and e.end() = (file size not rounded), which fails this constraint check. The constraint check is wrong. Remove it for now. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-02-07 00:07:57 -05:00
Zygo Blaxell	6aad124241	crawl: somebody should set max_transid The previous commit had both max_transid assigments commented out. It happens to work because we set max_transid in the constructor and it doesn't change after that, but it's cleaner to assign it explicitly. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-31 22:52:12 -05:00
Zygo Blaxell	087ec26c44	crawl: filter extents correctly When an extent ref is modified, all of the refs in the same metadata page get the same transid in the TREE_SEARCH_V2 header. This causes two problems: - Extents with generation < min_transid are included if they happen to be referenced by pages with generation >= min_transid. - Extent refs with generation > max_transid are excluded even if they reference extents with generation <= max_transid. Both of these are wrong: the first causes some extents to be repeatedly scanned, the second causes some extents to not be scanned at all. Change the TREE_SEARCH_V2 parameters so that Crawl sees all extents newer than min_transid (i.e. set max_transid to max). The TREE_SEARCH_V2 kernel logic already operates this way, i.e. it fetches every page with transid >= min_transid and discards newer items if they are too new for max_transid. Filter strictly by the extent reference generation field (i.e. the copy of the extent generation that is in the extent reference). Note this still scans extent data multiple times, but it should now be exactly once per extent reference. A proper fix for this requires extent-based scanning instead of extent-ref-based scanning. Formerly commit `5a8c655fc4` "roots: filter out obsolete extents from extent refs" which landed in the subvol-threads branch but not master. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-31 22:48:39 -05:00
Kai Krakow	408b6ae138	Code style: Fix wrong indentation This had spaces instead of tabs by accident. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-29 21:37:40 -05:00
Kai Krakow	e3c4a07216	Makefile: Unclutter "make test" output This adds a .txt Makefile target to create a text file which receives the test program output. In case the test failed, it will cat the contents and fail the target. Execution of each test itself is forced, so it would run every time make is invoked, thus no failing test would be missed. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-29 21:37:40 -05:00
Kai Krakow	d8241a7720	README: Add notes about packaging Give some pointers on how to package bees for a distribution. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-29 21:37:40 -05:00
Kai Krakow	5590fc0b13	Cmdline: Fix text alignment Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-29 21:37:40 -05:00
Kai Krakow	29d40ca359	Cmdline: Rename "relative-paths" to "strip-paths" The previous name didn't match what this option really does. Affects: #41 Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-29 21:37:40 -05:00
Kai Krakow	b164717a25	Cmdline: Rename "notimestamps" to "no-timestamps" That aligns better with the other options. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-29 21:37:40 -05:00
Zygo Blaxell	af250f7732	roots: determine transid_max without open()ing every subvol root Scan the roots tree directly for roots other than 5 (the FS root), and use btrfs_get_root_transid on root_fd for root 5. This avoids filling up the root FD cache every time we want a new transid_max. Now the only reason we open a subvol root FD is to open a file within the subvol. transid_max may be the same as the FS root's transid, in which case the search loop is not necessary. Place a counter (transid_max_miss) to see if we ever need to look at root items. If this counter never goes above zero, or does so very rarely, we can delete the search loop. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-29 21:37:39 -05:00
Zygo Blaxell	4f0bc78a4c	crawl: don't block a Task waiting for new transids Task should not block for extended periods of time. Remove the RateEstimator::wait_for() in crawl_roots. When crawl_roots runs out of data, let the last crawl_task end without rescheduling. Schedule crawl_task again on transid polls if it was not already running. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-29 21:37:39 -05:00
Zygo Blaxell	b67fba0acd	log: BEESLOGNOTE doesn't do what we think it does BEESLOGNOTE was intended to combine BEESLOG and BEESNOTE, i.e. write a log message and set the task status message from a single expression. With the log levels we would now need several more variants (BEESLOGNOTEDEBUG, BEESLOGNOTEERR...) or a parameter (BEESNOTELOG(DEBUG, ...)). Or we give up on the idea. This combination was used only 3 times so far. The log messages and the note message have different editorial styles. Remove the three instances of BEESLOGNOTE, and make the BEESLOGNOTE definition equvalent to BEESLOG at LOG_NOTICE level for consistency. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-29 21:37:38 -05:00
Zygo Blaxell	92fda34a68	task: allow user access to ID and default constructor The default constructor makes it more convenient to use Task as a class member. The ID is useful to disambiguate Task references. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-29 00:54:06 -05:00
Zygo Blaxell	2aacdcd95f	time: add update_monotonic to RateEstimator update_monotonic does not reset the counter if a new count is smaller than earlier counts. Useful when consuming an unsorted stream of eveent counts. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-29 00:51:13 -05:00
Zygo Blaxell	d367c6364c	context: improve toxic match logs Reword log message for discovery of new toxic extents vs. lookup of previously known toxic extents. Also add the block data (especially filename) to the discovery message. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-29 00:48:06 -05:00
Zygo Blaxell	591a44e59a	resolve: drop support for old-style compressed BeesAddr No public version of bees ever created old-style compressed hash table entries. Remove the code that supports them. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-29 00:48:06 -05:00
Zygo Blaxell	27125b8140	README: add scan-mode 2 and expand descriptions of modes 0 and 1 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-29 00:48:06 -05:00
Zygo Blaxell	636328fdc2	roots: add scan-mode 2 "oldest crawler first" Add a third scan mode with alternative trade-offs. Benefits: Good sequential read performance. Avoids race conditions described in https://github.com/Zygo/bees/issues/27. Avoids diverting scan resources into short-lived snapshots before their long-lived origin subvols are fully scanned. Drawbacks: Takes the longest time of the three implemented scan-modes to free space in extents that are shared between snapshots. Uses the maximum amount of temporary space. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-29 00:48:05 -05:00
Zygo Blaxell	ef44947145	roots: move common code for creating crawl Tasks into a method Duplicated code between the different scan modes has slowly been becoming less and less trivial. Move the code to a method and make both scan-modes call it. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-28 22:52:17 -05:00
Zygo Blaxell	72cc9c2b60	ExtentWalker: increase efficiency for typical btrfs extent sizes Perf was blaming more than 50% of cycles on TREE_SEARCH_V2. strace showed 4 TREE_SEARCH_V2 calls for every pread in grow_backward(). Fix by increasing the extent fetch batch size so it is more likely to include the desired items in the first fetch attempt. This removes TREE_SEARCH_V2 from the top 10 list of cycle consumers. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-28 22:52:07 -05:00
Zygo Blaxell	e74c0a9d80	scan: fix length mismatch exception for prealloc extents at EOF Prealloc extent sizes were taken from the Extent object and did not take the file size into account. If a file with a non-4K-aligned size is preallocated, the resulting dedup fails with an exception because the size of both ranges of the BeesRangePair do not match. Limit the size of the replacement hole extent to not extend past the end of the file. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-28 01:46:08 -05:00
Zygo Blaxell	762f833ab0	roots: poll every 10 transids Restartng scans for each transid is a bit aggressive. Scan every 10 transids for a polling rate close to the former BEES_COMMIT_INTERVAL. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-26 23:48:05 -05:00
Zygo Blaxell	48e78bbe82	roots: use RateEstimator as a transid_max cache and clean up logs transid_max is now measured at a single point in the crawl_transid thread. Move the Crawl deferred logic into BeesRoots so it restarts all crawls when transid_max increases. Gets rid of some messy time arithmetic. Change name of Crawl thread to "crawl_master" in both thread name and log messages. Replace "Next transid" with "Crawl started". Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-26 23:48:05 -05:00
Zygo Blaxell	ded26ff044	FdCache: clear cache on every new transid / crawl cycle The periodic cache age check was not protected by a lock, so multiple threads may decide to concurrently clear the cache. This led to duplicate log messages. Fix by moving the cache expiry trigger out of FdCache and into Roots, which knows when transids change and can perform cache clears at exactly the time they are most relevant, i.e. after something that was deleted becomes permanently so. This removes the last references to BEES_COMMIT_INTERVAL, so get rid of its definition too. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-26 23:48:05 -05:00
Zygo Blaxell	72857e84c0	crawl: combine two messages per crawl cycle into one Now that the polling interval is up to 30 times faster, next_transid seems too verbose again. Make it clearer that the interval quoted in the "Deferring..." message is the computed transaction polling interval. Combine "Next transid" and "Restarted crawl" into a single message. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-26 23:48:05 -05:00
Zygo Blaxell	0fdae37962	roots: use RateEstimator to track transids Make the crawl polling interval more closely track the commit interval on the btrfs filesystem. In the future this will provide opportunities to do things like clear FD caches and stop crawls on deleted subvols, but triggered by transaction commits instead of arbitrary time intervals. Rename the "crawl" thread so it no longer has the same name as the "crawl" task, and repurpose it for dedicated transid polling. Cancel the deletion of crawl_thread and repurpose it to trigger new crawls and wake up the main crawl Task when it runs out of data. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-26 23:48:05 -05:00
Zygo Blaxell	4694c7d250	time: add RateEstimator, a class for optimally polling irregular external events RateEstimator estimates the rate of external events by sampling a counter. Conversion functions are provided to predict the time when the event counter will be incremented to particular values based on past observations of the event counter. Synchronization functions are provided to block a thread until a specific counter value is reached. Event polling is supported using the history of previous event counts to determine the predicted time of the next event. A decay function emphasizes more recent event history. Polling delays are bounded by minimum and maximum values in the constructor parameters. wait_for() and wait_until() block the calling thread until the target event count is reached (or the counter is reset). These functions are not bounded by min_delay or max_delay, and require a separate tread to call update(). wait_for() waits for the counter to be incremented from its current value by the given count. wait_until() waits for the counter to reach an absolute value. update() counts external events and unblocks threads that are blocked in wait_for() or wait_until(). If the event counter decreases then it is reset to the new value. duration() and time_point() convert relative and absolute event counts into relative and absolute C++11 time quantities based on the last update time, last observed event count, and the observed event rate. Convenience functions seconds_for() and seconds_until() calculate polling delays for for the desired relative and absolute event counts respectively. These delays are bounded by max and min delay parameters. rate() and ratio() provide conversion factors based on the current estimated event rate. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-26 23:48:05 -05:00

... 2 3 4 5 6 ...

424 Commits