GGLinnk/bees - bees - Virtual World Git

mirror of https://github.com/Zygo/bees.git synced 2025-06-17 10:06:16 +02:00

Author	SHA1	Message	Date
Zygo Blaxell	96eb100ded	bees: use readahead instead of posix_fadvise Other btrfs utils use readahead() not posix_fadvise(). There does not appear to be a performance or correctness difference between the three (none, posix_fadvise, or readahead()). Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-09-14 23:50:00 -04:00
Zygo Blaxell	b22db12390	context: log dedups with single unbroken log message When BEESLOGINFO is called multiple times it generates separate log records that can be mixed up when multiple threads dedup. Use a single BEESLOGINFO call for each dedup to prevent this. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-09-14 23:50:00 -04:00
Zygo Blaxell	33d274eabd	resolve: break up long intra-extent dedup loops When both block candidates for dedup are located in the same extent, bees excludes them from deduplication because the dedup operation would not free any space (both blocks are still referenced, so neither is deleted). Candidates in other extents are still considered. Typically a few blocks are duplicated many thousands or even millions of times within a filesystem. Many of these blocks appear in the same extent as each other. In cases where an extent contains an extremely common duplicate block, it may appear multiple times in many extents. bees can get into a loop with a very bad worst-case running time: 32768 blocks per extent * 2560 bees reference limit * 256 distinct hash table entries = 21.5 billion iterations...squared, because this loop happens every time bees encounteres any of the references. Not an infinite number, but close enough. In each iteration of the loop, replace_dst detects that both src and dst block are part of the same btrfs extent data item and therefore should not be deduped; however, this occurs after the block has been allocated and read by chase_extent_ref. This dst is discarded, but the outer loop tries again with another reference to the same block and gets the same result. An easy fix for this problem is to stop the loop immediately when the same physical extent is found in both src and dst. The condition is rare enough to ignore the negligible space efficiency loss, and filesystem scan stops dead if the loop is allowed to proceed. An exception is thrown to terminate the loop at scan_one_extent from within replace_dst. It would be better to determine the extent bytenr of each candidate extent and filter them out in scan_one_extent (which reduces the number of LOGICAL_INO calls as a side-effect), but bees has no code capable of doing extent data tree lookups with backward iteration yet. Even better would be to change the hash table format so that the extent bytenr can be decoded directly from the hash table entry (this already exists for compressed extents). Both of these changes are too large for v0.6. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-02-25 10:08:42 -05:00
Zygo Blaxell	d367c6364c	context: improve toxic match logs Reword log message for discovery of new toxic extents vs. lookup of previously known toxic extents. Also add the block data (especially filename) to the discovery message. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-29 00:48:06 -05:00
Zygo Blaxell	e74c0a9d80	scan: fix length mismatch exception for prealloc extents at EOF Prealloc extent sizes were taken from the Extent object and did not take the file size into account. If a file with a non-4K-aligned size is preallocated, the resulting dedup fails with an exception because the size of both ranges of the BeesRangePair do not match. Limit the size of the replacement hole extent to not extend past the end of the file. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-28 01:46:08 -05:00
Zygo Blaxell	ded26ff044	FdCache: clear cache on every new transid / crawl cycle The periodic cache age check was not protected by a lock, so multiple threads may decide to concurrently clear the cache. This led to duplicate log messages. Fix by moving the cache expiry trigger out of FdCache and into Roots, which knows when transids change and can perform cache clears at exactly the time they are most relevant, i.e. after something that was deleted becomes permanently so. This removes the last references to BEES_COMMIT_INTERVAL, so get rid of its definition too. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-26 23:48:05 -05:00
Zygo Blaxell	f6909dac17	bees: drop BEESINFO Having too many "write a message to the log" primitives is confusing, and having one that intermittently and silently discards output is even _more_ confusing. Replace all BEESINFO with appropriate BEESLOG*s. Usually DEBUG. Except for one or two that occur too often. Just delete those. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-26 23:48:05 -05:00
Kai Krakow	677da5de45	Logging: Add log levels to output This commit adds log levels to the output. In systemd, it makes colored lines, otherwise it's probably just a number. Bees is very chatty, so this paves the road for log level filtering. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-18 23:41:29 +01:00
Zygo Blaxell	055c8d4c75	roots: scan in parallel using Tasks Distribute incoming extents across a thread pool for faster execution on multi-core, multi-disk environments. Switch extent enumeration model to scan extent refs consecutively(ish). Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:52:00 -05:00
Zygo Blaxell	a175ee0689	bees: clean up #if 0 ... fsync ... #endif code Remove some dead code because dedup-related deadlocks have not been observed since Linux kernel v4.11. Preserve rationale of remaining #if 0 block (why we do write/rename instead of write/fsync/rename) so that people don't try to replace the "missing" fsync() there. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:30:07 -05:00
Zygo Blaxell	8d3a27bf85	subvol-threads: increase resource and thread limits With kernel 4.14 there is no sign of the previous LOGICAL_INO performance problems, so there seems to be no need to throttle threads using this ioctl. Increase the FD cache size limits and scan thread count. Let the kernel figure out scheduling. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:30:07 -05:00
Zygo Blaxell	77614a0e99	scan: insert toxic matched extents into hash table as they are discovered When a toxic extent is discovered, insert the offending hash/address/toxic entry into the hash table. When a previously discovered toxic extent is encountered, do nothing, i.e. allow the offending hash/address/toxic entry in the hash table to expire. Previously both inserts were removed from the code, but the former one is required. The latter prevents bees from forgiving toxic extents (or any hash matching one) should they be relocated, deleted, or simply become non-toxic. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2017-12-21 13:56:15 -05:00
Zygo Blaxell	732896b471	log: simplify output for dedup and scan With many threads it is inconvenient to reassemble the elided parts of the dedup src/dst and scan filenames output. Simply output them unconditionally, and balance the line lengths. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2017-09-16 17:30:30 -04:00
Zygo Blaxell	5cc5a44661	bees: drop unused BeesWorkQueue classes Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2017-09-16 17:30:22 -04:00
Zygo Blaxell	3901962379	bees: trace calls to BeesResolver This helps identify causes of the "same physical address in dedup" exception. Signed-off-by: Zygo Blaxell <bees@furryterror.org> (cherry picked from commit `cc7b4f22b5`)	2017-06-17 10:15:11 -04:00
Zygo Blaxell	3fdc217b4f	bees: change formatting for physical bytenr ranges in dedup Use a different character to make it easier to search for bytenr ranges in the logs. Signed-off-by: Zygo Blaxell <bees@furryterror.org> (cherry picked from commit `d43199e3d6`)	2017-06-17 10:15:08 -04:00
Zygo Blaxell	6c8d2bf428	bees: limit FD cache size explicitly This will allow the default size limit for cache objects to be changed with impunity. Signed-off-by: Zygo Blaxell <bees@furryterror.org> (cherry picked from commit `9daa51edaa`)	2017-06-17 10:15:08 -04:00
Zygo Blaxell	dc00dce842	context: purge FD cache every COMMIT_INTERVAL Holding file FDs open for long periods of time delays inode destruction. For very large files this can lead to excessive delays while bees dedups data that will cease to be reachable. Use the same workaround for file FDs (in the root_ino cache) that is used for subvols (in the root cache): forcibly close all cached FDs at regular intervals. The FD cache will reacquire FDs from files that still have existing paths, and will abandon FDs from files that no longer have existing paths. The non-existing-path case is not new (bees has always been able to discover deleted inodes) so it is already handled by existing code. Fixes: https://github.com/Zygo/bees/issues/18 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2017-02-08 22:01:00 -05:00
Zygo Blaxell	db8ea92133	bees: fix further instances of copy-after-unlock bug Before: unique_lock<mutex> lock(some_mutex); // run lock.~unique_lock() because return // return reference to unprotected heap return foo[bar]; After: unique_lock<mutex> lock(some_mutex); // make copy of object on heap protected by mutex lock auto tmp_copy = foo[bar]; // run lock.~unique_lock() because return // pass locally allocated object to copy constructor return tmp_copy; Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2017-01-22 22:00:27 -05:00
Zygo Blaxell	eec80944cd	roots: add a counter for crawl_ms, open_root and open_root_ino Linux kernel commit 7f8e406 ("btrfs: improve delayed refs iterations") seems to dramatically improve LOGICAL_INO performance. Hopefully this commit will find its way into mainline Linux soon. This means that most of the time in Bees is now spent on block reading (50-75%); however, there is still a big gap between block read and the sum of everything else we are measuring with the "*_ms" counters. This gap is about 30% of the run time, so it would be good to find out what's in the gap. Add ms counters around the crawl and open calls to capture where we are spending all the time.	2016-12-08 23:55:39 -05:00
Zygo Blaxell	642581e89a	hash: remove the experimental shared hash-table and shared mmap features The experiments are over, and the results were not success. Having two filesystems cohabiting in the same hash table results in a lot of false positives, each of which requires some heavy IO to resolve. Using MAP_SHARED to share a beeshash.dat between processes results in catastrophically bad performance. These features were abandoned long ago, but some of the code--and even worse, its documentation--still remains. Bees wants a hash table false positive rate below 0.1%. With a shared hash table the FP rate is about the same as the dedup rate. Typically duplicate files on one filesystem are duplicate on many filesystems. One or more of Linux VFS and the btrfs mmap(MAP_SHARED) implementation produce extremely poor performance results. A five-order-of-magnitude speedup was achieved by implementing paging in userspace with worker threads. We no longer need the support code for the MAP_SHARED case. It is still possible to run many BeesContexts in a single process, but now the only thing contexts share is the FD cache.	2016-12-02 00:26:02 -05:00
Zygo Blaxell	fdfa78a81b	context: default and relative BEESHOME Allow relative paths with BEESHOME. These paths will be relative to the root of the dedup target filesystem. BEESHOME is now optional. If not specified, '.beeshome' is used. We don't try to create BEESHOME if it doesn't exist. BEESHOME might not be on a btrfs filesystem, so we can't insist it be a subvol.	2016-12-02 00:22:18 -05:00
Zygo Blaxell	1303fb9da8	build: fix FTBFS on GCC 6.2 I'm not surprised that GCC 6 doesn't let me send an ostream ref to itself, even inside an uninstantiated template specialization. I am a little surprised I was trying to, and 4.9 let me get away with it. It's 2016. auto_ptr is deprecated now. Some things were including vector that don't any more. https://github.com/Zygo/bees/issues/1	2016-11-24 22:20:11 -05:00
Zygo Blaxell	cca0ee26a8	bees: remove local cruft, throw at github	2016-11-17 12:12:13 -05:00

1 2

74 Commits