GGLinnk/bees - bees - Virtual World Git

mirror of https://github.com/Zygo/bees.git synced 2025-08-24 15:02:21 +02:00

Author	SHA1	Message	Date
Kai Krakow	c17618c371	README: Some things are simply no longer true Environment variables are no longer the /only/ option. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-20 14:47:04 -05:00
Kai Krakow	dee6f189bb	README: Fix markdown syntax error Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-20 14:47:04 -05:00
Kai Krakow	de6d7d6f25	Makefile: Get rid of test for-loop Tests could now be run in parallel. Additionally, single tests can be run by simply using "make testname", i.e. "make chatter" would run the chatter test. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-20 14:44:27 -05:00
Kai Krakow	63f249f005	Makefile: force rebuilding tests when Makefile changed Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-20 14:43:37 -05:00
Kai Krakow	ca1a3bed12	Makefile: -lXXXXX is really a filename parameter According to gcc docs, -l is converted to a filename which makes it a filename parameter. Let's move it to the end. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-20 14:43:21 -05:00
Kai Krakow	d6312c338b	Logging: Improve text layout when discarding log timestamps When timestamps are removed from logging, the current text layout shows lines like tid 12345 thread_name: Example log Let's convert it to a more conforming layout: thread_name[12345]: Example log Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-20 14:42:49 -05:00
Zygo Blaxell	5533d09b3d	Merge remote-tracking branch 'kakra/proposal/prepare-for-more-libs'	2018-01-20 14:23:55 -05:00
Zygo Blaxell	4c05c53d28	roots: update Task print functions for new usage This restores the old "crawl" prefix in the case of Crawler log messages. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-20 14:00:52 -05:00
Zygo Blaxell	5063a635fc	logging: get Task names for log messages When a Task worker thread is executing a Task, the thread name is less useful than the Task description. Use the Task description instead of the thread name if the thread has no BeesThread name and the thread is currently executing a task. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-20 14:00:51 -05:00
Zygo Blaxell	fef7aed8fa	BeesNote: if thread name was not set, get it from Task or pthread_getname_np Threads from the Task module in libcrucible don't set BeesNote::tl_name. Even if they did, in Task context the thread name is unspecific to the point of meaninglessness. Use the Task::print method as the name for such threads, and be sure that future Task print functions are designed for that usage. The extra complexity in BeesNote::get_name() seems preferable to bombarding pthread_setname_np hundreds or thousands of times per second. FIXME: we are now calling Task::print() on every BeesNote, which is effectively unconditionally. Maybe we should have Task::print() and get_name() return a closure, or just evaluate Task::print() once and cache it in TaskState, or define Task's constructor with a string argument instead of the current print_fn closure. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-20 13:57:51 -05:00
Zygo Blaxell	3f60a0efde	task: allow external access to Task print function This enables bees' thread introspection to use task descriptions in status and log messages. BeesNote will be calling Task::current_task() from non-Task contexts, which means we need to allow Task's shared state pointer to be null. Remove some asserts that will ruin our day in that case. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-20 13:51:05 -05:00
Zygo Blaxell	e970ac6c02	crawl: make logging less verbose Silence the three(!) log messages per crawl increment an extra one at the end of the subvol. The three critical messages per subvol crawl cycle are: Next transid in BeesCrawlState <SUBVOL>:0 offset 0x0 transid <A>..<B> started <T> (<AGO>s ago) Subvol has been completely scanned and a new transaction range will be created. CrawlState is the state of the old subvol. Restarted crawl BeesCrawlState <SUBVOL>:0 offset 0x0 transid <B>..<C> started <T+AGO> (0s ago) Subvol has been restarted. CRawlState is the state of the new subvol. Deferring next transid in BeesCrawlState <SUBVOL>:0 offset 0x0 transid <B>..<C> started <T+AGO> (0s ago) Subvol has been completely scanned, but it is too soon to start a new scan. Fix the "Restart..." message to use the correct verb tense and to use the correct BeesCrawlState data. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-20 13:50:47 -05:00
Zygo Blaxell	38ccf5c921	counters: track pair growing time When we find a matching block we attempt to extend ("grow") the matched pair around the first matching block. This function takes the IO hit of reading the second extent from each duplicate extent pair. It's also very slow--too many allocations, too small reads, reads in the wrong order, an order of magnitude too many calls to TREE_SEARCH_V2, and it is usually in the top 3 most frequent PERFORMANCE warnings. Start tracking the running time of grows using the pairforward_ms and pairbackward_ms counters so that we can compare it to various replacements. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-20 13:04:56 -05:00
Kai Krakow	826b27fde2	Makefile: Fix some dependencies Some deps are already referenced by depends.mk, some where actually missing. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-19 01:50:13 +01:00
Kai Krakow	8a5f790a03	Makefile: Some cleanups Reorder and reformat some arguments so it looks more streamlined during the build process. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-19 01:50:13 +01:00
Kai Krakow	677da5de45	Logging: Add log levels to output This commit adds log levels to the output. In systemd, it makes colored lines, otherwise it's probably just a number. Bees is very chatty, so this paves the road for log level filtering. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-18 23:41:29 +01:00
Kai Krakow	d6b847db0d	Makefile: speedup dependency generation Dependencies can be generated in parallel which can be much faster. It also puts away the problem that for may fail multiple times in a row and leaving behind a broken intermediate file which would be picked up by successive runs. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-18 22:53:00 +01:00
Kai Krakow	b8f933d360	Makefile: do not be verbose about mv A small left-over from me fixing the same problem as Zygo did in his merged branch. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-18 22:53:00 +01:00
Kai Krakow	27b12821ee	Makefile: Generalize the .version.cc target This enables us to move the file around later. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-18 22:53:00 +01:00
Kai Krakow	fdf434e8eb	Makefile: fix dependency generation Let's generalize the depends.mk target so we can easily move files around later. While doing it, let's also fix the "gcc -M" call to use explicit target names and not clobber it with preprocessor output. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-18 22:53:00 +01:00
Kai Krakow	bc1b67fde1	Makefile: rename OBJS to CRUCIBLE_OBJS This paves the way for building different .so libs. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-18 22:53:00 +01:00
Kai Krakow	4cfd5b43da	Makefile: generalize .so target We can generalize the .so target by moving its depends into rules without build instructions. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-18 22:53:00 +01:00
Kai Krakow	4789445d7b	Makefile: .o already depends on its .h file We can remove the explicit depend on the .h file because that is covered by depends.mk. Let's instead depend on makeflags which makes more sense. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-18 22:53:00 +01:00
Kai Krakow	c8787fecd2	Makefile: depends.mk is not an optional include We really need depends.mk in the following Makefile reorganization. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-18 22:53:00 +01:00
Zygo Blaxell	4943a07cce	crucible: cache: linked-list LRU implementation We need a better cache expiration algorithm than "make a copy of the entire thing, sort it while holding a lock, and delete half the items in a single burst." Replace the Lamport clock with a double-linked list. Each insert or lookup operation moves the affected item to the head of the list. Each erase operation deletes one single item at the tail of the list. Also sort out some iterator invalidation nonsense by doing erases before inserts instead of "insert, erase, find the inserted item again because we invalidated the found iterator during the erase." The new implementation adds a second word-sized member to each Value as well as a copy of the Key. Hopefully the enlarged size is not a deal-breaker. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:58:44 -05:00
Zygo Blaxell	00d9b8ed76	hash: do the mlock after loading the table The mlock runs much faster, probably because the hash fetches are doing most of the work that mlock does. It makes bees startup latency for testing smaller, even if it takes more time in absolute terms. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:58:44 -05:00
Zygo Blaxell	e8b4ab54c6	README: describe the scanning mode (-m option) Include a brief description of the two algorithms without getting into too much detail for an ostensibly temporary feature. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:58:44 -05:00
Zygo Blaxell	56c23c4517	crawl: implement two crawler algorithms and adjust scheduling parameters There are two subvol scan algorithms implemented so far. The two modes are unimaginatively named 0 and 1. 0: sorts extents by (inode, subvol, offset), 1: scans extents round-robin from all subvols. Algorithm 0 scans references to the same extent at close to the same time, which is good for performance; however, whenever a snapshot is created, the scan of the entire filesystem restarts at the beginning of the new snapshot. Algorithm 1 makes continuous forward progress even when new snapshots are created, but it does not benefit from caching and will force the kernel to reread data multiple times when there are snapshots. The algorithm can be selected at run-time using the -m or --scan-mode option. We can collect some field data on these before replacing them with an extent-tree-based scanner. Alternatively, for pre-4.14 kernels, we can keep these two modes as non-default options. Currently these algorithms have terrible names. TODO: fix that, but also TODO: delete all that code and do scans directly from the extent tree instead. Augment the scan algorithms relative to their earlier implementation by batching multiple extents to scan from each subvol before switching to a different subvol. Sprinkle some BEESNOTEs on the Task objects so that they don't disappear from the thread status output. Adjust some timing constants to deal with the increased latency from competing threads. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:53:49 -05:00
Zygo Blaxell	055c8d4c75	roots: scan in parallel using Tasks Distribute incoming extents across a thread pool for faster execution on multi-core, multi-disk environments. Switch extent enumeration model to scan extent refs consecutively(ish). Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:52:00 -05:00
Zygo Blaxell	090d79e13b	crucible: remove unused TimeQueue and WorkQueue classes WorkQueue is superceded by Task. TimeQueue will be replaced by something based on Tasks. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:52:00 -05:00
Zygo Blaxell	796aaed7f8	roots: remove dead code and #if blocks In both instances the code contained within (or the conditional compilation surrounding it) is no longer controversial. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:52:00 -05:00
Zygo Blaxell	8849e57bf0	crucible: add Task class We need a mechanism for distributing work across processor cores and disks. Task implements a simple FIFO/LIFO queue model for executing closures. Some locking primitives are included (mutex and barrier). Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:51:59 -05:00
Zygo Blaxell	844a488157	README: update dependencies and Linux kernel bugs list Bees will someday rely on features available only in kernel v4.14. Let's start now by removing workarounds for bugs that were fixed in v4.11. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:51:59 -05:00
Zygo Blaxell	a175ee0689	bees: clean up #if 0 ... fsync ... #endif code Remove some dead code because dedup-related deadlocks have not been observed since Linux kernel v4.11. Preserve rationale of remaining #if 0 block (why we do write/rename instead of write/fsync/rename) so that people don't try to replace the "missing" fsync() there. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:30:07 -05:00
Zygo Blaxell	f376b8e90d	test: add -lpthread to Makefile This resolves missing symbol build errors. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:30:07 -05:00
Zygo Blaxell	3da755713a	Makefiles: don't append to depends.mk.new Fixes errors such as: depends.mk:765: *** multiple target patterns. Stop. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:30:07 -05:00
Zygo Blaxell	8d3a27bf85	subvol-threads: increase resource and thread limits With kernel 4.14 there is no sign of the previous LOGICAL_INO performance problems, so there seems to be no need to throttle threads using this ioctl. Increase the FD cache size limits and scan thread count. Let the kernel figure out scheduling. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:30:07 -05:00
Zygo Blaxell	42a6053229	roots: remove open_root_cache correctly BEESNOTE puts a message on the status message stack. BEESINFO logs a message with rate limiting. The message that was flooding the logs was coming from BEESINFO not BEESNOTE. Fix earlier commit which removed the wrong message. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:30:07 -05:00
Zygo Blaxell	c477618924	crucible: resource: optimize map cleanup We were holding weak refs until the next time the resource ID was used. This is a bad thing if resource IDs are sparse (e.g. pointers or hashes) because we'll never see an ID twice. To fix, determine whether we released the last instance of a resource, and if so, free its weak ref immediately. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:30:07 -05:00
Zygo Blaxell	35100c2b9e	crucible: resource: remove excess locking The bugs in other parts of the code have been identified and fixed, so the overprotective locks around shared_ptr can be removed. Keep the other improvements to the Resource class. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:30:06 -05:00
Zygo Blaxell	116f15ace5	lockset: drop unused method wait_unlock This function is not used and does not appear to be useful. Remove it. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-17 22:30:06 -05:00
Zygo Blaxell	8a68b5f20b	crucible: add cleanup class Store a function (or closure) in an instance and invoke the function from the destructor. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-15 11:07:48 -05:00
Kai Krakow	6d6aedd8ec	Makefile: Fail gracefully if markdown is not installed Previously, MARKDOWN may end up empty. This commit should fix it. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-11 21:25:12 +01:00
Kai Krakow	025b14f38f	Installation: Depend Gentoo ebuild on markdown Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-11 20:56:48 +01:00
Kai Krakow	dd6d8caaa2	Installation: Remove superfluous cruft from Gentoo ebuild Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-11 20:56:34 +01:00
Zygo Blaxell	4bfb637b0e	Merge remote-tracking branch 'nefelim4ag/master'	2018-01-10 23:43:00 -05:00
Zygo Blaxell	4aa5978a89	hash: reduce mutex contention using one mutex per hash table extent This avoids PERFORMANCE warnings when large hash tables are used on slow CPUs or with lots of worker threads. It also simplifies the code (no locksets, only one object-wide mutex instead of two). Fixed a few minor bugs along the way (e.g. we were not setting the dirty flag on the right hash table extent when we detected hash table errors). Simplified error handling: IO errors on the hash table are ignored, instead of throwing an exception into the function that tried to use the hash table. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-01-10 23:25:45 -05:00
Kai Krakow	365a913a26	Installation: Add Gentoo ebuild This commit adds an ebuild for Gentoo. Version 9999 is building live from current git, currently using kakra:integration because it has some installation and build fixes important for Gentoo. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-11 03:03:17 +01:00
Kai Krakow	634a1d0bf6	Installation: -fPIC should not be used unconditionally According to Gentoo packaging guide, -fPIC should only be used on shared libraries, and not added unconditionally to every linker call. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-11 02:30:12 +01:00
Kai Krakow	3a24cd3010	Installation: Fix soname QA warning in Gentoo Gentoo warns about libs missing a proper soname during QA phase. Let's fix this. Signed-off-by: Kai Krakow <kai@kaishome.de>	2018-01-11 02:30:12 +01:00

... 6 7 8 9 10 ...

566 Commits