GGLinnk/bees - bees - Virtual World Git

mirror of https://github.com/Zygo/bees.git synced 2026-01-08 20:00:22 +01:00

Author	SHA1	Message	Date
Zygo Blaxell	e9e6870de8	fs: add btrfs_inode_flags_ntoa Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-07-21 21:21:54 -04:00
Zygo Blaxell	a3c0ba0d69	fs: add a runtime debug stream for btrfs tree searches This allows plugging in an ostream at run time so that we can audit all the search calls we are doing. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-06 22:42:15 -05:00
Zygo Blaxell	c4ba6ec269	fs: add a ntoa function for chunk types Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-06 22:42:15 -05:00
Zygo Blaxell	925b12823e	fs: add do_ioctl_nothrow and fsid methods to btrfs fs info Enable use of the ioctl to probe whether two fds refer to the same btrfs, without throwing an exception. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2025-02-06 22:42:15 -05:00
Zygo Blaxell	1dd96f20c6	fs: drop extra declaration of hexdump hexdump was moved into a template in its own header years ago, but the declaration of the implementation that used to be in fs.cc remains. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2024-12-04 11:17:44 -05:00
Zygo Blaxell	099ad2ce7c	fs: add some performance metrics for TREE_SEARCH_V2 calls These give some visibility into how efficiently bees is using the TREE_SEARCH_V2 ioctl. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2024-11-30 23:30:33 -05:00
Zygo Blaxell	7c764a73c8	fs: allow BtrfsIoctlLogicalInoArgs to be reused, remove virtual methods Some malloc implementations will try to mmap() and munmap() large buffers every time they are used, causing a severe loss of performance. Nothing ever overrode the virtual methods, and there was no virtual destructor, so they cause compiler warnings at build time when used with a template that tries to delete pointers to them. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-02-23 22:40:12 -05:00
Zygo Blaxell	bd336e81a6	fs: get rid of base class btrfs_ioctl_logical_ino_args Another instance of the pattern where we derived a crucible class from a btrfs struct. Make it an automatic variable instead. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-01-27 22:16:02 -05:00
Zygo Blaxell	ea17c89165	fs: remove duplicate BTRFS_COMPRESS_ definitions This was fixed in `7f660f50b` lib: fs: stop using libbtrfs-dev helper functions to re-enable buffer length checks but apparently some copies live on. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-01-27 22:16:02 -05:00
Zygo Blaxell	cb2c20ccc9	fs: get rid of base class btrfs_ioctl_same_extent_info We only use BtrfsExtentInfo when it's exactly equivalent to the base, so drop the derived class. While we're here, fix BtrfsExtentSame::add so it uses a btrfs-compatible uint64_t instead of an off_t. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2023-01-05 01:10:17 -05:00
Zygo Blaxell	30ece57116	fs: export btrfs_compress_type_ntoa We already had a function that was _similar_, so add decoding for compress type NONE, give it a less specific name, and declare it in fs.h. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-12-20 20:50:54 -05:00
Zygo Blaxell	5953ea6d3c	fs: update btrfs compatibility header: add csum types, BTRFS_FS_INFO_FLAG_GENERATION and _METADATA_UUID I guess this means it's "args_v3" now? Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-10-25 12:56:16 -04:00
Zygo Blaxell	972721016b	fs: get rid of base class fiemap Yet another build failure of the form: error: flexible array member fiemap... not at end of struct crucible::Fiemap... bees doesn't use fiemap any more, so the fixes here are minimal changes to make it build, not shining examples of C++ class design. Signer-off-by: Zygo Blaxell <bees@furryterror.org>	2022-10-25 12:56:16 -04:00
Zygo Blaxell	5040303f50	fs: get rid of base class btrfs_data_container This fixes another build failure of the form: error: flexible array member btrfs_... not at end of struct crucible::Btrfs... Fixes: https://github.com/Zygo/bees/issues/236 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-10-23 22:42:57 -04:00
Zygo Blaxell	14ce81c081	fs: get rid of silly base class that causes build failures now The base class thing was an ugly way to get around the lack of C99 compound literals in C++, and also to make the bare ioctls usable with the derived classes. Today, both clang and gcc have C99 compound literals, so there's no need to do crazy things with memset. We never used the derived classes for ioctls, and for this specific ioctl it would have been a very, very bad idea, so there's no need to support that either. We do need to jump through hoops for ostream& operator<<() but we had to do those anyway as there are other members in the derived type. So we can simply drop the base class, and build the args object on the stack in `do_ioctl`. This also removes the need to verify initialization. There's no bug here since the `info` member of the base class was never used in place by the derived class, but new compilers reject the flexible array member in the base class because the derived class makes `info` be not at the end of the struct any more: error: flexible array member btrfs_ioctl_same_args::info not at end of struct crucible::BtrfsExtentSame Fixes: https://github.com/Zygo/bees/issues/232 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2022-10-09 20:39:15 -04:00
Zygo Blaxell	fcd847bbf9	fs: add an item type parameter to next_min When we are searching the btrfs metadata trees, we usually want only one type of item. If the last item in a search result is not of the desired type, we can restart the search at the next possible key with that item type, potentially skipping over some uninteresting items we would otherwise have to fetch, process, and discard. Also remove a bug in the previous next_min code that would skip over items if the offset overflowed and the next objectid in the tree had a lower item type number than the previous objectid. This doesn't seem to be a bug that has ever happened, as it would require a file to roll over in the offset field. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-10-31 19:56:04 -04:00
Zygo Blaxell	bba6f4f183	fs: convert vector<uint8_t> and Spanner to ByteVector and rewrite TREE_SEARCH_V2 wrapper Switch various methods in fs to use ByteVector to cut down on the number of slow allocations and copies. Automatically determine the correct size for TREE_SEARCH_V2 buffers based on the number of items requested, and grow the buffer as needed. This eliminates the need to cache some objects that were heavy to create. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-10-31 19:42:01 -04:00
Zygo Blaxell	ba1f3b93e4	fs: drop virtual do_ioctl methods for btrfs_ioctl_search_key These were never used, and they make the object very slightly heavier. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-10-31 19:42:01 -04:00
Zygo Blaxell	12e80658a8	fs: fix FIEMAP_MAX_OFFSET type silliness in fiemap.h In fiemap.h the members of struct fiemap are declared as __u64, but the FIEMAP_MAX_OFFSET macro is an unsigned long long value: $ grep FIEMAP_MAX_OFFSET -r /usr/include/ /usr/include/linux/fiemap.h:#define FIEMAP_MAX_OFFSET (~0ULL) $ grep fe_length -r /usr/include/ /usr/include/linux/fiemap.h: __u64 fe_length; /* length in bytes for this extent */ This results in a type mismatch error on architectures like ppc64le: fiemap.cc:31:35: note: deduced conflicting types for parameter 'const _Tp' ('long unsigned int' and 'long long unsigned int') 31 \| fm.fm_length = min(fm.fm_length, FIEMAP_MAX_OFFSET - fm.fm_start); \| ~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ Work around this by copying the macro into a uint64_t constant, and not using the macro any more. Fixes: https://github.com/Zygo/bees/issues/194 Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-10-06 15:17:02 -04:00
Zygo Blaxell	bcf3e7de3e	uuid: drop dependency on uuid.h The weird things distros do to the path where uuid.h gets installed have broken bees builds for the last time. We were only using uuid to support a legacy feature that was removed over four years ago. Hypothetical users who are upgrading directly from bees v0.1 should probably restart all the crawlers anyway--there were bugs. Also, if any such users exist, I respect their tremendous patience with the horrible performance all these years--bees got about 30x faster since v0.1. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-04-23 08:16:50 -04:00
Zygo Blaxell	7f660f50b8	lib: fs: stop using libbtrfs-dev helper functions to re-enable buffer length checks The Linux kernel's btrfs headers are better than the libbtrfs-dev headers: - the libbtrfs-dev headers have C++ language compatibility issues - upstream version in Linux kernel is more accurate and up to date - macros in libbtrfs-dev's ctree.h hide information that would enable bees to perform runtime buffer length checking - enum types whose presence cannot be detected with #ifdef When accessing members of metadata items from the filesystem, we want to verify that the member we are accessing is within the boundaries of the item that was retrieved; otherwise, a memory access violation may occur or garbage may be returned to the caller. A simple C++ template, given a pointer to a structure member and a buffer, can determine that the buffer contains enough bytes to safely access a struct member. This was implemented back in 2016, but left unused due to ctree.h issues. Some btrfs metadata structures have variable length despite using a fixed-size in-memory structure. The members that appear earliest in the structure contain information about which following members of the structure are used. The item stored in the filesystem is truncated after the last used member, and all following members must not be accessed. 'btrfs_stack_*' accessor macros obscure the memory boundaries of the members they access, which makes it impossible for a C++ template to verify the memory access. If the template checks the length of the entire structure, it will find an access violation for variable-length metadata items because the item is rarely large enough for the entire structure. Get rid of all the libbtrfs-dev accessor macros and reimplement them with the necessary buffer length checks. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2021-02-22 20:06:43 -05:00
Zygo Blaxell	c0149d72b7	fs: use Spanner to refer to ioctl arg buffer instead of making vector copies This avoids some allocations and copying. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2020-12-17 18:07:36 -05:00
Zygo Blaxell	9ca69bb7ff	fs: remove buffer overrun check in get_struct_ptr for non-copying containers When we are using non-copying containers, we can't call resize() on them. get_struct_ptr is essentially a pointer cast, so we will end up with a pointer to a struct that extends beyond the boundaries of the container. As long as the btrfs metadata is not corrupted, we should not have too many problems. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2020-12-17 18:07:36 -05:00
Zygo Blaxell	f45e379802	fs: deprecate vector<char> Use uint8_t when we mean uint8_t, i.e. vector<uint8_t> instead of vector<char>. Add a template parameter instead of vector so we can swap in a non-copying data type. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2020-12-17 18:07:36 -05:00
Zygo Blaxell	180bb60cde	fs: add support and workarounds for btrfs fs_info v2 Define a local copy of the header that has fields for the csum type and length, so we can build in places that haven't caught up to kernel 5.5 headers yet. The reason why the csum type and length are not unconditionally filled in eludes me. csum_length is necessarily non-zero, and the cost of the conditional is worse than the cost of the copy, so the whole flags dance is a WTF...but it's part of the kernel API now, so it's too late to NAK it. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2020-12-17 18:07:36 -05:00
Zygo Blaxell	459071597b	fs: make operator<() for search ioctl inline Perf blames this operator for >1% of instructions with -O2, and 70% of instructions without -O2. Let the compiler inline the function. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2020-12-17 17:54:51 -05:00
Zygo Blaxell	87e8a21c41	fs: do not emulate extent-same by clone It is not possible to emulate extent-same by clone in a safe way. EXTENT_SAME has been supported in btrfs since kernel 3.13, which is much too old to contemplate running bees on. Remove this dangerous and unused function. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2019-06-12 22:48:06 -04:00
Zygo Blaxell	a676928ed5	fs: remove thread_local storage If we are not zero-filling containers then the overhead of allocating them on each use is negligible. The effect that the thread_local containers were having on RAM usage was very non-negligible. Use dynamic containers (members or stack objects) for better control of object lifetimes and much lower peak RAM usage. They're a tiny bit faster, too. Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-11-08 23:55:13 -05:00
Zygo Blaxell	8cbd6fc67a	fs: support LOGICAL_INO_V2 Automatically fall back to LOGICAL_INO if LOGICAL_INO_V2 fails and no _V2 flags are used. Add methods to set the flags argument with build portability to older headers. Use thread_local storage for the somewhat large buffers used by LOGICAL_INO_V2 (and other users of BtrfsDataContainer like INO_PATHS). Signed-off-by: Zygo Blaxell <bees@furryterror.org>	2018-11-05 21:12:36 -05:00
Timofey Titovets	80e4302958	Update btrfs compression types, add ZSTD, drop LAST Signed-off-by: Timofey Titovets <nefelim4ag@gmail.com>	2018-01-04 20:32:04 +03:00
Zygo Blaxell	e835e8766e	crucible: use set instead of vector in BtrfsExtentWalker This gets rid of some more big memsets. It may replace them with a lot of tiny mallocs, though. If this turns out to be a bad idea then at least we can easily revert the change.	2016-12-13 21:46:41 -05:00
Zygo Blaxell	7782b79e4b	crucible: reduce buffer size and CPU overhead for BtrfsIoctlSearchKey We really do need some large buffers for BtrfsIoctlSearchKey in some cases, but we don't need to zero them out first. Don't do that so we save some CPU. Reduce the default buffer size to 4K because most BISK users don't get need much more than 1K. Set the buffer size explicitly to the product of the number of items and the desired item size in the places that really need a lot of items.	2016-12-13 21:46:35 -05:00
Zygo Blaxell	ec9d4a1d15	crucible: fs: use a much smaller default search buffer size It turns out we never use a value for m_buf_size that isn't the default, and we also never ask for more than a few thousand items; however, we do spend a ton of time memsetting the huge buffer to zero. I don't know what the ideal size is, but 16K is a far better guess than 1MB. Let's reduce it for some immediate CPU benefit, and determine what the size should be later. Reported at https://github.com/Zygo/bees/issues/11	2016-12-11 13:24:44 -05:00
Zygo Blaxell	38bb70f5d0	build: OK, maybe 32-bit machines could work I accidentally did a pre-push verification on a 32-bit build host. There were a surprisingly small number of problems, so fix them. Bees now builds on a 32-bit host. Let's not update README just yet, though: the 32-bit ioctl support fails immediately after startup on a 64-bit kernel.	2016-11-26 02:06:28 -05:00
Zygo Blaxell	cca0ee26a8	bees: remove local cruft, throw at github	2016-11-17 12:12:13 -05:00

35 Commits