krak/zstd - zstd - Gitea: Git with a cup of tea

krak/zstd

mirror of https://github.com/facebook/zstd.git synced 2025-03-07 01:10:04 +02:00

Author	SHA1	Message	Date
Yann Collet	2949252923	fix minor conversion warnings	2025-02-05 17:01:19 -08:00
Yann Collet	e87d15938c	more %zu warnings fixes	2025-02-05 16:48:19 -08:00
Yann Collet	04a2a0219c	update type names naming convention: Type names should start with a Capital letter (after the prefix)	2024-12-29 14:25:33 -08:00
Yann Collet	433f4598ad	fixed minor conversion warnings on Visual	2024-10-23 11:50:56 -07:00
Yann Collet	80a912dec1	fixed zstreamtest	2024-10-23 11:50:56 -07:00
Yann Collet	61d08b0e42	fix test a margin of 4 is insufficient to guarantee compression success.	2024-10-17 09:37:23 -07:00
Dimitri Papadopoulos	2d736d9c50	Fix new typos found by codespell	2024-06-20 20:12:16 +02:00
Elliot Gorokhovsky	be6a182006	Unit test for external sequence producer + static CCtx + streaming (#4063 )	2024-06-03 12:42:27 -04:00
Elliot Gorokhovsky	7d970bd83c	Implement one-shot fallback for magicless format (#3971 )	2024-03-18 10:55:53 -04:00
Elliot Gorokhovsky	f65b9e27ce	Exercise ZSTD_findDecompressedSize() in the simple decompression fuzzer (#3959 ) * Improve decompression fuzzer * Fix legacy frame header fuzzer crash, add unit test	2024-03-12 17:07:06 -04:00
Elliot Gorokhovsky	c6cabf9441	Make offload API compatible with static CCtx (#3854 ) * Add ZSTD_CCtxParams_registerSequenceProducer() to public API * add unit test * add docs to zstd.h * nits * Add ZSTDLIB_STATIC_API prefix * Add asserts	2023-12-28 14:48:46 -05:00
Dimitri Papadopoulos	fe34776c20	Fix new typos found by codespell	2023-09-23 18:56:01 +02:00
Nick Terrell	61efb2a047	Add ZSTD_d_maxBlockSize parameter Reduces memory when blocks are guaranteed to be smaller than allowed by the format. This is useful for streaming compression in conjunction with ZSTD_c_maxBlockSize. This PR saves 2 * (formatMaxBlockSize - paramMaxBlockSize) when streaming. Once it is rebased on top of PR #3616 it will save 3 * (formatMaxBlockSize - paramMaxBlockSize).	2023-04-17 22:06:44 -07:00
Elliot Gorokhovsky	ff42ed1582	Rename "External Matchfinder" to "Block-Level Sequence Producer" (#3484 ) * change "external matchfinder" to "external sequence producer" * migrate contrib/ to new naming convention * fix contrib build * fix error message * update debug strings * fix def of invalid sequences in zstd.h * nit * update CHANGELOG * fix .gitignore	2023-02-09 17:01:17 -05:00
Elliot Gorokhovsky	64052ef57d	Guard against invalid sequences from external matchfinders (#3465 )	2023-01-31 13:55:48 -05:00
Danielle Rozenblit	66fae56c86	remove big test around large offset with small window size	2023-01-30 06:26:03 -08:00
Danielle Rozenblit	da589a134a	update CI	2023-01-27 14:18:29 -08:00
Danielle Rozenblit	9e4c66b9e9	record long offsets in ZSTD_symbolEncodingTypeStats_t + add test case	2023-01-27 12:04:29 -08:00
Nick Terrell	8957fef554	[huf] Add generic C versions of the fast decoding loops Add generic C versions of the fast decoding loops to serve architectures that don't have an assembly implementation. Also allow selecting the C decoding loop over the assembly decoding loop through a zstd decompression parameter `ZSTD_d_disableHuffmanAssembly`. I benchmarked on my Intel i9-9900K and my Macbook Air with an M1 processor. The benchmark command forces zstd to compress without any matches, using only literals compression, and measures only Huffman decompression speed: ``` zstd -b1e1 --compress-literals --zstd=tlen=131072 silesia.tar ``` The new fast decoding loops outperform the previous implementation uniformly, but don't beat the x86-64 assembly. Additionally, the fast C decoding loops suffer from the same stability problems that we've seen in the past, where the assembly version doesn't. So even though clang gets close to assembly on x86-64, it still has stability issues. \| Arch \| Function \| Compiler \| Default (MB/s) \| Assembly (MB/s) \| Fast (MB/s) \| \|---------\|----------------\|--------------\|----------------\|-----------------\|-------------\| \| x86-64 \| decompress 4X1 \| gcc-12.2.0 \| 1029.6 \| 1308.1 \| 1208.1 \| \| x86-64 \| decompress 4X1 \| clang-14.0.6 \| 1019.3 \| 1305.6 \| 1276.3 \| \| x86-64 \| decompress 4X2 \| gcc-12.2.0 \| 1348.5 \| 1657.0 \| 1374.1 \| \| x86-64 \| decompress 4X2 \| clang-14.0.6 \| 1027.6 \| 1659.9 \| 1468.1 \| \| aarch64 \| decompress 4X1 \| clang-12.0.5 \| 1081.0 \| N/A \| 1234.9 \| \| aarch64 \| decompress 4X2 \| clang-12.0.5 \| 1270.0 \| N/A \| 1516.6 \|	2023-01-25 13:47:51 -08:00
Danielle Rozenblit	7d600c628a	fix bound check for ZSTD_copySequencesToSeqStoreNoBlockDelim()	2023-01-24 06:40:40 -08:00
daniellerozenblit	9116000be6	Merge pull request #3439 from daniellerozenblit/sequence-validation-bug-fix Fix sequence validation and seqStore bounds check	2023-01-23 13:50:37 -05:00
Danielle Rozenblit	815d1d4eda	update external sequence error to fit error naming scheme	2023-01-23 09:58:34 -08:00
Danielle Rozenblit	1b65727e74	fix nits and add new error code for invalid external sequences	2023-01-23 07:59:02 -08:00
Nick Terrell	666944fbe6	Cap hashLog & chainLog to ensure that we only use 32 bits of hash * Cap shortCache chainLog to 24 * Cap row match finder hashLog so that rowLog <= 24 * Add unit tests to expose all cases. The row match finder unit tests are only run in 64-bit mode, because they allocate ~1GB. Fixes #3336	2023-01-20 14:05:26 -08:00
Danielle Rozenblit	aa385ece13	fix sequence validation and bounds check in ZSTD_copySequencesToSeqStore()	2023-01-20 10:32:35 -08:00
Elliot Gorokhovsky	bce0382c82	Bugfixes for the External Matchfinder API (#3433 ) * external matchfinder bugfixes + tests * small doc fix	2023-01-19 10:41:24 -05:00
daniellerozenblit	dc1c6cc5df	Merge pull request #3418 from daniellerozenblit/fuzz-max-block-size Fuzz on maxBlockSize	2023-01-19 08:18:04 -05:00
Danielle Rozenblit	06b096db47	additional tests and documentation updates + allow maxBlockSize to be set to 0 (goes to default)	2023-01-12 13:41:50 -08:00
Danielle Rozenblit	53eb5a758c	add simple test for maxBlockSize expected functionality	2023-01-12 08:55:39 -08:00
Yann Collet	8b130009e3	minor simplification refactoring for timefn `UTIL_getSpanTimeMicro()` can be factored in a generic way, reducing OS-dependent code.	2023-01-06 16:12:54 -08:00
Danielle Rozenblit	908e812733	initial commit	2023-01-04 13:01:54 -08:00
Elliot Gorokhovsky	2a402626dd	External matchfinder API (#3333 ) * First building commit with sample matchfinder * Set up ZSTD_externalMatchCtx struct * move seqBuffer to ZSTD_Sequence* * support non-contiguous dictionary * clean up parens * add clearExternalMatchfinder, handle allocation errors * Add useExternalMatchfinder cParam * validate useExternalMatchfinder cParam * Disable LDM + external matchfinder * Check for static CCtx * Validate mState and mStateDestructor * Improve LDM check to cover both branches * Error API with optional fallback * handle RLE properly for external matchfinder * nit * Move to a CDict-like model for resource ownership * Add hidden useExternalMatchfinder bool to CCtx_params_s * Eliminate malloc, move to cwksp allocation * Handle CCtx reset properly * Ensure seqStore has enough space for external sequences * fix capitalization * Add DEBUGLOG statements * Add compressionLevel param to matchfinder API * fix c99 issues and add a param combination error code * nits * Test external matchfinder API * C90 compat for simpleExternalMatchFinder * Fix some @nocommits and an ASAN bug * nit * nit * nits * forward declare copySequencesToSeqStore functions in zstd_compress_internal.h * nit * nit * nits * Update copyright headers * Fix CMake zstreamtest build * Fix copyright headers (again) * typo * Add externalMatchfinder demo program to make contrib * Reduce memory consumption for small blockSize * ZSTD_postProcessExternalMatchFinderResult nits * test sum(matchlen) + sum(litlen) == srcSize in debug builds * refExternalMatchFinder -> registerExternalMatchFinder * C90 nit * zstreamtest nits * contrib nits * contrib nits * allow block splitter + external matchfinder, refactor * add windowSize param * add contrib/externalMatchfinder/README.md * docs * go back to old RLE heuristic because of the first block issue * fix initializer element is not a constant expression * ref contrib from zstd.h * extremely pedantic compiler warning fix, meson fix, typo fix * Additional docs on API limitations * minor nits * Refactor maxNbSeq calculation into a helper function * Fix copyright	2022-12-28 16:45:14 -05:00
W. Felix Handte	5d693cc38c	Coalesce Almost All Copyright Notices to Standard Phrasing ``` for f in $(find . $ -path ./.git -o -path ./tests/fuzz/corpora -o -path ./tests/regression/data-cache -o -path ./tests/regression/cache $ -prune -o -type f); do sed -i '/Copyright .* $Yann Collet$\\|$Meta Platforms$/ s/Copyright ./Copyright (c) Meta Platforms, Inc. and affiliates./' $f; done git checkout HEAD -- build/VS2010/libzstd-dll/libzstd-dll.rc build/VS2010/zstd/zstd.rc tests/test-license.py contrib/linux-kernel/test/include/linux/xxhash.h examples/streaming_compression_thread_pool.c lib/legacy/zstd_v0.c lib/legacy/zstd_v0*.h nano ./programs/windres/zstd.rc nano ./build/VS2010/zstd/zstd.rc nano ./build/VS2010/libzstd-dll/libzstd-dll.rc ```	2022-12-20 12:52:34 -05:00
W. Felix Handte	8927f985ff	Update Copyright Headers 'Facebook' -> 'Meta Platforms' ``` for f in $(find . $ -path ./.git -o -path ./tests/fuzz/corpora $ -prune -o -type f); do sed -i 's/Facebook, Inc\./Meta Platforms, Inc. and affiliates./' $f; done ```	2022-12-20 12:37:57 -05:00
Nick Terrell	a91e7ec175	Fix corruption that rarely occurs in 32-bit mode with wlog=25 Fix an off-by-one error in the compressor that emits corrupt blocks if: * Zstd is compiled in 32-bit mode * The windowLog == 25 exactly * An offset of 2^25-3, 2^25-2, 2^25-1, or 2^25 is emitted * The bitstream had 7 bits leftover before writing the offset This bug has been present since before v1.0, but wasn't able to easily be triggered, since until somewhat recently zstd wasn't able to find matches that were within 128KB of the window size. Add a test case, and fix 2 bugs in `ZSTD_compressSequences()`: * The `ZSTD_isRLE()` check was incorrect. It wouldn't produce corruption, but it could waste CPU and not emit RLE even if the block was RLE * One windowSize was `1 << windowLog`, not `1u << windowLog` Thanks to @tansy for finding the issue, and giving us a reproducer! Fixes Issue #3350.	2022-12-15 14:41:50 -08:00
Elliot Gorokhovsky	3720910d06	Fix fuzzer failure	2022-11-21 16:09:04 -05:00
Yonatan Komornik	21bd8c3b3c	Removed unused variable (#3272 )	2022-09-22 08:20:46 -07:00
Danielle Rozenblit	a06e953db9	some additional comments, remove apt-get from clang jobs, better test titles	2022-09-08 18:30:07 -07:00
Danielle Rozenblit	282a955d33	added test that exposes zero offset to null pointer error when built with clang	2022-09-07 08:58:08 -07:00
Danielle Rozenblit	69022ad886	null decompress buffer test and ubsan flag added	2022-09-06 14:34:55 -07:00
Yann Collet	91aeade735	Streaming decompression can detect incorrect header ID sooner Streaming decompression used to wait for a minimum of 5 bytes before attempting decoding. This meant that, in the case that only a few bytes (<5) were provided, and assuming these bytes are incorrect, there would be no error reported. The streaming API would simply request more data, waiting for at least 5 bytes. This PR makes it possible to detect incorrect Frame IDs as soon as the first byte is provided. Fix #3169	2022-06-21 23:09:03 -07:00
Yann Collet	f2d9652ad8	more usage of new error code stabilityCondition_notRespected as suggested by @terrelln	2022-01-26 18:30:55 -08:00
Yann Collet	dda4c10f07	added ZSTD_compressStream2() + ZSTD_c_stableInBuffer test	2022-01-26 13:33:04 -08:00
Yann Collet	af3d9c506e	added streaming test starting from non-0 pos	2022-01-26 10:31:25 -08:00
Yann Collet	c1668a00d2	fix extended case combining stableInBuffer with continue() and flush() modes	2022-01-26 10:31:25 -08:00
Yann Collet	270f9bf005	better consistency in accessing @input as suggested by @terrelln. Also : commented zstreamtest more to ensure ZSTD_stableInBuffer is tested/	2022-01-26 10:31:24 -08:00
Yann Collet	27d336b099	minor behavior refinements specifically, there is no obligation to start streaming compression with pos=0. stableSrc mode is now compatible with this setup.	2022-01-26 10:31:24 -08:00
Yann Collet	37b87add7a	make stableSrc compatible with regular streaming API including flushStream(). Now the only condition is for `input.size` to continuously grow.	2022-01-26 10:31:24 -08:00
Nick Terrell	91f5891dd0	[CircleCI] Fix short-tests-0 short-tests-0 were silently failing. I think because of the && make clean construction. Switch to ; instead. Also fix all the test failures that were exposed. `make all` is failing on CircleCI because it is missing Docker. Move that test to GitHub actions, and switch the pedantic CircleCI test to `make allmost`.	2021-12-01 17:43:46 -08:00
Dimitris Apostolou	ebbd675998	Fix typos	2021-11-13 10:04:04 +02:00

1 2 3 4 5 ...

291 Commits