krak/zstd - zstd - Gitea: Git with a cup of tea

krak/zstd

mirror of https://github.com/facebook/zstd.git synced 2025-03-07 01:10:04 +02:00

Author	SHA1	Message	Date
Yann Collet	796699c0bc	fix root cause of #3416 A minor change in 5434de0 changed a `<=` into a `<`, and as an indirect consequence allowed compression attempt of literals when there are only 6 literals to compress (previous limit was effectively 7 literals). This is not in itself a problem, as the threshold is merely an heuristic, but it emerged a bug that has always been there, and was just never triggered so far due to the previous limit. This bug would make the literal compressor believes that all literals are the same symbol, but for the exact case where nbLiterals==6, plus a pretty wild combination of other limit conditions, this outcome could be false, resulting in data corruption. Replaced the blind heuristic by an actual test for all limit cases, so that even if the threshold is changed again in the future, the detection of RLE mode will remain reliable.	2023-01-12 15:41:08 -08:00
Yann Collet	089b2797e3	Merge pull request #3398 from facebook/fix3316 spec update : require minimum nb of literals for 4-streams mode	2022-12-22 16:57:05 -08:00
Yann Collet	6a9c525903	spec update : require minimum nb of literals for 4-streams mode Reported by @shulib : the specification for 4-streams mode doesn't work when the amount of literals to compress is 5 bytes. Extending it, it also doesn't work for sizes 1 or 2. This patch updates the specification and the implementation to require a minimum of 6 literals to trigger or accept the 4-streams mode. The impact is expected to be a no-op : the 4-streams mode is never triggered for such small quantity of literals anyway, since it would be wasteful (it costs ~7.3 bytes more than single-stream mode). An informal lower limit is set at ~256 bytes, so the technical minimum is very far from this limit. This is just meant for completeness of the specification.	2022-12-22 16:14:34 -08:00
Yann Collet	ea2895cef4	Support decompression of compressed blocks of size ZSTD_BLOCKSIZE_MAX exactly	2022-12-22 12:40:27 -08:00
W. Felix Handte	5d693cc38c	Coalesce Almost All Copyright Notices to Standard Phrasing ``` for f in $(find . $ -path ./.git -o -path ./tests/fuzz/corpora -o -path ./tests/regression/data-cache -o -path ./tests/regression/cache $ -prune -o -type f); do sed -i '/Copyright .* $Yann Collet$\\|$Meta Platforms$/ s/Copyright ./Copyright (c) Meta Platforms, Inc. and affiliates./' $f; done git checkout HEAD -- build/VS2010/libzstd-dll/libzstd-dll.rc build/VS2010/zstd/zstd.rc tests/test-license.py contrib/linux-kernel/test/include/linux/xxhash.h examples/streaming_compression_thread_pool.c lib/legacy/zstd_v0.c lib/legacy/zstd_v0*.h nano ./programs/windres/zstd.rc nano ./build/VS2010/zstd/zstd.rc nano ./build/VS2010/libzstd-dll/libzstd-dll.rc ```	2022-12-20 12:52:34 -05:00
W. Felix Handte	8927f985ff	Update Copyright Headers 'Facebook' -> 'Meta Platforms' ``` for f in $(find . $ -path ./.git -o -path ./tests/fuzz/corpora $ -prune -o -type f); do sed -i 's/Facebook, Inc\./Meta Platforms, Inc. and affiliates./' $f; done ```	2022-12-20 12:37:57 -05:00
Yann Collet	ea24b88667	decompressBound() tests fixed an overflow in an intermediate result on 32-bit platform. Checked that the new test catch this bug in 32-bit mode.	2022-12-16 15:43:26 -08:00
Nick Terrell	f31b83ff34	[decompress] Fix nullptr addition & improve fuzzer Fix an instance of `NULL + 0` in `ZSTD_decompressStream()`. Also, improve our `stream_decompress` fuzzer to pass `NULL` in/out buffers to `ZSTD_decompressStream()`, and fix 2 issues that were immediately surfaced. Fixes #3351	2022-12-14 17:54:22 -08:00
Elliot Gorokhovsky	c43da3d605	Fix C90 compat	2022-12-13 18:01:32 -08:00
W. Felix Handte	d7841d150b	Make ZSTD_getDictID_fromDDict() Read DictID from DDict Currently this function actually reads the dict ID from the dictionary's header, via `ZSTD_getDictID_fromDict()`. But during decompression the decomp- ressor actually compares the dict ID in the frame header with the dict ID in the DDict. Now of course the dict ID in the dictionary contents and the dict ID in the DDict struct should be the same. But in cases of memory corrupt- ion, where they can drift out of sync, it's misleading for this function to read it again from the dict buffer rather then return the dict ID that will actually be used. Also doing it this way avoids rechecking the magic and so on and so it is a tiny bit more efficient.	2022-10-14 22:53:03 -04:00
Danielle Rozenblit	e46b12e1b4	fix indentation	2022-09-12 18:56:59 -07:00
Danielle Rozenblit	a1d89424c2	fuzzer error fix	2022-09-12 11:53:37 -07:00
Danielle Rozenblit	a06e953db9	some additional comments, remove apt-get from clang jobs, better test titles	2022-09-08 18:30:07 -07:00
Danielle Rozenblit	3d7f9a90df	skip flush operation in case where op is NULL	2022-09-08 13:53:13 -07:00
Danielle Rozenblit	f3ddaaddd6	ternary operator instead of if statement	2022-09-08 12:59:49 -07:00
Danielle Rozenblit	028842788b	fix zero offset to nullpointer errors	2022-09-07 17:52:26 -07:00
Nick Terrell	a70ca2bd7d	Fix off-by-one error in superblock mode (#3221 ) Fixes #3212. Long literal and match lengths had an off-by-one error in ZSTD_getSequenceLength. Fix the off-by-one error, and add a golden compression test that catches the bug. Also run all the golden tests in the cli-tests framework.	2022-08-03 11:28:39 -07:00
Jun He	ec5fdcde19	lib: add hint to generate more pipeline friendly code (#3138 ) With statistic data of test data files of silesia the chance of position beyond highThreshold is very low (~1.3%@L8 in most cases, all <2.5%), and is in "lowprob area". Add the branch hint so compiler can get better pipiline codegen. With this change it is observed ~1% of mozilla and xml, and slight (0.3%~0.8%) but consistent uplift on other files on Arm N1. Signed-off-by: Jun He <jun.he@arm.com> Change-Id: Id9ba1d5c767e975290b5c1bf0ecce906544f4ade	2022-07-29 10:28:04 -07:00
Jun He	558cf20d0d	decomp: add prefetch for matched seq on aarch64 (#3164 ) match is used for following sequence copy. It is only updated when extDict is needed, which is a low probability case. So it can be prefetched to reduce cache miss. The benchmarks on various Arm platforms showed uplift from 1% ~ 14% with gcc-11/clang-14. Signed-off-by: Jun He <jun.he@arm.com> Change-Id: If201af4799d2455d74c79f8387404439d7f684ae	2022-07-29 10:27:20 -07:00
udayanbapat	43f21a600e	Intial commit to address 3090. Added support to decompress empty block. (#3118 ) * Intial commit to address 3090. Added support to decompress empty block * Update zstd_decompress_block.c Addressed review comments for the case of 'set_basic' * Update lib/decompress/zstd_decompress_block.c Co-authored-by: Nick Terrell <nickrterrell@gmail.com> * Update lib/decompress/zstd_decompress_block.c Co-authored-by: Nick Terrell <nickrterrell@gmail.com> Co-authored-by: Nick Terrell <nickrterrell@gmail.com>	2022-07-14 11:54:34 -07:00
Yann Collet	91aeade735	Streaming decompression can detect incorrect header ID sooner Streaming decompression used to wait for a minimum of 5 bytes before attempting decoding. This meant that, in the case that only a few bytes (<5) were provided, and assuming these bytes are incorrect, there would be no error reported. The streaming API would simply request more data, waiting for at least 5 bytes. This PR makes it possible to detect incorrect Frame IDs as soon as the first byte is provided. Fix #3169	2022-06-21 23:09:03 -07:00
Jun He	2491c65937	dec: adjust seqSymbol load on aarch64 ZSTD_seqSymbol is a structure with total of 64 bits wide. So it can be loaded in one operation and extract its fields by simply shifting or extracting on aarch64. GCC doesn't recognize this and generates more unnecessary ldr/ldrb/ldrh operations that cause performance drop. With this change it is observed 2~4% uplift of silesia and 2.5~6% of cantrbry @L8 on Arm N1. Signed-off-by: Jun He <jun.he@arm.com> Change-Id: I7748909204cf78a17eb9d4f2333692d53239daa8	2022-05-30 22:01:38 +08:00
Dominique Pelle	b772f53952	Typo and grammar fixes	2022-03-12 08:58:04 +01:00
Elliot Gorokhovsky	529cd7b821	Fix nits	2022-02-14 14:24:50 -05:00
Elliot Gorokhovsky	db2f4a6532	Move bitwise builtins into bits.h	2022-02-14 11:16:03 -05:00
Yann Collet	2d154e627a	renamed HufLog into ZSTD_HUFFDTABLE_CAPACITY_LOG old name was not descriptive and actually misleading	2022-01-26 14:47:24 -08:00
Yann Collet	32a5d95dcb	moved HufLog to lib/decompress it's only used to size decompression tables	2022-01-26 14:47:24 -08:00
Yann Collet	7616e39f3b	adding traces to better track processing of literals	2022-01-26 14:47:21 -08:00
Wojciech Muła	e74ca7979e	Simplify HUF_decompress4X2_usingDTable_internal_bmi2_asm_loop Get rid of three divisions. The original expression was: opmin := min((oend0 - op0) / 10, (oend1 - op1) / 10, (oend2 - op2) / 10, (oend3 - op3) / 10) r15 := min(r15, opmin) The division by 10 can be moved outside the `min`: opmin := min(oend0 - op0, oend1 - op1, oend2 - op2, oend3 - op3) r15 := min(r15, opmin/10)	2022-01-19 18:38:46 +01:00
H.J. Lu	568c69a4eb	x86-64: Hide internal assembly functions Hide x86-64 internal assembly functions. Before $ nm -D lib/libzstd.so.1 \| grep usingDTable_internal_bmi2_asm_loop 00000000000c23c0 T _HUF_decompress4X1_usingDTable_internal_bmi2_asm_loop 00000000000c23c0 T HUF_decompress4X1_usingDTable_internal_bmi2_asm_loop 00000000000c283d T _HUF_decompress4X2_usingDTable_internal_bmi2_asm_loop 00000000000c283d T HUF_decompress4X2_usingDTable_internal_bmi2_asm_loop $ After $ nm -D lib/libzstd.so.1 \| grep usingDTable_internal_bmi2_asm_loop $ This fixes issue #2990.	2022-01-11 10:12:24 -08:00
Nick Terrell	c7b03c217c	[license] Fix license header of huf_decompress_amd64.S * Add the license header for `huf_decompress_amd64.S` * Add `.S` files to the `test-license.py` test	2022-01-07 09:35:27 -08:00
W. Felix Handte	ef1f9e80ff	Restrict `GNU-stack` Note to GNU Assemblers	2022-01-05 16:03:32 -05:00
W. Felix Handte	b12edddb37	Write `GNU-stack` Section on All ELF Architectures Previously we did this only on Linux, which missed other Unices.	2022-01-05 15:44:40 -05:00
W. Felix Handte	9a9d1ec6f4	Mark Huffman Decoder Assembly `noexecstack` on All Architectures Apparently, even when the assembly file is empty (because `ZSTD_ENABLE_ASM_X86_64_BMI2` is false), it still is marked as possibly needing an executable stack and so the whole library is marked as such. This commit applies a simple patch for this problem by moving the noexecstack indication outside the macro guard. This commit builds on #2857. This commit addresses #2963.	2021-12-29 17:47:12 -08:00
Norbert Lange	2fbb1d10c1	Reduce bit tables to 8bit This saves some 1.7Kb in rodata section (x86_64, zstd tool), while assembler code stays the same except the type of a few load/extend instructions. Should not have negative performance implications.	2021-12-14 23:47:57 +01:00
Yann Collet	e1ab2200ff	fixed x32 compatibility	2021-12-10 21:02:17 -08:00
Nick Terrell	c284569457	[asm] Share portability macros and restrict ASM further Move portability macros to `lib/common/portability_macros.h`. This file only contains platform/feature detection (e.g. 0/1 macros). This file is shared between C and ASM code, so it cannot include any C code. Rename `HUF_` ASM macros to be `ZSTD_` prefixed, and move to the new header. Restrict `ZSTD_ASM_SUPPORTED` to `__GNUC__`, because we need the GAS assembler. Finally, only include the ASM code if we are actually going to use it. This disables it on all Windows platforms, which should resolve the problem brought up in Issue #2789.	2021-12-02 16:58:04 -08:00
Nick Terrell	91f5891dd0	[CircleCI] Fix short-tests-0 short-tests-0 were silently failing. I think because of the && make clean construction. Switch to ; instead. Also fix all the test failures that were exposed. `make all` is failing on CircleCI because it is missing Docker. Move that test to GitHub actions, and switch the pedantic CircleCI test to `make allmost`.	2021-12-01 17:43:46 -08:00
Nick Terrell	5414dd7978	[bmi2] Add lzcnt and bmi target attributes * When dynamic dispatching to bmi2 add lzcnt and bmi to the TARGET_ATTRIBUTE. * Centralize the bmi2 TARGET_ATTRIBUTE definition to BMI2_TARGET_ATTRIBUTE so we can change it in the future. * Only enable bmi2 when both bmi1 & bmi2 are supported. There shouldn't be any cases where bmi2 is supported but bmi1 isn't. But, since we are using the instruction we should check bmi1 as well.	2021-11-30 17:54:56 -08:00
Yann Collet	a37a8df532	Merge pull request #2856 from rex4539/typos Fix typos	2021-11-17 13:04:30 -08:00
ko-zu	c67e07f34e	Remove executable flag from GNU_STACK section Putting stack marking into every assembly files is required to indicate that the stack does not need to be executable. Executable flag on stack conflicts with some security measures, Systemd MemoryDenyWriteExecute=yes for example.	2021-11-13 22:58:33 +09:00
Dimitris Apostolou	ebbd675998	Fix typos	2021-11-13 10:04:04 +02:00
Nick Terrell	d46995efeb	Backport zstd patch from LKML Credit to Nathan Chancellor for the bug fix and Nick Desaulniers for the bug report. Link: https://github.com/ClangBuiltLinux/linux/issues/1486 Link: https://lore.kernel.org/all/20211021202353.2356400-1-nathan@kernel.org/	2021-11-05 14:09:49 -07:00
binhdvo	04734ee84a	Fix oss fuzz test error (#2837 )	2021-10-29 10:29:50 -04:00
Yann Collet	518f06b281	added minimum for decoder buffer also : introduced macro BOUNDED()	2021-10-26 08:21:31 -07:00
Yann Collet	02be2a830f	build macro ZSTD_DECODER_INTERNAL_BUFFER just to make the topic more accessible for potential users.	2021-10-25 08:09:04 -07:00
binhdvo	6a7ede3dfc	Reduce size of dctx by reutilizing dst buffer (#2751 ) * Reduce size of dctx by reutilizing dst buffer Co-authored-by: Binh Vo <binhvo@fb.com>	2021-10-25 10:38:01 -04:00
Nick Terrell	abd717a5fa	[asm] Switch to C style comments Switch to C style comments for increased portability, and consistency.	2021-10-20 11:37:05 -07:00
Nick Terrell	31316cf158	[multiple-ddicts] Fix NULL checks The bug was reported by Dan Carpenter and found by Smatch static checker. https://lore.kernel.org/all/20211008063704.GA5370@kili/	2021-10-08 11:24:58 -07:00
Nick Terrell	3a4d421c0f	Merge pull request #2802 from solbjorn/fix-kernel-wundef [contrib][linux] Fix -Wundef inside Linux kernel tree	2021-09-29 09:48:17 -07:00

1 2 3 4 5 ...

709 Commits