krak/zstd - zstd - Gitea: Git with a cup of tea

krak/zstd

mirror of https://github.com/facebook/zstd.git synced 2025-03-06 08:49:28 +02:00

Author	SHA1	Message	Date
Dimitri Papadopoulos	fcf88ae39b	Fix new typos found by codespell	2024-11-26 11:15:39 +01:00
Yann Collet	2e02cd330d	inform manual users that it's automatically generated suggested by @Eugeny1	2024-10-31 15:06:48 -07:00
Yann Collet	d9553fd218	elevated ZSTD_getErrorCode() to stable status answering #4183	2024-10-31 14:15:50 -07:00
Yann Collet	3e7c66acd1	added ascending order example	2024-10-09 01:06:24 -07:00
Yann Collet	3b343dcfb1	refactor huffman prefix code paragraph	2024-10-07 17:15:07 -07:00
Yann Collet	a8b86d024a	refactor documentation of the FSE decoding table build process	2024-10-02 23:09:06 -07:00
Yann Collet	d2212c680a	Merge pull request #4013 from elasota/spec-clarify-offset-code-overflow Specify that decoders may reject non-zero probabilities for larger offset codes than implementation supports	2024-09-27 13:42:32 -07:00
Yann Collet	3de0541aef	Merge pull request #4079 from elasota/truncated-huff-state-error Throw error if Huffman weight initial states are truncated	2024-06-30 16:17:03 -07:00
elasota	0938308ff6	Throw error if Huffman weight initial states are truncated	2024-06-20 17:46:16 -04:00
Dimitri Papadopoulos	2d736d9c50	Fix new typos found by codespell	2024-06-20 20:12:16 +02:00
Quentin Boswank	f19c98228f	Fix $filter and Msys/Cygwin - switched the patter and input of $filter into the right places - added pattern wildcard to MSYS_NT & CYGWIN_NT as they change with windows versions - correctly identify MSYS2, even in an env like MINGW64	2024-06-05 18:37:27 +02:00
elasota	c54f4783d0	Specify that decoders may reject non-zero probabilities for larger offset codes than supported by the implementation	2024-04-01 20:13:48 -04:00
elasota	8cff66f2f5	Remove text specifying probability overflow as invalid, the variable-size value encoding scheme makes this impossible.	2024-04-01 20:08:42 -04:00
Yann Collet	c6e5257240	Merge pull request #3977 from facebook/doc_advanced Doc update	2024-03-21 12:33:15 -07:00
Elliot Gorokhovsky	741b87bbe1	Fuzzing and bugfixes for magicless-format decoding (#3976 ) * fuzzing and bugfixes for magicless format * reset dctx before each decompression * do not memcmp empty buffers * nit: decompressor errata	2024-03-20 19:22:34 -04:00
Yann Collet	c5da438dc0	fix typo	2024-03-18 12:33:22 -07:00
Yann Collet	3d18d9a9ce	updated API manual	2024-03-18 12:30:54 -07:00
Yann Collet	686e7e4b4b	updated version to v1.5.6	2024-03-14 15:38:14 -07:00
Yann Collet	eb5f7a7fa2	produced golden sample for the offset==0 decoder test is correctly detected as corrupted by new version, and is accepted (changed into offset==1) by older version. updated documentation accordingly, with an hexadecimal representation.	2024-03-09 00:33:44 -08:00
Yann Collet	d2f56ba442	update documentation	2024-03-08 15:55:30 -08:00
Yann Collet	e127139ceb	Merge pull request #3824 from elasota/specify-zero-offset Specify offset 0 as invalid and specify required fixup behavior	2024-03-08 15:25:48 -08:00
Yann Collet	478e5fedf9	Merge pull request #3816 from elasota/fix-state-table Fix state table formatting	2024-03-08 15:02:00 -08:00
Yann Collet	f77f634d41	update API documentation	2024-02-24 01:28:17 -08:00
Yann Collet	7971fd16f7	Merge pull request #3817 from elasota/oversized-probs-clarification Clarify that probability tables must not contain non-zero probabilities for invalid values	2024-01-13 11:37:54 -08:00
elasota	f06b18b3ff	Specify offset 0 as invalid	2023-12-28 16:47:09 -05:00
elasota	05059e5a48	Clarify that there must be at least 2 weights, i.e. encoding all weights as 0 is invalid	2023-11-24 16:49:40 -05:00
elasota	dc84e35138	Clarify that the presence of a value with weight 1 is required	2023-11-24 16:49:40 -05:00
elasota	c5bf96fb74	Clarify that a non-zero probability for an invalid symbol is invalid	2023-11-13 00:03:56 -05:00
elasota	52e41b9ac8	Fix malformed state table	2023-11-09 12:28:21 -05:00
elasota	e61e3ff152	Clarify that decoding too many Huffman weights is a failure condition	2023-11-08 20:06:58 -05:00
elasota	324cce4996	Add definition of "log2sup" function	2023-10-31 11:45:10 -04:00
elasota	b38d87b476	Clarify that the log2 of the largest possible symbol is the maximum number of bits consumed	2023-10-31 01:17:23 -04:00
Dimitri Papadopoulos	fe34776c20	Fix new typos found by codespell	2023-09-23 18:56:01 +02:00
Yann Collet	3732a08f5b	fixed decoder behavior when nbSeqs==0 is encoded using 2 bytes The sequence section starts with a number, which tells how sequences are present in the section. If this number if 0, the section automatically ends. The number 0 can be represented using the 1 byte or the 2 bytes formats. That's because the 2-bytes formats fully overlaps the 1 byte format. However, when 0 is represented using the 2-bytes format, the decoder was expecting the sequence section to continue, and was looking for FSE tables, which is incorrect. Fixed this behavior, in both the reference decoder and the educational behavior. In practice, this behavior never happens, because the encoder will always select the 1-byte format to represent 0, since this is more efficient. Completed the fix with a new golden sample for tests, a clarification of the specification, and a decoder errata paragraph.	2023-06-05 16:03:00 -07:00
Yann Collet	8030342eea	Merge pull request #3659 from facebook/fixHarness Fixed a bug in the educational decoder	2023-06-05 15:03:14 -04:00
Yann Collet	1f83b7cfc4	fix a minor inefficiency in compress_superblock and in `decodecorpus`: the specific case `nbSeq=127` can be represented using the 1-byte format. Note that both the 1-byte and the 2-bytes formats are valid to represent this case, so there was no "error", produced data remains valid, it's just that the 1-byte format is more efficient. fix #3667 Credit to @ip7z for finding this issue.	2023-06-05 09:51:52 -07:00
Yann Collet	5108c9ac97	Fixed a bug in the educational decoder Credit to Igor Pavlov	2023-05-27 11:22:30 -07:00
Yann Collet	0d6954b4cc	added golden file for the new decompressor erratum	2023-04-19 00:24:35 -07:00
Yann Collet	a29b6ed251	added decoder errata paragraph for compressed blocks of size exactly 128 KB which used to be disallowed by the spec but have become allowed in more recent version of the spec. While this limitation is fixed in decoders v1.5.4+, implementers should refrain from generating such block with their custom encoder as they could be misclassified as corrupted by older decoder versions.	2023-04-17 15:43:27 -07:00
Yann Collet	9f58241dcc	updated version number to v1.5.5 also : updated man pages	2023-03-31 23:02:08 -07:00
Yann Collet	64e8511b26	added clarifications for sizes of compressed huffman blocks and streams.	2023-03-08 15:31:36 -08:00
Yann Collet	832f559b0b	clarify zstd specification for Huffman blocks Following detailed comments from @dweiller in #3508.	2023-02-18 18:18:16 -08:00
Yann Collet	95ffc767f6	updated man pages	2023-02-09 14:40:39 -08:00
Yann Collet	39ceef27f9	bump version number to v1.5.4 start preparation for release	2023-01-30 19:06:39 -08:00
Yann Collet	6a9c525903	spec update : require minimum nb of literals for 4-streams mode Reported by @shulib : the specification for 4-streams mode doesn't work when the amount of literals to compress is 5 bytes. Extending it, it also doesn't work for sizes 1 or 2. This patch updates the specification and the implementation to require a minimum of 6 literals to trigger or accept the 4-streams mode. The impact is expected to be a no-op : the 4-streams mode is never triggered for such small quantity of literals anyway, since it would be wasteful (it costs ~7.3 bytes more than single-stream mode). An informal lower limit is set at ~256 bytes, so the technical minimum is very far from this limit. This is just meant for completeness of the specification.	2022-12-22 16:14:34 -08:00
W. Felix Handte	5d693cc38c	Coalesce Almost All Copyright Notices to Standard Phrasing ``` for f in $(find . $ -path ./.git -o -path ./tests/fuzz/corpora -o -path ./tests/regression/data-cache -o -path ./tests/regression/cache $ -prune -o -type f); do sed -i '/Copyright .* $Yann Collet$\\|$Meta Platforms$/ s/Copyright ./Copyright (c) Meta Platforms, Inc. and affiliates./' $f; done git checkout HEAD -- build/VS2010/libzstd-dll/libzstd-dll.rc build/VS2010/zstd/zstd.rc tests/test-license.py contrib/linux-kernel/test/include/linux/xxhash.h examples/streaming_compression_thread_pool.c lib/legacy/zstd_v0.c lib/legacy/zstd_v0*.h nano ./programs/windres/zstd.rc nano ./build/VS2010/zstd/zstd.rc nano ./build/VS2010/libzstd-dll/libzstd-dll.rc ```	2022-12-20 12:52:34 -05:00
W. Felix Handte	7f12f24cf4	Rewrite Copyright Date Ranges from `-present` to `-2022` Apparently it's better. Somehow. ``` for f in $(find . $ -path ./.git -o -path ./tests/fuzz/corpora -o -path ./tests/regression/data-cache -o -path ./tests/regression/cache $ -prune -o -type f); do echo $f; sed -i 's/\-present/-2022/' $f; done g co HEAD -- build/meson/ ```	2022-12-20 12:44:56 -05:00
W. Felix Handte	36d5c2f326	Update Copyright Year ('2021' -> 'present') ``` for f in $(find . $ -path ./.git -o -path ./tests/fuzz/corpora -o -path ./tests/regression/data-cache -o -path ./tests/regression/cache $ -prune -o -type f); do sed -i 's/\-2021/-present/' $f; done g co HEAD -- .github/workflows/dev-short-tests.yml # fix bad match ```	2022-12-20 12:42:50 -05:00
W. Felix Handte	8927f985ff	Update Copyright Headers 'Facebook' -> 'Meta Platforms' ``` for f in $(find . $ -path ./.git -o -path ./tests/fuzz/corpora $ -prune -o -type f); do sed -i 's/Facebook, Inc\./Meta Platforms, Inc. and affiliates./' $f; done ```	2022-12-20 12:37:57 -05:00
Yann Collet	1bc9dfe46e	Update documentation link to html format fix #3319	2022-12-15 15:57:29 -08:00

1 2 3 4 5 ...

354 Commits