FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-11-26 19:01:44 +02:00

Author	SHA1	Message	Date
James Almer	0d34473d8e	Merge commit 'dd5d4a0e1e3a30a254d1a57ecbdcedf230c6014b' * commit 'dd5d4a0e1e3a30a254d1a57ecbdcedf230c6014b': checkasm: aarch64: Don't clobber x29 in checkasm_stack_clobber Merged-by: James Almer <jamrial@gmail.com>	2017-03-23 18:31:36 -03:00
James Almer	f23078904f	Merge commit '2816f8a8bb33bd67fec5e94f5d357918caf4e055' * commit '2816f8a8bb33bd67fec5e94f5d357918caf4e055': build: Drop arch-specific checkasm Makefiles Merged-by: James Almer <jamrial@gmail.com>	2017-03-23 18:01:47 -03:00
James Almer	3ddae9eee9	Merge commit '93d5b022a9fd3a1a1f9c521a1eac7f0410e05b81' * commit '93d5b022a9fd3a1a1f9c521a1eac7f0410e05b81': build: Drop duplicate asm recipe Merged-by: James Almer <jamrial@gmail.com>	2017-03-23 17:57:35 -03:00
James Almer	67b639b496	Merge commit 'c91d6a33f872574c95c8784277cf60ffcf6bff4f' * commit 'c91d6a33f872574c95c8784277cf60ffcf6bff4f': checkasm: aarch64: Add filler args to make sure all parameters are passed on the stack Merged-by: James Almer <jamrial@gmail.com>	2017-03-23 17:38:20 -03:00
James Almer	a2d34cc51b	Merge commit 'f1b3e131385176c3c9d9783b25047856a0dcebf6' * commit 'f1b3e131385176c3c9d9783b25047856a0dcebf6': checkasm: aarch64: Clobber the stack before calling functions Merged-by: James Almer <jamrial@gmail.com>	2017-03-23 17:36:53 -03:00
James Almer	cab4c7fa19	Merge commit 'a05cc56124b4f1237f6355784de821e3290ddb44' * commit 'a05cc56124b4f1237f6355784de821e3290ddb44': checkasm: arm/aarch64: Fix the amount of space reserved for stack parameters Merged-by: James Almer <jamrial@gmail.com>	2017-03-23 17:35:38 -03:00
Clément Bœsch	50bbb67472	Merge commit 'e3f941cb03b139b866a0ad6dc95fbe1b247d54af' * commit 'e3f941cb03b139b866a0ad6dc95fbe1b247d54af': checkasm: add a test for HEVC IDCT Merged-by: Clément Bœsch <u@pkh.me>	2017-03-23 12:17:39 +01:00
James Almer	30cadfe071	avcodec/lossless_videodsp: use ptrdiff_t for length parameters Signed-off-by: James Almer <jamrial@gmail.com>	2017-03-22 18:38:35 -03:00
Clément Bœsch	7c2a7f9c11	Merge commit '22c3ab18646924ce24dc6017a9e882ff69689e40' * commit '22c3ab18646924ce24dc6017a9e882ff69689e40': checkasm: Add test for huffyuvdsp add_bytes huffyuvdsp is renamed to llviddsp to be consistent with our codebase. Note: `af607b7e07` wasn't actually required for this test since this commit is not actually testing huffyuvdsp. Merged-by: Clément Bœsch <u@pkh.me>	2017-03-22 16:31:38 +01:00
Clément Bœsch	83cd80d10a	Merge commit '12004a9a7f20e44f4da2ee6c372d5e1794c8d6c5' * commit '12004a9a7f20e44f4da2ee6c372d5e1794c8d6c5': audiodsp/x86: yasmify vector_clipf_sse audiodsp: reorder arguments for vector_clipf Merged the version from Libav after a discussion with James Almer on IRC: 19:22 <ubitux> jamrial: opinion on 12004a9a7f20e44f4da2ee6c372d5e1794c8d6c5? 19:23 <ubitux> it was apparently yasmified differently 19:23 <ubitux> (it depends on the previous commit arg shuffle) 19:24 <ubitux> i don't see the magic movsxdifnidn in your port btw 19:24 <ubitux> it's a port from `1d36defe94` 19:25 <jamrial> seems better thanks to said arg shuffle 19:25 <jamrial> the loop is the same, but init is simpler 19:25 <jamrial> probably worth merging 19:25 <ubitux> OK 19:25 <ubitux> thanks 19:26 <jamrial> curious they didn't make len ptrdiff_t after the previous bunch of commits, heh 19:26 <ubitux> yeah indeed Both commits are merged at the same time to prevent a conflict with our existing yasmified ff_vector_clipf_sse. Merged-by: Clément Bœsch <u@pkh.me>	2017-03-20 22:35:07 +01:00
Clément Bœsch	8414755486	Merge commit 'e9ef6171396dc4106526aaa86b620c61ca3d1017' * commit 'e9ef6171396dc4106526aaa86b620c61ca3d1017': checkasm: add tests for audiodsp Merged-by: Clément Bœsch <u@pkh.me>	2017-03-20 19:10:56 +01:00
Clément Bœsch	c50b2164a6	Merge commit '2eb97af66af90ca3978229da151f0b8b3a5d9370' * commit '2eb97af66af90ca3978229da151f0b8b3a5d9370': checkasm: add a test for blockdsp Merged-by: Clément Bœsch <u@pkh.me>	2017-03-20 19:05:05 +01:00
Clément Bœsch	e07fa3008b	Merge commit 'de452e503734ebb0fdbce86e9d16693b3530fad3' * commit 'de452e503734ebb0fdbce86e9d16693b3530fad3': pixblockdsp: Change type of stride parameters to ptrdiff_t Merged-by: Clément Bœsch <u@pkh.me>	2017-03-20 15:58:32 +01:00
Clément Bœsch	3c8f7a8f6b	Merge commit 'e89cef40506d990a982aefedfde7d3ca4f88c524' * commit 'e89cef40506d990a982aefedfde7d3ca4f88c524': checkasm: Read the unsigned value as it should Merged-by: Clément Bœsch <u@pkh.me>	2017-03-20 11:55:20 +01:00
James Almer	e5623aafd8	Merge commit '87c6c78604e4dd16f1f45862b27ca006da010527' * commit '87c6c78604e4dd16f1f45862b27ca006da010527': vp8: Change type of stride parameters to ptrdiff_t Merged-by: James Almer <jamrial@gmail.com>	2017-03-19 15:11:44 -03:00
Clément Bœsch	8b13492c9e	Merge commit '40ad05bab206c932a32171d45581080c914b06ec' * commit '40ad05bab206c932a32171d45581080c914b06ec': checkasm: Cast unsigned to signed Merged-by: Clément Bœsch <cboesch@gopro.com>	2017-03-15 12:32:15 +01:00
Clément Bœsch	92cb9a3869	Merge commit '9064777dbb335ab4809ae09e3fdcc0245f925cdc' * commit '9064777dbb335ab4809ae09e3fdcc0245f925cdc': checkasm: add HEVC test for testing IDCT DC Merged-by: Clément Bœsch <cboesch@gopro.com>	2017-02-02 11:40:58 +01:00
Clément Bœsch	a0860b0a38	Merge commit '6f9e34baea4f6f484392e4e67f606a0835d07b73' * commit '6f9e34baea4f6f484392e4e67f606a0835d07b73': arm: Check for support for the .fpu directive Merged-by: Clément Bœsch <cboesch@gopro.com>	2017-02-02 11:22:04 +01:00
Clément Bœsch	9f1c81e5ec	Merge commit '71a0472114574993df7035f4de9aa007e03817b8' * commit '71a0472114574993df7035f4de9aa007e03817b8': checkasm: arm: report the first clobbered register in checkasm_checked_call Also includes `446353ea18`, `59aeed93e4`, and `37961044c6` to avoid breaking too much stuff. Merged-by: Clément Bœsch <u@pkh.me>	2017-01-24 19:21:29 +01:00
Martin Storsjö	388f6e6715	arm: vp9itxfm: Skip empty slices in the first pass of idct_idct 16x16 and 32x32 This work is sponsored by, and copyright, Google. Previously all subpartitions except the eob=1 (DC) case ran with the same runtime: Cortex A7 A8 A9 A53 vp9_inv_dct_dct_16x16_sub16_add_neon: 3188.1 2435.4 2499.0 1969.0 vp9_inv_dct_dct_32x32_sub32_add_neon: 18531.7 16582.3 14207.6 12000.3 By skipping individual 4x16 or 4x32 pixel slices in the first pass, we reduce the runtime of these functions like this: vp9_inv_dct_dct_16x16_sub1_add_neon: 274.6 189.5 211.7 235.8 vp9_inv_dct_dct_16x16_sub2_add_neon: 2064.0 1534.8 1719.4 1248.7 vp9_inv_dct_dct_16x16_sub4_add_neon: 2135.0 1477.2 1736.3 1249.5 vp9_inv_dct_dct_16x16_sub8_add_neon: 2446.7 1828.7 1993.6 1494.7 vp9_inv_dct_dct_16x16_sub12_add_neon: 2832.4 2118.3 2266.5 1735.1 vp9_inv_dct_dct_16x16_sub16_add_neon: 3211.7 2475.3 2523.5 1983.1 vp9_inv_dct_dct_32x32_sub1_add_neon: 756.2 456.7 862.0 553.9 vp9_inv_dct_dct_32x32_sub2_add_neon: 10682.2 8190.4 8539.2 6762.5 vp9_inv_dct_dct_32x32_sub4_add_neon: 10813.5 8014.9 8518.3 6762.8 vp9_inv_dct_dct_32x32_sub8_add_neon: 11859.6 9313.0 9347.4 7514.5 vp9_inv_dct_dct_32x32_sub12_add_neon: 12946.6 10752.4 10192.2 8280.2 vp9_inv_dct_dct_32x32_sub16_add_neon: 14074.6 11946.5 11001.4 9008.6 vp9_inv_dct_dct_32x32_sub20_add_neon: 15269.9 13662.7 11816.1 9762.6 vp9_inv_dct_dct_32x32_sub24_add_neon: 16327.9 14940.1 12626.7 10516.0 vp9_inv_dct_dct_32x32_sub28_add_neon: 17462.7 15776.1 13446.2 11264.7 vp9_inv_dct_dct_32x32_sub32_add_neon: 18575.5 17157.0 14249.3 12015.1 I.e. in general a very minor overhead for the full subpartition case due to the additional loads and cmps, but a significant speedup for the cases when we only need to process a small part of the actual input data. In common VP9 content in a few inspected clips, 70-90% of the non-dc-only 16x16 and 32x32 IDCTs only have nonzero coefficients in the upper left 8x8 or 16x16 subpartitions respectively. This is cherrypicked from libav commit `9c8bc74c2b`. Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2017-01-14 21:13:30 +01:00
Ronald S. Bultje	1c8fbd7b90	checkasm/vp9: benchmark all sub-IDCTs (but not WHT or ADST).	2016-12-27 10:02:33 -05:00
Hendrik Leppkes	286d8bae61	Merge commit '7b1ae0e73ab7f7c5eabc70dbe2e579127c6e154f' * commit '7b1ae0e73ab7f7c5eabc70dbe2e579127c6e154f': checkasm/arm: preserve the stack alignment checkasm_checked_call Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-17 15:21:32 +01:00
Hendrik Leppkes	c0af1ee90d	Merge commit '80fbb7becae530167373fe5178966b7d7604306e' * commit '80fbb7becae530167373fe5178966b7d7604306e': checkasm: vp8.mc: initialize the full src buffer after `ec32574209` Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-17 15:20:10 +01:00
Hendrik Leppkes	90b72f6bda	Merge commit '8c816c0c9b12fdefd9046415e97df299880bc9b8' * commit '8c816c0c9b12fdefd9046415e97df299880bc9b8': checkasm/arm: align the clobber check data properly for ldrd Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-17 15:06:10 +01:00
Hendrik Leppkes	4fe013fc70	Merge commit 'ec32574209f36467ef0d22c21a7e811ba98c15b6' * commit 'ec32574209f36467ef0d22c21a7e811ba98c15b6': checkasm: vp8: mc: test unequal width/height for partitions Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-17 15:05:25 +01:00
Hendrik Leppkes	47f75839e4	Merge commit 'f8d17d53957056c053a46f9320fa7ae6fe1479a5' * commit 'f8d17d53957056c053a46f9320fa7ae6fe1479a5': checkasm: Add tests for vp8dsp Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-14 15:29:08 +01:00
Hendrik Leppkes	f75035b06f	Merge commit 'e48746deec48e9ff195841bc3266b4e153a878cd' * commit 'e48746deec48e9ff195841bc3266b4e153a878cd': checkasm: h264dsp: Move the x and y variables into the randomize_buffer macro Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-13 23:02:39 +01:00
Martin Storsjö	dd5d4a0e1e	checkasm: aarch64: Don't clobber x29 in checkasm_stack_clobber x29 (FP) is a callee saved register and should be restored on return. Instead of backing up x29 and restoring it here, back up sp in a register that we are allowed to overwrite. This fixes crashes in checkasm on aarch64 since `f1b3e13138`. For some reason, gcc builds didn't crash, but clang builds do. Signed-off-by: Martin Storsjö <martin@martin.st>	2016-10-18 16:17:12 +03:00
Diego Biurrun	2816f8a8bb	build: Drop arch-specific checkasm Makefiles They only contain one line and will never contain more.	2016-10-17 16:25:38 +02:00
Diego Biurrun	93d5b022a9	build: Drop duplicate asm recipe And move the asm recipe to the top-level Makefile next to the other local pattern rules for .o files.	2016-10-17 16:25:35 +02:00
Martin Storsjö	c91d6a33f8	checkasm: aarch64: Add filler args to make sure all parameters are passed on the stack This, combined with clobbering the stack space prior to the call, increases the chances of finding cases where 32 bit parameters are erroneously treated as 64 bit. Signed-off-by: Martin Storsjö <martin@martin.st>	2016-10-16 23:26:33 +03:00
Martin Storsjö	f1b3e13138	checkasm: aarch64: Clobber the stack before calling functions Signed-off-by: Martin Storsjö <martin@martin.st>	2016-10-16 23:26:22 +03:00
Martin Storsjö	a05cc56124	checkasm: arm/aarch64: Fix the amount of space reserved for stack parameters Even if MAX_ARGS - 2 (for arm) or MAX_ARGS - 7 (for aarch64) parameters are passed on the stack to checkasm_checked_call, we actually only need to store MAX_ARGS - 4 (for arm) or MAX_ARGS - 8 (for aarch64) parameters on the stack when calling the tested function. Signed-off-by: Martin Storsjö <martin@martin.st>	2016-10-16 23:26:15 +03:00
Alexandra Hájková	e3f941cb03	checkasm: add a test for HEVC IDCT Signed-off-by: Anton Khirnov <anton@khirnov.net>	2016-10-11 18:15:40 +02:00
Hendrik Leppkes	6fc74934de	Merge commit 'dc7501e524dc3270335749302c7aa449973625f3' * commit 'dc7501e524dc3270335749302c7aa449973625f3': checkasm: Issue emms after benchmarking functions Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-10-07 13:18:05 +02:00
Ronald S. Bultje	c935b54bd6	checkasm: add VP9 loopfilter tests. The randomize_buffer() implementation assures that "most of the time", we'll do a good mix of wide16/wide8/hev/regular/no filters for complete code coverage. However, this is not mathematically assured because that would make the code either much more complex, or much less random. Some fixes and improvements by Rodger Combs <rodger.combs@gmail.com> Signed-off-by: Anton Khirnov <anton@khirnov.net>	2016-10-04 10:54:07 +02:00
Alexandra Hájková	22c3ab1864	checkasm: Add test for huffyuvdsp add_bytes Signed-off-by: Luca Barbato <lu_zero@gentoo.org>	2016-10-02 17:13:26 +02:00
Diego Biurrun	ba479f3daa	hevc: Change type of array stride parameters to ptrdiff_t ptrdiff_t is the correct type for array strides and similar.	2016-09-29 17:54:23 +02:00
Anton Khirnov	e9ef617139	checkasm: add tests for audiodsp	2016-09-22 09:47:52 +02:00
Anton Khirnov	2eb97af66a	checkasm: add a test for blockdsp	2016-09-22 09:47:52 +02:00
Anton Khirnov	683da86aab	audiodsp: reorder arguments for vector_clipf This will make the x86 asm simpler. ARM conversion by Martin Storsjö <martin@martin.st> and Janne Grunau <janne-libav@jannau.net>	2016-09-22 09:47:52 +02:00
Luca Barbato	e89cef4050	checkasm: Read the unsigned value as it should Reading a value larger than int using atoi() may give the wrong result.	2016-09-11 14:12:18 +02:00
Diego Biurrun	87c6c78604	vp8: Change type of stride parameters to ptrdiff_t ptrdiff_t is the correct type for array strides and similar.	2016-08-26 11:36:53 +02:00
Martin Storsjö	2e95054ebb	checkasm: h264dsp: Initialize the padding area This fixes valgrind warnings about conditional jumps based on uninitialized data (even though the uninitialized data only ever was compared with a direct copy of the same uninitialized data). Signed-off-by: Martin Storsjö <martin@martin.st> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-08-11 19:55:16 +02:00
Ronald S. Bultje	e99ecda550	checkasm: add vp9 MC tests. Signed-off-by: Anton Khirnov <anton@khirnov.net>	2016-08-03 11:07:01 +02:00
James Almer	54a0a52be1	checkasm/vp9dsp: use declare_func_emms in check_loopfilter Fixes checkasm failures on mmxext functions Signed-off-by: James Almer <jamrial@gmail.com>	2016-07-26 22:16:21 -03:00
Luca Barbato	40ad05bab2	checkasm: Cast unsigned to signed Avoid a warning for passing an unsigned value to abs(), some compilers might optimize away abs().	2016-07-23 08:27:32 +02:00
Alexandra Hájková	9064777dbb	checkasm: add HEVC test for testing IDCT DC Signed-off-by: Anton Khirnov <anton@khirnov.net>	2016-07-22 19:08:12 +02:00
Martin Storsjö	6f9e34baea	arm: Check for support for the .fpu directive When targeting COFF (windows), clang doesn't support this directive (while binutils supports it for all targets). Signed-off-by: Martin Storsjö <martin@martin.st>	2016-07-21 12:52:10 +03:00
Martin Storsjö	37961044c6	checkasm: arm: Ignore changes to bits 0-4 and 7 of FPSCR These bits are set by exceptions in NEON instructions. Also print the differing bits when FPSCR is clobbered, and use bic instead of lsl, for clearing the topmost bits. Signed-off-by: Martin Storsjö <martin@martin.st>	2016-07-17 21:48:17 +03:00

1 2 3 4

177 Commits