FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-12-23 12:43:46 +02:00

Author	SHA1	Message	Date
Andreas Rheinhardt	fd72d8aea3	avcodec/blockdsp: Remove unused AVCodecContext parameter Possible since `be95df12bb`. Reviewed-by: Rémi Denis-Courmont <remi@remlab.net> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-21 20:24:40 +02:00
Andreas Rheinhardt	57f3ca20dc	avcodec/cavsdsp: Remove unused function parameter Reviewed-by: Rémi Denis-Courmont <remi@remlab.net> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-21 20:24:22 +02:00
Andreas Rheinhardt	6a288ada55	fate/lavf-*: Add missing dependency on pipe protocol Forgotten in `bf1337f99c`. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-21 15:43:52 +02:00
Paul B Mahol	b9d71cdb9e	avcodec/mjpegbdec: use init_get_bits8()	2022-09-21 15:03:00 +02:00
Paul B Mahol	6a150fcdb9	avcodec/mlpenc: analyze only if there are samples	2022-09-21 14:32:07 +02:00
Thilo Borgmann	2ca2d46f0b	lavc/videotoolboxenc: Fix crash by uninitialized value If create_cv_pixel_buffer() fails, pixel_buffer_info might get into CFRelease() containing an arbitrary value.	2022-09-21 13:39:55 +02:00
Andreas Rheinhardt	5a7978a694	avfilter/avfilter: Make ff_command_queue_pop() static Only used here. Reviewed-by: Nicolas George <george@nsup.org> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-21 13:01:20 +02:00
Andreas Rheinhardt	55c35d84eb	avfilter/internal: Remove declaration of inexistent function ff_get_ref_perms_string() has been removed in `7e350379f8`. Reviewed-by: Nicolas George <george@nsup.org> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-21 13:00:52 +02:00
Hubert Mazur	b2732115dd	lavc/aarch64: Add neon implementation for pix_median_abs8 Provide optimized implementation for pix_median_abs8 function. Performance comparison tests are shown below. - median_sad_1_c: 277.0 - median_sad_1_neon: 82.0 Benchmarks and tests run with checkasm tool on AWS Graviton 3. Signed-off-by: Hubert Mazur <hum@semihalf.com> Signed-off-by: Martin Storsjö <martin@martin.st>	2022-09-21 12:57:56 +03:00
Hubert Mazur	e9a6170213	lavc/aarch64: Add neon implementation for vsad8_intra Provide optimized implementation for vsad8_intra function. Performance comparison tests are shown below. - vsad_5_c: 94.7 - vsad_5_neon: 20.7 Benchmarks and tests run with checkasm tool on AWS Graviton 3. Signed-off-by: Hubert Mazur <hum@semihalf.com> Signed-off-by: Martin Storsjö <martin@martin.st>	2022-09-21 12:57:56 +03:00
Hubert Mazur	0ee535b1db	lavc/aarch64: Add neon implementation for pix_median_abs16 Provide optimized implementation for pix_median_abs16 function. Performance comparison tests are shown below. - median_sad_0_c: 720.5 - median_sad_0_neon: 127.2 Benchmarks and tests run with checkasm tool on AWS Graviton 3. Signed-off-by: Hubert Mazur <hum@semihalf.com> Signed-off-by: Martin Storsjö <martin@martin.st>	2022-09-21 12:57:56 +03:00
Paul B Mahol	acbb2777e2	avcodec/ac3dec: add downmix support for mono and stereo for eac3 7.1	2022-09-21 09:22:08 +02:00
Wang Yaqiang	0744782de3	avformat/mov: get the correct fragment stsd_id when decrypting the sample When determining whether a packet should be decrypted, should use the stsd_id of the fragment where the current packet is located. Reviewed-by: Zhao Zhili <zhilizhao@tencent.com> Signed-off-by: Wang Yaqiang <wangyaqiang03@kuaishou.com>	2022-09-21 13:58:40 +08:00
Lynne	3ade6a8644	x86/lpc: implement a new Welch windowing function Old one was written with the assumption only even inputs would be given. This very messy replacement supports even and odd inputs, and supports AVX2 for extra speed. The buffers given are usually quite big (4k samples), so the speedup is worth it. The new SSE version is still faster than the old inline asm version by 33%. Also checkasm is provided to make sure this monstrosity works. This fixes some FATE tests.	2022-09-21 07:12:39 +02:00
Lynne	cc367a9b8a	lavc/lpc: do not explode when windowing a 1-length array Divided by 0.	2022-09-21 07:09:36 +02:00
Will Cassella	e601ec3c19	libavformat/riffec: Zero-initialize channels in ff_get_wav_header Clang's static analyzer complains that leaving the variable uninitialized could lead to a code path where the uninitialized value is written to at the end of this function. This patch simply zero-initializes that variable to avoid that. Signed-off-by: Will Cassella <cassew@google.com> Signed-off-by: James Almer <jamrial@gmail.com>	2022-09-20 18:58:23 -03:00
Andreas Rheinhardt	d438540d2b	avformat/spdifenc: Reorder struct members to make it smaller Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-20 18:32:28 +02:00
Andreas Rheinhardt	2e9fd627ed	fate/segafilm: Add tests for segafilm (de)muxer Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-20 18:32:28 +02:00
Andreas Rheinhardt	e27d67b24c	avcodec/rpzaenc: Avoid useless intermediate variable Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-20 18:32:28 +02:00
Andreas Rheinhardt	353108bfab	avcodec/smc: Move transient GetByteContext from context to stack Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-20 18:32:28 +02:00
Andreas Rheinhardt	6a8b3e7eb1	fate/ffmpeg: Set max_delay for shortest-sub The aim of this test is to show the interleavement of the file generated in the first pass; so make the interleavement queue in the framecrc muxer in the second pass as small as possible so that the framecrc muxer does not fix wrong interleavement of the input file behind our backs. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-20 18:32:28 +02:00
Andreas Rheinhardt	71364c54d4	fate/ffmpeg: Use transcode instead of enc_dec in shortest-sub test enc_dec is designed for raw input and output and computes the PSNR between these two. The input of the shortest-sub test is the idx file of a vobsub sub+idx combination and the output is the output of framecrc of said vobsub subtitle muxed into Matroska together with a synthesized video. Calculating the PSNR between these two files makes no sense, therefore switch to a transcode test, where the ref file file contains the output of framecrc directly, making the interleavement better visible in the ref file at the cost of a larger ref file (>400 lines). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-20 18:32:28 +02:00
Pierre-Anthony Lemieux	79845ce6cf	fate/mxf: add JPEG 2000 test Reviewed-by: Tomas Härdin <tjoppen@acc.umu.se> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-20 11:26:00 +02:00
Pierre-Anthony Lemieux	48fa27e77a	avformat/mxf: set stream frame rates for ST 422 essence containers The MXF demuxer does not currently set AVStream::avg_frame_rate and ::r_frame_rate when J2K essence is wrapped according to SMPTE ST 422. Reviewed-by: Tomas Härdin <tjoppen@acc.umu.se> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-20 11:25:54 +02:00
Andreas Rheinhardt	f08d529e11	fate/mxf: Add ProRes remux test Also covers writing mastering display metadata. Reviewed-by: Tomas Härdin <tjoppen@acc.umu.se> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-20 11:25:48 +02:00
Andreas Rheinhardt	56e29fcac2	tests/mxf: Fix test requirements Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-20 10:39:36 +02:00
James Almer	8f119b501e	tests/checkasm: add a test for VorbisDSPContext Signed-off-by: James Almer <jamrial@gmail.com>	2022-09-19 21:28:23 -03:00
Andreas Rheinhardt	118b36f418	avcodec/mjpegdec: Avoid copying data when flipping image Basically reverts `af15c17daa`. Flipping a picture by modifying the pointers is so common that even users of direct rendering should take it into account. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-20 01:55:29 +02:00
Andreas Rheinhardt	888a02a126	swscale/output: Don't call av_pix_fmt_desc_get() in a loop Up until now, libswscale/output.c used a macro to write an output pixel which involved a call to av_pix_fmt_desc_get() to find out whether the input pixel format is BE or LE despite this being known at compile-time (there are templates per pixfmt). Even worse, these calls are made in a loop, so that e.g. there are eight calls to av_pix_fmt_desc_get() for every pixel processed in yuv2rgba64_X_c_template() for 64bit RGB formats. This commit modifies these macros to ensure that isBE() is evaluated at compile-time. This saved 41184B of .text for me (GCC 11.2, -O3). Of course, it also improved performance. E.g. ffmpeg_g -f lavfi -i testsrc2,format=yuva420p -pix_fmt rgba64le \ -threads 1 -t 1:00 -f null - (which uses yuv2rgba64le_X_c, which is an invocation of yuv2rgba64_X_c_template() mentioned above), performance improved from 95589 to 41387 decicycles for one call to yuv2packedX; for the be variant the numbers went down from 76087 to 43024 decicycles. Reviewed-by: Anton Khirnov <anton@khirnov.net> Reviewed-by: Paul B Mahol <onemda@gmail.com> Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-19 23:40:41 +02:00
Andreas Rheinhardt	4d7a1a4619	swscale/input: Avoid calls to av_pix_fmt_desc_get() Up until now, libswscale/input.c used a macro to read an input pixel which involved a call to av_pix_fmt_desc_get() to find out whether the input pixel format is BE or LE despite this being known at compile-time (there are templates per pixfmt). Even worse, these calls are made in a loop, so that e.g. there are six calls to av_pix_fmt_desc_get() for every pair of UV pixel processed in rgb64ToUV_half_c_template(). This commit modifies these macros to ensure that isBE() is evaluated at compile-time. This saved 9743B of .text for me (GCC 11.2, -O3). For a simple RGB64LE->YUV420P transformation like ffmpeg -f lavfi -i haldclutsrc,format=rgba64le -pix_fmt yuv420p \ -threads 1 -t 1:00 -f null - the amount of decicycles spent in rgb64LEToUV_half_c (which is created via the template mentioned above) decreases from 19751 to 5341; for RGBA64BE the number went down from 11945 to 5393. For shared builds (where the call to av_pix_fmt_desc_get() is indirect) the old numbers are 15230 for RGBA64BE and 27502 for RGBA64LE, whereas the numbers with this patch are indistinguishable from the numbers from a static build. Also make the macros that are touched conform to the usual convention of using uppercase names while just at it. Reviewed-by: Anton Khirnov <anton@khirnov.net> Reviewed-by: Paul B Mahol <onemda@gmail.com> Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-19 23:40:41 +02:00
Andreas Rheinhardt	187cd27832	avutil/dict: Error out in case of key == NULL Up until now, using NULL as key in av_dict_get() on a non-empty AVDictionary would crash; using NULL as key in av_dict_set() would also crash for a non-empty AVDictionary unless AV_DICT_MULTIKEY was set; in case the dictionary was initially empty or AV_DICT_MULTIKEY was set, it was even possible for av_dict_set() to succeed when adding a NULL key, namely when one uses a value != NULL and the AV_DICT_DONT_STRDUP_VAL flag. Using av_dict_get() on such an AVDictionary will usually lead to crashes, though. Fix this by actually checking for key in both functions; error out if they are NULL. While just at it, also stop relying on av_strdup(NULL) to return NULL in av_dict_set(). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-19 23:39:58 +02:00
Andreas Rheinhardt	32129d6495	avcodec/vorbisdec: Move ff_vorbis_inverse_coupling() to vorbisdsp.c Only used there. Also make it static. Reviewed-by: James Almer <jamrial@gmail.com> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-19 19:19:08 +02:00
Rémi Denis-Courmont	5c15cb138e	lavc/vorbisdec: use intermediate variables The compiler cannot infer that the two float vectors do not alias, causing unnecessary extra loads and serialisation. This patch caches the two input values in local variables so that compiler can optimise individual loop iterations.	2022-09-19 14:07:27 -03:00
Rémi Denis-Courmont	b52034270a	lavc/vorbisdsp: use ptrdiff_t rather than intptr_t ... for a difference between pointers.	2022-09-19 13:51:00 -03:00
Rémi Denis-Courmont	bbf045aa59	lavc/vorbisdec: use ptrdiff_t to iterate over intptr_t While this probably never overflows, we are better safe than sorry. The callback prototype should probably also use ptrdiff_t or size_t, but I diggress (this would affect the DSP callback prototype).	2022-09-19 13:51:00 -03:00
Andreas Rheinhardt	129cbbd7be	avcodec/aptxenc: Process data in complete blocks of four samples only Do this by setting AVCodecInternal.pad_samples. This prevents reading into the frame's padding and writing into the packet's padding. This actually happened in our FATE tests (where the number of samples is 2 mod 4), which therefore needed to be updated. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-19 17:05:33 +02:00
Andreas Rheinhardt	017d976629	avcodec/encode: Enable encoders to control padding of last frame Some audio codecs work with atomic units that decode to a fixed number of audio samples with this number being so small that it is common to put multiple of these atoms into one packet. In these cases it makes no sense to pad the last frame to the big frame_size, so allow encoders to set the number of samples that they want the last frame to be padded to instead. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-19 17:05:33 +02:00
Andreas Rheinhardt	dc7b664557	avcodec/encode: Redo checks for small last audio frame In particular, check that there is only one small last frame in case the encoder has the AV_CODEC_CAP_SMALL_LAST_FRAME set. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-19 17:05:33 +02:00
Andreas Rheinhardt	e6bfb14223	avcodec/aptxdec: Process data in complete blocks only The APTX (HD) decoder decodes blocks of four (six) bytes to four output samples. It makes no sense to handle incomplete blocks: They would just lead to synchronization errors, in which case the complete frame is discarded. So only handle complete blocks. This also avoids reading from the packet's padding and writing into the frame's padding. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-19 17:05:33 +02:00
Andreas Rheinhardt	c8707c105f	avformat/aptxdec: Don't set AV_PKT_FLAG_CORRUPT mistakenly Just because we try to put multiple units of block_align bytes (the atomic units for APTX and APTX HD) into one packet does not mean that packets with fewer units than the one we wanted are corrupt; only those packets that are not a multiple of block_align are. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-19 17:05:33 +02:00
Andreas Rheinhardt	9d10d3a4ee	avformat/aptxdec: Don't set AVCodecParameters.frame_size This field was misunderstood: It gives the number of samples in a packet, not the number of bytes. Its usage was wrong for APTX HD. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-19 17:05:33 +02:00
Andreas Rheinhardt	c0d483cecb	avcodec/aptx: Use AVCodecContext.frame_size according to the API Currently the APTX (HD) codecs set frame_size if unset and check whether it is divisible by block_size (corresponding to block_align as used by other codecs). But this is based upon a misunderstanding of the API: frame_size is not in bytes, but in samples. Said value is also not intended to be set by the user at all, but set by encoders and (possibly) decoders if the number of channels in a frame is constant. The latter condition is not fulfilled here, so only set it for encoders. Given that the encoder can handle any number of samples as long as it is divisible by four and given that it worked to set a custom frame size before, the encoders accept any multiple of four; otherwise the value is set to the value that it already had for APTX: 1024 samples (per channel). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-19 17:05:33 +02:00
Andreas Rheinhardt	6f22d1965e	avcodec/utils: Support APTX (HD) in av_get_audio_frame_duration() APTX decodes four bytes of input to four stereo samples; APTX HD does the same with six bytes of input. So it can be easily supported in av_get_audio_frame_duration(). This fixes invalid durations and (derived) timestamps of demuxed APTX HD packets and therefore fixed the timestamp in the aptx-hd FATE test. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-19 17:05:33 +02:00
Andreas Rheinhardt	9884d14711	fate/audio: Add tests for APTX (HD) We have de- and encoders for APTX and APTX HD, yet not FATE tests. This commit therefore adds a transcoding test to utilize them. Furthermore, during creating these tests it turned out that the duration is set incorrectly for APTX HD. This will be fixed in a future commit. (Thanks to Andriy Gelman for finding an issue in an earlier version that used a 192kHz input sample which does not work reliably accross platforms.) Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-19 17:05:33 +02:00
James Almer	33cdf51a06	avcodec/avcodec: Use the new API fields to validate the layout returned by decoders This block was scheduled for removal, which means that no validation would have taken place after the old API was removed. It was algo going to mistakenly remove an unrelated bits_per_coded_sample check. Signed-off-by: James Almer <jamrial@gmail.com>	2022-09-19 11:59:15 -03:00
Anton Khirnov	8576c3c5d8	lavc/videotoolbox: deprecate write-only output_callback This field has never been used for anything, so stop setting it and deprecate it.	2022-09-19 15:18:20 +02:00
Anton Khirnov	d7f4ad88a0	lavc/videotoolbox: do not pass AVCodecContext to decoder output callback The opaque parameter for the callback is set in videotoolbox_start(), called when the hwaccel is initialized. When frame threading is used, avctx will be the context corresponding to the frame thread currently doing the decoding. Using this same codec context in all subsequent invocations of the decoder callback (even those triggered by a different frame thread) is unsafe, and broken after `cc867f2c09`, since each frame thread now cleans up its hwaccel state after decoding each frame. Fix this by passing hwaccel_priv_data as the opaque parameter, which exists in a single instance forwarded between all frame threads. The only other use of AVCodecContext in the decoder output callback is as a logging context. For this purpose, store a logging context in hwaccel_priv_data.	2022-09-19 15:18:20 +02:00
Lynne	4ba68639ca	x86/tx_float: add asm call versions of the 2pt and 4pt transforms Verified to be working.	2022-09-19 06:01:06 +02:00
Lynne	892548e6a1	x86/tx_float: fully support 128bit regs in LOAD64_LUT The gather path didn't support 128bit registers. It's not faster on Zen 3, but it's here for completeness.	2022-09-19 06:01:04 +02:00
Lynne	af42bb3d61	x86/tx_float: simplify and describe the intra-asm call convention	2022-09-19 06:01:02 +02:00

1 2 3 4 5 ...

108255 Commits