FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2025-11-23 21:54:53 +02:00

Author	SHA1	Message	Date
Zhao Zhili	a5cc0e5c9e	avfilter/vf_drawtext: fix call GET_UTF8 with invalid argument For GET_UTF8(val, GET_BYTE, ERROR), val has type of uint32_t, GET_BYTE must return an unsigned integer, otherwise signed extension happened due to val= (GET_BYTE), and GET_UTF8 went to the error path. This bug incidentally cancelled the bug where hb_buffer_add_utf8 was being called with incorrect argument, allowing drawtext to function correctly on x86 and macOS ARM, which defined char as signed. However, on Linux and Android ARM environments, because char is unsigned by default, GET_UTF8 now returns the correct return, which unexpectedly revealed issue #20906.	2025-11-19 17:46:06 +00:00
Zhao Zhili	9bc3c572ea	avfilter/vf_drawtext: fix incorrect text length From the doc of HarfBuzz, what hb_buffer_add_utf8 needs is the number of bytes, not Unicode character: hb_buffer_add_utf8(buf, text, strlen(text), 0, strlen(text)); Fix issue #20906.	2025-11-19 17:46:06 +00:00
Zhao Zhili	551e964e8a	avcodec/videotoolboxenc: remove redundant "Error: " in error message	2025-11-20 00:56:12 +08:00
Zhao Zhili	d4031984db	avcodec/videotoolboxenc: reorder and cleanup headers	2025-11-20 00:56:12 +08:00
Zhao Zhili	7049df14c8	avcodec/videotoolboxenc: fix crash with negative linesize	2025-11-20 00:56:05 +08:00
Zhao Zhili	0da15c93c8	avcodec/videotoolboxenc: improve Lock/Unlock BaseAddress error handling 1. Fix continue after CVPixelBufferLockBaseAddress. 2. Remove redundant "Error: " in error message.	2025-11-20 00:55:32 +08:00
GyanD	f283750ba8	doc/encoders: minor mediafoundation encoders updates	2025-11-19 04:48:11 +00:00
Harshitha	4e556b0c0c	doc/encoders: Document MediaFoundation encoders	2025-11-19 04:48:11 +00:00
Hendi	b399896046	avformat/dashdec: Fix urls with special characters in manifest This was especially a problem with ampersands, which occur frequently as part of query parameters.	2025-11-18 22:10:34 +00:00
Stefan Breunig	4c4ab2ec6f	fate/filter-video: add frei0r test where input is realigned An installation of frei0r-plugins is required to run the tests, which is usually seperate from the build headers. Some systems have it packaged (e.g. apt install frei0r-plugins). An upstream release extracted to FREI0R_PATH also works. The distort0r filter requires dimensions to be divisible by 8.	2025-11-18 21:26:36 +00:00
Stefan Breunig	f8bfc20281	avfilter/vf_frei0r: fix time when input is realigned av_frame_copy doesn't copy the input's PTS property, which resulted in the frei0r filter always receiving the same static time. Example that has a static distortion without patch: ffmpeg -filter_complex "testsrc2=s=328x240:d=5,frei0r=distort0r" out.mp4	2025-11-18 21:26:36 +00:00
Andreas Rheinhardt	5bf57a925c	avutil/x86/asm: Remove wrong comment, rename FF_REG_sp Before FFmpeg commit `531b0a316b`, FFmpeg used REG_SP as macro for the stack pointer, yet this clashed with a REG_SP define in Solaris system headers, so it was changed to REG_sp and a comment was added for this. Libav fixed it by adding an FF_ prefix to the macros in `1e9c5bf4c1`. FFmpeg switched to using these prefixes in `9eb3da2f99`, using FF_REG_sp instead of Libav's FF_REG_SP. In said commit the comment was changed to claim that Solaris system headers define FF_REG_SP, but this is (most likely) wrong. This commit removes the wrong comment and renames the (actually unused) macro to FF_REG_SP to make it consistent with FF_REG_BP. Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-18 20:41:13 +01:00
Andreas Rheinhardt	99209c2876	avcodec/x86/mpegvideoenc_template: Reduce number of registers used qmat and bias always have a constant offset, so one can use one register to address both of them. This allows to remove the check for HAVE_6REGS (untested on a system where HAVE_6REGS is false). Also avoid FF_REG_a while at it. Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-18 20:41:13 +01:00
Andreas Rheinhardt	b890cd0f73	avcodec/x86/mpegvideoenc_template: Avoid touching nonvolatile register xmm7 is nonvolatile on x64 Windows. Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-18 20:41:13 +01:00
Andreas Rheinhardt	aeb138679a	avcodec/x86/mpegvideoencdsp: Port add_8x8basis_ssse3() to ASM Both GCC and Clang completely unroll the unlikely loop at -O3, leading to codesize bloat; their code is also suboptimal, as they don't make use of pmulhrsw (even with -mssse3). This commit therefore ports the whole function to external assembly. The new function occupies 176B here vs 1406B for GCC. Benchmarks for a testcase with huge qscale (notice that the C version is unrolled just like the unlikely loop in the SSSE3 version): add_8x8basis_c: 43.4 ( 1.00x) add_8x8basis_ssse3 (old): 43.6 ( 1.00x) add_8x8basis_ssse3 (new): 11.9 ( 3.63x) Reviewed-by: Kieran Kunhya <kieran@kunhya.com> Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-18 20:41:12 +01:00
Andreas Rheinhardt	0d3a88e55f	tests/checkasm/mpegvideoencdsp: Test denoise_dct Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-18 20:41:12 +01:00
Andreas Rheinhardt	1c00e09427	avcodec/mpegvideo_enc: Port denoise_dct to MpegvideoEncDSPContext It is very simple to remove the MPVEncContext from it. Notice that this also fixes a bug in x86/mpegvideoenc.c: It only used the SSE2 version of denoise_dct when dct_algo was auto or mmx (and it was therefore unused during FATE). Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-18 20:41:12 +01:00
Andreas Rheinhardt	d633fa0433	avcodec/x86/mpegvideoenc: Port denoise_dct_sse2 to external assembly Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-18 20:41:12 +01:00
Andreas Rheinhardt	2cfef7031c	avcodec/x86/mpegvideoenc: Reduce number of registers used Avoids a push+pop on x64 Windows. Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-18 20:41:12 +01:00
Andreas Rheinhardt	503afa40f7	avcodec/x86/mpegvideoenc: Remove check for MMX Reviewed-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-18 20:41:12 +01:00
Marvin Scholz	00ef656a85	.forgejo/CODEOWNERS: add myself to VideoToolbox and Icecast	2025-11-18 15:17:05 +01:00
Carl Hetherington via ffmpeg-devel	1eb2cbd865	avfilter/f_ebur128: Fix incorrect ebur128 peak calculation. Since `3b26b782ee` it would only look at the first channel. Signed-off-by: Carl Hetherington <cth@carlh.net> Reviewed-by: Niklas Haas <ffmpeg@haasn.xyz>	2025-11-18 08:40:08 +01:00
Gyan Doshi	f60db2e566	doc/fate: document setting of session-wide env variables	2025-11-18 04:19:06 +00:00
Kacper Michajłow	9b2162275b	configure: filter out -guard:signret from armasm flags While cl.exe supports -guard:signret, armasm64 complains about unknown flag. Note that -guard:ehcont is accepted by armasm64. Fixes: error A2029: unknown command-line argument or argument value -guard:signret Signed-off-by: Kacper Michajłow <kasper93@gmail.com>	2025-11-17 20:41:34 +00:00
Kacper Michajłow	523d688c2b	fate: add more configure flags to fate config Signed-off-by: Kacper Michajłow <kasper93@gmail.com>	2025-11-17 20:25:24 +00:00
Andreas Rheinhardt	ddf443f1e9	avfilter/vf_fsppdsp: Fix left shifts of negative numbers They are undefined behavior and UBSan warns about them (in the checkasm test). Put the shifts in the constants instead. This even gives a tiny speedup here: Old benchmarks: column_fidct_c: 3369.9 ( 1.00x) column_fidct_sse2: 829.1 ( 4.06x) New benchmarks: column_fidct_c: 3304.2 ( 1.00x) column_fidct_sse2: 827.9 ( 3.99x) Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 12:18:12 +01:00
Andreas Rheinhardt	f8bcea4946	avfilter/vf_fsppdsp: Remove pointless cast Also don't cast const away and use a smaller scope. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 12:18:12 +01:00
Andreas Rheinhardt	0c556a6b09	avfilter/vf_fspp: Pre-reorder threshold table Avoids reordering at runtime. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 12:18:12 +01:00
Andreas Rheinhardt	778ff97efa	avfilter/vf_fspp: Make output endian-independent Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 12:18:12 +01:00
Andreas Rheinhardt	f442145729	avfilter/vf_fspp: Avoid casts, effective-type violations Maybe uint64_t has been used as a poor man's alignment specifier? Anyway, reading an uint64_t via an lvalue of type int16_t (as happens in the C versions of the dsp functions) is undefined behavior. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 12:18:12 +01:00
Andreas Rheinhardt	c0648b2004	avfilter/x86/vf_spp: Fix comment Forgotten in `dcb28ed860`. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 12:18:12 +01:00
Andreas Rheinhardt	06b0dae51b	avfilter/vf_fsppdsp: Constify Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 12:18:12 +01:00
Andreas Rheinhardt	cc97f1e276	avfilter/vf_fspp: Fix effective type violation Also don't use unnecessarily large alignment; it avoids having to align the stack. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 12:18:12 +01:00
Andreas Rheinhardt	3cd452cbf1	avfilter/x86/vf_fspp: Avoid stack on x64 Possible due to the amount of registers. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 12:18:12 +01:00
Andreas Rheinhardt	ddd74276f8	avfilter/x86/vf_fspp: Port ff_column_fidct_mmx() to SSE2 It gains a lot because it has to operate on eight words; it also saves 608B of .text here. Old benchmarks: column_fidct_c: 3365.7 ( 1.00x) column_fidct_mmx: 1784.6 ( 1.89x) New benchmarks: column_fidct_c: 3361.5 ( 1.00x) column_fidct_sse2: 801.1 ( 4.20x) Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 12:18:11 +01:00
Andreas Rheinhardt	68b11cde82	tests/checkasm/vf_fspp: Add test for column_fidct Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 12:18:11 +01:00
Andreas Rheinhardt	63493bf0e0	avfilter/x86/vf_fspp: Put shifts into constants This avoids some shift instructions and also gives us more headroom in the registers. In fact, I have proven to myself that everything that is supposed to fit into 16bits now actually does so. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 12:18:11 +01:00
Andreas Rheinhardt	66af18d06a	avfilter/x86/vf_fspp: Make ff_column_fidct_mmx() bitexact It currently is not, because the shortcut mode uses different rounding than the C code (as well as the non-shortcut code). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 12:18:11 +01:00
Andreas Rheinhardt	1049a5fba8	avfilter/vf_fsppdsp: Reduce discrepancies between C code and x86 asm The x86 assembly uses the following pattern to zero all the values with abs<threshold: x -= threshold; x satu+= threshold (unsigned saturated addition) x += threshold x satu-= threshold (unsigned saturated subtraction) The reference C code meanwhile zeroed everything with abs <= threshold. This commit makes the C code behave like the x86 assembly to reduce discrepancies between the two. An alternative would be to require SSSE3, so that one can use pabsw, pcmpgtw for abs>threshold, followed by a pand with the original data. Or one could modify the thresholds to make both equal. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 11:28:04 +01:00
Andreas Rheinhardt	d19050a1ae	avfilter/vf_fsppdsp: Use restrict It is possible because the requirements are fulfilled; it is also beneficial performance and code-size wise. For GCC 14 (with -O3), this reduced codesize by 26750B here; for Clang 20, it was 432B. Old benchmarks: mul_thrmat_c: 4.3 ( 1.00x) mul_thrmat_sse2: 4.3 ( 1.00x) store_slice_c: 2810.8 ( 1.00x) store_slice_sse2: 542.5 ( 5.18x) store_slice2_c: 3817.0 ( 1.00x) store_slice2_sse2: 410.4 ( 9.30x) New benchmarks: mul_thrmat_c: 4.3 ( 1.00x) mul_thrmat_sse2: 4.3 ( 1.00x) store_slice_c: 1510.1 ( 1.00x) store_slice_sse2: 545.2 ( 2.77x) store_slice2_c: 1763.5 ( 1.00x) store_slice2_sse2: 408.3 ( 4.32x) Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 11:28:04 +01:00
Andreas Rheinhardt	ff85a20b7d	avfilter/x86/vf_fspp: Port store_slice to SSE2 Old benchmarks: store_slice_c: 2798.3 ( 1.00x) store_slice_mmx: 950.2 ( 2.94x) store_slice2_c: 3811.7 ( 1.00x) store_slice2_mmx: 682.3 ( 5.59x) New benchmarks: store_slice_c: 2797.2 ( 1.00x) store_slice_sse2: 543.5 ( 5.15x) store_slice2_c: 3817.0 ( 1.00x) store_slice2_sse2: 408.2 ( 9.35x) Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 11:28:04 +01:00
Andreas Rheinhardt	570f8fc6c9	tests/checkasm/vf_fspp: Test store_slice Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 11:28:04 +01:00
Andreas Rheinhardt	e042f17e99	avfilter/vf_fsppdsp: Use standard clamping This is obviously what is intended and what the MMX code does; yet I cannot rule out that it changes the output for some inputs: I have observed individual src values which would lead to temp values just above 512 if they came in pairs (i.e. if both inputs were simultaneously huge). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 11:28:04 +01:00
Andreas Rheinhardt	52ba2ac7bd	avfilter/x86/vf_fspp: Port mul_thrmat to SSE2 This fixes an ABI violation, as mul_thrmat did not issue emms. It seems that this ABI violation could reach the user, namely if ff_get_video_buffer() fails. Notice that ff_get_video_buffer() itself could fail because of this, namely if the allocator uses floating point registers. On x64 (where GCC already used SSE2 in the C version) mul_thrmat_c: 4.4 ( 1.00x) mul_thrmat_mmx: 8.6 ( 0.52x) mul_thrmat_sse2: 4.4 ( 1.00x) On 32bit (where SSE2 is not known to be available): mul_thrmat_c: 56.0 ( 1.00x) mul_thrmat_sse2: 6.0 ( 9.40x) Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 11:28:04 +01:00
Andreas Rheinhardt	70eb8a76a9	tests/checkasm: Add vf_fspp mul_thrmat test Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 11:28:04 +01:00
Andreas Rheinhardt	9f4d5d818d	avfilter/x86/vf_fspp: Don't duplicate dither table Reuse the one from vf_fsppdsp.c; also don't overalign said table too much. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 11:28:04 +01:00
Andreas Rheinhardt	1699de0955	avfilter/vf_fsppdsp: Use enum for constants It means that the compiler does not have to optimize the static const object away. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 11:28:04 +01:00
Andreas Rheinhardt	9b34088c4d	avfilter/vf_fspp: Add DSPCtx, move DSP functions to file of their own This is in preparation for adding checkasm tests; without it, checkasm would pull all of libavfilter in. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 11:28:04 +01:00
Andreas Rheinhardt	57d6898730	configure: Only test for SSE2 intrinsics on x86 Reviewed-by: Kieran Kunhya <kieran@kunhya.com> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2025-11-17 10:36:54 +01:00
Artem Smorodin	e94439e49b	avformat/tee: fix the default onfail setting of the tee salves I found that the default value is not set for onfail option. I see that there is an attempt to set this value by default inside parse_slave_failure_policy_option. But look at the CONSUME_OPTION macro. If av_dict_get cannot find this option, then this function is not even called.	2025-11-17 00:01:42 +00:00

1 2 3 4 5 ...

121815 Commits