FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-12-07 11:13:41 +02:00

Author	SHA1	Message	Date
Paul B Mahol	3609d2b783	avcodec: add QOA decoder	2023-11-26 17:49:09 +01:00
Geoffrey McRae	93b5d9030b	libavcodec/mlpdec: add missing correction to ch_layout when downmixing This fixes corrupted audio for applications relying on ch_layout when codec downmixing is active. Signed-off-by: Geoffrey McRae <geoff@hostfission.com> Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-26 10:18:33 -03:00
Geoffrey McRae	a8677bcc8f	libavcodec/dcadec: adjust the `ch_layout` when downmix is active Applications making use of this codec with the `downmix` option are segfaulting unless the `ch_layout` is overridden after `avcodec_open2` as can be seen in projects like MythTV[1] This patch fixes this by overriding the ch_layout as done in other decoders such as AC3. 1: `af6f362a14/mythtv/libs/libmythtv/decoders/avformatdecoder.cpp (L4607)` Signed-off-by: Geoffrey McRae <geoff@hostfission.com> Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-26 10:18:33 -03:00
James Almer	72390dea00	mips/ac3dsp_mips: add missing stddef.h header include Fixes compilation failures after `567c67c6c8`. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-25 21:51:04 -03:00
James Almer	e40ea9f34b	x86/ac3dsp: add ff_float_to_fixed24_avx() Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-25 21:50:56 -03:00
James Almer	d8b1a34433	x86/ac3dsp: reduce instruction count inside the float_to_fixed24 loop Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-25 21:50:56 -03:00
Rémi Denis-Courmont	0fa421c8f1	lavc/llvidencdsp: add R-V V diff_bytes diff_bytes_c: 163.0 diff_bytes_rvv_i32: 52.7	2023-11-23 18:57:18 +02:00
Rémi Denis-Courmont	0183c2c830	lavc/aacpsdsp: use LMUL=2 and amortise strides The input is laid out in 16 segments, of which 13 actually need to be loaded. There are no really efficient ways to deal with this: 1) If we load 8 segments wit unit stride, then narrow to 16 segments with right shifts, we can only get one half-size vector per segment, or just 2 elements per vector (EMUL=1/2) - at least with 128-bit vectors. This ends up unsurprisingly about as fas as the C code. 2) The current approach is to load with strides. We keep that approach, but improve it using three 4-segmented loads instead of 12 single-segment loads. This divides the number of distinct loaded addresses by 4. 3) A potential third approach would be to avoid segmentation altogether and splat the scalar coefficient into vectors. Then we can use a unit-stride and maximum EMUL. But the downside then is that we have to multiply the 3 (of 16) unused segments with zero as part of the multiply-accumulate operations. In addition, we also reuse vectors mid-loop so as to increase the EMUL from 1 to 2, which also improves performance a little bit. Oeverall the gains are quite small with the device under test, as it does not deal with segmented loads very well. But at least the code is tidier, and should enjoy bigger speed-ups on better hardware implementation. Before: ps_hybrid_analysis_c: 1819.2 ps_hybrid_analysis_rvv_f32: 1037.0 (before) ps_hybrid_analysis_rvv_f32: 990.0 (after)	2023-11-23 18:57:18 +02:00
Rémi Denis-Courmont	b88d4058f9	lavc/g722dsp: optimise R-V V apply_qmf This stores the constant coefficients deinterleaved, so that they can be loaded directly with NF=0. Unfortunately, we cannot optimise loading the input, due to insufficient memory alignment (not 32-bit). Before: g722_apply_qmf_c: 82.5 g722_apply_qmf_rvv_i32: 78.2 After: g722_apply_qmf_c: 82.5 g722_apply_qmf_rvv_i32: 65.2	2023-11-23 18:57:18 +02:00
James Almer	567c67c6c8	avcodec/ac3dsp: make len a size_t in float_to_fixed24 Should simplify asm implementations, and prevent UB on at least win64. Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-22 18:33:00 -03:00
James Almer	2d9fd814d0	x86/: clear the high bits for order in scalarproduct_and_madd functions Should fix checkasm failures on win64. Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-22 14:18:42 -03:00
Zhao Zhili	e8a49b1424	avcodec/mmaldec: Fix build error Fix #10670. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2023-11-22 21:02:04 +08:00
Zhao Zhili	f27fce0c0c	avcodec/mediacodecdec: fix return EAGAIN after EOF Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2023-11-22 21:02:04 +08:00
Dmitry Rogozhkin	e9c93009fc	avcodec/decode: validate hw_frames_ctx when AVHWAccel.free_frame_priv is used Validate that a hw_frames_ctx is available before using it for the AVHWAccel.free_frame_priv callback, and don't require it to be present when the callback is not in use by the HWAccel. v2: check for free_frame_priv (Hendrik) v3: return EINVAL (Christoph Reiter) v4: better commit message (Hendrik) v5: fix typo with missed frames_ctx (Lynne) See[1]: https://github.com/msys2/MINGW-packages/pull/19050 Fixes: `be07145109` ("avcodec: add AVHWAccel.free_frame_priv callback") CC: Lynne <dev@lynne.ee> CC: Christoph Reiter <reiter.christoph@gmail.com> Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com>	2023-11-22 05:01:16 +01:00
Zhao Zhili	aa3b857101	avcodec/h264_mp4toannexb_bsf: process new extradata For fate-h264_mp4toannexb_ticket5927 and fate-h264_mp4toannexb_ticket5927_2, they work by accident previously. The sample file has two 'avc1' entries, and video samples use the second one. It means packets should be decoded with new extradata in side data. Before this patch, only extradata was kept in the output, new extradata has been dropped. The output can be decoded because the two extradata are almost the same, except level indication. This patch fixed the issue, and add another fate test. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2023-11-22 19:42:14 +08:00
Zhao Zhili	d3aa0cd16f	avcodec/h264_mp4toannexb_bsf: fix missing PS before IDR frames If there is a single group of SPS/PPS before an IDR frame, but no SPS/PPS after that, we will miss the chance to reset idr_sps_seen/idr_pps_seen. No SPS/PPS are inserted afterwards. This patch saves in-band SPS/PPS and insert them before IDR frames when necessary. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2023-11-22 19:42:14 +08:00
Zhao Zhili	4c4b833abd	avcodec/h264_mp4toannexb_bsf: remove pass padding size as argument It's a fixed value. There is no use case to change that. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2023-11-22 19:42:14 +08:00
Zhao Zhili	91cbae2f6c	avcodec/h264_mp4toannexb_bsf: refactor start_code_size handling start_code_size depends on whether PS comes from out-of-band or in-band. Make the code more readable. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2023-11-22 19:42:14 +08:00
Michael Niedermayer	fb52070848	avcodec/h264dec: use BOOL for skip_gray, noref_gray Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-22 01:22:31 +01:00
Jun Zhao	c961ac4b0c	vulkan_decode: fix the print format of VkDeviceSize VkDeviceSize represents device memory size and offset values as uint64_t in Spec. Signed-off-by: Jun Zhao <barryjzhao@tencent.com>	2023-11-21 08:02:43 +08:00
James Almer	1258f99978	avcodec: bump version after EVC additions Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-20 11:55:51 -03:00
Dawid Kozinski	cfe2947887	avcodec/evc_decoder: Provided support for EVC decoder - Added EVC decoder wrapper - Changes in project configuration file and libavcodec Makefile - Added documentation for xevd wrapper Signed-off-by: Dawid Kozinski <d.kozinski@samsung.com> Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-20 11:55:51 -03:00
Dawid Kozinski	c59a96fd08	avcodec/evc_encoder: Provided support for EVC encoder - Added EVC encoder wrapper - Changes in project configuration file and libavcodec Makefile - Added documentation for xeve wrapper Signed-off-by: Dawid Kozinski <d.kozinski@samsung.com> Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-20 11:55:51 -03:00
Michael Niedermayer	e56d91f8a8	avcodec/h264dec: Support skipping frames that used gray gap frames Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:19:25 +01:00
Michael Niedermayer	6364fa9e9a	avcodec/h264: Avoid using gray gap frames as references Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:19:25 +01:00
Michael Niedermayer	29f6c9b04d	avcodec/h264: keep track of which frames used gray references Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:19:04 +01:00
Michael Niedermayer	e4337606e1	avcodec/h264dec: More elaborate documentation for frame_recovered Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:12:30 +01:00
Michael Niedermayer	68e1cf204a	avcodec/h264: Use FRAME_RECOVERED_HEURISTIC instead of IDR/SEI This keeps IDR/SEI and heuristically detected recovery points cleaner seperated Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:12:30 +01:00
Michael Niedermayer	3f4a1a24a5	avcodec/h264: Seperate SEI and IDR recovery handling This avoids SEI and IDR recovery flags affecting each other Also eliminate litteral numbers from recovery handling This should make the code clearer Improves: tickets/4738/tickets_cut.ts Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:12:29 +01:00
Rémi Denis-Courmont	fbc7adba67	lavc/llviddsp: R-V V add_bytes add_bytes_c: 2077.2 add_bytes_rvv_i32: 105.0	2023-11-18 22:07:14 +02:00
Rémi Denis-Courmont	ca664f2254	lavc/flacdsp: R-V V LPC16 function In this case, the inner loop computing the scalar product can be reduced to just one multiplication and one sum even with 128-bit vectors. The result is a lot simpler, but also brings more modest performance gains: flac_lpc_16_13_c: 15241.0 flac_lpc_16_13_rvv_i32: 11230.0 flac_lpc_16_16_c: 17884.0 flac_lpc_16_16_rvv_i32: 12125.7 flac_lpc_16_29_c: 27847.7 flac_lpc_16_29_rvv_i32: 10494.0 flac_lpc_16_32_c: 30051.5 flac_lpc_16_32_rvv_i32: 10355.0	2023-11-18 22:06:57 +02:00
Rémi Denis-Courmont	295092b46d	lavc/flacdsp: R-V V LPC32 The entire set of 32 coefficients and corresponding past 32 samples can fit in a single vector (with LMUL=8) exactly, but... since widening double the needed vector sizes, we still end up too short with 128-bit vectors. This adds a very simple version for future 256+-bit hardware, and for pred_orders values up to 16, and a bit more involved loop for for 128-bit hardware with pred_orders between 17 and 32. With 128-bit hardware, the benchmarks look like this: flac_lpc_32_13_c: 30152.0 flac_lpc_32_13_rvv_i32: 10244.7 flac_lpc_32_16_c: 37314.2 flac_lpc_32_16_rvv_i32: 10126.2 flac_lpc_32_29_c: 61910.0 flac_lpc_32_29_rvv_i32: 14495.2 flac_lpc_32_32_c: 68204.0 flac_lpc_32_32_rvv_i32: 13273.7	2023-11-18 22:05:43 +02:00
Diederik de Haas via ffmpeg-devel	c07ed10b0e	apply spelling fixes Fix spelling issue as reported by Debian's lintian tool: accomodate -> accommodate addtional -> additional auxillary -> auxiliary bellow -> below betweeen -> between Calulate -> Calculate coefficents -> coefficients Defalt -> Default defaul -> default higer -> higher neccesary -> necessary orignal -> original ouput -> output precison -> precision processsing -> processing substract -> subtract Transfered -> Transferred upto -> up to Also add several of them to the 'common typos' check in patcheck. Signed-off-by: Diederik de Haas <didi.debian@cknow.org>	2023-11-18 19:55:42 +01:00
Rémi Denis-Courmont	07c303b708	lavc/flacdsp: R-V V decorrelate_indep 16-bit packed flac_decorrelate_indep2_16_c: 981.7 flac_decorrelate_indep2_16_rvv_i32: 199.2 flac_decorrelate_indep4_16_c: 1749.7 flac_decorrelate_indep4_16_rvv_i32: 401.2 flac_decorrelate_indep6_16_c: 2517.7 flac_decorrelate_indep6_16_rvv_i32: 858.0 flac_decorrelate_indep8_16_c: 3285.7 flac_decorrelate_indep8_16_rvv_i32: 1123.5	2023-11-17 23:59:56 +02:00
Rémi Denis-Courmont	fb0295e5fd	lavc/flacdsp: R-V V decorrelate_indep 32-bit packed flac_decorrelate_indep2_32_c: 981.7 flac_decorrelate_indep2_32_rvv_i32: 183.7 flac_decorrelate_indep4_32_c: 1749.7 flac_decorrelate_indep4_32_rvv_i32: 362.5 flac_decorrelate_indep6_32_c: 2517.7 flac_decorrelate_indep6_32_rvv_i32: 715.2 flac_decorrelate_indep8_32_c: 3285.7 flac_decorrelate_indep8_32_rvv_i32: 909.0	2023-11-17 23:59:56 +02:00
Rémi Denis-Courmont	6183a69c0b	lavc/flacdsp: R-V V decorrelate_ms packed flac_decorrelate_ms_16_c: 585.5 flac_decorrelate_ms_16_rvv_i32: 263.0 flac_decorrelate_ms_32_c: 584.7 flac_decorrelate_ms_32_rvv_i32: 250.0	2023-11-17 23:59:23 +02:00
Rémi Denis-Courmont	636ae0e0bc	lavc/flacdsp: R-V V packed decorrelate_{l,r}s flac_decorrelate_ms_16_c: 457.2 flac_decorrelate_ms_16_rvv_i32: 203.0 flac_decorrelate_ms_32_c: 457.2 flac_decorrelate_ms_32_rvv_i32: 203.5 flac_decorrelate_rs_16_c: 456.2 flac_decorrelate_rs_16_rvv_i32: 207.0 flac_decorrelate_rs_32_c: 456.2 flac_decorrelate_rs_32_rvv_i32: 210.5	2023-11-17 23:59:22 +02:00
Rémi Denis-Courmont	d076517056	lavc/llauddsp: R-V V scalarproduct_and_madd_int32 scalarproduct_and_madd_int32_c: 10899.7 scalarproduct_and_madd_int32_rvv_i32: 1749.0	2023-11-16 16:53:44 +02:00
Rémi Denis-Courmont	45d0eb3f70	lavc/llauddsp: R-V V scalarproduct_and_madd_int16 scalarproduct_and_madd_int16_c: 10355.7 scalarproduct_and_madd_int16_rvv_i32: 1480.0	2023-11-16 16:53:44 +02:00
James Almer	78f55457c9	x86/flacds: clear the high bits from pred_order in lpc_32 functions Reviewed-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-15 16:10:15 -03:00
Dai, Jianhui J	c9fe9fb863	avcodec/cbs_vp8: Add support for VP8 codec bitstream This commit adds support for VP8 bitstream read methods to the cbs codec. This enables the trace_headers bitstream filter to support VP8, in addition to AV1, H.264, H.265, and VP9. This can be useful for debugging VP8 stream issues. The CBS VP8 implements a simple VP8 boolean decoder using GetBitContext to read the bitstream. Only the read methods `read_unit` and `split_fragment` are implemented. The write methods `write_unit` and `assemble_fragment` return the error code AVERROR_PATCHWELCOME. This is because CBS VP8 write is unlikely to be used by any applications at the moment. The write methods can be added later if there is a real need for them. TESTS: ffmpeg -i fate-suite/vp8/frame_size_change.webm -vcodec copy -bsf:v trace_headers -f null - Signed-off-by: Jianhui Dai <jianhui.j.dai@intel.com> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2023-11-15 10:29:03 -05:00
Dai, Jianhui J	5cb8accd09	avcodec/vp8: Export `vp8_token_update_probs` variable This commit exports the `vp8_token_update_probs` variable to internal library scope to facilitate its reuse within the library. Signed-off-by: Jianhui Dai <jianhui.j.dai@intel.com> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2023-11-15 10:29:03 -05:00
Rémi Denis-Courmont	90a779bed6	lavc/huffyuvdsp: basic R-V V add_hfyu_left_pred_bgr32 Better performance can probably be achieved with a more intricate unrolled loop, but this is a start: add_hfyu_left_pred_bgr32_c: 15084.0 add_hfyu_left_pred_bgr32_rvv_i32: 10280.2 This would actually be cleaner with the RISC-V P extension, but that is not ratified yet (I think?) and usually not supported if V is supported.	2023-11-15 16:51:07 +02:00
James Almer	b360c91752	avcodec/codecpar: mention how to allocate coded_side_data Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-14 14:26:42 -03:00
Anton Khirnov	6dbde68cb5	lavc/8bps: fix exporting palette after `63767b79a5` It would be left empty on each frame whose packet does not come with palette attached.	2023-11-14 18:18:26 +01:00
Rémi Denis-Courmont	ce467421dc	lavc/exrdsp: unroll predictor With explicit unrolling, we can skip half of the sign bit flips, and the compiler is then better able to optimise the scalar loop: predictor_c: 31376.0 (before) predictor_c: 23703.0 (after)	2023-11-14 19:15:51 +02:00
Rémi Denis-Courmont	c536e92207	lavc/sbrdsp: R-V V hf_apply_noise functions This is restricted to 128-bit vectors as larger vector sizes could read past the end of the noise array. Support for future hardware with larger vector sizes is left for some other time. hf_apply_noise_0_c: 2319.7 hf_apply_noise_0_rvv_f32: 1229.0 hf_apply_noise_1_c: 2539.0 hf_apply_noise_1_rvv_f32: 1244.7 hf_apply_noise_2_c: 2319.7 hf_apply_noise_2_rvv_f32: 1232.7 hf_apply_noise_3_c: 2541.2 hf_apply_noise_3_rvv_f32: 1244.2	2023-11-13 18:34:29 +02:00
Rémi Denis-Courmont	5b33104fca	lavc/sbrdsp: R-V V hf_gen hf_gen_c: 2922.7 hf_gen_rvv_f32: 731.5	2023-11-13 18:33:02 +02:00
Gyan Doshi	67a2571a55	avcodec/libsvtav1: add version guard for external param Setting of external param 'force_key_frames' was added in `7bcc1b4eb8`. It is available since v1.1.0 but ffmpeg allows linking against v0.9.0.	2023-11-13 13:14:43 +05:30
Evgeny Pavlov	da3ce21f68	libavcodec/amfenc: Fix issue with missing headers in AV1 encoder This commit fixes issue with missing SPS/PPS headers in video encoded by AMF AV1 encoder. Missing headers leads to broken seek in MPV video player. Default value for property AV1_HEADER_INSERTION_MODE shouldn't be setup to NONE (no headers insertion). We need to skip definition of this property, because default value depends on USAGE property. Signed-off-by: Dmitrii Ovchinnikov <ovchinnikov.dmitrii@gmail.com>	2023-11-12 22:57:17 +01:00

1 2 3 4 5 ...

49082 Commits