FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-12-02 03:06:28 +02:00

Author	SHA1	Message	Date
Rémi Denis-Courmont	0183c2c830	lavc/aacpsdsp: use LMUL=2 and amortise strides The input is laid out in 16 segments, of which 13 actually need to be loaded. There are no really efficient ways to deal with this: 1) If we load 8 segments wit unit stride, then narrow to 16 segments with right shifts, we can only get one half-size vector per segment, or just 2 elements per vector (EMUL=1/2) - at least with 128-bit vectors. This ends up unsurprisingly about as fas as the C code. 2) The current approach is to load with strides. We keep that approach, but improve it using three 4-segmented loads instead of 12 single-segment loads. This divides the number of distinct loaded addresses by 4. 3) A potential third approach would be to avoid segmentation altogether and splat the scalar coefficient into vectors. Then we can use a unit-stride and maximum EMUL. But the downside then is that we have to multiply the 3 (of 16) unused segments with zero as part of the multiply-accumulate operations. In addition, we also reuse vectors mid-loop so as to increase the EMUL from 1 to 2, which also improves performance a little bit. Oeverall the gains are quite small with the device under test, as it does not deal with segmented loads very well. But at least the code is tidier, and should enjoy bigger speed-ups on better hardware implementation. Before: ps_hybrid_analysis_c: 1819.2 ps_hybrid_analysis_rvv_f32: 1037.0 (before) ps_hybrid_analysis_rvv_f32: 990.0 (after)	2023-11-23 18:57:18 +02:00
Rémi Denis-Courmont	b88d4058f9	lavc/g722dsp: optimise R-V V apply_qmf This stores the constant coefficients deinterleaved, so that they can be loaded directly with NF=0. Unfortunately, we cannot optimise loading the input, due to insufficient memory alignment (not 32-bit). Before: g722_apply_qmf_c: 82.5 g722_apply_qmf_rvv_i32: 78.2 After: g722_apply_qmf_c: 82.5 g722_apply_qmf_rvv_i32: 65.2	2023-11-23 18:57:18 +02:00
James Almer	567c67c6c8	avcodec/ac3dsp: make len a size_t in float_to_fixed24 Should simplify asm implementations, and prevent UB on at least win64. Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-22 18:33:00 -03:00
James Almer	2d9fd814d0	x86/: clear the high bits for order in scalarproduct_and_madd functions Should fix checkasm failures on win64. Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-22 14:18:42 -03:00
Zhao Zhili	e8a49b1424	avcodec/mmaldec: Fix build error Fix #10670. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2023-11-22 21:02:04 +08:00
Zhao Zhili	f27fce0c0c	avcodec/mediacodecdec: fix return EAGAIN after EOF Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2023-11-22 21:02:04 +08:00
Dmitry Rogozhkin	e9c93009fc	avcodec/decode: validate hw_frames_ctx when AVHWAccel.free_frame_priv is used Validate that a hw_frames_ctx is available before using it for the AVHWAccel.free_frame_priv callback, and don't require it to be present when the callback is not in use by the HWAccel. v2: check for free_frame_priv (Hendrik) v3: return EINVAL (Christoph Reiter) v4: better commit message (Hendrik) v5: fix typo with missed frames_ctx (Lynne) See[1]: https://github.com/msys2/MINGW-packages/pull/19050 Fixes: `be07145109` ("avcodec: add AVHWAccel.free_frame_priv callback") CC: Lynne <dev@lynne.ee> CC: Christoph Reiter <reiter.christoph@gmail.com> Signed-off-by: Dmitry Rogozhkin <dmitry.v.rogozhkin@intel.com>	2023-11-22 05:01:16 +01:00
Zhao Zhili	aa3b857101	avcodec/h264_mp4toannexb_bsf: process new extradata For fate-h264_mp4toannexb_ticket5927 and fate-h264_mp4toannexb_ticket5927_2, they work by accident previously. The sample file has two 'avc1' entries, and video samples use the second one. It means packets should be decoded with new extradata in side data. Before this patch, only extradata was kept in the output, new extradata has been dropped. The output can be decoded because the two extradata are almost the same, except level indication. This patch fixed the issue, and add another fate test. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2023-11-22 19:42:14 +08:00
Zhao Zhili	d3aa0cd16f	avcodec/h264_mp4toannexb_bsf: fix missing PS before IDR frames If there is a single group of SPS/PPS before an IDR frame, but no SPS/PPS after that, we will miss the chance to reset idr_sps_seen/idr_pps_seen. No SPS/PPS are inserted afterwards. This patch saves in-band SPS/PPS and insert them before IDR frames when necessary. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2023-11-22 19:42:14 +08:00
Zhao Zhili	4c4b833abd	avcodec/h264_mp4toannexb_bsf: remove pass padding size as argument It's a fixed value. There is no use case to change that. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2023-11-22 19:42:14 +08:00
Zhao Zhili	91cbae2f6c	avcodec/h264_mp4toannexb_bsf: refactor start_code_size handling start_code_size depends on whether PS comes from out-of-band or in-band. Make the code more readable. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2023-11-22 19:42:14 +08:00
Michael Niedermayer	fb52070848	avcodec/h264dec: use BOOL for skip_gray, noref_gray Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-22 01:22:31 +01:00
Jun Zhao	c961ac4b0c	vulkan_decode: fix the print format of VkDeviceSize VkDeviceSize represents device memory size and offset values as uint64_t in Spec. Signed-off-by: Jun Zhao <barryjzhao@tencent.com>	2023-11-21 08:02:43 +08:00
James Almer	1258f99978	avcodec: bump version after EVC additions Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-20 11:55:51 -03:00
Dawid Kozinski	cfe2947887	avcodec/evc_decoder: Provided support for EVC decoder - Added EVC decoder wrapper - Changes in project configuration file and libavcodec Makefile - Added documentation for xevd wrapper Signed-off-by: Dawid Kozinski <d.kozinski@samsung.com> Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-20 11:55:51 -03:00
Dawid Kozinski	c59a96fd08	avcodec/evc_encoder: Provided support for EVC encoder - Added EVC encoder wrapper - Changes in project configuration file and libavcodec Makefile - Added documentation for xeve wrapper Signed-off-by: Dawid Kozinski <d.kozinski@samsung.com> Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-20 11:55:51 -03:00
Michael Niedermayer	e56d91f8a8	avcodec/h264dec: Support skipping frames that used gray gap frames Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:19:25 +01:00
Michael Niedermayer	6364fa9e9a	avcodec/h264: Avoid using gray gap frames as references Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:19:25 +01:00
Michael Niedermayer	29f6c9b04d	avcodec/h264: keep track of which frames used gray references Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:19:04 +01:00
Michael Niedermayer	e4337606e1	avcodec/h264dec: More elaborate documentation for frame_recovered Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:12:30 +01:00
Michael Niedermayer	68e1cf204a	avcodec/h264: Use FRAME_RECOVERED_HEURISTIC instead of IDR/SEI This keeps IDR/SEI and heuristically detected recovery points cleaner seperated Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:12:30 +01:00
Michael Niedermayer	3f4a1a24a5	avcodec/h264: Seperate SEI and IDR recovery handling This avoids SEI and IDR recovery flags affecting each other Also eliminate litteral numbers from recovery handling This should make the code clearer Improves: tickets/4738/tickets_cut.ts Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:12:29 +01:00
Rémi Denis-Courmont	fbc7adba67	lavc/llviddsp: R-V V add_bytes add_bytes_c: 2077.2 add_bytes_rvv_i32: 105.0	2023-11-18 22:07:14 +02:00
Rémi Denis-Courmont	ca664f2254	lavc/flacdsp: R-V V LPC16 function In this case, the inner loop computing the scalar product can be reduced to just one multiplication and one sum even with 128-bit vectors. The result is a lot simpler, but also brings more modest performance gains: flac_lpc_16_13_c: 15241.0 flac_lpc_16_13_rvv_i32: 11230.0 flac_lpc_16_16_c: 17884.0 flac_lpc_16_16_rvv_i32: 12125.7 flac_lpc_16_29_c: 27847.7 flac_lpc_16_29_rvv_i32: 10494.0 flac_lpc_16_32_c: 30051.5 flac_lpc_16_32_rvv_i32: 10355.0	2023-11-18 22:06:57 +02:00
Rémi Denis-Courmont	295092b46d	lavc/flacdsp: R-V V LPC32 The entire set of 32 coefficients and corresponding past 32 samples can fit in a single vector (with LMUL=8) exactly, but... since widening double the needed vector sizes, we still end up too short with 128-bit vectors. This adds a very simple version for future 256+-bit hardware, and for pred_orders values up to 16, and a bit more involved loop for for 128-bit hardware with pred_orders between 17 and 32. With 128-bit hardware, the benchmarks look like this: flac_lpc_32_13_c: 30152.0 flac_lpc_32_13_rvv_i32: 10244.7 flac_lpc_32_16_c: 37314.2 flac_lpc_32_16_rvv_i32: 10126.2 flac_lpc_32_29_c: 61910.0 flac_lpc_32_29_rvv_i32: 14495.2 flac_lpc_32_32_c: 68204.0 flac_lpc_32_32_rvv_i32: 13273.7	2023-11-18 22:05:43 +02:00
Diederik de Haas via ffmpeg-devel	c07ed10b0e	apply spelling fixes Fix spelling issue as reported by Debian's lintian tool: accomodate -> accommodate addtional -> additional auxillary -> auxiliary bellow -> below betweeen -> between Calulate -> Calculate coefficents -> coefficients Defalt -> Default defaul -> default higer -> higher neccesary -> necessary orignal -> original ouput -> output precison -> precision processsing -> processing substract -> subtract Transfered -> Transferred upto -> up to Also add several of them to the 'common typos' check in patcheck. Signed-off-by: Diederik de Haas <didi.debian@cknow.org>	2023-11-18 19:55:42 +01:00
Rémi Denis-Courmont	07c303b708	lavc/flacdsp: R-V V decorrelate_indep 16-bit packed flac_decorrelate_indep2_16_c: 981.7 flac_decorrelate_indep2_16_rvv_i32: 199.2 flac_decorrelate_indep4_16_c: 1749.7 flac_decorrelate_indep4_16_rvv_i32: 401.2 flac_decorrelate_indep6_16_c: 2517.7 flac_decorrelate_indep6_16_rvv_i32: 858.0 flac_decorrelate_indep8_16_c: 3285.7 flac_decorrelate_indep8_16_rvv_i32: 1123.5	2023-11-17 23:59:56 +02:00
Rémi Denis-Courmont	fb0295e5fd	lavc/flacdsp: R-V V decorrelate_indep 32-bit packed flac_decorrelate_indep2_32_c: 981.7 flac_decorrelate_indep2_32_rvv_i32: 183.7 flac_decorrelate_indep4_32_c: 1749.7 flac_decorrelate_indep4_32_rvv_i32: 362.5 flac_decorrelate_indep6_32_c: 2517.7 flac_decorrelate_indep6_32_rvv_i32: 715.2 flac_decorrelate_indep8_32_c: 3285.7 flac_decorrelate_indep8_32_rvv_i32: 909.0	2023-11-17 23:59:56 +02:00
Rémi Denis-Courmont	6183a69c0b	lavc/flacdsp: R-V V decorrelate_ms packed flac_decorrelate_ms_16_c: 585.5 flac_decorrelate_ms_16_rvv_i32: 263.0 flac_decorrelate_ms_32_c: 584.7 flac_decorrelate_ms_32_rvv_i32: 250.0	2023-11-17 23:59:23 +02:00
Rémi Denis-Courmont	636ae0e0bc	lavc/flacdsp: R-V V packed decorrelate_{l,r}s flac_decorrelate_ms_16_c: 457.2 flac_decorrelate_ms_16_rvv_i32: 203.0 flac_decorrelate_ms_32_c: 457.2 flac_decorrelate_ms_32_rvv_i32: 203.5 flac_decorrelate_rs_16_c: 456.2 flac_decorrelate_rs_16_rvv_i32: 207.0 flac_decorrelate_rs_32_c: 456.2 flac_decorrelate_rs_32_rvv_i32: 210.5	2023-11-17 23:59:22 +02:00
Rémi Denis-Courmont	d076517056	lavc/llauddsp: R-V V scalarproduct_and_madd_int32 scalarproduct_and_madd_int32_c: 10899.7 scalarproduct_and_madd_int32_rvv_i32: 1749.0	2023-11-16 16:53:44 +02:00
Rémi Denis-Courmont	45d0eb3f70	lavc/llauddsp: R-V V scalarproduct_and_madd_int16 scalarproduct_and_madd_int16_c: 10355.7 scalarproduct_and_madd_int16_rvv_i32: 1480.0	2023-11-16 16:53:44 +02:00
James Almer	78f55457c9	x86/flacds: clear the high bits from pred_order in lpc_32 functions Reviewed-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-15 16:10:15 -03:00
Dai, Jianhui J	c9fe9fb863	avcodec/cbs_vp8: Add support for VP8 codec bitstream This commit adds support for VP8 bitstream read methods to the cbs codec. This enables the trace_headers bitstream filter to support VP8, in addition to AV1, H.264, H.265, and VP9. This can be useful for debugging VP8 stream issues. The CBS VP8 implements a simple VP8 boolean decoder using GetBitContext to read the bitstream. Only the read methods `read_unit` and `split_fragment` are implemented. The write methods `write_unit` and `assemble_fragment` return the error code AVERROR_PATCHWELCOME. This is because CBS VP8 write is unlikely to be used by any applications at the moment. The write methods can be added later if there is a real need for them. TESTS: ffmpeg -i fate-suite/vp8/frame_size_change.webm -vcodec copy -bsf:v trace_headers -f null - Signed-off-by: Jianhui Dai <jianhui.j.dai@intel.com> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2023-11-15 10:29:03 -05:00
Dai, Jianhui J	5cb8accd09	avcodec/vp8: Export `vp8_token_update_probs` variable This commit exports the `vp8_token_update_probs` variable to internal library scope to facilitate its reuse within the library. Signed-off-by: Jianhui Dai <jianhui.j.dai@intel.com> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2023-11-15 10:29:03 -05:00
Rémi Denis-Courmont	90a779bed6	lavc/huffyuvdsp: basic R-V V add_hfyu_left_pred_bgr32 Better performance can probably be achieved with a more intricate unrolled loop, but this is a start: add_hfyu_left_pred_bgr32_c: 15084.0 add_hfyu_left_pred_bgr32_rvv_i32: 10280.2 This would actually be cleaner with the RISC-V P extension, but that is not ratified yet (I think?) and usually not supported if V is supported.	2023-11-15 16:51:07 +02:00
James Almer	b360c91752	avcodec/codecpar: mention how to allocate coded_side_data Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-14 14:26:42 -03:00
Anton Khirnov	6dbde68cb5	lavc/8bps: fix exporting palette after `63767b79a5` It would be left empty on each frame whose packet does not come with palette attached.	2023-11-14 18:18:26 +01:00
Rémi Denis-Courmont	ce467421dc	lavc/exrdsp: unroll predictor With explicit unrolling, we can skip half of the sign bit flips, and the compiler is then better able to optimise the scalar loop: predictor_c: 31376.0 (before) predictor_c: 23703.0 (after)	2023-11-14 19:15:51 +02:00
Rémi Denis-Courmont	c536e92207	lavc/sbrdsp: R-V V hf_apply_noise functions This is restricted to 128-bit vectors as larger vector sizes could read past the end of the noise array. Support for future hardware with larger vector sizes is left for some other time. hf_apply_noise_0_c: 2319.7 hf_apply_noise_0_rvv_f32: 1229.0 hf_apply_noise_1_c: 2539.0 hf_apply_noise_1_rvv_f32: 1244.7 hf_apply_noise_2_c: 2319.7 hf_apply_noise_2_rvv_f32: 1232.7 hf_apply_noise_3_c: 2541.2 hf_apply_noise_3_rvv_f32: 1244.2	2023-11-13 18:34:29 +02:00
Rémi Denis-Courmont	5b33104fca	lavc/sbrdsp: R-V V hf_gen hf_gen_c: 2922.7 hf_gen_rvv_f32: 731.5	2023-11-13 18:33:02 +02:00
Gyan Doshi	67a2571a55	avcodec/libsvtav1: add version guard for external param Setting of external param 'force_key_frames' was added in `7bcc1b4eb8`. It is available since v1.1.0 but ffmpeg allows linking against v0.9.0.	2023-11-13 13:14:43 +05:30
Evgeny Pavlov	da3ce21f68	libavcodec/amfenc: Fix issue with missing headers in AV1 encoder This commit fixes issue with missing SPS/PPS headers in video encoded by AMF AV1 encoder. Missing headers leads to broken seek in MPV video player. Default value for property AV1_HEADER_INSERTION_MODE shouldn't be setup to NONE (no headers insertion). We need to skip definition of this property, because default value depends on USAGE property. Signed-off-by: Dmitrii Ovchinnikov <ovchinnikov.dmitrii@gmail.com>	2023-11-12 22:57:17 +01:00
Sebastian Ramacher	250471ea17	avcoded/fft: Fix memory leak if ctx2 is used Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-12 14:47:56 -03:00
Sebastian Ramacher	a562cfee2e	avcodec/fft: Use av_mallocz to avoid invalid free/uninit Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-12 14:47:56 -03:00
Rémi Denis-Courmont	cd7b352c53	lavc/sbrdsp: R-V V autocorrelate With 5 accumulator vectors and 6 inputs, this can only use LMUL=2. Also the number of vector loop iterations is small, just 5 on 128-bit vector hardware. The vector loop is somewhat unusual in that it processes data in descending memory order, in order to save on vector slides: in descending order, we can extract elements to carry over to the next iteration from the bottom of the vectors directly. With ascending order (see in the Opus postfilter function), there are no ways to get the top elements directly. On the downside, this requires the use of separate shift and sub (the would-be SH3SUB instruction does not exist), with a small pipeline stall on the vector load address. The edge cases in scalar are done in scalar as this saves on loads and remains significantly faster than C. autocorrelate_c: 669.2 autocorrelate_rvv_f32: 421.0	2023-11-12 14:03:09 +02:00
Rémi Denis-Courmont	f576a0835b	lavc/aacpsdsp: rework R-V V hybrid_synthesis_deint Given the size of the data set, strided memory accesses cannot be avoided. We can still do better than the current code. ps_hybrid_synthesis_deint_c: 12065.5 ps_hybrid_synthesis_deint_rvv_i32: 13650.2 (before) ps_hybrid_synthesis_deint_rvv_i64: 8181.0 (after)	2023-11-12 14:03:09 +02:00
Rémi Denis-Courmont	eb508702a8	lavc/aacpsdsp: rework R-V V add_squares Segmented loads may be slower than not. So this advantageously uses a unit-strided load and narrowing shifts instead. Before: ps_add_squares_c: 60757.7 ps_add_squares_rvv_f32: 22242.5 After: ps_add_squares_c: 60516.0 ps_add_squares_rvv_i64: 17067.7	2023-11-12 14:03:09 +02:00
Paul B Mahol	10440a489a	avcodec/gif_parser: split correctly also bitstreams that do not have extension blocks	2023-11-12 02:19:53 +01:00
Nuo Mi	09f783692e	avcodec/cbs_h266: H266RawSliceHeader, expose curr_subpic_idx Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-11 11:53:21 -03:00
Michael Niedermayer	ac4e3e188a	avcodec/evc_parse: Check num_remaining_tiles_in_slice_minus1 Fixes: out of array access Fixes: 62467/clusterfuzz-testcase-minimized-ffmpeg_BSF_EVC_FRAME_MERGE_fuzzer-6092990982258688 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Reviewed-by: "Dawid Kozinski/Multimedia (PLT) /SRPOL/Staff Engineer/Samsung Electronics" <d.kozinski@samsung.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-10 00:15:28 +01:00
Michael Niedermayer	bb0a684d93	avcodec/4xm: Check for cfrm exhaustion Fixes: index -1 out of bounds for type 'CFrameBuffer [100]' Fixes: 63877/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_FOURXM_fuzzer-5854263397711872 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-10 00:14:02 +01:00
Niklas Haas	96d2a40b9e	avcodec/pnm: explicitly tag color range PGMYUV seems to be always limited range. This was a format originally invented by FFmpeg at a time when YUVJ distinguished limited from full range YUV, and this codec never appeared to output YUVJ in any circumstance, so hard-coding limited range preserves the status quo. The other formats are explicitly documented to be full range RGB/gray formats. That said, don't tag them yet, due to outstanding bugs w.r.t grayscale formats and color range handling. This change in behavior updates a bunch of FATE tests in trivial ways (added tagging being the only difference).	2023-11-09 12:53:35 +01:00
Peter Ross	10869cd849	avcodec: LEAD MCMP decoder Partially fixes ticket #798 Reviewed-by: James Almer <jamrial@gmail.com> Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Reviewed-by: Paul B Mahol <onemda@gmail.com> Signed-off-by: Peter Ross <pross@xvid.org>	2023-11-08 17:37:58 +11:00
Rémi Denis-Courmont	adc87a5f7c	lavc/opusdsp: rewrite R-V V postfilter This uses a more traditional approach allowing up processing of up to period minus two elements per iteration. This also allows the algorithm to work for all and any vector length. As the T-Head C908 device under test can load 16 elements loop, there is unsurprisingly a little performance drop when the period is minimal and the parallelism is capped at 13 elements: Before: postfilter_15_c: 21222.2 postfilter_15_rvv_f32: 22007.7 postfilter_512_c: 20189.7 postfilter_512_rvv_f32: 22004.2 postfilter_1022_c: 20189.7 postfilter_1022_rvv_f32: 22004.2 After: postfilter_15_c: 20189.5 postfilter_15_rvv_f32: 7057.2 postfilter_512_c: 20189.5 postfilter_512_rvv_f32: 5667.2 postfilter_1022_c: 20192.7 postfilter_1022_rvv_f32: 5667.2	2023-11-06 22:09:30 +02:00
Rémi Denis-Courmont	02594c8c01	lavc/pixblockdsp: rework R-V V get_pixels_unaligned As in the aligned case, we can use VLSE64.V, though the way of doing so gets more convoluted, so the performance gains are more modest: get_pixels_unaligned_c: 126.7 get_pixels_unaligned_rvv_i32: 145.5 (before) get_pixels_unaligned_rvv_i64: 62.2 (after) For the reference, those are the aligned benchmarks (unchanged) on the same T-Head C908 hardware: get_pixels_c: 126.7 get_pixels_rvi: 85.7 get_pixels_rvv_i64: 33.2	2023-11-06 19:42:49 +02:00
Rémi Denis-Courmont	f68ad5d2de	lavc/sbrdsp: R-V V sbr_hf_g_filt hf_g_filt_c: 1552.5 hf_g_filt_rvv_f32: 679.5	2023-11-06 19:42:49 +02:00
Andreas Rheinhardt	3f890fbfd9	avcodec/cbs_h2645: Fix leak of SPS VUI extension data Fixes: VUI extension leak Fixes: 63004/clusterfuzz-testcase-minimized-ffmpeg_BSF_VVC_METADATA_fuzzer-4928832253329408 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-11-04 01:27:41 +01:00
Andreas Rheinhardt	5935423e1e	avcodec/aactab: Deduplicate swb_offset_960 tabs swb_offset_960_48 and swb_offset_960_32 coincide. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-11-04 01:24:09 +01:00
Michael Niedermayer	03a4aa9699	avcodec/flicvideo: consider width in copy loops Fixes: out of array write Fixes: 63520/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_FLIC_fuzzer-4876198087622656 Regression since: `c7f8d42c12` (was not posted to ffmpeg-devel) Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Reviewed-by: Sean McGovern <gseanmcg@gmail.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-03 22:16:33 +01:00
Rémi Denis-Courmont	d06fd18f8f	lavc/sbrdsp: R-V V neg_odd_64 With 128-bit vectors, this is mostly pointless but also harmless. Performance gains should be more noticeable with larger vector sizes. neg_odd_64_c: 76.2 neg_odd_64_rvv_i64: 74.7	2023-11-01 22:53:26 +02:00
Rémi Denis-Courmont	b0aba7dd0c	lavc/sbrdsp: R-V V sum_square sum_square_c: 803.5 sum_square_rvv_f32: 283.2	2023-11-01 22:53:26 +02:00
Rémi Denis-Courmont	86bee42473	lavc/sbrdsp: R-V V sum64x5 sum64x5_c: 385.0 sum64x5_rvv_f32: 116.0	2023-11-01 22:53:26 +02:00
Andreas Rheinhardt	eba73142ad	avcodec/vp9: Join extradata buffer pools Up until now each thread had its own buffer pool for extradata buffers when using frame-threading. Each thread can have at most three references to extradata and in the long run, each thread's bufferpool seems to fill up with three entries. But given that at any given time there can be at most 2 + number of threads entries used (the oldest thread can have two references to preceding frames that are not currently decoded and each thread has its own current frame, but there can be no references to any other frames), this is wasteful. This commit therefore uses a single buffer pool that is synced across threads. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-11-01 20:16:02 +01:00
Andreas Rheinhardt	0c44f63b02	avcodec/refstruct: Allow to share pools To do this, make FFRefStructPool itself refcounted according to the RefStruct API. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-11-01 20:15:54 +01:00
Andreas Rheinhardt	92abc7266b	avcodec/vaapi_encode: Use RefStruct pool API, stop abusing AVBuffer API Up until now, the VAAPI encoder uses fake data with the AVBuffer-API: The data pointer does not point to real memory, but is instead just a VABufferID converted to a pointer. This has probably been copied from the VAAPI-hwcontext-API (which presumably does it to avoid allocations). This commit changes this without causing additional allocations by switching to the RefStruct-pool API. This also fixes an unchecked av_buffer_ref(). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-11-01 20:14:22 +01:00
Andreas Rheinhardt	8c0350f57e	avcodec/vp9: Use RefStruct-pool API for extradata It avoids allocations and corresponding error checks. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-11-01 20:14:06 +01:00
Andreas Rheinhardt	090d9956fd	avcodec/refstruct: Allow to always return zeroed pool entries This is in preparation for the following commit. Reviewed-by: Anton Khirnov <anton@khirnov.net> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-11-01 20:13:40 +01:00
Andreas Rheinhardt	e01e30ede1	avcodec/nvdec: Use RefStruct-pool API for decoder pool It involves less allocations, in particular no allocations after the entry has been created. Therefore creating a new reference from an existing one can't fail and therefore need not be checked. It also avoids indirections and casts. Also note that nvdec_decoder_frame_init() (the callback to initialize new entries from the pool) does not use atomics to read and replace the number of entries currently used by the pool. This relies on nvdec (like most other hwaccels) not being run in a truely frame-threaded way. Tested-by: Timo Rothenpieler <timo@rothenpieler.org> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-11-01 20:13:01 +01:00
Andreas Rheinhardt	fd2e65871c	avcodec/hevcdec: Use RefStruct-pool API instead of AVBufferPool API It involves less allocations and therefore has the nice property that deriving a reference from a reference can't fail, simplifying hevc_ref_frame(). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-11-01 20:10:20 +01:00
Andreas Rheinhardt	736b510fcc	avcodec/h264dec: Use RefStruct-pool API instead of AVBufferPool API It involves less allocations and therefore has the nice property that deriving a reference from a reference can't fail. This allows for considerable simplifications in ff_h264_(ref\|replace)_picture(). Switching to the RefStruct API also allows to make H264Picture smaller, because some AVBufferRef* pointers could be removed without replacement. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-11-01 20:07:56 +01:00
Andreas Rheinhardt	26c0a7321f	avcodec/refstruct: Add RefStruct pool API Very similar to the AVBufferPool API, but with some differences: 1. Reusing an already existing entry does not incur an allocation at all any more (the AVBufferPool API needs to allocate an AVBufferRef). 2. The tasks done while holding the lock are smaller; e.g. allocating new entries is now performed without holding the lock. The same goes for freeing. 3. The entries are freed as soon as possible (the AVBufferPool API frees them in two batches: The first in av_buffer_pool_uninit() and the second immediately before the pool is freed when the last outstanding entry is returned to the pool). 4. The API is designed for objects and not naked buffers and therefore has a reset callback. This is called whenever an object is returned to the pool. 5. Just like with the RefStruct API, custom allocators are not supported. (If desired, the FFRefStructPool struct itself could be made reference counted via the RefStruct API; an FFRefStructPool would then be freed via ff_refstruct_unref().) Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-11-01 20:07:23 +01:00
Rémi Denis-Courmont	92bcc6703a	lavc/pixblockdsp: remove R-V V get_pixels_16 In the aligned case, the existing RVI assembler is actually much faster. In the unaligned case, there is nothing much to gain over C.	2023-11-01 19:27:22 +02:00
Rémi Denis-Courmont	28840cf499	lavc/jpeg2000dsp: R-V V rct_int jpeg2000_rct_int_c: 2592.2 jpeg2000_rct_int_rvv_i32: 1154.2	2023-11-01 18:52:55 +02:00
Rémi Denis-Courmont	73dea2bb91	lavc/jpeg2000dsp: R-V V ict_float jpeg2000_ict_float_c: 3112.2 jpeg2000_ict_float_rvv_f32: 1225.0	2023-11-01 18:52:55 +02:00
Rémi Denis-Courmont	b2a441a3be	lavc/jpeg2000dsp: make coefficients extern This is so that they can be loaded from assembler, rather than duplicated.	2023-11-01 18:52:55 +02:00
Michael Niedermayer	a5259f326b	avcodec/vlc: Pass VLC_MULTI_ELEM directly not by pointer This makes the code more testable as uninitialized fields are 0 and not random values from the last call Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-01 16:40:22 +01:00
Michael Niedermayer	8516609edd	avcodec/vlc: Replace mysterious max computation code in multi vlc Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-01 16:40:21 +01:00
Michael Niedermayer	356b1ba765	avcodec/vlc: Skip subtable entries in multi VLC These entries do not correspond to VLC symbols that can be used they do corrupt various variables like min/max bits This also no longer assumes that there is a single non subtable entry Probably fixes some infinite loops too Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-01 16:40:21 +01:00
Michael Niedermayer	2817efbba3	avcodec/dovi_rpu: Use 64 bit in get_us/se_coeff() Fixes: shift exponent 32 is too large for 32-bit type 'int' Fixes: 63151/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_HEVC_fuzzer-5067531154751488 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-01 16:40:20 +01:00
Michael Niedermayer	2def617787	avcodec/apedec: Fix integer overflow in predictor_decode_stereo_3950() Fixes: signed integer overflow: 1900031961 + 553590817 cannot be represented in type 'int' Fixes: 63061/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_APE_fuzzer-5166188298371072 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-01 16:40:20 +01:00
Michael Niedermayer	68cc1744db	avcodec/evc_parse: Check tid The check is based on not infinite looping. It is likely a more strict check can be done Fixes: Infinite loop Fixes: 62473/clusterfuzz-testcase-minimized-ffmpeg_BSF_EVC_FRAME_MERGE_fuzzer-5719883750703104 Fixes: 62765/clusterfuzz-testcase-minimized-ffmpeg_dem_EVC_fuzzer-6448531252314112 Fixes: 63378/clusterfuzz-testcase-minimized-ffmpeg_dem_MPEGPS_fuzzer-6504993844494336 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Reviewed-by: "Dawid Kozinski/Multimedia (PLT) /SRPOL/Staff Engineer/Samsung Electronics" <d.kozinski@samsung.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-01 16:40:19 +01:00
Michael Niedermayer	d35eecd24f	avcodec/evc_parse: remove pow() and log2() The use of float based functions is both unneeded and wrong due to unpredictable rounding Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-01 16:40:03 +01:00
Andreas Rheinhardt	f2687a3b69	avcodec/wmavoice: Avoid unnecessary VLC structure Everything besides VLC.table is basically write-only and even VLC.table can be removed by accessing the underlying table directly. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 21:44:48 +01:00
Andreas Rheinhardt	5615f9dab4	avcodec/wmaprodec: Avoid superfluous VLC structures For all VLCs here, the number of bits of the VLC is write-only, because it is hardcoded at the call site. Therefore one can replace these VLC structures with the only thing that is actually used: The pointer to the VLCElem table. And in most cases one can even avoid this. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 21:44:48 +01:00
Andreas Rheinhardt	7e2120c4d9	avcodec/mpeg12: Avoid unnecessary VLC structures Everything besides VLC.table is basically write-only and even VLC.table can be removed by accessing the underlying tables directly. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 21:44:48 +01:00
Andreas Rheinhardt	c9aa80c313	avcodec/mpegaudiodec_common: Avoid superfluous VLC structures For some VLCs here, the number of bits of the VLC is write-only, because it is hardcoded at the call site. Therefore one can replace these VLC structures with the only thing that is actually used: The pointer to the VLCElem table. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 21:44:48 +01:00
Andreas Rheinhardt	5dc31bc67b	avcodec/aacps_common: Apply offset for VLCs during init This avoids having to apply it later after every get_vlc2(). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 21:44:48 +01:00
Andreas Rheinhardt	40a8cb9e6c	avcodec/aacps_common: Combine huffman tabels This allows to avoid the relocations inherent in an array to individual tables; it also reduces padding. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 21:44:48 +01:00
Andreas Rheinhardt	774611a349	avcodec/aacps_common: Switch to ff_vlc_init_tables_from_lengths() It allows to replace codes of type uint16_t or uint32_t by symbols of type uint8_t. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 21:44:48 +01:00
Andreas Rheinhardt	eb422c606a	avcodec/aacps_common: Avoid superfluous VLC structures For all VLCs here, the number of bits of the VLC is write-only, because it is hardcoded at the call site. Therefore one can replace these VLC structures with the only thing that is actually used: The pointer to the VLCElem table. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 21:44:48 +01:00
Andreas Rheinhardt	4fe91e3676	avcodec/aacps: Move initializing common stuff to aacdec_common.c ff_ps_init() initializes some tables for AAC parametric stereo and some of them are only valid for the fixed- or floating-point decoder, whereas others (namely VLCs) are valid for both. The latter are therefore initialized by ff_ps_init_common() and because the two versions of ff_ps_init() can be run concurrently, it is guarded by an AVOnce. Yet now that there is ff_aacdec_common_init_once() there is a better way to do this: Call ff_ps_init_common() from ff_aacdec_common_init_once(). That way there is no need to guard ff_ps_init_common() by an AVOnce any more. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 21:44:48 +01:00
Andreas Rheinhardt	7f66d9d6c5	avcodec/aacdec_common: Apply offset for SBR VLCs during init This avoids having to apply it later after every get_vlc2(). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 21:44:48 +01:00
Andreas Rheinhardt	1aca4e7fc5	avcodec/aacdec_common: Combine huffman tabs This allows to avoid the relocations inherent in a table to individual tables; it also reduces padding. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 21:44:48 +01:00
Andreas Rheinhardt	2c131f126d	avcodec/aacdec_common: Switch to ff_vlc_init_tables_from_lengths() It allows to replace code tables of type uint32_t or uint16_t by symbols of type uint8_t. It is also faster. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 21:44:48 +01:00
Andreas Rheinhardt	0b4e69cc87	avcodec/aacdec_common: Avoid superfluous VLC structures for SBR VLCs For all VLCs here, the number of bits of the VLC is write-only, because it is hardcoded at the call site. Therefore one can replace these VLC structures with the only thing that is actually used: The pointer to the VLCElem table. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 21:44:48 +01:00
Andreas Rheinhardt	22d60524d8	avcodec/aacsbr_template: Deduplicate VLCs The VLCs, their init code and the tables used for initialization are currently duplicated for the floating- and fixed-point decoders. This commit stops doing so and moves this stuff to aacdec_common.c. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 21:44:48 +01:00
Andreas Rheinhardt	4d6042e9d7	avcodec/aacdec_common: Avoid superfluous VLC structures For all VLCs here, the number of bits of the VLC is write-only, because it is hardcoded at the call site. Therefore one can replace these VLC structures with the only thing that is actually used: The pointer to the VLCElem table. And in some cases one can even avoid this. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 21:44:48 +01:00
Benjamin Cheng	4536de3769	vulkan_h264: fix long-term ref handling h->long_ref isn't guaranteed to be contiguously filled. Use the approach from both vaapi_h264 and vdpau_h264 which goes through the 16 frames in h->long_ref to find the LTR entries. Fixes MR2_MW_A.264 from JVT-AVC_V1.	2023-10-31 21:35:23 +01:00
Andreas Rheinhardt	1e63e24c76	avcodec/aactab: Improve included headers Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	30deaba97b	avcodec/aacdec_template: Don't init unused table for fixed-point decoder The fixed-point decoder actually does not use the floating-point tables initialized by ff_aac_tableinit() at all. So don't initialize them for it; instead merge initializing these tables into ff_aac_float_common_init() which is already the function for the common static initializations of the floating-point AAC decoder and the (also floating-point) AAC encoder. Doing so saves also one AVOnce. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	3b080fe7af	avcodec/aacdec_template: Deduplicate VLCs They (as well as their init code) are currently duplicated for the floating- and fixed-point decoders. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	1f15a7e9a8	avcodec/aacdectab: Deduplicate common decoder tables Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	8c1e71a811	avcodec/aacps: Pass logctx as void* instead of AVCodecContext* Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	70b5d9c569	avcodec/aacps: Remove unused AVCodecContext* parameter from ff_ps_apply Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	36b5f71b1f	avcodec/msmpeg4dec: Avoid superfluous VLC structures For all VLCs here, the number of bits of the VLC is write-only, because it is hardcoded at the call site. Therefore one can replace these VLC structures with the only thing that is actually used: The pointer to the VLCElem table. And in some cases one can even avoid this. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	5a694d62c5	avcodec/mpeg4videodec: Avoid superfluous VLC structures For all VLCs here, the number of bits of the VLC is write-only, because it is hardcoded at the call site. Therefore one can replace these VLC structures with the only thing that is actually used: The pointer to the VLCElem table. And in some cases one can even avoid this. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	8c39b2bca7	avcodec/mss4: Partially inline max_depth and nb_bits of VLC It is known at compile-time for the vec_entry_vlcs. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	05577d2c76	avcodec/indeo2: Avoid unnecessary VLC structure Everything besides VLC.table is basically write-only and even VLC.table can be removed by accessing the underlying table directly. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	25b9ff2780	avcodec/4xm: Avoid unnecessary VLC structures Everything besides VLC.table is basically write-only and even VLC.table can be removed by accessing the underlying table directly. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	e5dcde620d	avcodec/vc1: Avoid superfluous VLC structures For all VLCs here, the number of bits of the VLC is write-only, because it is hardcoded at the call site. Therefore one can replace these VLC structures with the only thing that is actually used: The pointer to the VLCElem table. And in some cases one can even avoid this. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	fd4cb6ebee	avcodec/speedhqdec: Avoid unnecessary VLC structure Everything besides VLC.table is basically write-only and even VLC.table can be removed by accessing the underlying table directly. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	0a610e22c1	avcodec/lagarith: Avoid unnecessary VLC structure Everything besides VLC.table is basically write-only and even VLC.table can be removed by accessing the underlying table directly. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	e3ad5b9784	avcodec/imm4: Avoid unnecessary VLC structure Everything besides VLC.table is basically write-only and even VLC.table can be removed by accessing the underlying table directly. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	f6c5d04b6d	avcodec/mimic: Avoid unnecessary VLC structure Everything besides VLC.table is basically write-only and even VLC.table can be removed by accessing the underlying table directly. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	827d0325a9	avcodec/mobiclip: Avoid unnecessary VLC structure Everything besides VLC.table is basically write-only and only VLC.table needs to be retained. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	36e7f9b339	avcodec/vqcdec: Avoid unnecessary VLC structure Everything besides VLC.table is basically write-only and even VLC.table can be removed by accessing the underlying table directly. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	99ed510d4b	avcodec/mv30: Avoid unnecessary VLC structure Everything besides VLC.table is basically write-only and even VLC.table can be removed by accessing the underlying table directly. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	b60a3f70be	avcodec/wnv1: Avoid unnecessary VLC structure Everything besides VLC.table is basically write-only and even VLC.table can be removed by accessing the underlying table directly. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	1ae750a16e	avcodec/rv34: Constify pointer to static object Said object is only allowed to be modified during its initialization and is immutable afterwards. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	716ddc8c62	avcodec/rv34: Avoid superfluous VLC structures For most VLCs here, the number of bits of the VLC is write-only, because it is hardcoded at the call site. Therefore one can replace these VLC structures with the only thing that is actually used: The pointer to the VLCElem table. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	73fa6d486d	avcodec/vp3: Reindent after the previous commits Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	75c6a253a4	avcodec/vp3: Avoid complete VLC struct, only use VLCElem* Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	6c7a344b65	avcodec/vp3: Share coefficient VLCs between threads These VLCs are very big: The VP3 one have 164382 elements but due to the overallocation enough memory for 313344 elements are allocated (1.195 MiB with sizeof(VLCElem) == 4); for VP4 the numbers are very similar, namely 311296 and 164392 elements. Since `1f4cf92cfb`, each frame thread has its own copy of these VLCs. This commit fixes this by sharing these VLCs across threads. The approach used here will also make it easier to support stream reconfigurations in case of frame-multithreading in the future. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	7fee90efac	avcodec/imc: Avoid superfluous VLC structures Of all these VLCs here, only VLC.table was really used after init, so use the ff_vlc_init_tables API to get rid of them. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	6fb96ef755	avcodec/atrac9dec: Avoid superfluous VLC structures Of all these VLCs here, only VLC.table was really used after init, so use the ff_vlc_init_tables API to get rid of them. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	4c7e8b969e	avcodec/clearvideo: Avoid superfluous VLC structures Of all these VLCs here, only VLC.table was really used after init, so use the ff_vlc_init_tables API to get rid of them. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	c95e123e8c	avcodec/intrax8: Avoid superfluous VLC structures Of all these VLCs here, only VLC.table was really used after init, so use the ff_vlc_init_tables API to get rid of them. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	886fbec82f	avcodec/mpc7: Avoid superfluous VLC structures Of all these VLCs here, only VLC.table was really used after init, so use the ff_vlc_init_tables API to get rid of them. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	e5e05fd3c8	avcodec/rv40: Avoid superfluous VLC structures Of all these VLCs here, only VLC.table was really used after init, so use the ff_vlc_init_tables API to get rid of them. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	460c6ae597	avcodec/svq1dec: Increase size of VLC It allows to reduce the number of maximum reloads by one. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	7d542e26a9	avcodec/svq1dec: Avoid superfluous VLC structures Of all these VLCs here, only VLC.table was really used after init, so use the ff_vlc_init_tables API to get rid of them. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:47:00 +01:00
Andreas Rheinhardt	7902c0df4c	avcodec/msmpeg4_vc1_data: Avoid superfluous VLC structures Of all these VLCs here, only VLC.table was really used after init, so use the ff_vlc_init_tables API to get rid of them. Also combine the ff_msmp4_dc_(luma\|chroma)_vlcs as well as the tables used to generate them to simplify the code. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:46:59 +01:00
Andreas Rheinhardt	ff886fc282	avcodec/ituh263dec: Avoid superfluous VLC structures Of all these VLCs here, only VLC.table was really used after init, so use the ff_vlc_init_tables API to get rid of them. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:46:59 +01:00
Andreas Rheinhardt	5a9e185dfc	avcodec/h261dec: Avoid superfluous VLC structures Of all these VLCs here, only VLC.table was really used after init, so use the ff_vlc_init_tables API to get rid of them. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:46:59 +01:00
Andreas Rheinhardt	363837de0e	avcodec/faxcompr: Avoid superfluous VLC structures Of all these VLCs here, only VLC.table was really used after init, so use the ff_vlc_init_tables API to get rid of them. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:46:59 +01:00
Andreas Rheinhardt	a99285aedf	avcodec/asvdec: Avoid superfluous VLC structures Of all these VLCs here, only VLC.table was really used after init, so use the ff_vlc_init_tables API to get rid of them. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:46:59 +01:00
Andreas Rheinhardt	ab8a8246c8	avcodec/h264_cavlc: Remove code duplication Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:46:59 +01:00
Andreas Rheinhardt	bd4c778e19	avcodec/h264_cavlc: Avoid indirection for coefficient table VLCs Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:46:59 +01:00
Andreas Rheinhardt	fe748ddf62	avcodec/h264_cavlc: Avoid superfluous VLC structures Of all these VLCs here, only VLC.table was really used after init, so use the ff_vlc_init_tables API to get rid of them. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:46:59 +01:00
Andreas Rheinhardt	c630d76b27	avcodec/vp3: Increase some VLC tables These are quite small and therefore force reloads that can be avoided by modest increases in the number of bits used. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:46:59 +01:00
Andreas Rheinhardt	1fee3a3dce	avcodec/vp3: Make VLC tables static where possible This is especially important for frame-threaded decoders like this one, because up until now each thread had an identical copy of all these VLC tables. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:46:59 +01:00
Andreas Rheinhardt	edc50658d9	avcodec/vlc: Add functions to init static VLCElem[] without VLC For lots of static VLCs, the number of bits is not read from VLC.bits, but rather a compile-constant that is hardcoded at the callsite of get_vlc2(). Only VLC.table is ever used and not using it directly is just an unnecessary indirection. This commit adds helper functions and macros to avoid the VLC structure when initializing VLC tables; there are 2x2 functions: Two choices for init_sparse or from_lengths and two choices for "overlong" initialization (as used when multiple VLCs are initialized that share the same underlying table). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-31 20:46:59 +01:00
Rémi Denis-Courmont	424c8ceb08	lavc/huffyuvdsp: R-V V add_int16 add_int16_128_c: 2390.5 add_int16_128_rvv_i32: 832.0 add_int16_rnd_width_c: 2390.2 add_int16_rnd_width_rvv_i32: 832.5	2023-10-31 21:33:25 +02:00
Rémi Denis-Courmont	7e1cdc69fb	lavc/utvideodsp: R-V V restore_rgb_planes10 restore_rgb_planes10_c: 185852.2 restore_rgb_planes10_rvv_i32: 90130.5	2023-10-31 21:33:25 +02:00
Rémi Denis-Courmont	4aea0da230	lavc/utvideodsp: R-V V restore_rgb_planes restore_rgb_planes_c: 133065.7 restore_rgb_planes_rvv_i32: 33317.2	2023-10-31 21:33:25 +02:00
Logan Lyu	55f28eb627	lavc/aarch64: new optimization for 8-bit hevc_qpel_hv checkasm bench: put_hevc_qpel_hv4_8_c: 422.1 put_hevc_qpel_hv4_8_i8mm: 101.6 put_hevc_qpel_hv6_8_c: 756.4 put_hevc_qpel_hv6_8_i8mm: 225.9 put_hevc_qpel_hv8_8_c: 1189.9 put_hevc_qpel_hv8_8_i8mm: 296.6 put_hevc_qpel_hv12_8_c: 2407.4 put_hevc_qpel_hv12_8_i8mm: 552.4 put_hevc_qpel_hv16_8_c: 4021.4 put_hevc_qpel_hv16_8_i8mm: 886.6 put_hevc_qpel_hv24_8_c: 8992.1 put_hevc_qpel_hv24_8_i8mm: 1968.9 put_hevc_qpel_hv32_8_c: 15197.9 put_hevc_qpel_hv32_8_i8mm: 3209.4 put_hevc_qpel_hv48_8_c: 32811.1 put_hevc_qpel_hv48_8_i8mm: 7442.1 put_hevc_qpel_hv64_8_c: 58106.1 put_hevc_qpel_hv64_8_i8mm: 12423.9 Co-Authored-By: J. Dekker <jdek@itanimul.li> Signed-off-by: Martin Storsjö <martin@martin.st>	2023-10-31 14:14:21 +02:00
Logan Lyu	97a9d12657	lavc/aarch64: new optimization for 8-bit hevc_qpel_v checkasm bench: put_hevc_qpel_v4_8_c: 138.1 put_hevc_qpel_v4_8_neon: 41.1 put_hevc_qpel_v6_8_c: 276.6 put_hevc_qpel_v6_8_neon: 60.9 put_hevc_qpel_v8_8_c: 478.9 put_hevc_qpel_v8_8_neon: 72.9 put_hevc_qpel_v12_8_c: 1072.6 put_hevc_qpel_v12_8_neon: 203.9 put_hevc_qpel_v16_8_c: 1852.1 put_hevc_qpel_v16_8_neon: 264.1 put_hevc_qpel_v24_8_c: 4137.6 put_hevc_qpel_v24_8_neon: 586.9 put_hevc_qpel_v32_8_c: 7579.1 put_hevc_qpel_v32_8_neon: 1036.6 put_hevc_qpel_v48_8_c: 16355.6 put_hevc_qpel_v48_8_neon: 2326.4 put_hevc_qpel_v64_8_c: 33545.1 put_hevc_qpel_v64_8_neon: 4126.4 Co-Authored-By: J. Dekker <jdek@itanimul.li> Signed-off-by: Martin Storsjö <martin@martin.st>	2023-10-31 14:14:21 +02:00
Logan Lyu	265450b89e	lavc/aarch64: new optimization for 8-bit hevc_epel_hv checkasm bench: put_hevc_epel_hv4_8_c: 213.7 put_hevc_epel_hv4_8_i8mm: 59.4 put_hevc_epel_hv6_8_c: 350.9 put_hevc_epel_hv6_8_i8mm: 130.2 put_hevc_epel_hv8_8_c: 548.7 put_hevc_epel_hv8_8_i8mm: 136.9 put_hevc_epel_hv12_8_c: 1126.7 put_hevc_epel_hv12_8_i8mm: 302.2 put_hevc_epel_hv16_8_c: 1925.2 put_hevc_epel_hv16_8_i8mm: 459.9 put_hevc_epel_hv24_8_c: 4301.9 put_hevc_epel_hv24_8_i8mm: 1024.9 put_hevc_epel_hv32_8_c: 7509.2 put_hevc_epel_hv32_8_i8mm: 1680.4 put_hevc_epel_hv48_8_c: 16566.9 put_hevc_epel_hv48_8_i8mm: 3945.4 put_hevc_epel_hv64_8_c: 29134.2 put_hevc_epel_hv64_8_i8mm: 6567.7 Co-Authored-By: J. Dekker <jdek@itanimul.li> Signed-off-by: Martin Storsjö <martin@martin.st>	2023-10-31 14:02:53 +02:00
Logan Lyu	22c7291506	lavc/aarch64: new optimization for 8-bit hevc_epel_v checkasm bench: put_hevc_epel_v4_8_c: 79.9 put_hevc_epel_v4_8_neon: 25.7 put_hevc_epel_v6_8_c: 151.4 put_hevc_epel_v6_8_neon: 46.4 put_hevc_epel_v8_8_c: 250.9 put_hevc_epel_v8_8_neon: 41.7 put_hevc_epel_v12_8_c: 542.7 put_hevc_epel_v12_8_neon: 108.7 put_hevc_epel_v16_8_c: 939.4 put_hevc_epel_v16_8_neon: 169.2 put_hevc_epel_v24_8_c: 2104.9 put_hevc_epel_v24_8_neon: 307.9 put_hevc_epel_v32_8_c: 3713.9 put_hevc_epel_v32_8_neon: 524.2 put_hevc_epel_v48_8_c: 8175.2 put_hevc_epel_v48_8_neon: 1197.2 put_hevc_epel_v64_8_c: 16049.4 put_hevc_epel_v64_8_neon: 2094.9 Co-Authored-By: J. Dekker <jdek@itanimul.li> Signed-off-by: Martin Storsjö <martin@martin.st>	2023-10-31 14:02:53 +02:00
Logan Lyu	772865717b	lavc/aarch64: new optimization for 8-bit hevc_epel_pixels and and hevc_qpel_pixels checkasm bench: put_hevc_pel_pixels4_8_c: 33.7 put_hevc_pel_pixels4_8_neon: 20.2 put_hevc_pel_pixels6_8_c: 61.4 put_hevc_pel_pixels6_8_neon: 25.4 put_hevc_pel_pixels8_8_c: 121.4 put_hevc_pel_pixels8_8_neon: 16.9 put_hevc_pel_pixels12_8_c: 199.9 put_hevc_pel_pixels12_8_neon: 40.2 put_hevc_pel_pixels16_8_c: 355.9 put_hevc_pel_pixels16_8_neon: 43.4 put_hevc_pel_pixels24_8_c: 774.7 put_hevc_pel_pixels24_8_neon: 78.9 put_hevc_pel_pixels32_8_c: 1345.2 put_hevc_pel_pixels32_8_neon: 152.2 put_hevc_pel_pixels48_8_c: 2963.7 put_hevc_pel_pixels48_8_neon: 309.4 put_hevc_pel_pixels64_8_c: 5236.2 put_hevc_pel_pixels64_8_neon: 514.2 Co-Authored-By: J. Dekker <jdek@itanimul.li> Signed-off-by: Martin Storsjö <martin@martin.st>	2023-10-31 14:02:53 +02:00
Rémi Denis-Courmont	ae72412aa8	lavc/idctdsp: improve R-V V put_pixels_clamped	2023-10-30 18:14:16 +02:00
Rémi Denis-Courmont	d48810f3a5	lavc/idctdsp: improve R-V V add_pixels_clamped	2023-10-30 18:14:16 +02:00
Rémi Denis-Courmont	600c6f1b55	lavc/idctdsp: improve R-V V put_signed_pixels_clamped This follows the same idea as with pixblockdsp, but applied at the other end, whilst writing data at the end of the function.	2023-10-30 18:14:16 +02:00
Rémi Denis-Courmont	3ea2310e89	lavc/idctdsp: require Zve64x for R-V V functions This will be required for the following changesets.	2023-10-30 18:14:16 +02:00
Rémi Denis-Courmont	300ee8b02d	lavc/pixblockdsp: aligned R-V V 8-bit functions If the scan lines are aligned, we can load each row as a 64-bit value, thus avoiding segmentation. And then we can factor the conversion or subtraction. In principle, the same optimisation should be possible for high depth, but would require 128-bit elements, for which no FFmpeg CPU flag exists.	2023-10-30 18:14:16 +02:00
Rémi Denis-Courmont	722765687b	lavc/pixblockdsp: rename unaligned R-V V functions	2023-10-30 18:14:16 +02:00
Kieran Kunhya	2532e832d2	libavcodec/mpeg12: Reindent	2023-10-29 22:12:05 +00:00
Kieran Kunhya	7d497a1119	libavcodec/mpeg12: Remove "fast" mode	2023-10-29 22:12:02 +00:00
TADANO Tokumei	a824c6f2f6	lavc/libaribcaption: rename `replace_fullwidth_ascii` to `replace_msz_ascii` This should hopefully clarify that the option only affects MSZ full-width characters, and not all full-width ASCII. Additionally, this matches the prefix with the upstream option. Signed-off-by: TADANO Tokumei <aimingoff@pc.nifty.jp>	2023-10-29 18:21:05 +02:00
TADANO Tokumei	21bfadd9b4	lavc/libaribcaption: add MSZ character related options This patch adds two MSZ (Middle Size; half width) character related options, mapping against newly added upstream functionality: * `replace_msz_japanese`, which was introduced in version 1.0.1 of libaribcaption. * `replace_msz_glyph`, which was introduced in version 1.1.0 of libaribcaption. The latter option improves bitmap type rendering if specified fonts contain half-width glyphs (e.g., BIZ UDGothic), even if both ASCII and Japanese MSZ replacement options are set to false. As these options require newer versions of libaribcaption, the configure requirement has been bumped accordingly. Signed-off-by: TADANO Tokumei <aimingoff@pc.nifty.jp>	2023-10-29 18:20:43 +02:00
TADANO Tokumei	82faba8a6c	lavc/libaribcaption: switch all `bool` context variables to `int` On some environments, a `bool` variable is of smaller size than `int`. As AV_OPT_TYPE_BOOL is internally handled as sizeof(int), if a `bool` option was set on such an environment, the memory of following variables would be filled. Additionally, set values may be destroyed by av_opt_copy(). Signed-off-by: TADANO Tokumei <aimingoff@pc.nifty.jp>	2023-10-29 18:19:58 +02:00
Michael Niedermayer	47e784f881	Bump versions after 6.1 Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-10-29 16:19:14 +01:00
Michael Niedermayer	9d3a7d30c4	Bump versions prior to 6.1 Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-10-29 15:34:05 +01:00
Michael Niedermayer	88453250db	avcodec/jpeg2000dec: Check image offset Fixes: left shift of negative value -538967841 Fixes: 62447/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_JPEG2000_fuzzer-6427134337613824 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Reviewed-by: Tomas Härdin <git@haerdin.se> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-10-27 18:10:47 +02:00
Michael Niedermayer	9690d71f11	avcodec/vlc: dont pass nb_elems into multi vlc code Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-10-27 18:10:46 +02:00
Michael Niedermayer	9b546a0717	avcodec/vlc: merge lost 16bit end of array check Also cleanup related code Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-10-27 18:10:46 +02:00
Michael Niedermayer	a23d527ec5	avcodec/magicyuv: remove redundant check in inner loop Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-10-27 18:10:46 +02:00
Michael Niedermayer	4ddf4f5001	avcodec/magicyuv: correct end of array check in multi VLC parsing Fixes: out of array write Fixes: 63390/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_MAGICYUV_fuzzer-5144552979431424.fuzz Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-10-27 18:10:45 +02:00
Michael Niedermayer	ffac64a270	avcodec/bitstream_template: Basic documentation for read_vlc_multi() Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-10-27 18:10:28 +02:00
Paul B Mahol	36eb774ad4	avcodec/mlpenc: try different filter parameters in case of out of range output from LPC	2023-10-27 12:45:23 +02:00
Paul B Mahol	567af48fba	avcodec/mlpenc: add support for 4.0/4.1 ch layout	2023-10-27 12:45:23 +02:00
Paul B Mahol	210e844def	avcodec/mlpdec: support for truehd with channels not representable with 5bit field in second stream Fixes decoding for 4.0/4.1 layouts.	2023-10-27 12:45:23 +02:00
Paul B Mahol	deb4c28dcc	avcodec/mlpenc: add 3.1 ch layout support for truehd	2023-10-27 12:45:23 +02:00
Andreas Rheinhardt	ba6a5e7a3d	avcodec/hevcdec: Move collocated_ref to HEVCContext Only the collocated_ref of the current frame (i.e. HEVCContext.ref) is ever used, so move it to HEVCContext directly after ref. : This goes so far that collocated_ref was not even synced across threads in case of frame-threading. Reviewed-by: Anton Khirnov <anton@khirnov.net> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-26 13:18:01 +02:00
Lynne	70864e6adb	vulkan_decode: correct flipped condition in image layout Changed by the previous commit. Caused validation issues on hardware with !reuse_dpb_dst but not layered_dpb.	2023-10-25 22:01:21 +02:00
Lynne	0b3616231d	vulkan_decode: fix another validation issue Surprising no one, the insane usage rule has a catch.	2023-10-25 20:51:55 +02:00
Lynne	467e411839	vulkan_decode: fix pedantic validation issue "Validation Error: [ VUID-VkImageViewCreateInfo-imageViewType-04974 ] Object 0: handle = 0x9f9b41000000003c, type = VK_OBJECT_TYPE_IMAGE; \| MessageID = 0xc120e150 \| vkCreateImageView(): Using pCreateInfo->viewType VK_IMAGE_VIEW_TYPE_2D and the subresourceRange.layerCount VK_REMAINING_ARRAY_LAYERS=(17) and must 1 (try looking into VK_IMAGE_VIEW_TYPE_*_ARRAY). The Vulkan spec states: If viewType is VK_IMAGE_VIEW_TYPE_1D, VK_IMAGE_VIEW_TYPE_2D, or VK_IMAGE_VIEW_TYPE_3D; and subresourceRange.layerCount is VK_REMAINING_ARRAY_LAYERS, then the remaining number of layers must be 1"	2023-10-25 20:51:54 +02:00
Lynne	9ee4f47c94	vulkan_decode: use coded_width/height instead of the non-coded width and height Partially fixes https://streams.videolan.org/issues/19938/20000_20180305-15.04.59.ts The is coded as 1920x1080, meant to be rendered at 1440x1080 with cropping, or 1680x1080 before cropping. Currently, the created DPB is 1440x1080, which results in the image being decoded incorrectly, as the decoder overwrites output memory. This commit fixes this.	2023-10-25 20:51:05 +02:00
Martin Storsjö	a4877f1ec1	aarch64: Only enable extensions in the intended files/regions This eases actual development of the assembly functions, by only allowing extension instructions within the sections that explicitly enable them, instead of having all extensions enabled everywhere. Signed-off-by: Martin Storsjö <martin@martin.st>	2023-10-24 14:46:20 +03:00
Martin Storsjö	1762975ba1	libavcodec/aarch64/hevc: Require consistent use of trailing semicolon Signed-off-by: Martin Storsjö <martin@martin.st>	2023-10-23 10:39:12 +03:00
Andreas Rheinhardt	6e4030a07b	avcodec/av1dec, vaapi_av1: Remove excessive logmessages Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-22 22:11:37 +02:00
Andreas Rheinhardt	315c956cbd	avcodec/pthread_frame: Remove ff_thread_release_buffer() It is unnecessary since the removal of non-thread-safe callbacks in `e0786a8eeb`. Since then, the AVCodecContext has only been used as logcontext. Removing ff_thread_release_buffer() allowed to remove AVCodecContext* parameters from several other functions (not only unref functions, but also e.g. ff_h264_ref_picture() which calls ff_h264_unref_picture() on error). Reviewed-by: Anton Khirnov <anton@khirnov.net> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-22 22:09:59 +02:00
Leo Izen	86ed68420d	avcodec/librsvgdec: fix memory leaks and deprecated functions At various points through the function librsvg_decode_frame, errors are returned from immediately without deallocating any allocated structs. This patch both fixes those leaks, and also fixes the use of functions that are deprecated since librsvg version 2.52.0. The older calls are still used, guarded by #ifdefs while the newer replacements are used if librsvg >= 2.52.0. One of the deprecated functions is used as a check for the configure shell script, so it was replaced with a different function. Signed-off-by: Leo Izen <leo.izen@gmail.com>	2023-10-22 15:18:13 -04:00
Martin Storsjö	a76b409dd0	aarch64: Reindent all assembly to 8/24 column indentation libavcodec/aarch64/vc1dsp_neon.S is skipped here, as it intentionally uses a layered indentation style to visually show how different unrolled/interleaved phases fit together. Signed-off-by: Martin Storsjö <martin@martin.st>	2023-10-21 23:25:54 +03:00
Martin Storsjö	7f905f3672	aarch64: Make the indentation more consistent Some functions have slightly different indentation styles; try to match the surrounding code. libavcodec/aarch64/vc1dsp_neon.S is skipped here, as it intentionally uses a layered indentation style to visually show how different unrolled/interleaved phases fit together. Signed-off-by: Martin Storsjö <martin@martin.st>	2023-10-21 23:25:29 +03:00
Martin Storsjö	93cda5a9c2	aarch64: Lowercase UXTW/SXTW and similar flags Signed-off-by: Martin Storsjö <martin@martin.st>	2023-10-21 23:25:23 +03:00
Martin Storsjö	184103b310	aarch64: Consistently use lowercase for vector element specifiers Signed-off-by: Martin Storsjö <martin@martin.st>	2023-10-21 23:25:18 +03:00
Paul B Mahol	393d1ee541	avcodec/mlpenc: add 2.1 layout support for truehd	2023-10-20 23:29:45 +02:00
Paul B Mahol	79c568dd4e	avcodec/mlpenc: add proper support for output bit shift	2023-10-20 17:07:25 +02:00
Paul B Mahol	3f773d8d02	avcodec/mlpenc: add support for TrueHD substreams Add 3.0 channel layout support for truehd encoder.	2023-10-20 17:07:24 +02:00
Paul B Mahol	98857ece48	avcodec/mlpenc: use ctx->num_substreams when writing headers	2023-10-20 17:07:23 +02:00
Paul B Mahol	94abb4df32	avcodec/mlpenc: add helper function to derive TrueHD ch map from ch_layout	2023-10-20 17:07:22 +02:00
Michael Niedermayer	5feceed008	avcodec/hevc_ps: Check cpb_cnt_minus1 before storing it Fixes: index 32 out of bounds for type 'uint32_t [32]' Fixes: 63003/clusterfuzz-testcase-minimized-ffmpeg_DEMUXER_fuzzer-4685160840560640 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-10-19 20:46:55 +02:00
Elias Carotti	644b2235c5	avcodec/libx264: Add the SSE computation for libx264. Since libx264 only provides a per-frame per-channel PSNR, this is inverted to get back the SSE. Signed-off-by: Anton Khirnov <anton@khirnov.net>	2023-10-19 13:34:37 +02:00
Paul B Mahol	e7a6bba51a	avcodec/mlp*: merge flags used by encoder and decoder	2023-10-18 23:01:40 +02:00
Paul B Mahol	be2bbfe71d	avcodec/mlpenc: cleanup filtering	2023-10-18 23:01:39 +02:00
Paul B Mahol	c1053e2e35	avcodec/mlpenc: allow smaller shift for LPC	2023-10-18 23:01:38 +02:00
Paul B Mahol	b206056c82	avcodec/mlpenc: implement advanced stereo rematrix	2023-10-18 23:01:37 +02:00
Paul B Mahol	727ee32da7	avcodec/mlpenc: remove TODO comment, sample rate is always fixed	2023-10-18 23:01:36 +02:00
Paul B Mahol	9adc5d8bfe	avcodec/mlpenc: restructure code even more Implement lsb_bypass for lossless rematrix.	2023-10-18 23:01:35 +02:00
Zhao Zhili	2361970880	avcodec/videotoolboxenc: Check and set hevc profile 1. If user don't specify the profile, set it to main10 when pixel format is 10 bits. Before the patch, videotoolbox output main profile bitstream with 10 bit input, which can be confusing. 2. Warning when user set profile to main explicitly with 10 bit input. It works, but not the best choice. 3. Reject main 10 profile with 8 bit input, it doesn't work. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2023-10-18 23:05:14 +08:00
Zhao Zhili	cbb6199ff8	avcodec/videotoolboxenc: add hw_configs Will be used in the following patch. With hw_config we can get avctx->hw_frames_ctx, and with avctx->hw_frames_ctx we get sw_pix_fmt. Otherwise sw_pix_fmt is none. I need sw_pix_fmt before get the first frame to set hevc encoder profile. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2023-10-18 23:02:12 +08:00
Leo Izen	bf814387f4	avcodec/jpegxl_parser: fix OOB read regression In `f7ac3512f5` the size of the dynamically allocated buffer was shrunk, but it was made too small for very small alphabet sizes. This patch restores the size to prevent an OOB read. Reported-by: Cole Dilorenzo <coolkingcole@gmail.com> Signed-off-by: Leo Izen <leo.izen@gmail.com>	2023-10-17 08:40:49 -04:00
Michael Niedermayer	5ddab49d48	avcodec/h2645_parse: Avoid EAGAIN EAGAIN causes an assertion failure when it is returned from the decoder Fixes: Assertion consumed != (-(11)) failed at libavcodec/decode.c:462 Fixes: assertion_IOT_instruction_decode_c_462/poc Found-by: Hardik Shah of Vehere (Dawn Treaders team) Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-10-16 01:16:19 +02:00
Michael Niedermayer	f7e5537dc1	avcodec/xvididct: Make c* unsigned to avoid undefined overflows Fixes: signed integer overflow: 1496950099 + 728014168 cannot be represented in type 'int' Fixes: 62667/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_MJPEGB_fuzzer-6511785170305024 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-10-16 01:14:10 +02:00
Michael Niedermayer	61b86add52	avcodec/cbs_h2645: Fix showing bits at the end in cbs_read_se_golomb() Fixes: Assertion n>0 && n<=25 failed at libavcodec/get_bits.h:375 Fixes: 62618/clusterfuzz-testcase-minimized-ffmpeg_BSF_H264_REDUNDANT_PPS_fuzzer-5145745046765568 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-10-16 01:14:09 +02:00
Michael Niedermayer	75eb698bdc	avcodec/cbs_h2645: Fix showing bits at the end in cbs_read_ue_golomb() Fixes: Assertion n>0 && n<=25 failed at libavcodec/get_bits.h:375 Fixes: 62617/clusterfuzz-testcase-minimized-ffmpeg_BSF_TRACE_HEADERS_fuzzer-5156555663998976 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-10-16 01:14:08 +02:00
Michael Niedermayer	cd66606a8f	avcodec/bonk: Fix undefined overflow in predictor_calc_error() Fixes: signed integer overflow: -2146469728 - 1488954 cannot be represented in type 'int' Fixes: 62490/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_BONK_fuzzer-5612782399389696 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-10-16 01:05:07 +02:00
Michael Niedermayer	ef3b42738b	avcodec/evc_ps: Check chroma_format_idc Fixes: out of array access Fixes: 62678/clusterfuzz-testcase-minimized-ffmpeg_DEMUXER_fuzzer-4858264984354816 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Reviewed-by: Kieran Kunhya <kierank@obe.tv> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-10-16 00:54:37 +02:00
Andreas Rheinhardt	66908a43e2	avcodec/h261dec: Don't set write-only macroblock dimensions They are generally set in ff_mpv_init_context_frame() (mostly called by ff_mpv_common_init()); setting them somewhere else should be avoided. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-14 23:36:46 +02:00
Andreas Rheinhardt	4e6cf5e52b	avcodec/h264dec: Constify H.264 decoder Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-13 14:35:07 +02:00
Andreas Rheinhardt	8e1bb594fb	avcodec/h264idct_template: Don't include h264dec.h It is only needed for scan8 which is in h264_parse.h. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-13 14:35:07 +02:00
Andreas Rheinhardt	a7663c9604	avcodec/error_resilience: Constify ThreadFrame* Forgotten in `0eb399ac39`. While just at it, also use a forward declaration. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-12 22:50:34 +02:00
Timo Rothenpieler	68f9dfa5cc	avcodec/nvdec_hevc: fail to initialize on unsupported profiles	2023-10-12 20:57:35 +02:00
Andreas Rheinhardt	ab95338a20	avcodec/mpeg4video_parser: Don't set write-only current_picture_ptr It is unused by ff_mpeg4_decode_picture_header() (unsurprisingly given that when decoding this function is called before the context has been initialized). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-11 22:28:24 +02:00
Andreas Rheinhardt	b561dafd56	avcodec/h261dec: Discard whole packet when discarding (The return value doesn't really matter: For video decoders every return value >= 0 is treated as "consumed all of the input".) Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-11 22:28:24 +02:00
Andreas Rheinhardt	c995311bcf	avcodec/h261dec: Don't set write-only picture_number Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-11 22:28:24 +02:00
John Mather	a2175ca861	avcodec/libkvazaar: Respect codec context color settings. This patch makes the libkvazaar encoder respect color settings that are present on the codec context, including color range, primaries, transfer function and colorspace.	2023-10-11 21:50:47 +03:00
Paul B Mahol	44dc42e4ac	avcodec/mlpenc: export lpc_coeff_precision option Change default precision from 11 to 15, improves compression.	2023-10-10 13:53:11 +02:00
Paul B Mahol	394106a138	avcodec/mlpenc: fix regression in encoding only zeroes Previously it would use more bits than neccessary.	2023-10-10 10:31:37 +02:00
Andreas Rheinhardt	9c1294eadd	avcodec/vdpau_vc1: Fix indentation Forgotten after `af6e232ccf`. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-10 00:22:55 +02:00
Andreas Rheinhardt	c77aee61b8	avcodec/mpeg(picture\|video_dec): Move comment to more appropriate place Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-10 00:22:35 +02:00
Andreas Rheinhardt	52509f63ce	avcodec/mpegpicture: Move caller-specific parts of function to callers Since at least commit `c954cf1e1b` (adding ff_encode_alloc_frame()), a large part of ff_alloc_picture() is completely separate for the two callers. Move the caller-specific parts out to the callers. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-10 00:22:19 +02:00
Andreas Rheinhardt	2a8ac5a780	avcodec/mpegvideo_enc: Don't call av_frame_copy_props() unnecessarily It is unnecessary in case of user-supplied frames, because it happens directly after a av_frame_ref() with the same src and dst. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-10 00:22:15 +02:00
Andreas Rheinhardt	22b0141d87	avcodec/mpegvideo_enc: Don't allocate buffers unnecessarily ff_alloc_picture() performs two tasks: a) In most instances, it allocates frame buffers and b) it allocates certain auxiliary buffers. The exception to a) is the case when the encoder can reuse user-supplied frames. And for these frames the auxiliary buffers are unused, because this frame will never be used as current_picture (and therefore also not as next_picture or last_picture); see select_input_picture(). This means that we can simply avoid calling ff_alloc_picture() with user-supplied frames at all. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-10 00:21:48 +02:00
Andreas Rheinhardt	d87c358ee6	avcodec/mpegvideo_enc: Remove dead block None of the mpegvideo encoders support anything but coded frames; and if this were to change, it is unclear whether they would need the adjustment here. So remove it. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-10 00:21:46 +02:00
Andreas Rheinhardt	070bc4d2c5	avcodec/mpegvideo_enc: Don't set write-only properties The frame is immediately reset in the ff_mpeg_unref_picture() call below. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-10 00:21:40 +02:00
Andreas Rheinhardt	3937a21f21	avcodec/mpegvideo_enc: Don't overallocate arrays Only entries 0..max_b_frames are ever used. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-10 00:21:20 +02:00
Andreas Rheinhardt	0524b4ec3e	avcodec/mpegvideo_enc: Don't reget known values Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-10 00:20:57 +02:00
Andreas Rheinhardt	18f7d8d880	avcodec/mpegvideo_enc: Don't pretend input to be non-refcounted Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-10 00:20:44 +02:00
Andreas Rheinhardt	f5220475de	avcodec/mpegvideo_enc: Reindentation Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-10 00:20:39 +02:00
Andreas Rheinhardt	5aaaa7dbee	avcodec/mpegvideo_enc: Remove always-false checks In case "!direct" we are not reusing the input buffers (due to e.g. insufficient alignment), but allocating new ones. These of course do not alias with the ones provided by the user, so these checks are always-false. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-10 00:20:09 +02:00
Andreas Rheinhardt	b96ba62bdd	avcodec/mpegvideo_enc: Fix abort on allocation errors mpegvideo_enc uses a fixed-size array of Pictures; a slot is considered taken if the Picture's AVFrame is set. When an error happens after a slot has been taken, this Picture has typically not been reset and is therefore not usable for future requests. The code aborts when one runs out of slots and this can happen in case of allocation failures. Fix this by always unreferencing a Picture in case of errors. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-10 00:19:52 +02:00
Paul B Mahol	78fa1cff70	avcodec/mlpenc: export max_interval option	2023-10-09 23:48:00 +02:00
Paul B Mahol	ee9fb28429	avcodec/mlpenc: export codebook_search option too	2023-10-09 23:47:58 +02:00
Paul B Mahol	1703bfa133	avcodec/hcadec: implement proper .flush callback	2023-10-09 21:23:25 +02:00
Timo Rothenpieler	e006680d8e	avcodec/nvenc: add option to control subsampling of packed rgb input	2023-10-09 20:17:44 +02:00
Rémi Denis-Courmont	3c6516330f	lavc/exrdsp: R-V V reoder_pixels	2023-10-09 19:52:51 +03:00
Paul B Mahol	8786b91607	avcodec/mlpenc: change flag for shorten_by in THD case	2023-10-09 18:42:43 +02:00
Paul B Mahol	27c623b3d5	avcodec/mlpenc: fix stereo decorrelation	2023-10-09 18:42:42 +02:00
Andreas Rheinhardt	12c4cf9f72	avcodec/refstruct: Inline ff_refstruct_allocz() Suggested by James Almer. Reviewed-by: James Almer <jamrial@gmail.com> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-09 15:54:09 +02:00
Andreas Rheinhardt	fc880c7032	avcodec/h261dec: Remove pointless goto There is no need to parse the header twice; doing so does nothing. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-09 15:53:06 +02:00
Andreas Rheinhardt	cae30c5ed2	avcodec/wmv2dec: Parse extradata during init And stop setting picture_number which was only done to not parse extradata multiple times. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-09 15:52:23 +02:00
Michael Niedermayer	7fedbc7606	avcodec/h264_parser: saturate dts a bit Fixes: signed integer overflow: 0 - -9223372036854775808 cannot be represented in type 'long' Fixes: 51896/clusterfuzz-testcase-minimized-ffmpeg_IO_DEMUXER_fuzzer-6112289464123392 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-10-08 21:36:09 +02:00
Andreas Rheinhardt	c7fb4d0eb6	avcodec/nvdec: Use RefStruct API for decoder_ref Avoids allocations and error checks as well as the boilerplate code for creating an AVBuffer with a custom free callback. Also increases type safety. Reviewed-by: Anton Khirnov <anton@khirnov.net> Tested-by: Timo Rothenpieler <timo@rothenpieler.org> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-07 22:36:21 +02:00
Andreas Rheinhardt	2ec62b1ca6	avcodec/pthread_frame: Use RefStruct API for ThreadFrame.progress Avoids allocations and error checks and allows to remove cleanup code for earlier allocations. Also avoids casts and indirections. Reviewed-by: Anton Khirnov <anton@khirnov.net> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-07 22:36:04 +02:00
Andreas Rheinhardt	452089ee23	avcodec/hevcdec: Use RefStruct API for RefPicListTab buffer Given that the RefStruct API relies on the user to know the size of the objects and does not provide a way to get it, we need to store the number of elements allocated ourselves; but this is actually better than deriving it from the size in bytes. Reviewed-by: Anton Khirnov <anton@khirnov.net> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-07 22:35:56 +02:00
Andreas Rheinhardt	6695c0af0e	avcodec/vulkan_decode: Use RefStruct API for shared_ref Avoids allocations, error checks and indirections. Also increases type-safety. Reviewed-by: Lynne <dev@lynne.ee> Tested-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-07 22:35:50 +02:00
Andreas Rheinhardt	f8252d6ce3	avcodec/decode: Use RefStruct API for hwaccel_picture_private Avoids allocations and therefore error checks: Syncing hwaccel_picture_private across threads can't fail any more. Also gets rid of an unnecessary pointer in structures and in the parameter list of ff_hwaccel_frame_priv_alloc(). Reviewed-by: Anton Khirnov <anton@khirnov.net> Reviewed-by: Lynne <dev@lynne.ee> Tested-by: Lynne <dev@lynne.ee> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-10-07 22:35:22 +02:00

... 3 4 5 6 7 ...

49275 Commits