FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-11-26 19:01:44 +02:00

Author	SHA1	Message	Date
Jun Zhao	c961ac4b0c	vulkan_decode: fix the print format of VkDeviceSize VkDeviceSize represents device memory size and offset values as uint64_t in Spec. Signed-off-by: Jun Zhao <barryjzhao@tencent.com>	2023-11-21 08:02:43 +08:00
James Almer	1258f99978	avcodec: bump version after EVC additions Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-20 11:55:51 -03:00
Dawid Kozinski	cfe2947887	avcodec/evc_decoder: Provided support for EVC decoder - Added EVC decoder wrapper - Changes in project configuration file and libavcodec Makefile - Added documentation for xevd wrapper Signed-off-by: Dawid Kozinski <d.kozinski@samsung.com> Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-20 11:55:51 -03:00
Dawid Kozinski	c59a96fd08	avcodec/evc_encoder: Provided support for EVC encoder - Added EVC encoder wrapper - Changes in project configuration file and libavcodec Makefile - Added documentation for xeve wrapper Signed-off-by: Dawid Kozinski <d.kozinski@samsung.com> Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-20 11:55:51 -03:00
Michael Niedermayer	e56d91f8a8	avcodec/h264dec: Support skipping frames that used gray gap frames Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:19:25 +01:00
Michael Niedermayer	6364fa9e9a	avcodec/h264: Avoid using gray gap frames as references Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:19:25 +01:00
Michael Niedermayer	29f6c9b04d	avcodec/h264: keep track of which frames used gray references Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:19:04 +01:00
Michael Niedermayer	e4337606e1	avcodec/h264dec: More elaborate documentation for frame_recovered Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:12:30 +01:00
Michael Niedermayer	68e1cf204a	avcodec/h264: Use FRAME_RECOVERED_HEURISTIC instead of IDR/SEI This keeps IDR/SEI and heuristically detected recovery points cleaner seperated Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:12:30 +01:00
Michael Niedermayer	3f4a1a24a5	avcodec/h264: Seperate SEI and IDR recovery handling This avoids SEI and IDR recovery flags affecting each other Also eliminate litteral numbers from recovery handling This should make the code clearer Improves: tickets/4738/tickets_cut.ts Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-20 00:12:29 +01:00
Rémi Denis-Courmont	fbc7adba67	lavc/llviddsp: R-V V add_bytes add_bytes_c: 2077.2 add_bytes_rvv_i32: 105.0	2023-11-18 22:07:14 +02:00
Rémi Denis-Courmont	ca664f2254	lavc/flacdsp: R-V V LPC16 function In this case, the inner loop computing the scalar product can be reduced to just one multiplication and one sum even with 128-bit vectors. The result is a lot simpler, but also brings more modest performance gains: flac_lpc_16_13_c: 15241.0 flac_lpc_16_13_rvv_i32: 11230.0 flac_lpc_16_16_c: 17884.0 flac_lpc_16_16_rvv_i32: 12125.7 flac_lpc_16_29_c: 27847.7 flac_lpc_16_29_rvv_i32: 10494.0 flac_lpc_16_32_c: 30051.5 flac_lpc_16_32_rvv_i32: 10355.0	2023-11-18 22:06:57 +02:00
Rémi Denis-Courmont	295092b46d	lavc/flacdsp: R-V V LPC32 The entire set of 32 coefficients and corresponding past 32 samples can fit in a single vector (with LMUL=8) exactly, but... since widening double the needed vector sizes, we still end up too short with 128-bit vectors. This adds a very simple version for future 256+-bit hardware, and for pred_orders values up to 16, and a bit more involved loop for for 128-bit hardware with pred_orders between 17 and 32. With 128-bit hardware, the benchmarks look like this: flac_lpc_32_13_c: 30152.0 flac_lpc_32_13_rvv_i32: 10244.7 flac_lpc_32_16_c: 37314.2 flac_lpc_32_16_rvv_i32: 10126.2 flac_lpc_32_29_c: 61910.0 flac_lpc_32_29_rvv_i32: 14495.2 flac_lpc_32_32_c: 68204.0 flac_lpc_32_32_rvv_i32: 13273.7	2023-11-18 22:05:43 +02:00
Diederik de Haas via ffmpeg-devel	c07ed10b0e	apply spelling fixes Fix spelling issue as reported by Debian's lintian tool: accomodate -> accommodate addtional -> additional auxillary -> auxiliary bellow -> below betweeen -> between Calulate -> Calculate coefficents -> coefficients Defalt -> Default defaul -> default higer -> higher neccesary -> necessary orignal -> original ouput -> output precison -> precision processsing -> processing substract -> subtract Transfered -> Transferred upto -> up to Also add several of them to the 'common typos' check in patcheck. Signed-off-by: Diederik de Haas <didi.debian@cknow.org>	2023-11-18 19:55:42 +01:00
Rémi Denis-Courmont	07c303b708	lavc/flacdsp: R-V V decorrelate_indep 16-bit packed flac_decorrelate_indep2_16_c: 981.7 flac_decorrelate_indep2_16_rvv_i32: 199.2 flac_decorrelate_indep4_16_c: 1749.7 flac_decorrelate_indep4_16_rvv_i32: 401.2 flac_decorrelate_indep6_16_c: 2517.7 flac_decorrelate_indep6_16_rvv_i32: 858.0 flac_decorrelate_indep8_16_c: 3285.7 flac_decorrelate_indep8_16_rvv_i32: 1123.5	2023-11-17 23:59:56 +02:00
Rémi Denis-Courmont	fb0295e5fd	lavc/flacdsp: R-V V decorrelate_indep 32-bit packed flac_decorrelate_indep2_32_c: 981.7 flac_decorrelate_indep2_32_rvv_i32: 183.7 flac_decorrelate_indep4_32_c: 1749.7 flac_decorrelate_indep4_32_rvv_i32: 362.5 flac_decorrelate_indep6_32_c: 2517.7 flac_decorrelate_indep6_32_rvv_i32: 715.2 flac_decorrelate_indep8_32_c: 3285.7 flac_decorrelate_indep8_32_rvv_i32: 909.0	2023-11-17 23:59:56 +02:00
Rémi Denis-Courmont	6183a69c0b	lavc/flacdsp: R-V V decorrelate_ms packed flac_decorrelate_ms_16_c: 585.5 flac_decorrelate_ms_16_rvv_i32: 263.0 flac_decorrelate_ms_32_c: 584.7 flac_decorrelate_ms_32_rvv_i32: 250.0	2023-11-17 23:59:23 +02:00
Rémi Denis-Courmont	636ae0e0bc	lavc/flacdsp: R-V V packed decorrelate_{l,r}s flac_decorrelate_ms_16_c: 457.2 flac_decorrelate_ms_16_rvv_i32: 203.0 flac_decorrelate_ms_32_c: 457.2 flac_decorrelate_ms_32_rvv_i32: 203.5 flac_decorrelate_rs_16_c: 456.2 flac_decorrelate_rs_16_rvv_i32: 207.0 flac_decorrelate_rs_32_c: 456.2 flac_decorrelate_rs_32_rvv_i32: 210.5	2023-11-17 23:59:22 +02:00
Rémi Denis-Courmont	d076517056	lavc/llauddsp: R-V V scalarproduct_and_madd_int32 scalarproduct_and_madd_int32_c: 10899.7 scalarproduct_and_madd_int32_rvv_i32: 1749.0	2023-11-16 16:53:44 +02:00
Rémi Denis-Courmont	45d0eb3f70	lavc/llauddsp: R-V V scalarproduct_and_madd_int16 scalarproduct_and_madd_int16_c: 10355.7 scalarproduct_and_madd_int16_rvv_i32: 1480.0	2023-11-16 16:53:44 +02:00
James Almer	78f55457c9	x86/flacds: clear the high bits from pred_order in lpc_32 functions Reviewed-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-15 16:10:15 -03:00
Dai, Jianhui J	c9fe9fb863	avcodec/cbs_vp8: Add support for VP8 codec bitstream This commit adds support for VP8 bitstream read methods to the cbs codec. This enables the trace_headers bitstream filter to support VP8, in addition to AV1, H.264, H.265, and VP9. This can be useful for debugging VP8 stream issues. The CBS VP8 implements a simple VP8 boolean decoder using GetBitContext to read the bitstream. Only the read methods `read_unit` and `split_fragment` are implemented. The write methods `write_unit` and `assemble_fragment` return the error code AVERROR_PATCHWELCOME. This is because CBS VP8 write is unlikely to be used by any applications at the moment. The write methods can be added later if there is a real need for them. TESTS: ffmpeg -i fate-suite/vp8/frame_size_change.webm -vcodec copy -bsf:v trace_headers -f null - Signed-off-by: Jianhui Dai <jianhui.j.dai@intel.com> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2023-11-15 10:29:03 -05:00
Dai, Jianhui J	5cb8accd09	avcodec/vp8: Export `vp8_token_update_probs` variable This commit exports the `vp8_token_update_probs` variable to internal library scope to facilitate its reuse within the library. Signed-off-by: Jianhui Dai <jianhui.j.dai@intel.com> Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2023-11-15 10:29:03 -05:00
Rémi Denis-Courmont	90a779bed6	lavc/huffyuvdsp: basic R-V V add_hfyu_left_pred_bgr32 Better performance can probably be achieved with a more intricate unrolled loop, but this is a start: add_hfyu_left_pred_bgr32_c: 15084.0 add_hfyu_left_pred_bgr32_rvv_i32: 10280.2 This would actually be cleaner with the RISC-V P extension, but that is not ratified yet (I think?) and usually not supported if V is supported.	2023-11-15 16:51:07 +02:00
James Almer	b360c91752	avcodec/codecpar: mention how to allocate coded_side_data Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-14 14:26:42 -03:00
Anton Khirnov	6dbde68cb5	lavc/8bps: fix exporting palette after `63767b79a5` It would be left empty on each frame whose packet does not come with palette attached.	2023-11-14 18:18:26 +01:00
Rémi Denis-Courmont	ce467421dc	lavc/exrdsp: unroll predictor With explicit unrolling, we can skip half of the sign bit flips, and the compiler is then better able to optimise the scalar loop: predictor_c: 31376.0 (before) predictor_c: 23703.0 (after)	2023-11-14 19:15:51 +02:00
Rémi Denis-Courmont	c536e92207	lavc/sbrdsp: R-V V hf_apply_noise functions This is restricted to 128-bit vectors as larger vector sizes could read past the end of the noise array. Support for future hardware with larger vector sizes is left for some other time. hf_apply_noise_0_c: 2319.7 hf_apply_noise_0_rvv_f32: 1229.0 hf_apply_noise_1_c: 2539.0 hf_apply_noise_1_rvv_f32: 1244.7 hf_apply_noise_2_c: 2319.7 hf_apply_noise_2_rvv_f32: 1232.7 hf_apply_noise_3_c: 2541.2 hf_apply_noise_3_rvv_f32: 1244.2	2023-11-13 18:34:29 +02:00
Rémi Denis-Courmont	5b33104fca	lavc/sbrdsp: R-V V hf_gen hf_gen_c: 2922.7 hf_gen_rvv_f32: 731.5	2023-11-13 18:33:02 +02:00
Gyan Doshi	67a2571a55	avcodec/libsvtav1: add version guard for external param Setting of external param 'force_key_frames' was added in `7bcc1b4eb8`. It is available since v1.1.0 but ffmpeg allows linking against v0.9.0.	2023-11-13 13:14:43 +05:30
Evgeny Pavlov	da3ce21f68	libavcodec/amfenc: Fix issue with missing headers in AV1 encoder This commit fixes issue with missing SPS/PPS headers in video encoded by AMF AV1 encoder. Missing headers leads to broken seek in MPV video player. Default value for property AV1_HEADER_INSERTION_MODE shouldn't be setup to NONE (no headers insertion). We need to skip definition of this property, because default value depends on USAGE property. Signed-off-by: Dmitrii Ovchinnikov <ovchinnikov.dmitrii@gmail.com>	2023-11-12 22:57:17 +01:00
Sebastian Ramacher	250471ea17	avcoded/fft: Fix memory leak if ctx2 is used Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-12 14:47:56 -03:00
Sebastian Ramacher	a562cfee2e	avcodec/fft: Use av_mallocz to avoid invalid free/uninit Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-12 14:47:56 -03:00
Rémi Denis-Courmont	cd7b352c53	lavc/sbrdsp: R-V V autocorrelate With 5 accumulator vectors and 6 inputs, this can only use LMUL=2. Also the number of vector loop iterations is small, just 5 on 128-bit vector hardware. The vector loop is somewhat unusual in that it processes data in descending memory order, in order to save on vector slides: in descending order, we can extract elements to carry over to the next iteration from the bottom of the vectors directly. With ascending order (see in the Opus postfilter function), there are no ways to get the top elements directly. On the downside, this requires the use of separate shift and sub (the would-be SH3SUB instruction does not exist), with a small pipeline stall on the vector load address. The edge cases in scalar are done in scalar as this saves on loads and remains significantly faster than C. autocorrelate_c: 669.2 autocorrelate_rvv_f32: 421.0	2023-11-12 14:03:09 +02:00
Rémi Denis-Courmont	f576a0835b	lavc/aacpsdsp: rework R-V V hybrid_synthesis_deint Given the size of the data set, strided memory accesses cannot be avoided. We can still do better than the current code. ps_hybrid_synthesis_deint_c: 12065.5 ps_hybrid_synthesis_deint_rvv_i32: 13650.2 (before) ps_hybrid_synthesis_deint_rvv_i64: 8181.0 (after)	2023-11-12 14:03:09 +02:00
Rémi Denis-Courmont	eb508702a8	lavc/aacpsdsp: rework R-V V add_squares Segmented loads may be slower than not. So this advantageously uses a unit-strided load and narrowing shifts instead. Before: ps_add_squares_c: 60757.7 ps_add_squares_rvv_f32: 22242.5 After: ps_add_squares_c: 60516.0 ps_add_squares_rvv_i64: 17067.7	2023-11-12 14:03:09 +02:00
Paul B Mahol	10440a489a	avcodec/gif_parser: split correctly also bitstreams that do not have extension blocks	2023-11-12 02:19:53 +01:00
Nuo Mi	09f783692e	avcodec/cbs_h266: H266RawSliceHeader, expose curr_subpic_idx Signed-off-by: James Almer <jamrial@gmail.com>	2023-11-11 11:53:21 -03:00
Michael Niedermayer	ac4e3e188a	avcodec/evc_parse: Check num_remaining_tiles_in_slice_minus1 Fixes: out of array access Fixes: 62467/clusterfuzz-testcase-minimized-ffmpeg_BSF_EVC_FRAME_MERGE_fuzzer-6092990982258688 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Reviewed-by: "Dawid Kozinski/Multimedia (PLT) /SRPOL/Staff Engineer/Samsung Electronics" <d.kozinski@samsung.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-10 00:15:28 +01:00
Michael Niedermayer	bb0a684d93	avcodec/4xm: Check for cfrm exhaustion Fixes: index -1 out of bounds for type 'CFrameBuffer [100]' Fixes: 63877/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_FOURXM_fuzzer-5854263397711872 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-10 00:14:02 +01:00
Niklas Haas	96d2a40b9e	avcodec/pnm: explicitly tag color range PGMYUV seems to be always limited range. This was a format originally invented by FFmpeg at a time when YUVJ distinguished limited from full range YUV, and this codec never appeared to output YUVJ in any circumstance, so hard-coding limited range preserves the status quo. The other formats are explicitly documented to be full range RGB/gray formats. That said, don't tag them yet, due to outstanding bugs w.r.t grayscale formats and color range handling. This change in behavior updates a bunch of FATE tests in trivial ways (added tagging being the only difference).	2023-11-09 12:53:35 +01:00
Peter Ross	10869cd849	avcodec: LEAD MCMP decoder Partially fixes ticket #798 Reviewed-by: James Almer <jamrial@gmail.com> Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Reviewed-by: Paul B Mahol <onemda@gmail.com> Signed-off-by: Peter Ross <pross@xvid.org>	2023-11-08 17:37:58 +11:00
Rémi Denis-Courmont	adc87a5f7c	lavc/opusdsp: rewrite R-V V postfilter This uses a more traditional approach allowing up processing of up to period minus two elements per iteration. This also allows the algorithm to work for all and any vector length. As the T-Head C908 device under test can load 16 elements loop, there is unsurprisingly a little performance drop when the period is minimal and the parallelism is capped at 13 elements: Before: postfilter_15_c: 21222.2 postfilter_15_rvv_f32: 22007.7 postfilter_512_c: 20189.7 postfilter_512_rvv_f32: 22004.2 postfilter_1022_c: 20189.7 postfilter_1022_rvv_f32: 22004.2 After: postfilter_15_c: 20189.5 postfilter_15_rvv_f32: 7057.2 postfilter_512_c: 20189.5 postfilter_512_rvv_f32: 5667.2 postfilter_1022_c: 20192.7 postfilter_1022_rvv_f32: 5667.2	2023-11-06 22:09:30 +02:00
Rémi Denis-Courmont	02594c8c01	lavc/pixblockdsp: rework R-V V get_pixels_unaligned As in the aligned case, we can use VLSE64.V, though the way of doing so gets more convoluted, so the performance gains are more modest: get_pixels_unaligned_c: 126.7 get_pixels_unaligned_rvv_i32: 145.5 (before) get_pixels_unaligned_rvv_i64: 62.2 (after) For the reference, those are the aligned benchmarks (unchanged) on the same T-Head C908 hardware: get_pixels_c: 126.7 get_pixels_rvi: 85.7 get_pixels_rvv_i64: 33.2	2023-11-06 19:42:49 +02:00
Rémi Denis-Courmont	f68ad5d2de	lavc/sbrdsp: R-V V sbr_hf_g_filt hf_g_filt_c: 1552.5 hf_g_filt_rvv_f32: 679.5	2023-11-06 19:42:49 +02:00
Andreas Rheinhardt	3f890fbfd9	avcodec/cbs_h2645: Fix leak of SPS VUI extension data Fixes: VUI extension leak Fixes: 63004/clusterfuzz-testcase-minimized-ffmpeg_BSF_VVC_METADATA_fuzzer-4928832253329408 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-11-04 01:27:41 +01:00
Andreas Rheinhardt	5935423e1e	avcodec/aactab: Deduplicate swb_offset_960 tabs swb_offset_960_48 and swb_offset_960_32 coincide. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2023-11-04 01:24:09 +01:00
Michael Niedermayer	03a4aa9699	avcodec/flicvideo: consider width in copy loops Fixes: out of array write Fixes: 63520/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_FLIC_fuzzer-4876198087622656 Regression since: `c7f8d42c12` (was not posted to ffmpeg-devel) Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Reviewed-by: Sean McGovern <gseanmcg@gmail.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-11-03 22:16:33 +01:00
Rémi Denis-Courmont	d06fd18f8f	lavc/sbrdsp: R-V V neg_odd_64 With 128-bit vectors, this is mostly pointless but also harmless. Performance gains should be more noticeable with larger vector sizes. neg_odd_64_c: 76.2 neg_odd_64_rvv_i64: 74.7	2023-11-01 22:53:26 +02:00
Rémi Denis-Courmont	b0aba7dd0c	lavc/sbrdsp: R-V V sum_square sum_square_c: 803.5 sum_square_rvv_f32: 283.2	2023-11-01 22:53:26 +02:00

1 2 3 4 5 ...

49063 Commits