FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-12-02 03:06:28 +02:00

Author	SHA1	Message	Date
Kieran Kunhya	40c5c19eac	x86/h264_pred: Convert ff_pred8x8_vertical_8_mmx to ff_pred8x8_vertical_8_sse2	2024-02-13 21:17:06 +00:00
Kieran Kunhya	f43b5f1098	vp6dsp: Remove MMX code Missed from `6cb3ee8`	2024-02-13 20:47:16 +00:00
sunyuechi	fdebde817c	lavc/blockdsp: R-V V clear_blocks C908: blockdsp.clear_blocks_c: 128.2 blockdsp.clear_blocks_rvv_i64: 102.5 Signed-off-by: Rémi Denis-Courmont <remi@remlab.net>	2024-02-13 21:29:46 +02:00
sunyuechi	0748d2bbc7	lavc/blockdsp: R-V V clear_block C908: blockdsp.clear_block_c: 47.2 blockdsp.clear_block_rvv_i64: 28.5 Signed-off-by: Rémi Denis-Courmont <remi@remlab.net>	2024-02-12 22:00:03 +02:00
Michael Niedermayer	3be80ce299	avcodec/indeo3: Round dimensions up in allocate_frame_buffers() Fixes: Ticket6581 Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-02-12 00:50:47 +01:00
Michael Niedermayer	66f60a2355	avcodec/ac3enc_template: add fbw_channels assert fbw_channels must be > 0 as teh code is only run if cpl_enabled is set and that requires mode >= AC3_CHMODE_STEREO CID 718138 Uninitialized scalar variable assumes this assert to be false Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-02-12 00:50:46 +01:00
James Almer	2c324fcce0	x86/h264_intrapred: convert ff_pred16x16_horizontal_8_mmxext to sse2 Signed-off-by: James Almer <jamrial@gmail.com>	2024-02-11 22:23:37 +00:00
Nuo Mi	4f80441455	avcodec/hevc_mp4toannexb: more validations for nalu_len For a corrupted stream, the value of nalu_len read from the extradata is not reliable. We need to perform additional checks	2024-02-11 22:50:01 +08:00
Nuo Mi	f7a504a0df	avcodec/vvc_mp4toannexb: more validations for nalu_len For a corrupted stream, the value of nalu_len read from the extradata is not reliable. We need to perform additional checks Fixes: fuzzer timeout Fixes: 65253/clusterfuzz-testcase-minimized-ffmpeg_BSF_VVC_MP4TOANNEXB_fuzzer-4972412487467008 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-02-11 22:49:51 +08:00
Leo Izen	7894904141	avcodec/pngenc: write cLLi and mDVc chunks These chunks contain the Content Light Level Information and the Mastering Display Color Volume information that FFmpeg already supports as AVFrameSideData. This patch adds support for the png encoder to save this metadata as the corresponding chunks in the PNG stream. Signed-off-by: Leo Izen <leo.izen@gmail.com>	2024-02-11 08:58:00 -05:00
Leo Izen	c7a57b0f70	avcodec/pngdec: read cLLi and mDVc chunks These chunks contain the Content Light Level Information and the Mastering Display Color Volume information that FFmpeg already supports as AVFrameSideData. This patch adds support for the png decoder to read these chunks if present and attach the corresponding side data to the decoded frame. Signed-off-by: Leo Izen <leo.izen@gmail.com>	2024-02-11 08:57:40 -05:00
Leo Izen	eb4df2709e	avcodec/tiff: pass arguments to bytestream2_seek in the right order The function signature for bytestream2_seek is (gb, offset, whence); Before this patch, the code passed (gb, SEEK_SET, offset), which is incorrect. Siged-off-by: Leo Izen <leo.izen@gmail.com>	2024-02-11 08:57:11 -05:00
Akihiko Odaki	66231e5871	avcodec/vc1dec: Fix vc1_hwaccel_pixfmt_list_420 vc1_hwaccel_pixfmt_list_420 is referenced even if !(CONFIG_WMV3IMAGE_DECODER \|\| CONFIG_VC1IMAGE_DECODER) so move it out of the #if block. Signed-off-by: Akihiko Odaki <akihiko.odaki@gmail.com> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-02-11 11:15:15 +01:00
Connor Worley	afb630ce4d	lavc/dxv: remove ctx fields that can be derived from texdsp ctxs Signed-off-by: Connor Worley <connorbworley@gmail.com>	2024-02-11 00:40:06 +01:00
Connor Worley	2f9936e446	lavc/dvx: use texdsp funcs for texture block decompression Signed-off-by: Connor Worley <connorbworley@gmail.com>	2024-02-11 00:40:06 +01:00
Connor Worley	939bf30d82	lavc/dxv: move tag definitions to common header Signed-off-by: Connor Worley <connorbworley@gmail.com>	2024-02-11 00:40:06 +01:00
James Almer	81c2557691	avcodec: remove some references to avcodec_close Signed-off-by: James Almer <jamrial@gmail.com>	2024-02-10 00:04:16 -03:00
Lynne	90adef99ca	avfft: avoid overreads with RDFT API users The new API requires an extra array member at the very end, which old API users did not do. This disables in-place RDFT transforms and instead does the transform out of place by copying once, there shouldn't be a significant loss of speed as our in-place FFT requires a reorder which is likely more expensive in the majority of cases to do.	2024-02-09 23:20:29 +01:00
Andreas Rheinhardt	ce7c90ff82	avcodec/dca_core: Remove unneeded emms.h inclusion Possible since `7ec2354c38`. Reviewed-by: Martin Storsjö <martin@martin.st> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-02-09 23:11:52 +01:00
Martin Storsjö	7ec2354c38	x86: Remove inline MMX assembly that clobbers the FPU state These inline implementations of AV_COPY64, AV_SWAP64 and AV_ZERO64 are known to clobber the FPU state - which has to be restored with the 'emms' instruction afterwards. This was known and signaled with the FF_COPY_SWAP_ZERO_USES_MMX define, which calling code seems to have been supposed to check, in order to call emms_c() after using them. See `0b1972d409`, `29c4c0886d` and `df215e5758` for history on earlier fixes in the same area. However, new code can use these AV_*64() macros without knowing about the need to call emms_c(). Just get rid of these dangerous inline assembly snippets; this doesn't make any difference for 64 bit architectures anyway. Signed-off-by: Martin Storsjö <martin@martin.st>	2024-02-09 23:55:52 +02:00
Connor Worley	d5aaed9d4c	lavc/dxv: align to 4x4 blocks instead of 16x16 The previous assumption that DXV needs to be aligned to 16x16 was erroneous. 4x4 works just as well, and FATE decoder tests pass for all texture formats. On the encoder side, we should reject input that isn't 4x4 aligned, like the HAP encoder does, and stop aligning to 16x16. This both solves the uninitialized reads causing current FATE tests to fail and produces smaller encoded outputs. With regard to correctness, I've checked the decoding path by encoding a real-world sample with git master, and decoding it with ffmpeg -i dxt1-master.mov -c:v rawvideo -f framecrc - The results are exactly the same between master and this patch. On the encoding side, I've encoded a real-world sample with both master and this patch, and decoded both versions with ffmpeg -i dxt1-{master,patch}.mov -c:v rawvideo -f framecrc - Under this patch, results for both inputs are exactly the same. In other words, the extra padding gained by 16x16 alignment over 4x4 alignment has no impact on decoded video. Signed-off-by: Connor Worley <connorbworley@gmail.com> Signed-off-by: Martin Storsjö <martin@martin.st>	2024-02-09 23:47:14 +02:00
James Almer	7c873fb298	lavc/refstruct: do not use max_align_t on MSVC It is not available there, even when C11/17 is requested. Signed-off-by: Anton Khirnov <anton@khirnov.net>	2024-02-09 16:24:50 +01:00
Anton Khirnov	1cc24d7495	lavc: deprecate avcodec_close() Its use has been discouraged since 2016, but now is no longer used in avformat, so there is no reason to keep it public.	2024-02-09 16:14:56 +01:00
Andreas Rheinhardt	2b0e9e278a	avcodec/h263dec: Remove AVCodec.pix_fmts arrays They are not intended for decoders (for which there is the get_format callback in case the user has a choice). Also note that the list was wrong for MPEG4, because it did not contain the high bit depth pixel formats used for studio profiles. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-02-09 08:16:25 +01:00
Andreas Rheinhardt	39cfd30bf1	avcodec/vc1dec: Remove AVCodec.pix_fmts arrays They are not intended for decoders (for which there is the get_format callback in case the user has a choice). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-02-09 08:16:25 +01:00
Andreas Rheinhardt	ca95863758	avcodec/vc1dec: Don't call ff_get_format() twice It is currently called once in the codecs' init function and once when (re)initializing the VC-1 decode context (which happens upon frame size changes as well as before decoding the first frame). The first one is unnecessary now that vc1_decode_frame() no longer requires avctx->hwaccel to be already set for hwaccel to work properly. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-02-09 08:16:25 +01:00
Andreas Rheinhardt	38f234c06e	avcodec/vc1dec: Set pointers for hwaccel even without hwaccel VC-1 uses a 0x03 escaping scheme like H.26x and our decoder unescapes data for this purpose, but hardware accelerations just want the data as-is and therefore get fed the original data. The pointers to the actual data are only setcorrectly if avctx->hwaccel is set (after all, they are only used in this case). There are two problems with this: The first is that the branch is pointless; the second is that it is harmful, because a hardware acceleration may be added after the packet has been parsed (in case there is a reconfiguration e.g. due to frame size changes) in which case decoding the first few frames won't work. So delete these branches. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-02-09 08:16:25 +01:00
Andreas Rheinhardt	687a287e14	avcodec/mmaldec: Avoid using AVCodec.pix_fmts It is entirely unnecessary to use it given that all decoders here share the same set of supported pixel formats. So just hardcode this list. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-02-09 08:16:25 +01:00
Andreas Rheinhardt	89995cfda1	avcodec: Remove redundant pix_fmts from decoders AVCodec.pix_fmts is only intended for encoders (decoders use the get_format callback to let the user choose a pix fmt). So remove them for the decoders for which this is possible without further complications; keep them for now in the codecs that actually use them (by passing avctx->codec->pix_fmts to ff_get_formatt()). Also notice that some of these lists were wrong; e.g. `317b7b06fd` added support for YUV444P16 for cuviddec, but forgot to add it to pix_fmts. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-02-09 08:16:25 +01:00
Connor Worley	1eeee68d8e	lavc/dxv: fix incorrect back-reference index calculation in DXT5 decoding This bug causes the DXT5 decoder to produce incorrect block texture data. After the fix, textures are visually correct and match data decoded by Resolume Alley (extracted with Nvida Nsight for comparison). Current FATE DXT5 samples did not cover this case. Signed-off-by: Connor Worley <connorbworley@gmail.com>	2024-02-08 20:36:15 +01:00
Connor Worley	3b6a515c5f	lavc/dxv: treat DXT5-tagged files as DXT4 DXV files seem to misnomer DXT5 and really encode DXT4 with premultiplied alpha. At least, this is what Resolume alley does. To check, encode some input with alpha as "Normal Quality, With Alpha" in Alley, then decode the output with this change -- results are true to the original input compared to git-master. Signed-off-by: Connor Worley <connorbworley@gmail.com>	2024-02-08 20:36:04 +01:00
Connor Worley	c4e9556cf5	lavc/texturedsp: fix premult2straight inversion This function should convert premultiplied alpha to straight, but does the opposite. Signed-off-by: Connor Worley <connorbworley@gmail.com>	2024-02-08 20:36:04 +01:00
Andreas Rheinhardt	9ae40f282d	avcodec/nvdec: Constify bitstream pointee Reviewed-by: Timo Rothenpieler <timo@rothenpieler.org> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-02-08 14:00:37 +01:00
Haihao Xiang	cd31eac999	lavc/qsvenc: Add workaround for VP9 keyframe The runtime doesn't set the frame type to MFX_FRAMETYPE_IDR on the returned mfx bitstream for a keyframe, it set the frame type to MFX_FRAMETYPE_I only. This patch added workaround for VP9 keyframe to make the coded stream seekable. Signed-off-by: Haihao Xiang <haihao.xiang@intel.com>	2024-02-08 10:34:02 +08:00
Tong Wu	82e8838165	avcodec/dxva2: fix different 'const' qualifiers warning Signed-off-by: Tong Wu <tong1.wu@intel.com>	2024-02-08 10:34:02 +08:00
Tong Wu	92ee7461c3	avcodec/d3d12va_decode: fix different 'const' qualifiers warning Signed-off-by: Tong Wu <tong1.wu@intel.com>	2024-02-08 10:34:02 +08:00
Aleksoid	336d59643a	avcodec/d3d12va_vc1: add support for D3D12_VIDEO_DECODE_PROFILE_VC1_D2010 guid. The VC1_D2010 profile, also known as VC1_VLD2010, has the same functionality and specification as the VC1_D profile. Support for this profile serves only as a positive indication that the accelerator has been designed with awareness of the modifications specified in the August 2010 version of this specification. Hardware accelerator drivers that expose support for this profile must not also expose the previously specified VC1_D GUID, unless the accelerator works properly with existing software decoders that use VC1_D and that do not incorporate the corrections added to the August 2010 version of this specification. As a result, we could give VC1_VLD2010 a higher priority and initialize it first. Signed-off-by: Aleksoid <Aleksoid1978@mail.ru> Signed-off-by: Tong Wu <tong1.wu@intel.com>	2024-02-08 10:34:02 +08:00
Wu Jianhua	3372876888	avcodec/x86/vvc/vvcdsp_init: fix unresolved external symbol on ARCH_X86_32 Signed-off-by: Wu Jianhua <toqsxw@outlook.com>	2024-02-07 22:53:15 +08:00
James Almer	7f92014aca	avcodec/nvdec: don't free NVDECContext->bitstream Ensure all hwaccels that allocate a buffer use NVDECContext->bitstream_internal instead. Otherwise, if FFHWAccel->end_frame() isn't called before FFHWAccel->uninit(), an attempt to free a stale pointer to memory not owned by the hwaccel could take place. Reviewed-by: Timo Rothenpieler <timo@rothenpieler.org> Signed-off-by: James Almer <jamrial@gmail.com>	2024-02-07 11:31:33 -03:00
Frank Plowman	5076fa30ab	lavc/vvc: Validate alf_list indexes Signed-off-by: Frank Plowman <post@frankplowman.com>	2024-02-06 22:10:06 +08:00
James Almer	e7a9dd03ab	avcodec/libaomdec: print libaomdec version in verbose level info level will be too noisy if several instances of the decoder are fired at the same time, as will be the case with tiled AVIF. Signed-off-by: James Almer <jamrial@gmail.com>	2024-02-06 10:34:50 -03:00
James Almer	48f4a29bae	avcodec/libdav1d: print libdav1d version in verbose level info level will be too noisy if several instances of the decoder are fired at the same time, as will be the case with tiled AVIF. Signed-off-by: James Almer <jamrial@gmail.com>	2024-02-06 10:34:50 -03:00
Frank Plowman	a42f884cd2	lavc/vvc: Fix slice_idx out-of-bounds memset If the number of CTUs reduces between one picture and the next, the slice_idx table is reduced in size in the frame_context_for_each_tl call on vvcdec.c:321. When initialising the slice_idx table on vvcdec.c:325, the old code uses fc->tab.sz.ctu_count when calculating the table size. fc->tab.sz.ctu_count holds the old ctu count at this point however, not being updated to hold the new ctu count until vvcdec.c:342. This causes an out-of-bounds write. Patch fixes the problem by using pps->ctb_count, which was just used when allocating the table, in place of fc->tab.sz.ctu_count when initialising the table. Signed-off-by: Frank Plowman <post@frankplowman.com>	2024-02-06 19:45:43 +08:00
Leo Izen	c0de7ac520	avcodec/libjxlenc: support negative linesizes libjxl doesn't support negative strides, but JPEG XL has an orientation flag inside the codestream. We can use this to work around the library limitation, by taking the absolute value of the negative row stride, sending the image up-side-down, and telling the library that the image has a vertical-flip orientation. Signed-off-by: Leo Izen <leo.izen@gmail.com>	2024-02-05 12:28:02 -05:00
Nuo Mi	88a040386a	avcodec/vvcdec: fix seeking for open GOP how to reproduce: wget https://media.xiph.org/video/derf/y4m/students_cif.y4m vvencapp --input students_cif.y4m --preset faster --output students.266 MP4Box -add students.266:fps=30000/1001:par=12:11 -new students.mp4 ffplay students.mp4	2024-02-05 21:43:18 +08:00
Mark Thompson	fa580a0f17	lavc/d3d12va: Improve behaviour on missing decoder support Distinguish between a decoder being entirely missing and a decoder which requires features which are not present in the incomplete implementation in libavcodec and therefore can't be used.	2024-02-04 19:18:58 +00:00
Damiano Galassi	45697e6a51	avcodec: add ambient viewing environment packet side data.	2024-02-04 13:36:21 -03:00
Andreas Rheinhardt	d525dbb41f	avcodec/vp8: Change criterion for calling ff_thread_finish_setup() The current criterion is to check for the existence of update_thread_context. Change this to check for whether we are actually decoding VP8 (and not VP7 or VP8-in-WebP). This is equivalent to the current criterion, but allows the WebP decoder to evolve and to get its own update_thread_context. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-02-04 13:58:38 +01:00
Andreas Rheinhardt	e37e9d58f8	avcodec/vp8: Remove write-only vp7 struct field This decoder always inlines whether it is VP7 or VP8. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-02-04 13:58:38 +01:00
Andreas Rheinhardt	4b8b1415ae	avcodec/vp8: Enforce key-frame only for WebP VP8-in-WebP only uses key frame encoding (see [1]), yet this is currently not enforced. This commit does so in order to make output reproducible with frame-threading as the VP8 decoder's update_thread_context is not called at all when using decoding VP8-in-WebP (as this is unnecessary for key frame-only streams). [1]: https://developers.google.com/speed/webp/docs/riff_container Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-02-04 13:58:38 +01:00
James Almer	cc774cd962	avcodec/cbs_h266_syntax_template: check aps_adaptation_parameter_set_id "When aps_params_type is equal to ALF_APS or SCALING_APS, the value of aps_adaptation_parameter_set_id shall be in the range of 0 to 7, inclusive. When aps_params_type is equal to LMCS_APS, the value of aps_adaptation_parameter_set_id shall be in the range of 0 to 3, inclusive." Fixes: out of array accesses Fixes: 65932/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_VVC_fuzzer-4563412340244480 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-02-03 23:49:25 +08:00
Andreas Rheinhardt	9d364fbdb0	avcodec/vlc: Remove unused macros Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-02-03 00:26:17 +01:00
Andreas Rheinhardt	648df1c250	avcodec/leaddec: Remove unnecessary VLC structures One only needs the VLCElem[]. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-02-03 00:26:17 +01:00
Anton Khirnov	aa3cfd4b5a	lavc/bsf: add a showinfo filter Analogous to the (a)showinfo lavfi filters, logs basic packet information. Mainly useful for debugging/testing/development.	2024-02-02 15:41:54 +01:00
James Almer	b181868aba	x86/h26x/h2656dsp: add missing preprocessor wrappers Fixes compilation on x86_32 targets. Signed-off-by: James Almer <jamrial@gmail.com>	2024-02-01 16:04:09 -03:00
James Almer	6b6eb7d74e	x86/Makefile: fix hevc and vvc dependency of h2656dsp.o And remove tabs while at it. Signed-off-by: James Almer <jamrial@gmail.com>	2024-02-01 16:02:50 -03:00
James Almer	2dc8221e66	x86/hevcdsp_init.c: fix preprocessor check HAVE_AVX2_EXTERNAL has a value, so check for it. Signed-off-by: James Almer <jamrial@gmail.com>	2024-02-01 10:47:53 -03:00
James Almer	78a7927df7	x86/vvc/vvc_mc: wrap the entire file in x86_64 and AVX2 checks Fixes compilation with old yasm. Signed-off-by: James Almer <jamrial@gmail.com>	2024-02-01 10:23:42 -03:00
James Almer	bf62ddc7bf	x86/vvc/vvc_mc: set the correct number of used registers in vvc_w_avg functions Fixes crashes when running fate-vvc-conformance-WP_A_3 on Win64 targets Signed-off-by: James Almer <jamrial@gmail.com>	2024-02-01 10:04:14 -03:00
Wu Jianhua	326cc01de9	avcodec/x86/vvc: add avg and avg_w AVX2 optimizations The avg/avg_w is based on dav1d. See https://code.videolan.org/videolan/dav1d/-/blob/master/src/x86/mc_avx2.asm vvc_avg_8_2x2_c: 71.6 vvc_avg_8_2x2_avx2: 26.8 vvc_avg_8_2x4_c: 140.8 vvc_avg_8_2x4_avx2: 34.6 vvc_avg_8_2x8_c: 410.3 vvc_avg_8_2x8_avx2: 41.3 vvc_avg_8_2x16_c: 769.3 vvc_avg_8_2x16_avx2: 60.3 vvc_avg_8_2x32_c: 1669.6 vvc_avg_8_2x32_avx2: 105.1 vvc_avg_8_2x64_c: 1978.3 vvc_avg_8_2x64_avx2: 425.8 vvc_avg_8_2x128_c: 6536.8 vvc_avg_8_2x128_avx2: 1315.1 vvc_avg_8_4x2_c: 155.6 vvc_avg_8_4x2_avx2: 26.1 vvc_avg_8_4x4_c: 250.3 vvc_avg_8_4x4_avx2: 31.3 vvc_avg_8_4x8_c: 831.8 vvc_avg_8_4x8_avx2: 41.3 vvc_avg_8_4x16_c: 1461.1 vvc_avg_8_4x16_avx2: 57.1 vvc_avg_8_4x32_c: 2821.6 vvc_avg_8_4x32_avx2: 105.1 vvc_avg_8_4x64_c: 3615.8 vvc_avg_8_4x64_avx2: 412.6 vvc_avg_8_4x128_c: 11962.6 vvc_avg_8_4x128_avx2: 1274.3 vvc_avg_8_8x2_c: 215.8 vvc_avg_8_8x2_avx2: 29.1 vvc_avg_8_8x4_c: 430.6 vvc_avg_8_8x4_avx2: 37.6 vvc_avg_8_8x8_c: 1463.3 vvc_avg_8_8x8_avx2: 51.8 vvc_avg_8_8x16_c: 2630.1 vvc_avg_8_8x16_avx2: 97.6 vvc_avg_8_8x32_c: 5813.8 vvc_avg_8_8x32_avx2: 196.6 vvc_avg_8_8x64_c: 6687.3 vvc_avg_8_8x64_avx2: 487.8 vvc_avg_8_8x128_c: 13178.6 vvc_avg_8_8x128_avx2: 1290.6 vvc_avg_8_16x2_c: 443.8 vvc_avg_8_16x2_avx2: 28.3 vvc_avg_8_16x4_c: 1253.3 vvc_avg_8_16x4_avx2: 32.1 vvc_avg_8_16x8_c: 2236.3 vvc_avg_8_16x8_avx2: 44.3 vvc_avg_8_16x16_c: 5127.8 vvc_avg_8_16x16_avx2: 63.3 vvc_avg_8_16x32_c: 6573.3 vvc_avg_8_16x32_avx2: 223.6 vvc_avg_8_16x64_c: 30311.8 vvc_avg_8_16x64_avx2: 437.8 vvc_avg_8_16x128_c: 25693.3 vvc_avg_8_16x128_avx2: 1266.8 vvc_avg_8_32x2_c: 954.6 vvc_avg_8_32x2_avx2: 32.1 vvc_avg_8_32x4_c: 2359.6 vvc_avg_8_32x4_avx2: 39.6 vvc_avg_8_32x8_c: 5703.6 vvc_avg_8_32x8_avx2: 57.1 vvc_avg_8_32x16_c: 9967.6 vvc_avg_8_32x16_avx2: 107.1 vvc_avg_8_32x32_c: 21327.6 vvc_avg_8_32x32_avx2: 272.6 vvc_avg_8_32x64_c: 39240.8 vvc_avg_8_32x64_avx2: 529.6 vvc_avg_8_32x128_c: 52580.8 vvc_avg_8_32x128_avx2: 1338.8 vvc_avg_8_64x2_c: 1647.3 vvc_avg_8_64x2_avx2: 38.8 vvc_avg_8_64x4_c: 5130.1 vvc_avg_8_64x4_avx2: 58.8 vvc_avg_8_64x8_c: 6529.3 vvc_avg_8_64x8_avx2: 88.3 vvc_avg_8_64x16_c: 19913.6 vvc_avg_8_64x16_avx2: 162.3 vvc_avg_8_64x32_c: 39360.8 vvc_avg_8_64x32_avx2: 295.8 vvc_avg_8_64x64_c: 49658.3 vvc_avg_8_64x64_avx2: 784.1 vvc_avg_8_64x128_c: 108513.1 vvc_avg_8_64x128_avx2: 1977.1 vvc_avg_8_128x2_c: 3226.1 vvc_avg_8_128x2_avx2: 61.1 vvc_avg_8_128x4_c: 10280.3 vvc_avg_8_128x4_avx2: 94.6 vvc_avg_8_128x8_c: 18079.3 vvc_avg_8_128x8_avx2: 155.3 vvc_avg_8_128x16_c: 45121.8 vvc_avg_8_128x16_avx2: 285.3 vvc_avg_8_128x32_c: 48651.8 vvc_avg_8_128x32_avx2: 581.6 vvc_avg_8_128x64_c: 165078.6 vvc_avg_8_128x64_avx2: 1942.8 vvc_avg_8_128x128_c: 339103.1 vvc_avg_8_128x128_avx2: 4332.6 vvc_avg_10_2x2_c: 144.3 vvc_avg_10_2x2_avx2: 26.8 vvc_avg_10_2x4_c: 142.6 vvc_avg_10_2x4_avx2: 45.3 vvc_avg_10_2x8_c: 478.1 vvc_avg_10_2x8_avx2: 38.1 vvc_avg_10_2x16_c: 518.3 vvc_avg_10_2x16_avx2: 58.1 vvc_avg_10_2x32_c: 2059.8 vvc_avg_10_2x32_avx2: 93.1 vvc_avg_10_2x64_c: 2383.8 vvc_avg_10_2x64_avx2: 714.8 vvc_avg_10_2x128_c: 4498.3 vvc_avg_10_2x128_avx2: 1466.3 vvc_avg_10_4x2_c: 228.6 vvc_avg_10_4x2_avx2: 26.8 vvc_avg_10_4x4_c: 378.3 vvc_avg_10_4x4_avx2: 30.6 vvc_avg_10_4x8_c: 866.8 vvc_avg_10_4x8_avx2: 44.6 vvc_avg_10_4x16_c: 1018.1 vvc_avg_10_4x16_avx2: 58.1 vvc_avg_10_4x32_c: 3590.8 vvc_avg_10_4x32_avx2: 128.8 vvc_avg_10_4x64_c: 4200.8 vvc_avg_10_4x64_avx2: 663.6 vvc_avg_10_4x128_c: 8450.8 vvc_avg_10_4x128_avx2: 1531.8 vvc_avg_10_8x2_c: 369.3 vvc_avg_10_8x2_avx2: 28.3 vvc_avg_10_8x4_c: 513.8 vvc_avg_10_8x4_avx2: 32.1 vvc_avg_10_8x8_c: 1720.3 vvc_avg_10_8x8_avx2: 49.1 vvc_avg_10_8x16_c: 1894.8 vvc_avg_10_8x16_avx2: 71.6 vvc_avg_10_8x32_c: 3931.3 vvc_avg_10_8x32_avx2: 148.1 vvc_avg_10_8x64_c: 7964.3 vvc_avg_10_8x64_avx2: 613.1 vvc_avg_10_8x128_c: 15540.1 vvc_avg_10_8x128_avx2: 1585.1 vvc_avg_10_16x2_c: 877.3 vvc_avg_10_16x2_avx2: 27.6 vvc_avg_10_16x4_c: 955.8 vvc_avg_10_16x4_avx2: 29.8 vvc_avg_10_16x8_c: 3419.6 vvc_avg_10_16x8_avx2: 62.6 vvc_avg_10_16x16_c: 3826.8 vvc_avg_10_16x16_avx2: 54.3 vvc_avg_10_16x32_c: 7655.3 vvc_avg_10_16x32_avx2: 86.3 vvc_avg_10_16x64_c: 30011.1 vvc_avg_10_16x64_avx2: 692.6 vvc_avg_10_16x128_c: 47894.8 vvc_avg_10_16x128_avx2: 1580.3 vvc_avg_10_32x2_c: 944.3 vvc_avg_10_32x2_avx2: 29.8 vvc_avg_10_32x4_c: 2022.6 vvc_avg_10_32x4_avx2: 35.1 vvc_avg_10_32x8_c: 6148.8 vvc_avg_10_32x8_avx2: 51.3 vvc_avg_10_32x16_c: 12601.6 vvc_avg_10_32x16_avx2: 70.8 vvc_avg_10_32x32_c: 15958.6 vvc_avg_10_32x32_avx2: 124.3 vvc_avg_10_32x64_c: 31784.6 vvc_avg_10_32x64_avx2: 757.3 vvc_avg_10_32x128_c: 63892.8 vvc_avg_10_32x128_avx2: 1711.3 vvc_avg_10_64x2_c: 1890.8 vvc_avg_10_64x2_avx2: 34.3 vvc_avg_10_64x4_c: 6267.3 vvc_avg_10_64x4_avx2: 42.6 vvc_avg_10_64x8_c: 12778.1 vvc_avg_10_64x8_avx2: 67.8 vvc_avg_10_64x16_c: 22304.3 vvc_avg_10_64x16_avx2: 116.8 vvc_avg_10_64x32_c: 30777.1 vvc_avg_10_64x32_avx2: 201.1 vvc_avg_10_64x64_c: 60169.1 vvc_avg_10_64x64_avx2: 1454.3 vvc_avg_10_64x128_c: 124392.8 vvc_avg_10_64x128_avx2: 3648.6 vvc_avg_10_128x2_c: 3650.1 vvc_avg_10_128x2_avx2: 41.1 vvc_avg_10_128x4_c: 22887.8 vvc_avg_10_128x4_avx2: 64.1 vvc_avg_10_128x8_c: 14622.6 vvc_avg_10_128x8_avx2: 111.6 vvc_avg_10_128x16_c: 62207.6 vvc_avg_10_128x16_avx2: 186.3 vvc_avg_10_128x32_c: 59761.3 vvc_avg_10_128x32_avx2: 374.6 vvc_avg_10_128x64_c: 117504.3 vvc_avg_10_128x64_avx2: 2684.6 vvc_avg_10_128x128_c: 236767.6 vvc_avg_10_128x128_avx2: 15278.1 vvc_avg_12_2x2_c: 78.6 vvc_avg_12_2x2_avx2: 26.1 vvc_avg_12_2x4_c: 254.1 vvc_avg_12_2x4_avx2: 30.6 vvc_avg_12_2x8_c: 261.8 vvc_avg_12_2x8_avx2: 39.1 vvc_avg_12_2x16_c: 527.6 vvc_avg_12_2x16_avx2: 57.3 vvc_avg_12_2x32_c: 1089.1 vvc_avg_12_2x32_avx2: 93.8 vvc_avg_12_2x64_c: 2337.6 vvc_avg_12_2x64_avx2: 707.1 vvc_avg_12_2x128_c: 4582.1 vvc_avg_12_2x128_avx2: 1414.6 vvc_avg_12_4x2_c: 129.6 vvc_avg_12_4x2_avx2: 26.8 vvc_avg_12_4x4_c: 427.3 vvc_avg_12_4x4_avx2: 30.6 vvc_avg_12_4x8_c: 529.6 vvc_avg_12_4x8_avx2: 36.6 vvc_avg_12_4x16_c: 1022.1 vvc_avg_12_4x16_avx2: 57.3 vvc_avg_12_4x32_c: 1987.6 vvc_avg_12_4x32_avx2: 84.3 vvc_avg_12_4x64_c: 4147.6 vvc_avg_12_4x64_avx2: 706.3 vvc_avg_12_4x128_c: 8469.3 vvc_avg_12_4x128_avx2: 1448.3 vvc_avg_12_8x2_c: 253.6 vvc_avg_12_8x2_avx2: 27.6 vvc_avg_12_8x4_c: 836.3 vvc_avg_12_8x4_avx2: 32.1 vvc_avg_12_8x8_c: 1074.6 vvc_avg_12_8x8_avx2: 45.1 vvc_avg_12_8x16_c: 3616.8 vvc_avg_12_8x16_avx2: 71.6 vvc_avg_12_8x32_c: 3823.6 vvc_avg_12_8x32_avx2: 140.1 vvc_avg_12_8x64_c: 7764.8 vvc_avg_12_8x64_avx2: 656.1 vvc_avg_12_8x128_c: 15896.1 vvc_avg_12_8x128_avx2: 1232.8 vvc_avg_12_16x2_c: 462.1 vvc_avg_12_16x2_avx2: 26.8 vvc_avg_12_16x4_c: 1732.1 vvc_avg_12_16x4_avx2: 29.1 vvc_avg_12_16x8_c: 2097.6 vvc_avg_12_16x8_avx2: 62.6 vvc_avg_12_16x16_c: 6753.1 vvc_avg_12_16x16_avx2: 47.8 vvc_avg_12_16x32_c: 7373.1 vvc_avg_12_16x32_avx2: 80.8 vvc_avg_12_16x64_c: 15046.3 vvc_avg_12_16x64_avx2: 621.1 vvc_avg_12_16x128_c: 52574.6 vvc_avg_12_16x128_avx2: 1417.1 vvc_avg_12_32x2_c: 1712.1 vvc_avg_12_32x2_avx2: 29.8 vvc_avg_12_32x4_c: 2036.8 vvc_avg_12_32x4_avx2: 37.6 vvc_avg_12_32x8_c: 4017.6 vvc_avg_12_32x8_avx2: 44.1 vvc_avg_12_32x16_c: 8018.6 vvc_avg_12_32x16_avx2: 70.8 vvc_avg_12_32x32_c: 15637.6 vvc_avg_12_32x32_avx2: 124.3 vvc_avg_12_32x64_c: 31143.3 vvc_avg_12_32x64_avx2: 830.3 vvc_avg_12_32x128_c: 75706.8 vvc_avg_12_32x128_avx2: 1604.8 vvc_avg_12_64x2_c: 3230.3 vvc_avg_12_64x2_avx2: 33.6 vvc_avg_12_64x4_c: 4139.6 vvc_avg_12_64x4_avx2: 45.1 vvc_avg_12_64x8_c: 8201.6 vvc_avg_12_64x8_avx2: 67.1 vvc_avg_12_64x16_c: 25632.3 vvc_avg_12_64x16_avx2: 110.3 vvc_avg_12_64x32_c: 30744.3 vvc_avg_12_64x32_avx2: 200.3 vvc_avg_12_64x64_c: 105554.8 vvc_avg_12_64x64_avx2: 1325.6 vvc_avg_12_64x128_c: 235254.3 vvc_avg_12_64x128_avx2: 3132.6 vvc_avg_12_128x2_c: 6194.3 vvc_avg_12_128x2_avx2: 55.1 vvc_avg_12_128x4_c: 7583.8 vvc_avg_12_128x4_avx2: 79.3 vvc_avg_12_128x8_c: 14635.6 vvc_avg_12_128x8_avx2: 104.3 vvc_avg_12_128x16_c: 29270.8 vvc_avg_12_128x16_avx2: 194.3 vvc_avg_12_128x32_c: 60113.6 vvc_avg_12_128x32_avx2: 346.3 vvc_avg_12_128x64_c: 197030.3 vvc_avg_12_128x64_avx2: 2779.6 vvc_avg_12_128x128_c: 432809.6 vvc_avg_12_128x128_avx2: 5513.3 vvc_w_avg_8_2x2_c: 84.3 vvc_w_avg_8_2x2_avx2: 42.6 vvc_w_avg_8_2x4_c: 156.3 vvc_w_avg_8_2x4_avx2: 58.8 vvc_w_avg_8_2x8_c: 310.6 vvc_w_avg_8_2x8_avx2: 73.1 vvc_w_avg_8_2x16_c: 942.1 vvc_w_avg_8_2x16_avx2: 113.3 vvc_w_avg_8_2x32_c: 1098.8 vvc_w_avg_8_2x32_avx2: 202.6 vvc_w_avg_8_2x64_c: 2414.3 vvc_w_avg_8_2x64_avx2: 467.6 vvc_w_avg_8_2x128_c: 4763.8 vvc_w_avg_8_2x128_avx2: 1333.1 vvc_w_avg_8_4x2_c: 140.1 vvc_w_avg_8_4x2_avx2: 49.8 vvc_w_avg_8_4x4_c: 276.3 vvc_w_avg_8_4x4_avx2: 58.1 vvc_w_avg_8_4x8_c: 524.3 vvc_w_avg_8_4x8_avx2: 72.3 vvc_w_avg_8_4x16_c: 1108.1 vvc_w_avg_8_4x16_avx2: 111.8 vvc_w_avg_8_4x32_c: 2149.8 vvc_w_avg_8_4x32_avx2: 199.6 vvc_w_avg_8_4x64_c: 12288.1 vvc_w_avg_8_4x64_avx2: 509.3 vvc_w_avg_8_4x128_c: 8398.6 vvc_w_avg_8_4x128_avx2: 1319.6 vvc_w_avg_8_8x2_c: 271.1 vvc_w_avg_8_8x2_avx2: 44.1 vvc_w_avg_8_8x4_c: 503.3 vvc_w_avg_8_8x4_avx2: 61.8 vvc_w_avg_8_8x8_c: 1031.1 vvc_w_avg_8_8x8_avx2: 93.8 vvc_w_avg_8_8x16_c: 2009.8 vvc_w_avg_8_8x16_avx2: 163.1 vvc_w_avg_8_8x32_c: 4161.3 vvc_w_avg_8_8x32_avx2: 292.1 vvc_w_avg_8_8x64_c: 7940.6 vvc_w_avg_8_8x64_avx2: 592.1 vvc_w_avg_8_8x128_c: 16802.3 vvc_w_avg_8_8x128_avx2: 1287.6 vvc_w_avg_8_16x2_c: 762.6 vvc_w_avg_8_16x2_avx2: 53.6 vvc_w_avg_8_16x4_c: 1486.3 vvc_w_avg_8_16x4_avx2: 67.1 vvc_w_avg_8_16x8_c: 1907.8 vvc_w_avg_8_16x8_avx2: 96.8 vvc_w_avg_8_16x16_c: 3883.6 vvc_w_avg_8_16x16_avx2: 151.3 vvc_w_avg_8_16x32_c: 7974.8 vvc_w_avg_8_16x32_avx2: 285.8 vvc_w_avg_8_16x64_c: 25160.6 vvc_w_avg_8_16x64_avx2: 589.8 vvc_w_avg_8_16x128_c: 58328.1 vvc_w_avg_8_16x128_avx2: 1169.8 vvc_w_avg_8_32x2_c: 1009.1 vvc_w_avg_8_32x2_avx2: 65.6 vvc_w_avg_8_32x4_c: 2091.1 vvc_w_avg_8_32x4_avx2: 96.8 vvc_w_avg_8_32x8_c: 3997.8 vvc_w_avg_8_32x8_avx2: 156.3 vvc_w_avg_8_32x16_c: 8216.8 vvc_w_avg_8_32x16_avx2: 269.6 vvc_w_avg_8_32x32_c: 21746.1 vvc_w_avg_8_32x32_avx2: 635.3 vvc_w_avg_8_32x64_c: 31564.8 vvc_w_avg_8_32x64_avx2: 1010.6 vvc_w_avg_8_32x128_c: 114373.3 vvc_w_avg_8_32x128_avx2: 2013.6 vvc_w_avg_8_64x2_c: 2067.3 vvc_w_avg_8_64x2_avx2: 97.6 vvc_w_avg_8_64x4_c: 3901.1 vvc_w_avg_8_64x4_avx2: 154.8 vvc_w_avg_8_64x8_c: 7911.6 vvc_w_avg_8_64x8_avx2: 268.8 vvc_w_avg_8_64x16_c: 16508.8 vvc_w_avg_8_64x16_avx2: 501.8 vvc_w_avg_8_64x32_c: 38770.3 vvc_w_avg_8_64x32_avx2: 1287.6 vvc_w_avg_8_64x64_c: 110350.6 vvc_w_avg_8_64x64_avx2: 1890.8 vvc_w_avg_8_64x128_c: 141354.6 vvc_w_avg_8_64x128_avx2: 3839.6 vvc_w_avg_8_128x2_c: 7012.1 vvc_w_avg_8_128x2_avx2: 159.3 vvc_w_avg_8_128x4_c: 8146.8 vvc_w_avg_8_128x4_avx2: 272.6 vvc_w_avg_8_128x8_c: 24596.8 vvc_w_avg_8_128x8_avx2: 501.1 vvc_w_avg_8_128x16_c: 35918.1 vvc_w_avg_8_128x16_avx2: 948.8 vvc_w_avg_8_128x32_c: 68799.6 vvc_w_avg_8_128x32_avx2: 1963.1 vvc_w_avg_8_128x64_c: 133862.1 vvc_w_avg_8_128x64_avx2: 3833.6 vvc_w_avg_8_128x128_c: 348427.8 vvc_w_avg_8_128x128_avx2: 7682.8 vvc_w_avg_10_2x2_c: 118.6 vvc_w_avg_10_2x2_avx2: 73.1 vvc_w_avg_10_2x4_c: 189.1 vvc_w_avg_10_2x4_avx2: 89.3 vvc_w_avg_10_2x8_c: 382.8 vvc_w_avg_10_2x8_avx2: 179.8 vvc_w_avg_10_2x16_c: 658.3 vvc_w_avg_10_2x16_avx2: 185.1 vvc_w_avg_10_2x32_c: 1409.3 vvc_w_avg_10_2x32_avx2: 290.8 vvc_w_avg_10_2x64_c: 2906.8 vvc_w_avg_10_2x64_avx2: 793.1 vvc_w_avg_10_2x128_c: 6292.6 vvc_w_avg_10_2x128_avx2: 1696.8 vvc_w_avg_10_4x2_c: 178.8 vvc_w_avg_10_4x2_avx2: 80.1 vvc_w_avg_10_4x4_c: 581.6 vvc_w_avg_10_4x4_avx2: 97.6 vvc_w_avg_10_4x8_c: 693.3 vvc_w_avg_10_4x8_avx2: 128.1 vvc_w_avg_10_4x16_c: 1436.6 vvc_w_avg_10_4x16_avx2: 179.8 vvc_w_avg_10_4x32_c: 2409.1 vvc_w_avg_10_4x32_avx2: 292.3 vvc_w_avg_10_4x64_c: 4925.3 vvc_w_avg_10_4x64_avx2: 746.1 vvc_w_avg_10_4x128_c: 10664.6 vvc_w_avg_10_4x128_avx2: 1647.6 vvc_w_avg_10_8x2_c: 359.3 vvc_w_avg_10_8x2_avx2: 80.1 vvc_w_avg_10_8x4_c: 925.6 vvc_w_avg_10_8x4_avx2: 97.6 vvc_w_avg_10_8x8_c: 1360.6 vvc_w_avg_10_8x8_avx2: 121.8 vvc_w_avg_10_8x16_c: 3490.3 vvc_w_avg_10_8x16_avx2: 203.3 vvc_w_avg_10_8x32_c: 5266.1 vvc_w_avg_10_8x32_avx2: 325.8 vvc_w_avg_10_8x64_c: 11127.1 vvc_w_avg_10_8x64_avx2: 747.8 vvc_w_avg_10_8x128_c: 31058.3 vvc_w_avg_10_8x128_avx2: 1424.6 vvc_w_avg_10_16x2_c: 624.8 vvc_w_avg_10_16x2_avx2: 84.6 vvc_w_avg_10_16x4_c: 1389.6 vvc_w_avg_10_16x4_avx2: 109.1 vvc_w_avg_10_16x8_c: 2688.3 vvc_w_avg_10_16x8_avx2: 137.1 vvc_w_avg_10_16x16_c: 5387.1 vvc_w_avg_10_16x16_avx2: 224.6 vvc_w_avg_10_16x32_c: 10776.3 vvc_w_avg_10_16x32_avx2: 312.1 vvc_w_avg_10_16x64_c: 18069.1 vvc_w_avg_10_16x64_avx2: 858.6 vvc_w_avg_10_16x128_c: 43460.3 vvc_w_avg_10_16x128_avx2: 1411.6 vvc_w_avg_10_32x2_c: 1232.8 vvc_w_avg_10_32x2_avx2: 99.1 vvc_w_avg_10_32x4_c: 4017.6 vvc_w_avg_10_32x4_avx2: 134.1 vvc_w_avg_10_32x8_c: 9306.3 vvc_w_avg_10_32x8_avx2: 208.1 vvc_w_avg_10_32x16_c: 8424.6 vvc_w_avg_10_32x16_avx2: 349.3 vvc_w_avg_10_32x32_c: 20787.8 vvc_w_avg_10_32x32_avx2: 655.3 vvc_w_avg_10_32x64_c: 40972.1 vvc_w_avg_10_32x64_avx2: 904.8 vvc_w_avg_10_32x128_c: 85670.3 vvc_w_avg_10_32x128_avx2: 1751.6 vvc_w_avg_10_64x2_c: 2454.1 vvc_w_avg_10_64x2_avx2: 132.6 vvc_w_avg_10_64x4_c: 5012.6 vvc_w_avg_10_64x4_avx2: 215.6 vvc_w_avg_10_64x8_c: 10811.3 vvc_w_avg_10_64x8_avx2: 361.1 vvc_w_avg_10_64x16_c: 33349.1 vvc_w_avg_10_64x16_avx2: 904.1 vvc_w_avg_10_64x32_c: 41892.3 vvc_w_avg_10_64x32_avx2: 1220.6 vvc_w_avg_10_64x64_c: 66983.3 vvc_w_avg_10_64x64_avx2: 2622.1 vvc_w_avg_10_64x128_c: 246508.8 vvc_w_avg_10_64x128_avx2: 3316.8 vvc_w_avg_10_128x2_c: 7791.6 vvc_w_avg_10_128x2_avx2: 198.8 vvc_w_avg_10_128x4_c: 10534.3 vvc_w_avg_10_128x4_avx2: 337.3 vvc_w_avg_10_128x8_c: 21142.3 vvc_w_avg_10_128x8_avx2: 614.8 vvc_w_avg_10_128x16_c: 40968.6 vvc_w_avg_10_128x16_avx2: 1160.6 vvc_w_avg_10_128x32_c: 113043.3 vvc_w_avg_10_128x32_avx2: 1644.6 vvc_w_avg_10_128x64_c: 230658.3 vvc_w_avg_10_128x64_avx2: 5065.3 vvc_w_avg_10_128x128_c: 335236.3 vvc_w_avg_10_128x128_avx2: 6450.3 vvc_w_avg_12_2x2_c: 185.3 vvc_w_avg_12_2x2_avx2: 43.6 vvc_w_avg_12_2x4_c: 340.3 vvc_w_avg_12_2x4_avx2: 55.8 vvc_w_avg_12_2x8_c: 632.3 vvc_w_avg_12_2x8_avx2: 70.1 vvc_w_avg_12_2x16_c: 728.3 vvc_w_avg_12_2x16_avx2: 108.1 vvc_w_avg_12_2x32_c: 1392.6 vvc_w_avg_12_2x32_avx2: 176.8 vvc_w_avg_12_2x64_c: 2618.3 vvc_w_avg_12_2x64_avx2: 757.3 vvc_w_avg_12_2x128_c: 6408.8 vvc_w_avg_12_2x128_avx2: 1435.1 vvc_w_avg_12_4x2_c: 349.3 vvc_w_avg_12_4x2_avx2: 44.3 vvc_w_avg_12_4x4_c: 607.1 vvc_w_avg_12_4x4_avx2: 52.6 vvc_w_avg_12_4x8_c: 1134.8 vvc_w_avg_12_4x8_avx2: 70.1 vvc_w_avg_12_4x16_c: 1378.1 vvc_w_avg_12_4x16_avx2: 115.3 vvc_w_avg_12_4x32_c: 2599.3 vvc_w_avg_12_4x32_avx2: 174.3 vvc_w_avg_12_4x64_c: 4474.8 vvc_w_avg_12_4x64_avx2: 656.1 vvc_w_avg_12_4x128_c: 11319.6 vvc_w_avg_12_4x128_avx2: 1373.1 vvc_w_avg_12_8x2_c: 595.8 vvc_w_avg_12_8x2_avx2: 44.3 vvc_w_avg_12_8x4_c: 1164.3 vvc_w_avg_12_8x4_avx2: 56.6 vvc_w_avg_12_8x8_c: 2019.6 vvc_w_avg_12_8x8_avx2: 80.1 vvc_w_avg_12_8x16_c: 4071.6 vvc_w_avg_12_8x16_avx2: 139.3 vvc_w_avg_12_8x32_c: 4485.1 vvc_w_avg_12_8x32_avx2: 250.6 vvc_w_avg_12_8x64_c: 8404.8 vvc_w_avg_12_8x64_avx2: 735.8 vvc_w_avg_12_8x128_c: 35679.8 vvc_w_avg_12_8x128_avx2: 1252.6 vvc_w_avg_12_16x2_c: 1114.8 vvc_w_avg_12_16x2_avx2: 46.6 vvc_w_avg_12_16x4_c: 2240.1 vvc_w_avg_12_16x4_avx2: 62.6 vvc_w_avg_12_16x8_c: 13174.6 vvc_w_avg_12_16x8_avx2: 88.6 vvc_w_avg_12_16x16_c: 5334.6 vvc_w_avg_12_16x16_avx2: 144.3 vvc_w_avg_12_16x32_c: 8378.1 vvc_w_avg_12_16x32_avx2: 234.6 vvc_w_avg_12_16x64_c: 21300.8 vvc_w_avg_12_16x64_avx2: 761.8 vvc_w_avg_12_16x128_c: 32786.8 vvc_w_avg_12_16x128_avx2: 1432.8 vvc_w_avg_12_32x2_c: 2154.3 vvc_w_avg_12_32x2_avx2: 61.1 vvc_w_avg_12_32x4_c: 4299.8 vvc_w_avg_12_32x4_avx2: 83.1 vvc_w_avg_12_32x8_c: 7964.8 vvc_w_avg_12_32x8_avx2: 132.6 vvc_w_avg_12_32x16_c: 13321.6 vvc_w_avg_12_32x16_avx2: 234.6 vvc_w_avg_12_32x32_c: 21149.3 vvc_w_avg_12_32x32_avx2: 433.3 vvc_w_avg_12_32x64_c: 43666.6 vvc_w_avg_12_32x64_avx2: 876.6 vvc_w_avg_12_32x128_c: 83189.8 vvc_w_avg_12_32x128_avx2: 1756.6 vvc_w_avg_12_64x2_c: 3829.8 vvc_w_avg_12_64x2_avx2: 83.1 vvc_w_avg_12_64x4_c: 8588.1 vvc_w_avg_12_64x4_avx2: 127.1 vvc_w_avg_12_64x8_c: 17027.6 vvc_w_avg_12_64x8_avx2: 310.6 vvc_w_avg_12_64x16_c: 29797.8 vvc_w_avg_12_64x16_avx2: 415.6 vvc_w_avg_12_64x32_c: 43854.3 vvc_w_avg_12_64x32_avx2: 773.3 vvc_w_avg_12_64x64_c: 137767.3 vvc_w_avg_12_64x64_avx2: 1608.6 vvc_w_avg_12_64x128_c: 316428.3 vvc_w_avg_12_64x128_avx2: 3249.8 vvc_w_avg_12_128x2_c: 8824.6 vvc_w_avg_12_128x2_avx2: 130.3 vvc_w_avg_12_128x4_c: 17173.6 vvc_w_avg_12_128x4_avx2: 219.3 vvc_w_avg_12_128x8_c: 21997.8 vvc_w_avg_12_128x8_avx2: 397.3 vvc_w_avg_12_128x16_c: 43553.8 vvc_w_avg_12_128x16_avx2: 790.1 vvc_w_avg_12_128x32_c: 89792.1 vvc_w_avg_12_128x32_avx2: 1497.6 vvc_w_avg_12_128x64_c: 226573.3 vvc_w_avg_12_128x64_avx2: 3153.1 vvc_w_avg_12_128x128_c: 332090.1 vvc_w_avg_12_128x128_avx2: 6499.6 Signed-off-by: Wu Jianhua <toqsxw@outlook.com>	2024-02-01 19:54:29 +08:00
Wu Jianhua	70889620f2	avcodec/vvcdec: reuse h26x/2656_inter.asm to enable x86 optimizations Signed-off-by: Wu Jianhua <toqsxw@outlook.com>	2024-02-01 19:54:28 +08:00
Wu Jianhua	fc5ff6b0b8	avcodec/x86/h26x/h2656_inter: add dststride to put Signed-off-by: Wu Jianhua <toqsxw@outlook.com>	2024-02-01 19:54:28 +08:00
Wu Jianhua	7d9f1f5485	avcodec/x86/hevc_mc: move put/put_uni to h26x/h2656_inter.asm This enable that the asm optimization can be reused by VVC Signed-off-by: Wu Jianhua <toqsxw@outlook.com>	2024-02-01 19:54:28 +08:00
Wu Jianhua	04c2e246a3	avcodec/hevcdsp_template: reuse put/put_luma/put_chroma from h2656_inter_template Signed-off-by: Wu Jianhua <toqsxw@outlook.com>	2024-02-01 19:54:28 +08:00
Wu Jianhua	639b1820ce	avcodec/vvc/vvc_inter_template: move put/put_luma/put_chroma template to h2656_inter_template.c Signed-off-by: Wu Jianhua <toqsxw@outlook.com>	2024-02-01 19:54:28 +08:00
James Almer	fa469545ba	avcodec: move leb reading functions to its own header Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-31 11:19:16 -03:00
Andreas Rheinhardt	7252e4f8ee	avcodec/aac_defines: Remove unused AAC_RENAME_32 Unused since `fbe6a51b11`. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-01-30 20:43:48 +01:00
Frank Plowman	36a986d9a1	lavc/vvc: Add check to num_multi_layer_olss Check that vps_each_layer_is_an_ols_flag, which indicates that "at least one OLS specified by the VPS contains more than one layer," is set if num_multi_layer_olss is non-zero. Fixes: 65160/clusterfuzz-testcase-minimized-ffmpeg_BSF_VVC_METADATA_fuzzer-4665241535119360 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Frank Plowman <post@frankplowman.com> Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-30 09:24:03 -03:00
Zhao Zhili	bc944168db	avcodec/mpeg4videodec: Remove write-only variable Fix warning: variable 'time_incr' set but not used. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2024-01-30 18:25:52 +08:00
James Almer	66f028accb	avcodec/cbs_h266: fix logic setting num_layers_in_ols when vps_ols_mode_idc is 2 The old code did not follow the syntax from the spec. Reviewed-by: Frank Plowman <post@frankplowman.com> Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-29 20:45:31 -03:00
Frank Plowman	85e031d5bf	lavc/vvc: Increase IntraEdgeParams buffer size The reference line buffers are used with indices in the range -MAX_TB_SIZE - 3 to refw + FFMAX(1, w/h) * ref_idx + 1, which is at most 5*MAX_TB_SIZE + 1. Fixes buffer overflows. http://fate.ffmpeg.org/report.cgi?slot=armv7-linux-gcc-9&time=20240124051736 Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-29 20:29:57 -03:00
Anton Khirnov	887a7817b6	lavc: move bitstream filters into bsf/ subdir	2024-01-29 18:40:14 +01:00
Andreas Rheinhardt	341d0419e1	avcodec/texturedsp: Factor common code out Namely calling avctx->execute2(avctx,...). Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-01-28 11:00:01 +01:00
Andreas Rheinhardt	4f58372c10	avcodec/hap: Avoid unnecessary opt.h inclusion It presumably exists because HapContext contains an AVClass. Yet AVClass is actually defined in log.h and even this inclusion can be avoided by struct AVClass. This avoids opt.h inclusions in hap.c and hapdec.c. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-01-28 10:59:36 +01:00
Andreas Rheinhardt	a3fc9fb9fb	avcodec/texturedsp: Add separate TextureDSPEncContext ff_texturedspenc_init() doesn't support most of the function types supported for decoding; add a separate context containing only pointers for the actually supported types. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-01-28 10:52:48 +01:00
Andreas Rheinhardt	8f791304f2	avcodec/dxvenc, hap(dec\|enc): Move TextureDSPContext to stack Only used during init. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-01-28 10:52:41 +01:00
Andreas Rheinhardt	e1d1304b4b	avcodec/texturedspenc: Remove unused rgtc1_u_alpha encoding func Effectively reverts `50a20de6b9`. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-01-28 10:52:38 +01:00
Andreas Rheinhardt	916f016741	avcodec/dxvenc: Fix data races with slice threading The old code set a common struct from each thread; this only "worked" (but is still UB) because the values written are the same for each thread. Fix this by moving the assignments to the main thread. (This also avoids casting const away from a const AVFrame*.) Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-01-28 10:52:24 +01:00
Andreas Rheinhardt	555879ca7c	avcodec/dxvenc: Don't cast const away Reviewed-by: Connor Worley <connorbworley@gmail.com> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-01-28 10:52:11 +01:00
Frank Plowman	0c517fcbe8	lavc/vvc: Fix emulation prevention byte handling nal->skipped_bytes_pos contains the positions of errors relative to the start of the slice header, whereas the position they were tested against is relative to the start of the slice data, i.e. one byte after the end of the slice header. Patch fixes this by storing the size of the slice header in H266RawSlice and adding it to the position given by the GetBitContext before comparing to skipped_bytes_pos. This fixes AVERROR_INVALIDDATAs for various valid bitstreams, such as the LMCS_B_Dolby_2 conformance bitstream. Signed-off-by: Frank Plowman <post@frankplowman.com>	2024-01-27 11:29:40 -03:00
James Almer	eb4584f994	avcodec/vvc_ps: remove duplicated enum Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-26 19:20:48 -03:00
Frank Plowman	763e31a8d3	lavc/vvc: Clamp shift RHS Resolves the following undefined behavior sanitiser error: runtime error: shift exponent 32 is too large for 32-bit type 'int' Signed-off-by: Frank Plowman <post@frankplowman.com> Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-26 15:47:41 -03:00
Frank Plowman	cb7b4ee024	lavc/vvc: Use av_log2 when destination is integer Signed-off-by: Frank Plowman <post@frankplowman.com> Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-26 15:47:41 -03:00
Tong Wu	8c99a1429a	avcodec/d3d12va_mpeg2\|vc1: remove the unused macros These macros are no longer used. Remove them. Signed-off-by: Tong Wu <tong1.wu@intel.com>	2024-01-26 09:22:12 +08:00
Tong Wu	8b41e9cfbe	avcodec/d3d12va_decode: check existance before assigning a new index Fixes #10759. It can happen in H.264, MPEG2, VC1 that the current frame resource memory is already in ref_resource. For example, for a interlaced frame, the same curr memory is passed twice. For the second time it could possibly reference itself. When this happens the curr is already given an index and in ref_resources. When the reference frame index is required, we should check the existance in the ref_resources first before assigning a new index for it. Signed-off-by: Tong Wu <tong1.wu@intel.com>	2024-01-26 09:22:12 +08:00
Leo Izen	ac06190a5a	avcodec/libjxl.h: include version.h This file has been exported since our minimum required version (0.7.0), but it wasn't documented. Instead it was transitively included by <jxl/decode.h> (but not jxl/encode.h), which ffmpeg relied on. libjxl broke its API in libjxl/libjxl@66b9592393 by removing the transitive include of version.h, and they do not plan on adding it back. Instead they are choosing to leave the API backwards- incompatible with downstream callers written for some fairly recent versions of their API. As a result, we include <jxl/version.h> to continue to build against more recent versions of libjxl. The version macros removed are also present in that file, so we no longer need to redefine them. Signed-off-by: Leo Izen <leo.izen@gmail.com>	2024-01-25 11:07:28 -05:00
James Almer	45a2f2635d	avcodec/d3d12va: remove unused variables Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-24 17:34:28 -03:00
James Almer	49a7fe86fe	avcodec/d3d12va_vc1: cast mapped_data to void* before passing it to mapped_data() Fixes -Wincompatible-pointer-types warnings. Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-24 17:28:05 -03:00
James Almer	9aa388e758	avcodec/d3d12va_hevc: cast mapped_data to void* before passing it to mapped_data() Fixes -Wincompatible-pointer-types warnings. Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-24 17:28:05 -03:00
James Almer	342cc1792f	avcodec/d3d12va_h264: cast mapped_data to void* before passing it to mapped_data() Fixes -Wincompatible-pointer-types warnings. Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-24 17:28:04 -03:00
James Almer	04332ca35e	avcodec/d3d12va_vp9.c: change the type for the ID3D12Resource_Map mapped_data argument Fixes -Wincompatible-pointer-types warnings. Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-24 17:27:58 -03:00
James Almer	b4d871fdc8	avcodec/d3d12va_av1.c: change the type for the ID3D12Resource_Map mapped_data argument Fixes -Wincompatible-pointer-types warnings. Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-24 17:27:40 -03:00
James Almer	cb6a488fba	avcodec/vvc_mvs: remove an unnecessary AV_ZERO64() call Should fix "member access within misaligned address 0xf00 for type 'const union av_alias64', which requires 8 byte alignment" errors as reported by GCC ubsan. Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-24 12:41:01 -03:00
James Almer	bc1d8a9b76	avcodec/vvc_mvs: align local motion vector fields Should fix "member access within misaligned address 0xf00 for type 'const union av_alias64', which requires 8 byte alignment" errors as reported by GCC ubsan. Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-24 12:41:01 -03:00
Andreas Rheinhardt	3435565e26	avcodec/d3d12va_(av1\|hevc\|vp9): Don't use deprecated FF_PROFILE_* Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2024-01-24 15:49:12 +01:00
Connor Worley	dfbbd11a4b	lavc/dxvenc: add DXV encoder with support for DXT1 texture format Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>	2024-01-23 21:31:22 +01:00
James Almer	1496ce8f6b	avcodec/vvc_ctu: align motion vector fields Should fix "member access within misaligned address 0xf00 for type 'const union av_alias64', which requires 8 byte alignment" errors as reported by GCC ubsan. Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-23 17:24:15 -03:00
Frank Plowman	8157b5d405	lavc/vvc: Remove left shifts of negative values VVC specifies << as arithmetic left shift, i.e. x << y is equivalent to x * pow2(y). C's << on the other hand has UB if x is negative. This patch removes all UB resulting from this, mostly by replacing x << y with x * (1 << y), but there are also a couple places where the OOP was changed instead. Signed-off-by: Frank Plowman <post@frankplowman.com> Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-23 11:17:05 -03:00
James Almer	ab39cc36c7	avcodec/speexdec: fix setting frame_size from extradata Finishes fixing vp5/potter512-400-partial.avi The fate-matroska-ms-mode test ref is updated to reflect that the Speex decoder can now read the stream. Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-22 10:58:12 -03:00
James Almer	cad35f0a77	avcodec/speexdec: relax the extradata check for the speex string There could be bogus bytes at the start, as is the case of vp5/potter512-400-partial.avi from the FATE suite, which could be a case of bad remuxing from an OGG source. Partially fixes decoding of vp5/potter512-400-partial.avi Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-22 10:58:12 -03:00
Anton Khirnov	08bebeb1be	Revert "all: Don't set AVClass.item_name to its default value" Some callers assume that item_name is always set, so this may be considered an API break. This reverts commit `0c6203c97a`.	2024-01-20 10:34:48 +01:00
James Almer	0a5813fc68	avcodec/vvcdec: allocate and store structs on their own within the table list Fixes "runtime error: member access within misaligned address 0xf00 for type 'struct bar', which requires # byte alignment" errors under GCC ubsan. Reviewed-by: Nuo Mi <nuomi2021@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-19 08:53:32 -03:00
sunyuechi	8e23ebe6f9	lavc/svq1enc: R-V V ssd_int8_vs_int16 C908 ssd_int8_vs_int16_c: 207.7 ssd_int8_vs_int16_rvv_i32: 14.2 Signed-off-by: Rémi Denis-Courmont <remi@remlab.net>	2024-01-17 17:49:54 +02:00
Nuo Mi	d595e0a0b6	avcodec/vvcdec: misc, constify hor_ctu_edge Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-17 10:14:50 -03:00
Nuo Mi	375dcf469e	avcodec/vvcdec: deblock, fix uninitialized values see https://fate.ffmpeg.org/report.cgi?slot=x86_64-archlinux-gcc-valgrind&time=20240105201935 If tc is zero, the max_len_q, max_len_p are uninitialized. Reported-by: James Almer <jamrial@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-17 10:14:35 -03:00
aybe aybe	36b402f80d	avcodec/mdec: DC reading for STRv1 is like STRv2 As I understand, support for .STR files is broken for almost 10 years now (since `161442ff2c` it seems). Currently, ffmpeg fails with tons of errors like this on version 1 STRs, e.g. Wipeout 1: [mdec @ 00000000027c72c0] ac-tex damaged at 1 9 What happens is that only the audio is present in the video file. Anyway, that one character patch fixes the problem, video is now rendered. Signed-off-by: aybe <aybe@users.noreply.github.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-01-16 01:34:56 +01:00
sunyuechi	0befc1fca7	lvac/svqenc: add ff_svq1enc_init This is for clarity and use in testing, consistent with other parts of the code Signed-off-by: Rémi Denis-Courmont <remi@remlab.net>	2024-01-15 19:03:03 +02:00
Rémi Denis-Courmont	278b4b60d6	lavc/takdsp: R-V V decorrelate_sf decorrelate_sf_c: 259.2 decorrelate_sf_rvv_i32: 45.5	2024-01-15 19:00:25 +02:00
yuanhecai	a87a52ed0b	avcodec/hevc: Add ff_hevc_idct_32x32_lasx asm opt tests/checkasm/checkasm: C LSX LASX hevc_idct_32x32_8_c: 1243.0 211.7 101.7 Speedup of decoding H265 4K 30FPS 30Mbps on 3A6000 with 8 threads is 1fps(56fps-->57fps). Reviewed-by: yinshiyou-hf@loongson.cn Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-01-12 23:35:40 +01:00
jinbo	9239081db3	avcodec/hevc: Add asm opt for the following functions tests/checkasm/checkasm: C LSX LASX put_hevc_qpel_uni_h4_8_c: 5.7 1.2 put_hevc_qpel_uni_h6_8_c: 12.2 2.7 put_hevc_qpel_uni_h8_8_c: 21.5 3.2 put_hevc_qpel_uni_h12_8_c: 47.2 9.2 7.2 put_hevc_qpel_uni_h16_8_c: 87.0 11.7 9.0 put_hevc_qpel_uni_h24_8_c: 188.2 27.5 21.0 put_hevc_qpel_uni_h32_8_c: 335.2 46.7 28.5 put_hevc_qpel_uni_h48_8_c: 772.5 104.5 65.2 put_hevc_qpel_uni_h64_8_c: 1383.2 142.2 109.0 put_hevc_epel_uni_w_v4_8_c: 5.0 1.5 put_hevc_epel_uni_w_v6_8_c: 10.7 3.5 2.5 put_hevc_epel_uni_w_v8_8_c: 18.2 3.7 3.0 put_hevc_epel_uni_w_v12_8_c: 40.2 10.7 7.5 put_hevc_epel_uni_w_v16_8_c: 70.2 13.0 9.2 put_hevc_epel_uni_w_v24_8_c: 158.2 30.2 22.5 put_hevc_epel_uni_w_v32_8_c: 281.0 52.0 36.5 put_hevc_epel_uni_w_v48_8_c: 631.7 116.7 82.7 put_hevc_epel_uni_w_v64_8_c: 1108.2 207.5 142.2 put_hevc_epel_uni_w_h4_8_c: 4.7 1.2 put_hevc_epel_uni_w_h6_8_c: 9.7 3.5 2.7 put_hevc_epel_uni_w_h8_8_c: 17.2 4.2 3.5 put_hevc_epel_uni_w_h12_8_c: 38.0 11.5 7.2 put_hevc_epel_uni_w_h16_8_c: 69.2 14.5 9.2 put_hevc_epel_uni_w_h24_8_c: 152.0 34.7 22.5 put_hevc_epel_uni_w_h32_8_c: 271.0 58.0 40.0 put_hevc_epel_uni_w_h48_8_c: 597.5 136.7 95.0 put_hevc_epel_uni_w_h64_8_c: 1074.0 252.2 168.0 put_hevc_epel_bi_h4_8_c: 4.5 0.7 put_hevc_epel_bi_h6_8_c: 9.0 1.5 put_hevc_epel_bi_h8_8_c: 15.2 1.7 put_hevc_epel_bi_h12_8_c: 33.5 4.2 3.7 put_hevc_epel_bi_h16_8_c: 59.7 5.2 4.7 put_hevc_epel_bi_h24_8_c: 132.2 11.0 put_hevc_epel_bi_h32_8_c: 232.7 20.2 13.2 put_hevc_epel_bi_h48_8_c: 521.7 45.2 31.2 put_hevc_epel_bi_h64_8_c: 949.0 71.5 51.0 After this patch, the peformance of decoding H265 4K 30FPS 30Mbps on 3A6000 with 8 threads improves 1fps(55fps-->56fsp). Change-Id: I8cc1e41daa63ca478039bc55d1ee8934a7423f51 Reviewed-by: yinshiyou-hf@loongson.cn Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-01-12 23:35:40 +01:00
jinbo	1f642b99af	avcodec/hevc: Add epel_uni_w_hv4/6/8/12/16/24/32/48/64 asm opt tests/checkasm/checkasm: C LSX LASX put_hevc_epel_uni_w_hv4_8_c: 9.5 2.2 put_hevc_epel_uni_w_hv6_8_c: 18.5 5.0 3.7 put_hevc_epel_uni_w_hv8_8_c: 30.7 6.0 4.5 put_hevc_epel_uni_w_hv12_8_c: 63.7 14.0 10.7 put_hevc_epel_uni_w_hv16_8_c: 107.5 22.7 17.0 put_hevc_epel_uni_w_hv24_8_c: 236.7 50.2 31.7 put_hevc_epel_uni_w_hv32_8_c: 414.5 88.0 53.0 put_hevc_epel_uni_w_hv48_8_c: 917.5 197.7 118.5 put_hevc_epel_uni_w_hv64_8_c: 1617.0 349.5 203.0 After this patch, the peformance of decoding H265 4K 30FPS 30Mbps on 3A6000 with 8 threads improves 3fps (52fps-->55fsp). Change-Id: If067e394cec4685c62193e7adb829ac93ba4804d Reviewed-by: yinshiyou-hf@loongson.cn Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-01-12 23:35:40 +01:00
jinbo	6c6bf18ce8	avcodec/hevc: Add qpel_uni_w_v\|h4/6/8/12/16/24/32/48/64 asm opt tests/checkasm/checkasm: C LSX LASX put_hevc_qpel_uni_w_h4_8_c: 6.5 1.7 1.2 put_hevc_qpel_uni_w_h6_8_c: 14.5 4.5 3.7 put_hevc_qpel_uni_w_h8_8_c: 24.5 5.7 4.5 put_hevc_qpel_uni_w_h12_8_c: 54.7 17.5 12.0 put_hevc_qpel_uni_w_h16_8_c: 96.5 22.7 13.2 put_hevc_qpel_uni_w_h24_8_c: 216.0 51.2 33.2 put_hevc_qpel_uni_w_h32_8_c: 385.7 87.0 53.2 put_hevc_qpel_uni_w_h48_8_c: 860.5 192.0 113.2 put_hevc_qpel_uni_w_h64_8_c: 1531.0 334.2 200.0 put_hevc_qpel_uni_w_v4_8_c: 8.0 1.7 put_hevc_qpel_uni_w_v6_8_c: 17.2 4.5 put_hevc_qpel_uni_w_v8_8_c: 29.5 6.0 5.2 put_hevc_qpel_uni_w_v12_8_c: 65.2 16.0 11.7 put_hevc_qpel_uni_w_v16_8_c: 116.5 20.5 14.0 put_hevc_qpel_uni_w_v24_8_c: 259.2 48.5 37.2 put_hevc_qpel_uni_w_v32_8_c: 459.5 80.5 56.0 put_hevc_qpel_uni_w_v48_8_c: 1028.5 180.2 126.5 put_hevc_qpel_uni_w_v64_8_c: 1831.2 319.2 224.2 Speedup of decoding H265 4K 30FPS 30Mbps on 3A6000 with 8 threads is 4fps(48fps-->52fps). Change-Id: I1178848541d90083869225ba98a02e6aa8bb8c5a Reviewed-by: yinshiyou-hf@loongson.cn Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-01-12 23:35:40 +01:00
jinbo	a28eea2a27	avcodec/hevc: Add pel_uni_w_pixels4/6/8/12/16/24/32/48/64 asm opt tests/checkasm/checkasm: C LSX LASX put_hevc_pel_uni_w_pixels4_8_c: 2.7 1.0 put_hevc_pel_uni_w_pixels6_8_c: 6.2 2.0 1.5 put_hevc_pel_uni_w_pixels8_8_c: 10.7 2.5 1.7 put_hevc_pel_uni_w_pixels12_8_c: 23.0 5.5 5.0 put_hevc_pel_uni_w_pixels16_8_c: 41.0 8.2 5.0 put_hevc_pel_uni_w_pixels24_8_c: 91.0 19.7 13.2 put_hevc_pel_uni_w_pixels32_8_c: 161.7 32.5 16.2 put_hevc_pel_uni_w_pixels48_8_c: 354.5 73.7 43.0 put_hevc_pel_uni_w_pixels64_8_c: 641.5 130.0 64.2 Speedup of decoding H265 4K 30FPS 30Mbps on 3A6000 with 8 threads is 1fps(47fps-->48fps). Reviewed-by: yinshiyou-hf@loongson.cn Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-01-12 23:35:40 +01:00
jinbo	cfbdda607d	avcodec/hevc: Add add_residual_4/8/16/32 asm opt After this patch, the peformance of decoding H265 4K 30FPS 30Mbps on 3A6000 with 8 threads improves 2fps (45fps-->47fsp). Reviewed-by: yinshiyou-hf@loongson.cn Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2024-01-12 23:35:40 +01:00
Zhao Zhili	13c1fea92f	avcodec/videotoolboxenc: fix setting avctx color_range doesn't work Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2024-01-12 10:49:18 +08:00
Nuo Mi	8d0dda8260	vvcdec: reuse h26x/h2656_deblock_template.c	2024-01-11 22:53:05 +08:00
Nuo Mi	ae0a83477b	hevcdec: move deblock template to h26x/h2656_deblock_template.c	2024-01-11 22:53:05 +08:00
Nuo Mi	69e179e8bf	vvcdec: reuse h26x/h2656_sao_template.c	2024-01-11 22:53:05 +08:00
Nuo Mi	d2fe23b835	hevcdec: move sao template to h26x/h2656_sao_template.c	2024-01-11 22:53:05 +08:00
Clément Bœsch	af509f9957	avcodec/proresenc_anatoliy: do not write into chroma reserved bitfields The layout for the frame flags is as follow: chroma_format u(2) reserved u(2) interlace_mode u(2) reserved u(2) chroma_format has 2 allowed values: 0: reserved 1: reserved 2: 4:2:2 3: 4:4:4 interlace_mode has 3 allowed values: 0: progressive 1: tff 2: bff 3: reserved 0x80 is what we expect for "422 not interlaced", and the extra 0x2 from 0x82 is actually writing into the reserved bits.	2024-01-10 23:33:02 +01:00
Clément Bœsch	21f7a814ea	avcodec/proresenc_anatoliy: do not write into alpha reserved bitfields This byte represents 4 reserved bits followed by 4 alpha_channel_type bits. alpha_channel_type currently has 3 differents defined values: 0 (no alpha), 1 (8b alpha), and 2 (16b alpha), all the other values are reserved. The 4 initial reserved bits are expected to be 0.	2024-01-10 23:33:02 +01:00
Clément Bœsch	6d35911667	avcodec/proresenc_kostya: do not write into alpha reserved bitfields This byte represents 4 reserved bits followed by 4 alpha_channel_type bits. alpha_channel_type currently has 3 differents defined values: 0 (no alpha), 1 (8b alpha), and 2 (16b alpha), all the other values are reserved. This part is correctly written (alpha_bits>>3 does the correct thing), but the 4 initial bits are reserved.	2024-01-10 23:33:02 +01:00
Clément Bœsch	aa7ccd0ce9	avcodec/proresenc_kostya: use a compatible bitstream version Quoting SMPTE RDD 36:2015: A decoder shall abort if it encounters a bitstream with an unsupported bitstream_version value. If 0, the value of the chroma_format syntax element shall be 2 (4:2:2 sampling) and the value of the alpha_channel_type element shall be 0 (no encoded alpha); if 1, any permissible value may be used for those syntax elements. So if we're not in 4:2:2 or if there is alpha, we are not allowed to use version 0.	2024-01-10 23:33:02 +01:00
Clément Bœsch	85cb1b9b20	avcodec/proresenc_anatoliy: use a compatible bitstream version Quoting SMPTE RDD 36:2015: A decoder shall abort if it encounters a bitstream with an unsupported bitstream_version value. If 0, the value of the chroma_format syntax element shall be 2 (4:2:2 sampling) and the value of the alpha_channel_type element shall be 0 (no encoded alpha); if 1, any permissible value may be used for those syntax elements. So if we're not in 4:2:2 or if there is alpha, we are not allowed to use version 0.	2024-01-10 23:33:02 +01:00
Clément Bœsch	1081bae94d	avcodec/proresenc_kostya: make a few cosmetics in encode_acs() Unify cosmetics with encode_acs() from proresenc_anatoliy.	2024-01-10 14:08:00 +01:00
Clément Bœsch	cc2206d142	avcodec/proresenc_anatoliy: make a few cosmetics in encode_acs() This makes the function pretty much identical to the function of the same name in proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	8fb2e96d7e	avcodec/proresenc_anatoliy: execute AC run/level FFMIN() at assignment This matches the logic from the function of the same name in proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	096a69ad43	avcodec/proresenc_anatoliy: rework inner loop in encode_acs() This matches the logic from the function of the same name in proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	25f28b9308	avcodec/proresenc_anatoliy: avoid using ff_ prefix in function arguments	2024-01-10 14:08:00 +01:00
Clément Bœsch	29fd3f75fe	avcodec/proresenc_anatoliy: rework encode_ac_coeffs() prototype This makes the prototype closer to the function of the same name in proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	3543100a05	avcodec/proresenc_anatoliy: replace get_level() with FFABS() This matches the code from proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	ed8692446c	avcodec/proresenc_anatoliy: cosmetics to make encode_dcs() identical to the one in Kostya encoder	2024-01-10 14:08:00 +01:00
Clément Bœsch	e87bc5641c	avcodec/proresenc_anatoliy: remove TO_GOLOMB2() A few cosmetics aside, this makes the function identical to the one with the same name in proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	a026f98f29	avcodec/proresenc_anatoliy: only pass down the first scale to encode_dcs() This matches encode_dcs() prototype from proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	1aa7d504ec	avcodec/proresenc_anatoliy: shuffle declarations around in encode_dcs() This makes the function closer to the same function in proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	87ba89281c	avcodec/proresenc_anatoliy: rename TO_GOLOMB() to MAKE_CODE() This matches the name in proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	7af42088d7	avcodec/proresenc_kostya: add Anatoliy copyright Both encoders share a lot of code from both authors.	2024-01-10 14:08:00 +01:00
Clément Bœsch	d269f84199	avcodec/proresenc_anatoliy: remove IS_NEGATIVE() macro This makes the function closer to encode_acs() in proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	9c7f6d89fd	avcodec/proresenc_anatoliy: rename new_dc to dc This makes the function closer to encode_dcs() in proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	9258f4eaf9	avcodec/proresenc_anatoliy: compute sign only once This makes the function closer to encode_dcs() in proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	17392ca84f	avcodec/proresenc_anatoliy: import GET_SIGN() macro from Kostya encoder and use it	2024-01-10 14:08:00 +01:00
Clément Bœsch	273f591a3d	avcodec/proresenc_anatoliy: directly work with blocks in encode_dcs() This makes the function closer to encode_dcs() in proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	dadc5ac24a	avcodec/proresenc_anatoliy: reduce DC encoding function prototype differences with Kostya encoder	2024-01-10 14:08:00 +01:00
Clément Bœsch	8e42d3aba0	avcodec/proresenc_anatoliy: execute codebook FFMIN() at assignment This makes the function closer to encode_dcs() in proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	43baba4647	avcodec/proresenc_anatoliy: rename new_code/code to code/codebook This makes the function closer to encode_dcs() in proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	c44cd371ca	avcodec/proresenc_anatoliy: inline QSCALE() Also replaces 16384 with 0x4000. This makes the function slightly closer to same function in proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	1574475033	avcodec/proresenc_anatoliy: rework encode_codeword() prototype This matches the function of the same name in proresenc_kostya.	2024-01-10 14:08:00 +01:00
Clément Bœsch	1832bd7838	avcodec/proresenc_anatoliy: shuffle encode_codeword() code to match Kostya encoder Code is functionally identical, it's just rename of variables, cosmetics and branch logic shuffling.	2024-01-10 14:08:00 +01:00
Clément Bœsch	3885d2493d	avcodec/proresenc_anatoliy: use FRAME_ID defined in proresdata.h	2024-01-10 14:08:00 +01:00
Clément Bœsch	d6e0fb7c92	avcodec/proresenc_kostya: simplify quantization matrix bytestream writing	2024-01-10 14:08:00 +01:00
Clément Bœsch	cbee015867	avcodec/proresenc_kostya: fix chroma quantisation matrix in frame header Most of the time the quantisation matrices are the same, it only matters with the proxy profile.	2024-01-10 14:08:00 +01:00
Clément Bœsch	631fa19ee0	avcodec/proresenc_kostya: save a few operations in DC encoding This matches the logic from proresenc_anatoliy.	2024-01-10 14:08:00 +01:00
Clément Bœsch	f06f2cf16a	avcodec/proresenc_anatoliy: move DC codebook LUT to shared proresdata This is going to be shared with proresenc_kostya in the upcoming commit.	2024-01-10 14:08:00 +01:00
Clément Bœsch	9f547e2f15	avcodec/proresenc_anatoliy: remove duplicated define This is already defined in proresdata.h	2024-01-10 14:08:00 +01:00
Clément Bœsch	c35733006a	avcodec/proresenc_kostya: remove one LUT indirection for run/level to codebook mapping This is following the same logic as proresenc_anatoliy.	2024-01-10 14:08:00 +01:00
Clément Bœsch	3ba52f18e4	avcodec/proresenc_anatoliy: move run/lev to codebook LUT to shared proresdata This is going to be shared with proresenc_kostya in the upcoming commit.	2024-01-10 14:08:00 +01:00
Clément Bœsch	e940baa65b	avcodec/proresenc_kostya: remove redundant codebook assignments This is already assigned at declaration.	2024-01-10 14:08:00 +01:00
Clément Bœsch	e453efcfbc	avcodec/proresenc_kostya: remove unused plane factor variables	2024-01-10 14:08:00 +01:00
Clément Bœsch	2ac88c1362	avcodec/proresenc_kostya: remove an unnecessary parenthesis level in MAKE_CODE() macro	2024-01-10 14:08:00 +01:00
Marton Balint	363b3ec98a	all: use av_channel_layout_describe_bprint instead of av_channel_layout_describe in a few places Where an AVBPrint buffer is used later anyway. Signed-off-by: Marton Balint <cus@passwd.hu>	2024-01-07 22:47:22 +01:00
James Almer	b95ccfcada	avcodec/vvc_thread: don't use an anonymous union Should fix compilation with old GCC. Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-06 23:28:03 -03:00
Nuo Mi	02d600c568	vvcdec: add TODO for combining transform, lmcs_scale_chroma, and add_residual Thanks for the suggestion from Lynne.	2024-01-07 09:01:04 +08:00
Nuo Mi	26769024d1	avcodec/vvcdec: decode extradata to support container formats For example: wget https://www.elecard.com/storage/video/NovosobornayaSquare_1920x1080.mp4 ./ffplay NovosobornayaSquare_1920x1080.mp4	2024-01-07 08:58:43 +08:00
Sam James	2f24f10d9c	libavcodec: fix -Wint-conversion in vulkan FIx warnings (soon to be errors in GCC 14, already so in Clang 15): ``` src/libavcodec/vulkan_av1.c: In function ‘vk_av1_create_params’: src/libavcodec/vulkan_av1.c:183:43: error: initialization of ‘long long unsigned int’ from ‘void *’ makes integer from pointer without a cast [-Wint-conversion] 183 \| .videoSessionParametersTemplate = NULL, \| ^~~~ src/libavcodec/vulkan_av1.c:183:43: note: (near initialization for ‘(anonymous).videoSessionParametersTemplate’) ``` Use Vulkan's VK_NULL_HANDLE instead of bare NULL. Fix Trac ticket #10724. Was reported downstream in Gentoo at https://bugs.gentoo.org/919067. Signed-off-by: Sam James <sam@gentoo.org>	2024-01-06 22:38:55 +01:00
Clément Bœsch	9109273e3b	avcodec/proresenc: fix alpha plane encoding bitstream These functions encode a slice of alpha (1 to 8 macroblocks) which are expected to be encoded as a repeated sequence of "[diff][run-1]", where diff is the running difference of the alpha value and run is how many times that value is expected to be duplicated (within the limit of a grand total of 2048 unpacked samples, corresponding to a slice of 8 MB). Even when run==0 (the run variable semantic is actually "run minus 1"), there is always a diff previously encoded that needs a counter of at least 1. This means we need to call put_alpha_run() unconditionally at the end of the bitstream to account for the last running diff. This commit fixes glitchy playbacks on QuickTime with M2 and M3 hardware (but not M1 for some mysterious reason) with files generated with commands such as: ffmpeg -f lavfi -i testsrc2=d=5:s=912x320,chromakey -c:v prores_aw -profile:v 4 -y aw.mov ffmpeg -f lavfi -i testsrc2=d=5:s=912x320,chromakey -c:v prores_ks -profile:v 4444 -y ks.mov The glitch expresses itself deterministically as blinking black rectangles on random frames (for example on frame 21, 54, 71, 79, ...). Even with the proresdec from FFmpeg, overreads actually happens while reading the run-minus-1 value (around val = get_bits(gb, 4) in unpack_alpha()). This doesn't seem to cause any particular issue because it simply overreads into the next slice, and because the decoder is resilient, but it's still a problem. The investigation leading to this fix was made possible because of paid work for Jitter (https://jitter.video). Fixes ticket #10255.	2024-01-06 17:29:59 +01:00
Clément Bœsch	2142141a16	avcodec/proresenc: make transparency honored in mov/QT In the mov muxer (in mov_write_video_tag()), bits_per_coded_sample will be written under certain conditions and is required to be 32 for the transparency to be honored in QuickTime. prores_kostya already has this setting but prores_anatoliy and prores_videotoolbox didn't.	2024-01-06 17:29:59 +01:00
Wu Jianhua	94949d4770	avcodec/d3d12va_decode: don't change the resource state if the referenced frame is the same as the current frame This commit removes the follow warning and error: D3D12 WARNING: ID3D12CommandList::ResourceBarrier: Called on the same subresource(s) of Resource(0x000002236E0E00D0:'Unnamed ID3D12Resource Object') in separate Barrier Descs which is inefficient and likely unintentional. Desc[0] and Desc[1] on (subresource : 4294967295). [RESOURCE_MANIPULATION WARNING #1008: RESOURCE_BARRIER_DUPLICATE_SUBRESOURCE_TRANSITIONS] D3D12 ERROR: ID3D12CommandList::ResourceBarrier: Before state (0x0: D3D12_RESOURCE_STATE_[COMMON\|PRESENT]) of resource (0x000002236E0E00D0:'Unnamed ID3D12Resource Object') (subresource: 0) specified by transition barrier does not match with the state (0x20000: D3D12_RESOURCE_STATE_VIDEO_DECODE_WRITE) specified in the previous call to ResourceBarrier [RESOURCE_MANIPULATION ERROR #527: RESOURCE_BARRIER_BEFORE_AFTER_MISMATCH] Tested-by: Tong Wu <tong1.wu@intel.com> Signed-off-by: Wu Jianhua <toqsxw@outlook.com>	2024-01-05 11:08:17 +08:00
Tong Wu	270cd14bbb	avcodec/dxva2(h264\|mpeg2\|vc1): use av_assert0 instead of assert Signed-off-by: Tong Wu <tong1.wu@intel.com>	2024-01-05 11:06:57 +08:00
Tong Wu	0f01581ccd	avcodec/d3d12va_decode\|dxva2: add a warning to replace assertion Previous assertion was not useful. Now a warning is added to replace it. For get_surface_index, we should return a zero index in case the index is not found. But a warning is necessary to notify. Signed-off-by: Tong Wu <tong1.wu@intel.com>	2024-01-05 11:06:57 +08:00
Tong Wu	56c671c3b0	avcodec/d3d12va_h264: replace assert with av_assert0 Signed-off-by: Tong Wu <tong1.wu@intel.com>	2024-01-05 11:06:57 +08:00
Tong Wu	83e0fcbe03	avcodec/d3d12va_decode: delete an empty line and fix a fuction alignment Signed-off-by: Tong Wu <tong1.wu@intel.com>	2024-01-05 11:06:57 +08:00
Tong Wu	d18ed2ddb5	avcodec/d3d12va_vp9: fix vp9 max_num_refs value Previous max_num_refs was based on pp.frame_refs plus 1 and it could possibly reaches the size limit. Actually it should be the size of pp.ref_frame_map plus 1. Signed-off-by: Tong Wu <tong1.wu@intel.com>	2024-01-05 11:06:57 +08:00
Zhao Zhili	33698ef891	avcodec/mpegutils: print axis in debug_info2 For example, ./ffmpeg -nostats -threads 1 -debug qp \ -export_side_data +venc_params \ -i reinit-small_420_9-to-small_420_8.h264 \ -frames 2 \ -f null - New frame, type: B 0 64 128 192 0 313131313131313131313131313129 16 292929292929292929292929292929 32 323232323232323232323232323232 48 323232323232323232323232323232 64 323232323232323232323232323232 80 323232323232323232323232323232 96 323232323030303030303030303030 112 303030303030303030303030303030 128 303030303030303030303030303028 144 313131312929292929292929292929 160 292929292929292929292929292929 176 292929292929292929292929292931 192 312831312631313131312730283131 Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2024-01-04 17:35:11 +08:00
Zhao Zhili	bd48c08f80	avcodec/mpegutils: make debug_info2 thread safe Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2024-01-04 17:33:56 +08:00
Zhao Zhili	7f900a737f	avcodec/videotoolbox: specify color range for hw frame ctx Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2024-01-04 17:33:38 +08:00
Zhao Zhili	0f824d792d	avcodec/hevc_parser: fix missing zero_byte at frame beginning The start code is matched against 0x000001, zero_byte was treated as last byte of last frame rather than the beginning of next frame. Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2024-01-05 01:14:33 +08:00
Zhao Zhili	b7ac1f9856	avcodec/hevc_mp4toannexb_bsf: use HEVCNALUnitType instead of integer literal Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2024-01-05 01:14:33 +08:00
Nuo Mi	301ed950d1	vvcdec: add vvc decoder vvc decoder plug-in to avcodec. split frames into slices/tiles and send them to vvc_thread for further decoding reorder and wait for the frame decoding to be done and output the frame Features: + Support I, P, B frames + Support 8/10/12 bits, chroma 400, 420, 422, and 444 and range extension + Support VVC new tools like MIP, CCLM, AFFINE, GPM, DMVR, PROF, BDOF, LMCS, ALF + 295 conformace clips passed - Not support RPR, IBC, PALETTE, and other minor features yet Performance: C code FPS on an i7-12700K (x86): BQTerrace_1920x1080_60_10_420_22_RA.vvc 93.0 Chimera_8bit_1080P_1000_frames.vvc 184.3 NovosobornayaSquare_1920x1080.bin 191.3 RitualDance_1920x1080_60_10_420_32_LD.266 150.7 RitualDance_1920x1080_60_10_420_37_RA.266 170.0 Tango2_3840x2160_60_10_420_27_LD.266 33.7 C code FPS on a M1 Mac Pro (ARM): BQTerrace_1920x1080_60_10_420_22_RA.vvc 58.7 Chimera_8bit_1080P_1000_frames.vvc 153.3 NovosobornayaSquare_1920x1080.bin 150.3 RitualDance_1920x1080_60_10_420_32_LD.266 105.0 RitualDance_1920x1080_60_10_420_37_RA.266 133.0 Tango2_3840x2160_60_10_420_27_LD.266 21.7 Asm optimizations still working in progress. please check https://github.com/ffvvc/FFmpeg/wiki#performance-data for the latest Contributors (based on code merge order): Nuo Mi <nuomi2021@gmail.com> Xu Mu <toxumu@outlook.com> Frank Plowman <post@frankplowman.com> Shaun Loo <shaunloo10@gmail.com> Wu Jianhua <toqsxw@outlook.com> Thank you for reporting issues and providing performance reports: Łukasz Czech <lukaszcz18@wp.pl> Xu Fulong <839789740@qq.com> Thank you for providing review comments: Ronald S. Bultje <rsbultje@gmail.com> James Almer <jamrial@gmail.com> Andreas Rheinhardt <andreas.rheinhardt@outlook.com> Co-authored-by: Xu Mu <toxumu@outlook.com> Co-authored-by: Frank Plowman <post@frankplowman.com> Co-authored-by: Shaun Loo <shaunloo10@gmail.com> Co-authored-by: Wu Jianhua <toqsxw@outlook.com>	2024-01-03 23:15:12 +08:00
Nuo Mi	e7ef457d6b	vvcdec: add CTU thread logical This is the main entry point for the CTU (Coding Tree Unit) decoder. The code will divide the CTU decoder into several stages. It will check the stage dependencies and run the stage decoder. Co-authored-by: Xu Mu <toxumu@outlook.com> Co-authored-by: Frank Plowman <post@frankplowman.com> Co-authored-by: Shaun Loo <shaunloo10@gmail.com> Co-authored-by: Wu Jianhua <toqsxw@outlook.com>	2024-01-03 23:15:12 +08:00
Nuo Mi	07f75d5e02	vvcdec: add CTU parser Co-authored-by: Xu Mu <toxumu@outlook.com> Co-authored-by: Frank Plowman <post@frankplowman.com> Co-authored-by: Shaun Loo <shaunloo10@gmail.com> Co-authored-by: Wu Jianhua <toqsxw@outlook.com>	2024-01-03 23:15:12 +08:00
Nuo Mi	b49575f4cf	vvcdec: add dsp init and inv transform Co-authored-by: Xu Mu <toxumu@outlook.com> Co-authored-by: Frank Plowman <post@frankplowman.com> Co-authored-by: Shaun Loo <shaunloo10@gmail.com> Co-authored-by: Wu Jianhua <toqsxw@outlook.com>	2024-01-03 23:15:12 +08:00
Nuo Mi	02c1455b44	vvcdec: add LMCS, Deblocking, SAO, and ALF filters Co-authored-by: Xu Mu <toxumu@outlook.com> Co-authored-by: Frank Plowman <post@frankplowman.com> Co-authored-by: Shaun Loo <shaunloo10@gmail.com> Co-authored-by: Wu Jianhua <toqsxw@outlook.com>	2024-01-03 23:15:12 +08:00
Nuo Mi	c05ba94ce8	vvcdec: add intra prediction Co-authored-by: Xu Mu <toxumu@outlook.com> Co-authored-by: Frank Plowman <post@frankplowman.com> Co-authored-by: Shaun Loo <shaunloo10@gmail.com> Co-authored-by: Wu Jianhua <toqsxw@outlook.com>	2024-01-03 23:15:12 +08:00
Nuo Mi	2592cc1f96	vvcdec: add inv transform 1d Co-authored-by: Xu Mu <toxumu@outlook.com> Co-authored-by: Frank Plowman <post@frankplowman.com> Co-authored-by: Shaun Loo <shaunloo10@gmail.com> Co-authored-by: Wu Jianhua <toqsxw@outlook.com>	2024-01-03 23:15:11 +08:00
Nuo Mi	ea49c83bad	vvcdec: add inter prediction Co-authored-by: Xu Mu <toxumu@outlook.com> Co-authored-by: Frank Plowman <post@frankplowman.com> Co-authored-by: Shaun Loo <shaunloo10@gmail.com> Co-authored-by: Wu Jianhua <toqsxw@outlook.com>	2024-01-03 23:15:11 +08:00
Nuo Mi	603d0bd171	vvcdec: add motion vector decoder Co-authored-by: Xu Mu <toxumu@outlook.com> Co-authored-by: Frank Plowman <post@frankplowman.com> Co-authored-by: Shaun Loo <shaunloo10@gmail.com> Co-authored-by: Wu Jianhua <toqsxw@outlook.com>	2024-01-03 23:15:11 +08:00
Nuo Mi	c1a3d17491	vvcdec: add reference management Co-authored-by: Xu Mu <toxumu@outlook.com> Co-authored-by: Frank Plowman <post@frankplowman.com> Co-authored-by: Shaun Loo <shaunloo10@gmail.com> Co-authored-by: Wu Jianhua <toqsxw@outlook.com>	2024-01-03 23:15:11 +08:00
Nuo Mi	976d3b7d69	vvcdec: add cabac decoder add Context-based Adaptive Binary Arithmetic Coding (CABAC) decoder Co-authored-by: Xu Mu <toxumu@outlook.com> Co-authored-by: Frank Plowman <post@frankplowman.com> Co-authored-by: Shaun Loo <shaunloo10@gmail.com> Co-authored-by: Wu Jianhua <toqsxw@outlook.com>	2024-01-03 23:15:06 +08:00
Nuo Mi	e97a5bbb13	vvcdec: add parameter parser for sps, pps, ph, sh Co-authored-by: Xu Mu <toxumu@outlook.com> Co-authored-by: Frank Plowman <post@frankplowman.com> Co-authored-by: Shaun Loo <shaunloo10@gmail.com> Co-authored-by: Wu Jianhua <toqsxw@outlook.com>	2024-01-03 16:31:59 +08:00
Nuo Mi	49db9fc171	vvcdec: add vvc_data Co-authored-by: Xu Mu <toxumu@outlook.com> Co-authored-by: Frank Plowman <post@frankplowman.com> Co-authored-by: Shaun Loo <shaunloo10@gmail.com> Co-authored-by: Wu Jianhua <toqsxw@outlook.com>	2024-01-03 16:31:59 +08:00
James Almer	85b8d59ec7	avcodec/d3d12va_mpeg2: change the type for the ID3D12Resource_Map input data argument Fixes -Wincompatible-pointer-types warnings. Reviewed-by: Wu Jianhua <toqsxw@outlook.com> Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-01 09:46:41 -03:00
James Almer	e9722735fa	avcodec/d3d12va_mpeg2: remove unused variables Reviewed-by: Wu Jianhua <toqsxw@outlook.com> Signed-off-by: James Almer <jamrial@gmail.com>	2024-01-01 09:46:06 -03:00
Michael Niedermayer	e063c1d079	avcodec/mpegvideo_enc: Use ptrdiff_t for stride Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-12-30 21:50:05 +01:00
Michael Niedermayer	a066b8a809	avcodec/mpegvideo_enc: Dont copy beyond the image Fixes: out of array access Fixes: tickets/10754/poc17ffmpeg Discovered by Zeng Yunxiang. Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-12-30 21:50:04 +01:00
Michael Niedermayer	bf1159774b	avcodec/vaapi_encode: Avoid double AVERRORS Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-12-29 21:36:02 +01:00
Michael Niedermayer	850ab8f6da	avcodec/jpegxl_parser: Check get_vlc2() Fixes: shift exponent -1 is negative Fixes: 63889/clusterfuzz-testcase-minimized-ffmpeg_DEMUXER_fuzzer-6009343056936960 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-12-29 19:21:26 +01:00
Michael Niedermayer	d909d8e5e0	avcodec/leaddec: Check remaining bits in decode_block() Fixes: Timeout Fixes: 64163/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_LEAD_fuzzer-6418925835124736 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-12-29 01:15:42 +01:00
Michael Niedermayer	5f88458bea	avcodec/jpegxl_parser: Add padding to cs_buffer Fixes: out of array access Fixes: 64081/clusterfuzz-testcase-minimized-ffmpeg_DEMUXER_fuzzer-6151006496620544 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-12-29 01:15:42 +01:00
Michael Niedermayer	c72a20f01a	avcodec/jpeglsdec: Check Jpeg-LS LSE Fixes: signed integer overflow: 2147478526 + 33924 cannot be represented in type 'int' Fixes: shift exponent 32 is too large for 32-bit type 'unsigned int' Fixes: 64243/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_JPEGLS_fuzzer-5195717848989696 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-12-29 01:00:48 +01:00
Michael Niedermayer	c75fccd1d4	avcodec/osq: Implement flush() Fixes: out of array access Fixes: 62164/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_OSQ_fuzzer-6227491892887552 Fixes: 62164/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_OSQ_fuzzer-6268561729126400 Fixes: 62164/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_OSQ_fuzzer-6414805046788096 Fixes: 62164/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_OSQ_fuzzer-6538151088488448 Fixes: 62164/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_OSQ_fuzzer-6608131540779008 Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2023-12-29 00:45:20 +01:00

... 2 3 4 5 6 ...

49499 Commits