FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-12-07 11:13:41 +02:00

Author	SHA1	Message	Date
Niklas Haas	2a2e0aced2	avutil/dovi_meta: document static vs dynamic ext blocks	2024-08-16 11:48:02 +02:00
Niklas Haas	ae31acd702	fate/scalechroma: switch to standard chroma location Replace the manually specified chroma location by one using standard notation, arbitrarily "bottomleft" as it is a less common path. Required if we want to phase out the use of manual chroma locations.	2024-08-16 11:43:37 +02:00
Niklas Haas	f1071dc634	avfilter/vf_zscale: remove unused fields	2024-08-16 11:43:37 +02:00
Niklas Haas	c8bc6fabd7	avfilter/vf_scale: fix 4:1:0 interlaced chroma pos The current logic hard-coded a check for v_sub == 1. We can extend this logic slightly to cover the case of interlaced 4:1:0 (which has v_sub == 2). Here is a diagram explaining this scenario (with center-siting): a a a a a a a a b b b b b b b b X X a a a a a a a a b b b b b b b b a a a a a a a a b b b b b b b b Y Y a a a a a a a a b b b b b b b b a = even luma rows b = odd luma rows X = even chroma sample Y = odd chroma sample In progressive mode, the chroma samples sit at (384, 384) respectively. Relative to the 8x4 grid of even luma samples (a), the X sample sits at: h_chr_pos = 384 v_chr_pos = 192 Relative to the 8x4 grid of odd luma samples (b), the Y sample sits at: h_chr_pos = 384 v_chr_pos = 576 The new code calculates the correct values in all circumstances.	2024-08-16 11:43:37 +02:00
Niklas Haas	15a67c0947	avfilter/vf_scale: add in/out_chroma_loc Currently, this just functions as a more principled and user-friendly replacement for the (undocumented and hard to use) _chr_pos fields. However, the goal is to automatically infer these values from the input frames' chroma location, and deprecate the manual use of _chr_pos altogether. (Indeed, my plans for an swscale replacement will most likely also end up limiting the set of legal chroma locations to those permissible by AVFrame properties)	2024-08-16 11:43:37 +02:00
Niklas Haas	18b9687308	avfilter/swscale: always fix interlaced chroma location The current logic only fixes it when the user does not explicitly specify the chroma location. However, this does not make a lot of sense. Since there is no way to specify this property per-field, it effectively prevents the user from being able to correctly scale interlaced frames with top-aligned chroma. It makes more sense to consider the user setting in the progressive case only, and automatically adapt it to the correct interlaced field positions, following the details of the MPEG specification.	2024-08-16 11:43:37 +02:00
Niklas Haas	6b40be941a	swscale/options: relax src/dst_h/v_chr_pos value range When dealing with 4x subsampling ratios (log2 == 2), such as can arise with 4:1:1 or 4:1:0, a value range of 512 is not enough to cover the range of possible scenarios. For example, bottom-sited chroma in 4:1:0 would require an offset of 768 (three luma rows). Simply double the limit to 1024. I don't see any place in initFilter() that would experience overflow as a result of this change, especially since get_local_pos() right-shifts it by the subsampling ratio again.	2024-08-16 11:43:37 +02:00
Niklas Haas	5d964df5da	avfilter/vf_setparams: remove unnecessary options bounds AV_OPT_TYPE_CONST does not use min/max, we can leave them as 0.	2024-08-16 11:43:37 +02:00
Niklas Haas	201f1cba15	avfilter/vf_setparams: allow setting chroma location Shockingly, there isn't currently _any_ filter for overriding this.	2024-08-16 11:43:37 +02:00
Niklas Haas	3e064f52eb	swscale: document SWS_FULL_CHR_H_* flags Based on my best understanding of what they do, given the source code.	2024-08-16 11:43:37 +02:00
Fei Wang	be7ab63552	lavc/qsvdec: Add vvc_mp4toannexb bsf for QSV VVC decoder Fix error: $ ffmpeg -hwaccel qsv -i input.mp4 -f null - .. [vvc_qsv @ 0000026890D966C0] Error decoding stream header: unknown error (-1) [vvc_qsv @ 0000026890D966C0] Error decoding header Signed-off-by: Fei Wang <fei.w.wang@intel.com>	2024-08-16 14:15:04 +08:00
Lynne	a797317ab1	vulkan_filter: don't require the storage flag for the base frames format We check for whether subformats support storage immediately below. Those are the ones we require storage for, rather than the base format itself. This permits better reuse of AVHWFrame contexts. The patch also removes an always-false check in the subformat check.	2024-08-16 01:22:17 +02:00
Lynne	b165f144e7	vulkan_filter: allow reusing frame contexts with DRM tiling There's no reason not to permit this, particularly if a user wants to manipulate images which will be exported back to DRM.	2024-08-16 01:22:17 +02:00
Lynne	604dfdb44c	hwcontext_vulkan: align host mapping size to minImportedHostPointerAlignment This was left out of the recent rewrite of the system.	2024-08-16 01:22:16 +02:00
Lynne	18d964fc2c	vulkan: enable encoding of images if video_maintenance1 is enabled Vulkan encoding was designed in a very... consolidated way. You had to know the exact codec and profile that the image was going to eventually be encoded as at... image creation time. Unfortunately, as good as our code is, glimpsing into the exact future isn't what its capable of. video_maintenance1 removed that requirement, which only then made encoding images practically possible.	2024-08-16 01:22:16 +02:00
Lynne	46c13834b6	hwcontext_vulkan: enable VK_KHR_video_maintenance1 We require it for encoding.	2024-08-16 01:22:15 +02:00
Lynne	97e947a2a7	hwcontext_vulkan: setup extensions before features The issue is that enabling features requires that the device extension is supported. The extensions bitfield was set later, so it was always 0, leading to no features being added.	2024-08-16 01:22:15 +02:00
Lynne	c3cbaf39bb	hwcontext_vulkan: don't enable deprecated VK_KHR_sampler_ycbcr_conversion extension It was added to Vulkan 1.1 a long time ago. Validation layer will warn if this is enabled.	2024-08-16 01:22:15 +02:00
Lynne	3f65d24075	hwcontext_vulkan: fix user layers, add support for different debug modes The validation layer option only supported GPU-assisted validation. This is mutually exclusive with shader debug printfs, so we need to differentiate between the two. This also fixes issues with user-given layers, and leaks in case of errors.	2024-08-16 01:22:14 +02:00
Lynne	869f4aec48	vulkan_decode: use the correct queue family for decoding ops In `680d969a30`, the new API was used to find a queue family for dispatch, but the found queue family was not used for decoding, just for dispatching.	2024-08-16 01:22:08 +02:00
Anton Khirnov	d566a37003	lavfi: move AVFilterLink.graph to FilterLink	2024-08-15 19:34:27 +02:00
Anton Khirnov	fb3efef1db	lavfi: move AVFilterLink.frame_wanted_out to FilterLinkInternal	2024-08-15 19:34:27 +02:00
Anton Khirnov	42cbf66fff	lavfi: move AVFilterLink.{frame,sample}_count_{in,out} to FilterLink	2024-08-15 19:34:27 +02:00
Anton Khirnov	a23d565ea7	lavfi: move AVFilterLink.frame_rate to FilterLink Co-developed-by: James Almer <jamrial@gmail.com>	2024-08-15 19:34:27 +02:00
Anton Khirnov	d6318a244d	lavfi: move AVFilterLink.current_pts(_us) to FilterLink	2024-08-15 19:34:27 +02:00
Anton Khirnov	7f17e0e6dd	lavfi: move AVFilterLink.hw_frames_ctx to FilterLink	2024-08-15 19:34:24 +02:00
Anton Khirnov	ce24b5ba8f	lavfi/vf_*_cuda: do not access hw contexts before checking they exist The checks are performed in init_processing_chain().	2024-08-15 19:27:01 +02:00
Anton Khirnov	99ee7a948f	lavfi: move AVFilterLink.m{ax,in}_samples to FilterLink Also, document who sets these fields and when.	2024-08-15 19:27:01 +02:00
Anton Khirnov	54754eec1e	lavfi: add a new struct for private link properties Specifically those that should be visible to filters, but hidden from API callers. Such properties are currently located at the end of the public AVFilterLink struct, demarcated by a comment marking them as private. However it is generally better to hide them explicitly, using the same pattern already employed in avformat or avcodec. The new struct is currently trivial, but will become more useful in following commits.	2024-08-15 19:27:01 +02:00
Anton Khirnov	426e33c758	lavfi: set AVFilterLink.graph on link creation There is no reason to delay this.	2024-08-15 19:27:01 +02:00
Wu Jianhua	ca5c9e810a	avcodec/vvc/dsp: prefix TxType and TxSize with VVC See https://patchwork.ffmpeg.org/project/ffmpeg/patch/TYSPR06MB64337C4A9ADF5312E6648543AA62A@TYSPR06MB6433.apcprd06.prod.outlook.com/#81892 Signed-off-by: Wu Jianhua <toqsxw@outlook.com>	2024-08-15 20:52:14 +08:00
Wu Jianhua	ae1a9cfd52	avcodec/vvc_parser: move avctx->has_b_frames initialization to dec From Jun Zhao <mypopydev@gmail.com>: > Should we relocate this to the decoder? Other codecs typically set this > parameter in the decoder. Signed-off-by: Wu Jianhua <toqsxw@outlook.com>	2024-08-15 20:50:24 +08:00
Nuo Mi	80af195804	avcodec/vvcdec: move frame tab memset from the main thread to worker threads memset tables in the main thread can become a bottleneck for the decoder. For example, if it takes 1% of the processing time for one core, the maximum achievable FPS will be 100. Move the memeset to worker threads will fix the issue.	2024-08-15 20:33:57 +08:00
Nuo Mi	daf6fcd816	avcodec/vvcdec: do not zero frame qp table For luma, qp can only change at the CU level, so the qp tab size is related to the CU. For chroma, considering the joint CbCr, the QP tab size is related to the TU.	2024-08-15 20:33:57 +08:00
Nuo Mi	ca2caeb21d	avcodec/vvcdec: do not zero frame msf mmi table	2024-08-15 20:33:57 +08:00
Nuo Mi	0c7106f618	avcodec/vvcdec: do not zero frame cpm table	2024-08-15 20:33:57 +08:00
Nuo Mi	3e497249eb	avcodec/vvcdec: check_available, use && instead of &= for shortcut evaluation	2024-08-15 20:33:57 +08:00
Nuo Mi	4f882c453e	avcodec/vvcdec: do not zero frame mvf table	2024-08-15 20:33:57 +08:00
Nuo Mi	93b0d5f9ce	avcodec/vvcdec: refact out is_available from is_a0_available	2024-08-15 20:33:57 +08:00
Nuo Mi	e8ac761293	avcodec/vvcdec: split ctu table to zero init and no zero init parts cus need to init to zero, other parts are not	2024-08-15 20:19:45 +08:00
Nuo Mi	4f60961498	avcodec/vvcdec: remove unnecessary perframe initializations deblock, sao, alf skip, imtf, ipm, cqt_depth, cb_pos_x, cb_pos_y, cb_height, cp_mv, tb_pos_x0, tb_pos_y0, tb_width, tb_height	2024-08-15 20:19:45 +08:00
Nuo Mi	210bf1b154	avcodec/vvcdec: refact, combine bs tab with tu tab	2024-08-15 20:19:45 +08:00
Nuo Mi	bdb79fe60a	avcodec/vvcdec: thread, ensure the parse stage gets the highest priority The parser stage is not parallelizable. We need to schedule it as soon as possible to create later stages, which are more parallelizable clips \| before \| after \| delta --------------------------------------------\|--------\|-------\|------ RitualDance_1920x1080_60_10_420_37_RA.266 \| 342.7 \| 365.3 \| 6.59% NovosobornayaSquare_1920x1080.bin \| 321.7 \| 400 \| 24.34% Tango2_3840x2160_60_10_420_27_LD.266 \| 82.3 \| 91.7 \| 11.42% RitualDance_1920x1080_60_10_420_32_LD.266 \| 323.7 \| 319.3 \| -1.36% Chimera_8bit_1080P_1000_frames.vvc \| 364 \| 411.3 \| 12.99% BQTerrace_1920x1080_60_10_420_22_RA.vvc \| 162.7 \| 185.7 \| 14.14%	2024-08-15 20:19:45 +08:00
Nuo Mi	7eb1df44ae	checkasm: add tests for vvc dmvr dmvr_8_12x20_c: 186.2 dmvr_8_12x20_avx2: 25.7 dmvr_8_20x12_c: 181.7 dmvr_8_20x12_avx2: 25.2 dmvr_8_20x20_c: 283.2 dmvr_8_20x20_avx2: 32.0 dmvr_10_12x20_c: 90.0 dmvr_10_12x20_avx2: 15.7 dmvr_10_20x12_c: 41.0 dmvr_10_20x12_avx2: 14.7 dmvr_10_20x20_c: 81.5 dmvr_10_20x20_avx2: 26.7 dmvr_12_12x20_c: 190.7 dmvr_12_12x20_avx2: 20.2 dmvr_12_20x12_c: 187.2 dmvr_12_20x12_avx2: 20.2 dmvr_12_20x20_c: 292.7 dmvr_12_20x20_avx2: 27.2 dmvr_h_8_12x20_c: 317.0 dmvr_h_8_12x20_avx2: 37.0 dmvr_h_8_20x12_c: 340.0 dmvr_h_8_20x12_avx2: 41.0 dmvr_h_8_20x20_c: 540.7 dmvr_h_8_20x20_avx2: 64.0 dmvr_h_10_12x20_c: 322.7 dmvr_h_10_12x20_avx2: 30.7 dmvr_h_10_20x12_c: 344.2 dmvr_h_10_20x12_avx2: 34.0 dmvr_h_10_20x20_c: 529.0 dmvr_h_10_20x20_avx2: 51.5 dmvr_h_12_12x20_c: 326.7 dmvr_h_12_12x20_avx2: 33.5 dmvr_h_12_20x12_c: 331.7 dmvr_h_12_20x12_avx2: 51.2 dmvr_h_12_20x20_c: 534.0 dmvr_h_12_20x20_avx2: 62.7 dmvr_hv_8_12x20_c: 650.0 dmvr_hv_8_12x20_avx2: 57.2 dmvr_hv_8_20x12_c: 676.2 dmvr_hv_8_20x12_avx2: 70.0 dmvr_hv_8_20x20_c: 1068.5 dmvr_hv_8_20x20_avx2: 103.2 dmvr_hv_10_12x20_c: 649.0 dmvr_hv_10_12x20_avx2: 48.2 dmvr_hv_10_20x12_c: 677.7 dmvr_hv_10_20x12_avx2: 59.7 dmvr_hv_10_20x20_c: 1093.5 dmvr_hv_10_20x20_avx2: 91.7 dmvr_hv_12_12x20_c: 660.0 dmvr_hv_12_12x20_avx2: 58.7 dmvr_hv_12_20x12_c: 682.7 dmvr_hv_12_20x12_avx2: 72.0 dmvr_hv_12_20x20_c: 1094.0 dmvr_hv_12_20x20_avx2: 113.2 dmvr_v_8_12x20_c: 325.7 dmvr_v_8_12x20_avx2: 31.2 dmvr_v_8_20x12_c: 326.2 dmvr_v_8_20x12_avx2: 38.5 dmvr_v_8_20x20_c: 538.5 dmvr_v_8_20x20_avx2: 54.2 dmvr_v_10_12x20_c: 318.5 dmvr_v_10_12x20_avx2: 23.7 dmvr_v_10_20x12_c: 330.7 dmvr_v_10_20x12_avx2: 40.5 dmvr_v_10_20x20_c: 567.5 dmvr_v_10_20x20_avx2: 48.0 dmvr_v_12_12x20_c: 335.2 dmvr_v_12_12x20_avx2: 30.0 dmvr_v_12_20x12_c: 330.2 dmvr_v_12_20x12_avx2: 39.5 dmvr_v_12_20x20_c: 535.2 dmvr_v_12_20x20_avx2: 60.0	2024-08-15 20:19:45 +08:00
Nuo Mi	6464c157d5	x86/vvcdec: add dmvr avx2 code Decoder-Side Motion Vector Refinement is about 4~8% CPU usage for some clips here is the test result for one time clips \| before\| after \| delta ------------------------------------------\|-------\|-------\|------ RitualDance_1920x1080_60_10_420_37_RA.266 \| 338.7 \| 354.3 \|4.61% NovosobornayaSquare_1920x1080.bin \| 320.3 \| 329.3 \|2.81% Tango2_3840x2160_60_10_420_27_LD.266 \| 83.3 \| 83.7 \|0.48% RitualDance_1920x1080_60_10_420_32_LD.266 \| 320.7 \| 327.3 \|2.06% Chimera_8bit_1080P_1000_frames.vvc \| 360.7 \| 381.0 \|5.63% BQTerrace_1920x1080_60_10_420_22_RA.vvc \| 161.7 \| 163.0 \|0.80%	2024-08-15 20:19:45 +08:00
Nuo Mi	1c17520e79	avcodec/vvcdec: Use av_image_copy_plane for DMVR 10-bit integer pixels It's no need to shift and interpolate for 10-bit integer pixels, av_image_copy_plane is enough	2024-08-15 20:19:45 +08:00
eaphone	7897b0beed	libavdevice/gdigrab: change hwnd tail check fail logic to !=null Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2024-08-15 20:10:33 +08:00
gnattu	a1976e963f	avutil/hwcontext_videotoolbox: silence warning for RGB Hardware frames with RGB colorspace will not have a YCbCrMatrixKey. Currently, it will spam the console with warning if rgb frame is uploaded. Signed-off-by: Gnattu OC <gnattuoc@me.com> Reviewed-by: Marvin Scholz <epirat07@gmail.com> Signed-off-by: Zhao Zhili <zhilizhao@tencent.com>	2024-08-15 20:10:33 +08:00
Araz Iusubov	42a40538f3	avcodec/amfenc: new encoder features support Implemented: New usage modes for AV1 encoder. Latency mode for H264, HEVC and AV1 encoders. Adaptive Quantization (AQ) mode in AV1 encoder. Signed-off-by: Dmitrii Ovchinnikov <ovchinnikov.dmitrii@gmail.com>	2024-08-15 14:01:07 +02:00
Gyan Doshi	1f801dfdb5	lavc/libx265: unbreak build for X265_BUILD >= 210 x265 added support for alpha starting with build 210. While doing so, x265_encoder_encode() changed its fifth arg to an array of pointers to x265_picture. This broke building lavc/libx265.c This patch simply unbreaks the build and maintains existing single-layer non-alpha encoding support. Fixes #11130	2024-08-15 15:17:14 +05:30

... 2 3 4 5 6 ...

116847 Commits