FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-12-28 20:53:54 +02:00

Author	SHA1	Message	Date
Steven Liu	282b9f4cba	avfilter/overlay_cuda: add hsub and vsub compute x and y positions fix problem when set x to odd number in nv12 by cuda test step: 1. ffmpeg -f lavfi testsrc2=s=176x144 -pix_fmt nv12 -t 1 output_overlay.yuv 2. ffmpeg -f lavfi testsrc2=s=352x288 -pix_fmt nv12 -t 1 output_main.yuv before this patch: overlay_cuda=x=0:y=0 will right, overlay_cuda=x=3:y=0 will wrong, both will right after patch. Signed-off-by: Steven Liu <liuqi05@kuaishou.com> Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	2021-06-07 17:02:36 +02:00
Valerii Zapodovnikov	ff0d70c8a9	avfilter/vf_hqdn3d: fix left shift of negative numbers Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2021-06-05 16:59:37 +02:00
Guo Yejun	3ac7d99428	lavfi/dnn/dnn_io_proc.c: fix CID 1484955 CID 1484955: Memory - corruptions (ARRAY_VS_SINGLETON)	2021-06-05 17:15:29 +08:00
Philip Langdale	8f8a7e491d	avfilter/yadif_cuda: Fix time base for large denominators This is the same fix applied to regular yadif. Signed-off-by: Philip Langdale <philipl@overt.org>	2021-05-29 13:21:18 -07:00
Philip Langdale	7885ab3036	avfilter/bwdif: Fix time base for large denominators This is the same fix applied to regular yadif. Signed-off-by: Philip Langdale <philipl@overt.org>	2021-05-29 13:21:18 -07:00
Michael Niedermayer	7971f62120	avfilter/vf_yadif: Fix handing of tiny images Fixes: out of array access Fixes: Ticket8240 Fixes: CVE-2020-22021 Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2021-05-29 20:14:24 +02:00
Michael Niedermayer	82ad1b7675	avfilter/vf_vmafmotion: Check dimensions Fixes: out of array access Fixes: Ticket8241 Fixes: Ticket8246 Fixes: CVE-2020-22019 Fixes: CVE-2020-22033 Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2021-05-29 20:14:24 +02:00
Oneric	3300625c6f	avfilter/vf_subtitles: allow using embedded fonts ASS subtitles can have encoded fonts embedded into the subtitle file itself. Allow libass to load those, to render subs as intended.	2021-05-29 11:32:23 +05:30
Shubhanshu Saxena	e41255cddb	lavfi/dnn_backend_openvino.c: Correct Pointer Type while Freeing This commit corrects the type of pointer of elements from the inference queue in ff_dnn_free_model_ov. Signed-off-by: Shubhanshu Saxena <shubhanshu.e01@gmail.com>	2021-05-28 08:40:07 +08:00
Tom Boshoven	8205f32663	avfilter/yadif: Fix time base for large denominators This fixes an issue where the yadif filter could cause the timebase denominator to overflow. Signed-off-by: Tom Boshoven <tom@jwplayer.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2021-05-27 17:43:40 +02:00
Ting Fu	7a879cce37	libavfilter: vf_drawtext filter support draw text with detection bounding boxes in side_data This feature can be used with dnn detection by setting vf_drawtext's option text_source=side_data_detection_bboxes, for example: ./ffmpeg -i face.jpeg -vf dnn_detect=dnn_backend=openvino:model=face-detection-adas-0001.xml:\ input=data:output=detection_out:labels=face-detection-adas-0001.label,drawbox=box_source= side_data_detection_bboxes,drawtext=text_source=side_data_detection_bboxes:fontcolor=green:\ fontsize=40, -y face_detect.jpeg Please note, the default fontsize of vf_drawtext is 12, which may be too small to be seen clearly. Signed-off-by: Ting Fu <ting.fu@intel.com>	2021-05-26 08:58:27 +08:00
Ting Fu	f444be643e	libavfilter: vf_drawbox filter support draw box with detection bounding boxes in side_data This feature can be used with dnn detection by setting vf_drawbox's option box_source=side_data_detection_bboxes, for example: ./ffmpeg -i face.jpeg -vf dnn_detect=dnn_backend=openvino:model=face-detection-adas-0001.xml:\ input=data:output=detection_out:labels=face-detection-adas-0001.label,\ drawbox=box_source=side_data_detection_bboxes -y face_detect.jpeg Signed-off-by: Ting Fu <ting.fu@intel.com>	2021-05-26 08:58:27 +08:00
Ting Fu	9921ae8a5d	lavfi/drawbox: refine code Extract common code of filter_frame() and drawgrid_filter_frame() to draw_region(). Signed-off-by: Ting Fu <ting.fu@intel.com>	2021-05-26 08:58:27 +08:00
Guo, Yejun	4c705a2775	lavfi/dnn: refine code to separate processing and detection in backends	2021-05-24 09:09:34 +08:00
Guo, Yejun	cde6d0288f	lavfi/dnn_filter_common.h: make filter option 'options' as deprecated we'd use 'backend_configs' to avoid confusion.	2021-05-24 08:44:58 +08:00
Andreas Rheinhardt	a0ab83bf93	avfilter/vf_guided: Don't needlessly copy properties, fix potential NPD ref_frame is owned by the framesync structure and should therefore not be modified; furthermore, these properties that are copied don't seem to be used at all, so copying is unnecessary. Finally copying when the destination frame is NULL gives a guaranteed segfault. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2021-05-23 13:42:16 +02:00
Andreas Rheinhardt	376e80ad74	avfilter/vf_guided: Fix leak of frames Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2021-05-23 13:42:16 +02:00
Andreas Rheinhardt	618d186b8c	avfilter/vf_guided: Don't free frame we don't own Reviewed-by: Steven Liu <lq@chinaffmpeg.org> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2021-05-23 13:42:04 +02:00
Michael Niedermayer	1642d8188d	avfilter/avfiltergraph: Remove NULL checks after dereferences Fixes: CID1398579 Dereference before null check Reviewed-by: Nicolas George <george@nsup.org> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2021-05-19 16:58:25 +02:00
Guo, Yejun	4718d74c58	lavfi/vf_dnn_processing.c: fix CID 1460603 CID 1460603 (#1 of 1): Improper use of negative value (NEGATIVE_RETURNS)	2021-05-18 09:20:08 +08:00
Guo, Yejun	3fb1d2e71c	lavfi/dnn/dnn_io_proc.c: fix Improper use of negative value (NEGATIVE_RETURNS) fix coverity CID 1473511 and 1473566	2021-05-18 09:20:08 +08:00
Guo, Yejun	bd6ea9ed1d	lavfi/dnn/dnn_io_proc.c: Fix Out-of-bounds access (ARRAY_VS_SINGLETON) fix coverity CID 1473571, 1473577 and 1482089	2021-05-18 09:20:08 +08:00
Shubhanshu Saxena	11b489d592	lavfi/dnn_backend_native_layer_mathunary.h: Documentation Add documentation for Unary Math Layer Signed-off-by: Shubhanshu Saxena <shubhanshu.e01@gmail.com>	2021-05-17 09:33:40 +08:00
Shubhanshu Saxena	57fe5c1412	lavfi/dnn_backend_native_layer_depth2space.h: Documentation Add documentation for Depth to Space Layer Signed-off-by: Shubhanshu Saxena <shubhanshu.e01@gmail.com>	2021-05-17 09:33:40 +08:00
Shubhanshu Saxena	58de2b9eb3	lavfi/dnn_backend_native_layer_dense.h: Documentation Add documentation for Dense Layer Signed-off-by: Shubhanshu Saxena <shubhanshu.e01@gmail.com>	2021-05-17 09:33:40 +08:00
Shubhanshu Saxena	a61b7654a2	lavfi/dnn_backend_native_layer_conv2d.h: Documentation Add documentation for 2D Convolution Layer Signed-off-by: Shubhanshu Saxena <shubhanshu.e01@gmail.com>	2021-05-17 09:33:40 +08:00
Gyan Doshi	f53414a038	avfilter/metadata: add intuitive labels for metadata values	2021-05-16 10:24:27 +05:30
Gyan Doshi	234e719194	avfilter/guided: reindent after `93ddb9b617`	2021-05-14 15:37:45 +05:30
Gyan Doshi	93ddb9b617	avfilter/guided: simplify subsampling assignment. Reduce option ranges to effective values. Signed-off-by: Gyan Doshi <ffmpeg@gyani.pro> Reviewed-by: Steven Liu <liuqi05@kuaishou.com>	2021-05-14 15:33:30 +05:30
Shubhanshu Saxena	0bdd677c5f	lavfi/dnn_backend_native_layer_avgpool.h: Documentation Add documentation for Average Pool Layer Signed-off-by: Shubhanshu Saxena <shubhanshu.e01@gmail.com>	2021-05-14 10:21:15 +08:00
Xuewei Meng	43d70feb78	GSoC: Support fast guided filter. Two modes are supported in guided filter, basic mode and fast mode. Basic mode is the initial pushed guided filter without optimization. Fast mode is implemented based on the basic one by sub-sampling method. The sub-sampling ratio which can be defined by users controls the algorithm complexity. The larger the sub-sampling ratio, the lower the algorithm complexity. Signed-off-by: Xuewei Meng <xwmeng96@gmail.com> Reviewed-by: Steven Liu <liuqi05@kuaishou.com>	2021-05-13 11:59:11 +08:00
Limin Wang	2899fb61d2	avfilter/dnn/dnn_backend_tf: fix cross library usage duplicate ff_hex_to_data() function from avformat and rename it to hex_to_data() as static function. Reviewed-by: Guo, Yejun <yejun.guo@intel.com> Signed-off-by: Limin Wang <lance.lmwang@gmail.com>	2021-05-11 18:46:14 +08:00
Steven Liu	7ce0f246f4	avfilter/vf_dnn_classify: add result check for av_frame_get_side_data CID: 1482090 there can return null from av_frame_get_side_data, and will use sd->data after av_frame_get_side_data, so should check null return value. Signed-off-by: Steven Liu <liuqi05@kuaishou.com>	2021-05-11 10:49:33 +08:00
Ting Fu	c38bc5634d	dnn/vf_dnn_detect.c: add tensorflow output parse support Testing model is tensorflow offical model in github repo, please refer https://github.com/tensorflow/models/blob/master/research/object_detection/g3doc/tf2_detection_zoo.md to download the detect model as you need. For example, local testing was carried on with 'ssd_mobilenet_v2_coco_2018_03_29.tar.gz', and used one image of dog in https://github.com/tensorflow/models/blob/master/research/object_detection/test_images/image1.jpg Testing command is: ./ffmpeg -i image1.jpg -vf dnn_detect=dnn_backend=tensorflow:input=image_tensor:output=\ "num_detections&detection_scores&detection_classes&detection_boxes":model=ssd_mobilenet_v2_coco.pb,\ showinfo -f null - We will see the result similar as below: [Parsed_showinfo_1 @ 0x33e65f0] side data - detection bounding boxes: [Parsed_showinfo_1 @ 0x33e65f0] source: ssd_mobilenet_v2_coco.pb [Parsed_showinfo_1 @ 0x33e65f0] index: 0, region: (382, 60) -> (1005, 593), label: 18, confidence: 9834/10000. [Parsed_showinfo_1 @ 0x33e65f0] index: 1, region: (12, 8) -> (328, 549), label: 18, confidence: 8555/10000. [Parsed_showinfo_1 @ 0x33e65f0] index: 2, region: (293, 7) -> (682, 458), label: 1, confidence: 8033/10000. [Parsed_showinfo_1 @ 0x33e65f0] index: 3, region: (342, 0) -> (690, 325), label: 1, confidence: 5878/10000. There are two boxes of dog with cores 94.05% & 93.45% and two boxes of person with scores 80.33% & 58.78%. Signed-off-by: Ting Fu <ting.fu@intel.com> Signed-off-by: Guo, Yejun <yejun.guo@intel.com>	2021-05-11 10:38:36 +08:00
Ting Fu	e42125edab	lavfi/dnn_backend_tensorflow: support detect model Signed-off-by: Ting Fu <ting.fu@intel.com>	2021-05-11 10:28:35 +08:00
Ting Fu	1b1064054c	lavfi/dnn_backend_tensorflow: add multiple outputs support Signed-off-by: Ting Fu <ting.fu@intel.com>	2021-05-11 10:28:35 +08:00
Ting Fu	f02928eb5a	dnn: add DCO_RGB color order to enum DNNColorOrder Adding DCO_RGB color order to DNNColorOrder, since tensorflow model needs this kind of color oder as input. Signed-off-by: Ting Fu <ting.fu@intel.com>	2021-05-11 10:28:35 +08:00
Andreas Rheinhardt	7fac6efa97	avfilter/vf_guided: Add missing const Forgotten in `f8d910e90f`. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2021-05-11 00:29:16 +02:00
Xuewei Meng	f8d910e90f	GSoC: Add guided filter Add examples on how to use this filter, and improve the code style. Implement the slice-level parallelism for guided filter. Add the basic version of guided filter. Signed-off-by: Xuewei Meng <xwmeng96@gmail.com> Reviewed-by: Steven Liu <liuqi05@kuaishou.com>	2021-05-10 13:34:29 +08:00
Guo, Yejun	41ef57fdb2	lavfi/dnn_classify: add filter dnn_classify for classification based on detection bounding boxes classification is done on every detection bounding box in frame's side data, which are the results of object detection (filter dnn_detect). Please refer to commit log of dnn_detect for the material for detection, and see below for classification. - download material for classifcation: wget https://github.com/guoyejun/ffmpeg_dnn/raw/main/models/openvino/2021.1/emotions-recognition-retail-0003.bin wget https://github.com/guoyejun/ffmpeg_dnn/raw/main/models/openvino/2021.1/emotions-recognition-retail-0003.xml wget https://github.com/guoyejun/ffmpeg_dnn/raw/main/models/openvino/2021.1/emotions-recognition-retail-0003.label - run command as: ./ffmpeg -i cici.jpg -vf dnn_detect=dnn_backend=openvino:model=face-detection-adas-0001.xml:input=data:output=detection_out:confidence=0.6:labels=face-detection-adas-0001.label,dnn_classify=dnn_backend=openvino:model=emotions-recognition-retail-0003.xml:input=data:output=prob_emotion:confidence=0.3:labels=emotions-recognition-retail-0003.label:target=face,showinfo -f null - We'll see the detect&classify result as below: [Parsed_showinfo_2 @ 0x55b7d25e77c0] side data - detection bounding boxes: [Parsed_showinfo_2 @ 0x55b7d25e77c0] source: face-detection-adas-0001.xml, emotions-recognition-retail-0003.xml [Parsed_showinfo_2 @ 0x55b7d25e77c0] index: 0, region: (1005, 813) -> (1086, 905), label: face, confidence: 10000/10000. [Parsed_showinfo_2 @ 0x55b7d25e77c0] classify: label: happy, confidence: 6757/10000. [Parsed_showinfo_2 @ 0x55b7d25e77c0] index: 1, region: (888, 839) -> (967, 926), label: face, confidence: 6917/10000. [Parsed_showinfo_2 @ 0x55b7d25e77c0] classify: label: anger, confidence: 4320/10000. Signed-off-by: Guo, Yejun <yejun.guo@intel.com>	2021-05-06 10:50:44 +08:00
Guo, Yejun	fc26dca64e	lavfi/dnn: add classify support with openvino backend Signed-off-by: Guo, Yejun <yejun.guo@intel.com>	2021-05-06 10:50:44 +08:00
Guo, Yejun	a3b74651a0	lavfi/dnn: refine dnn interface to add DNNExecBaseParams Different function type of model requires different parameters, for example, object detection detects lots of objects (cat/dog/...) in the frame, and classifcation needs to know which object (cat or dog) it is going to classify. The current interface needs to add a new function with more parameters to support new requirement, with this change, we can just add a new struct (for example DNNExecClassifyParams) based on DNNExecBaseParams, and so we can continue to use the current interface execute_model just with params changed.	2021-05-06 10:50:44 +08:00
Guo, Yejun	7eb9accc37	lavfi/dnn_backend_openvino.c: move the logic for batch mode earlier	2021-05-06 10:50:44 +08:00
Guo, Yejun	e37cc72387	lavfi/dnn_backend_openvino.c: add InferenceItem between TaskItem and RequestItem There's one task item for one function call from dnn interface, there's one request item for one call to openvino. For classify, one task might need multiple inference for classification on every bounding box, so add InferenceItem.	2021-05-06 10:50:44 +08:00
Guo, Yejun	1b5dc712cd	lavfi/dnn_backend_openvino.c: unify code for infer request for sync/async	2021-05-06 10:50:44 +08:00
Shubhanshu Saxena	26d3fe1a52	lavfi/dnn_backend_native_layer_avgpool.c: Correct Spelling of Pixel Correct spelling of word `pixel` from `pxiels` Signed-off-by: Shubhanshu Saxena <shubhanshu.e01@gmail.com>	2021-05-06 10:17:57 +08:00
Limin Wang	c7c138e411	avfilter/vf_identity: fix typo Signed-off-by: Limin Wang <lance.lmwang@gmail.com>	2021-05-01 08:45:30 +08:00
Limin Wang	d150a9eb44	avfilter/vf_identity: remove unnecessary check Signed-off-by: Limin Wang <lance.lmwang@gmail.com>	2021-05-01 08:45:30 +08:00
Limin Wang	8410000f17	avfilter/vf_psnr: remove unnecessary check Signed-off-by: Limin Wang <lance.lmwang@gmail.com>	2021-05-01 08:45:30 +08:00
Limin Wang	fd3dabe68e	avfilter/vf_ssim: remove unnecessary check For the pointer have been checked in the previous few lines of code Signed-off-by: Limin Wang <lance.lmwang@gmail.com>	2021-05-01 08:45:30 +08:00

1 2 3 4 5 ...

9158 Commits