FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2025-01-13 21:28:01 +02:00

Author	SHA1	Message	Date
Ting Fu	7a879cce37	libavfilter: vf_drawtext filter support draw text with detection bounding boxes in side_data This feature can be used with dnn detection by setting vf_drawtext's option text_source=side_data_detection_bboxes, for example: ./ffmpeg -i face.jpeg -vf dnn_detect=dnn_backend=openvino:model=face-detection-adas-0001.xml:\ input=data:output=detection_out:labels=face-detection-adas-0001.label,drawbox=box_source= side_data_detection_bboxes,drawtext=text_source=side_data_detection_bboxes:fontcolor=green:\ fontsize=40, -y face_detect.jpeg Please note, the default fontsize of vf_drawtext is 12, which may be too small to be seen clearly. Signed-off-by: Ting Fu <ting.fu@intel.com>	2021-05-26 08:58:27 +08:00
Ting Fu	f444be643e	libavfilter: vf_drawbox filter support draw box with detection bounding boxes in side_data This feature can be used with dnn detection by setting vf_drawbox's option box_source=side_data_detection_bboxes, for example: ./ffmpeg -i face.jpeg -vf dnn_detect=dnn_backend=openvino:model=face-detection-adas-0001.xml:\ input=data:output=detection_out:labels=face-detection-adas-0001.label,\ drawbox=box_source=side_data_detection_bboxes -y face_detect.jpeg Signed-off-by: Ting Fu <ting.fu@intel.com>	2021-05-26 08:58:27 +08:00
Ting Fu	9921ae8a5d	lavfi/drawbox: refine code Extract common code of filter_frame() and drawgrid_filter_frame() to draw_region(). Signed-off-by: Ting Fu <ting.fu@intel.com>	2021-05-26 08:58:27 +08:00
Guo, Yejun	4c705a2775	lavfi/dnn: refine code to separate processing and detection in backends	2021-05-24 09:09:34 +08:00
Guo, Yejun	cde6d0288f	lavfi/dnn_filter_common.h: make filter option 'options' as deprecated we'd use 'backend_configs' to avoid confusion.	2021-05-24 08:44:58 +08:00
Andreas Rheinhardt	a0ab83bf93	avfilter/vf_guided: Don't needlessly copy properties, fix potential NPD ref_frame is owned by the framesync structure and should therefore not be modified; furthermore, these properties that are copied don't seem to be used at all, so copying is unnecessary. Finally copying when the destination frame is NULL gives a guaranteed segfault. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2021-05-23 13:42:16 +02:00
Andreas Rheinhardt	376e80ad74	avfilter/vf_guided: Fix leak of frames Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2021-05-23 13:42:16 +02:00
Andreas Rheinhardt	618d186b8c	avfilter/vf_guided: Don't free frame we don't own Reviewed-by: Steven Liu <lq@chinaffmpeg.org> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2021-05-23 13:42:04 +02:00
Michael Niedermayer	1642d8188d	avfilter/avfiltergraph: Remove NULL checks after dereferences Fixes: CID1398579 Dereference before null check Reviewed-by: Nicolas George <george@nsup.org> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2021-05-19 16:58:25 +02:00
Guo, Yejun	4718d74c58	lavfi/vf_dnn_processing.c: fix CID 1460603 CID 1460603 (#1 of 1): Improper use of negative value (NEGATIVE_RETURNS)	2021-05-18 09:20:08 +08:00
Guo, Yejun	3fb1d2e71c	lavfi/dnn/dnn_io_proc.c: fix Improper use of negative value (NEGATIVE_RETURNS) fix coverity CID 1473511 and 1473566	2021-05-18 09:20:08 +08:00
Guo, Yejun	bd6ea9ed1d	lavfi/dnn/dnn_io_proc.c: Fix Out-of-bounds access (ARRAY_VS_SINGLETON) fix coverity CID 1473571, 1473577 and 1482089	2021-05-18 09:20:08 +08:00
Shubhanshu Saxena	11b489d592	lavfi/dnn_backend_native_layer_mathunary.h: Documentation Add documentation for Unary Math Layer Signed-off-by: Shubhanshu Saxena <shubhanshu.e01@gmail.com>	2021-05-17 09:33:40 +08:00
Shubhanshu Saxena	57fe5c1412	lavfi/dnn_backend_native_layer_depth2space.h: Documentation Add documentation for Depth to Space Layer Signed-off-by: Shubhanshu Saxena <shubhanshu.e01@gmail.com>	2021-05-17 09:33:40 +08:00
Shubhanshu Saxena	58de2b9eb3	lavfi/dnn_backend_native_layer_dense.h: Documentation Add documentation for Dense Layer Signed-off-by: Shubhanshu Saxena <shubhanshu.e01@gmail.com>	2021-05-17 09:33:40 +08:00
Shubhanshu Saxena	a61b7654a2	lavfi/dnn_backend_native_layer_conv2d.h: Documentation Add documentation for 2D Convolution Layer Signed-off-by: Shubhanshu Saxena <shubhanshu.e01@gmail.com>	2021-05-17 09:33:40 +08:00
Gyan Doshi	f53414a038	avfilter/metadata: add intuitive labels for metadata values	2021-05-16 10:24:27 +05:30
Gyan Doshi	234e719194	avfilter/guided: reindent after `93ddb9b617`	2021-05-14 15:37:45 +05:30
Gyan Doshi	93ddb9b617	avfilter/guided: simplify subsampling assignment. Reduce option ranges to effective values. Signed-off-by: Gyan Doshi <ffmpeg@gyani.pro> Reviewed-by: Steven Liu <liuqi05@kuaishou.com>	2021-05-14 15:33:30 +05:30
Shubhanshu Saxena	0bdd677c5f	lavfi/dnn_backend_native_layer_avgpool.h: Documentation Add documentation for Average Pool Layer Signed-off-by: Shubhanshu Saxena <shubhanshu.e01@gmail.com>	2021-05-14 10:21:15 +08:00
Xuewei Meng	43d70feb78	GSoC: Support fast guided filter. Two modes are supported in guided filter, basic mode and fast mode. Basic mode is the initial pushed guided filter without optimization. Fast mode is implemented based on the basic one by sub-sampling method. The sub-sampling ratio which can be defined by users controls the algorithm complexity. The larger the sub-sampling ratio, the lower the algorithm complexity. Signed-off-by: Xuewei Meng <xwmeng96@gmail.com> Reviewed-by: Steven Liu <liuqi05@kuaishou.com>	2021-05-13 11:59:11 +08:00
Limin Wang	2899fb61d2	avfilter/dnn/dnn_backend_tf: fix cross library usage duplicate ff_hex_to_data() function from avformat and rename it to hex_to_data() as static function. Reviewed-by: Guo, Yejun <yejun.guo@intel.com> Signed-off-by: Limin Wang <lance.lmwang@gmail.com>	2021-05-11 18:46:14 +08:00
Steven Liu	7ce0f246f4	avfilter/vf_dnn_classify: add result check for av_frame_get_side_data CID: 1482090 there can return null from av_frame_get_side_data, and will use sd->data after av_frame_get_side_data, so should check null return value. Signed-off-by: Steven Liu <liuqi05@kuaishou.com>	2021-05-11 10:49:33 +08:00
Ting Fu	c38bc5634d	dnn/vf_dnn_detect.c: add tensorflow output parse support Testing model is tensorflow offical model in github repo, please refer https://github.com/tensorflow/models/blob/master/research/object_detection/g3doc/tf2_detection_zoo.md to download the detect model as you need. For example, local testing was carried on with 'ssd_mobilenet_v2_coco_2018_03_29.tar.gz', and used one image of dog in https://github.com/tensorflow/models/blob/master/research/object_detection/test_images/image1.jpg Testing command is: ./ffmpeg -i image1.jpg -vf dnn_detect=dnn_backend=tensorflow:input=image_tensor:output=\ "num_detections&detection_scores&detection_classes&detection_boxes":model=ssd_mobilenet_v2_coco.pb,\ showinfo -f null - We will see the result similar as below: [Parsed_showinfo_1 @ 0x33e65f0] side data - detection bounding boxes: [Parsed_showinfo_1 @ 0x33e65f0] source: ssd_mobilenet_v2_coco.pb [Parsed_showinfo_1 @ 0x33e65f0] index: 0, region: (382, 60) -> (1005, 593), label: 18, confidence: 9834/10000. [Parsed_showinfo_1 @ 0x33e65f0] index: 1, region: (12, 8) -> (328, 549), label: 18, confidence: 8555/10000. [Parsed_showinfo_1 @ 0x33e65f0] index: 2, region: (293, 7) -> (682, 458), label: 1, confidence: 8033/10000. [Parsed_showinfo_1 @ 0x33e65f0] index: 3, region: (342, 0) -> (690, 325), label: 1, confidence: 5878/10000. There are two boxes of dog with cores 94.05% & 93.45% and two boxes of person with scores 80.33% & 58.78%. Signed-off-by: Ting Fu <ting.fu@intel.com> Signed-off-by: Guo, Yejun <yejun.guo@intel.com>	2021-05-11 10:38:36 +08:00
Ting Fu	e42125edab	lavfi/dnn_backend_tensorflow: support detect model Signed-off-by: Ting Fu <ting.fu@intel.com>	2021-05-11 10:28:35 +08:00
Ting Fu	1b1064054c	lavfi/dnn_backend_tensorflow: add multiple outputs support Signed-off-by: Ting Fu <ting.fu@intel.com>	2021-05-11 10:28:35 +08:00
Ting Fu	f02928eb5a	dnn: add DCO_RGB color order to enum DNNColorOrder Adding DCO_RGB color order to DNNColorOrder, since tensorflow model needs this kind of color oder as input. Signed-off-by: Ting Fu <ting.fu@intel.com>	2021-05-11 10:28:35 +08:00
Andreas Rheinhardt	7fac6efa97	avfilter/vf_guided: Add missing const Forgotten in `f8d910e90f`. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2021-05-11 00:29:16 +02:00
Xuewei Meng	f8d910e90f	GSoC: Add guided filter Add examples on how to use this filter, and improve the code style. Implement the slice-level parallelism for guided filter. Add the basic version of guided filter. Signed-off-by: Xuewei Meng <xwmeng96@gmail.com> Reviewed-by: Steven Liu <liuqi05@kuaishou.com>	2021-05-10 13:34:29 +08:00
Guo, Yejun	41ef57fdb2	lavfi/dnn_classify: add filter dnn_classify for classification based on detection bounding boxes classification is done on every detection bounding box in frame's side data, which are the results of object detection (filter dnn_detect). Please refer to commit log of dnn_detect for the material for detection, and see below for classification. - download material for classifcation: wget https://github.com/guoyejun/ffmpeg_dnn/raw/main/models/openvino/2021.1/emotions-recognition-retail-0003.bin wget https://github.com/guoyejun/ffmpeg_dnn/raw/main/models/openvino/2021.1/emotions-recognition-retail-0003.xml wget https://github.com/guoyejun/ffmpeg_dnn/raw/main/models/openvino/2021.1/emotions-recognition-retail-0003.label - run command as: ./ffmpeg -i cici.jpg -vf dnn_detect=dnn_backend=openvino:model=face-detection-adas-0001.xml:input=data:output=detection_out:confidence=0.6:labels=face-detection-adas-0001.label,dnn_classify=dnn_backend=openvino:model=emotions-recognition-retail-0003.xml:input=data:output=prob_emotion:confidence=0.3:labels=emotions-recognition-retail-0003.label:target=face,showinfo -f null - We'll see the detect&classify result as below: [Parsed_showinfo_2 @ 0x55b7d25e77c0] side data - detection bounding boxes: [Parsed_showinfo_2 @ 0x55b7d25e77c0] source: face-detection-adas-0001.xml, emotions-recognition-retail-0003.xml [Parsed_showinfo_2 @ 0x55b7d25e77c0] index: 0, region: (1005, 813) -> (1086, 905), label: face, confidence: 10000/10000. [Parsed_showinfo_2 @ 0x55b7d25e77c0] classify: label: happy, confidence: 6757/10000. [Parsed_showinfo_2 @ 0x55b7d25e77c0] index: 1, region: (888, 839) -> (967, 926), label: face, confidence: 6917/10000. [Parsed_showinfo_2 @ 0x55b7d25e77c0] classify: label: anger, confidence: 4320/10000. Signed-off-by: Guo, Yejun <yejun.guo@intel.com>	2021-05-06 10:50:44 +08:00
Guo, Yejun	fc26dca64e	lavfi/dnn: add classify support with openvino backend Signed-off-by: Guo, Yejun <yejun.guo@intel.com>	2021-05-06 10:50:44 +08:00
Guo, Yejun	a3b74651a0	lavfi/dnn: refine dnn interface to add DNNExecBaseParams Different function type of model requires different parameters, for example, object detection detects lots of objects (cat/dog/...) in the frame, and classifcation needs to know which object (cat or dog) it is going to classify. The current interface needs to add a new function with more parameters to support new requirement, with this change, we can just add a new struct (for example DNNExecClassifyParams) based on DNNExecBaseParams, and so we can continue to use the current interface execute_model just with params changed.	2021-05-06 10:50:44 +08:00
Guo, Yejun	7eb9accc37	lavfi/dnn_backend_openvino.c: move the logic for batch mode earlier	2021-05-06 10:50:44 +08:00
Guo, Yejun	e37cc72387	lavfi/dnn_backend_openvino.c: add InferenceItem between TaskItem and RequestItem There's one task item for one function call from dnn interface, there's one request item for one call to openvino. For classify, one task might need multiple inference for classification on every bounding box, so add InferenceItem.	2021-05-06 10:50:44 +08:00
Guo, Yejun	1b5dc712cd	lavfi/dnn_backend_openvino.c: unify code for infer request for sync/async	2021-05-06 10:50:44 +08:00
Shubhanshu Saxena	26d3fe1a52	lavfi/dnn_backend_native_layer_avgpool.c: Correct Spelling of Pixel Correct spelling of word `pixel` from `pxiels` Signed-off-by: Shubhanshu Saxena <shubhanshu.e01@gmail.com>	2021-05-06 10:17:57 +08:00
Limin Wang	c7c138e411	avfilter/vf_identity: fix typo Signed-off-by: Limin Wang <lance.lmwang@gmail.com>	2021-05-01 08:45:30 +08:00
Limin Wang	d150a9eb44	avfilter/vf_identity: remove unnecessary check Signed-off-by: Limin Wang <lance.lmwang@gmail.com>	2021-05-01 08:45:30 +08:00
Limin Wang	8410000f17	avfilter/vf_psnr: remove unnecessary check Signed-off-by: Limin Wang <lance.lmwang@gmail.com>	2021-05-01 08:45:30 +08:00
Limin Wang	fd3dabe68e	avfilter/vf_ssim: remove unnecessary check For the pointer have been checked in the previous few lines of code Signed-off-by: Limin Wang <lance.lmwang@gmail.com>	2021-05-01 08:45:30 +08:00
James Almer	92769f260d	avfilter/vf_scale: store the offset in a local variable before adding it Signed-off-by: James Almer <jamrial@gmail.com>	2021-04-30 19:35:56 -03:00
Limin Wang	f183d6555e	avfilter/dnn/dnn_backend_tf: simplify the code with ff_hex_to_data please use tools/python/tf_sess_config.py to get the sess_config after that. note the byte order of session config is in normal order. bump the MICRO version for the config change. Signed-off-by: Limin Wang <lance.lmwang@gmail.com>	2021-04-29 20:02:29 +08:00
Andreas Rheinhardt	a04ad248a0	avfilter: Constify all AVFilters This is possible now that the next-API is gone. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com> Signed-off-by: James Almer <jamrial@gmail.com>	2021-04-27 11:48:05 -03:00
Anton Khirnov	85ba17f36d	Bump major versions of all libraries. Signed-off-by: James Almer <jamrial@gmail.com> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2021-04-27 11:48:05 -03:00
James Almer	90262f3fb4	avfilter/buffersrc: postpone removal of sws_param It was depreacted less than two years ago Signed-off-by: James Almer <jamrial@gmail.com>	2021-04-27 11:48:04 -03:00
James Almer	0bf3a7361d	avutil: remove deprecated AVClass.child_class_next Signed-off-by: James Almer <jamrial@gmail.com>	2021-04-27 11:48:04 -03:00
Andreas Rheinhardt	420cedd497	libavresample: Remove deprecated library Deprecated in `c29038f304`. The resample filter based upon this library has been removed as well. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com> Signed-off-by: James Almer <jamrial@gmail.com>	2021-04-27 10:43:13 -03:00
Andreas Rheinhardt	ef6a9e5e31	avutil/buffer: Switch AVBuffer API to size_t Announced in `14040a1d91`. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com> Signed-off-by: James Almer <jamrial@gmail.com>	2021-04-27 10:43:13 -03:00
Andreas Rheinhardt	985c0dac67	avutil/pixdesc: Remove deprecated AV_PIX_FMT_FLAG_PSEUDOPAL Deprecated in `d6fc031caf`. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com>	2021-04-27 10:43:13 -03:00
Andreas Rheinhardt	3b56fa85e8	avutil/frame: Remove deprecated AVFrame.error Deprecated in `1aa24df74c`. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@gmail.com> Signed-off-by: James Almer <jamrial@gmail.com>	2021-04-27 10:43:12 -03:00

1 2 3 4 5 ...

9148 Commits