mirror of https://github.com/FFmpeg/FFmpeg.git synced 2025-08-15 14:13:16 +02:00

Go to file

Guo, Yejun 41ef57fdb2 lavfi/dnn_classify: add filter dnn_classify for classification based on detection bounding boxes

classification is done on every detection bounding box in frame's side data,
which are the results of object detection (filter dnn_detect).

Please refer to commit log of dnn_detect for the material for detection,
and see below for classification.

- download material for classifcation:
wget https://github.com/guoyejun/ffmpeg_dnn/raw/main/models/openvino/2021.1/emotions-recognition-retail-0003.bin
wget https://github.com/guoyejun/ffmpeg_dnn/raw/main/models/openvino/2021.1/emotions-recognition-retail-0003.xml
wget https://github.com/guoyejun/ffmpeg_dnn/raw/main/models/openvino/2021.1/emotions-recognition-retail-0003.label

- run command as:
./ffmpeg -i cici.jpg -vf dnn_detect=dnn_backend=openvino:model=face-detection-adas-0001.xml:input=data:output=detection_out:confidence=0.6:labels=face-detection-adas-0001.label,dnn_classify=dnn_backend=openvino:model=emotions-recognition-retail-0003.xml:input=data:output=prob_emotion:confidence=0.3:labels=emotions-recognition-retail-0003.label:target=face,showinfo -f null -

We'll see the detect&classify result as below:
[Parsed_showinfo_2 @ 0x55b7d25e77c0]   side data - detection bounding boxes:
[Parsed_showinfo_2 @ 0x55b7d25e77c0] source: face-detection-adas-0001.xml, emotions-recognition-retail-0003.xml
[Parsed_showinfo_2 @ 0x55b7d25e77c0] index: 0,  region: (1005, 813) -> (1086, 905), label: face, confidence: 10000/10000.
[Parsed_showinfo_2 @ 0x55b7d25e77c0]            classify:  label: happy, confidence: 6757/10000.
[Parsed_showinfo_2 @ 0x55b7d25e77c0] index: 1,  region: (888, 839) -> (967, 926), label: face, confidence: 6917/10000.
[Parsed_showinfo_2 @ 0x55b7d25e77c0]            classify:  label: anger, confidence: 4320/10000.

Signed-off-by: Guo, Yejun <yejun.guo@intel.com>

2021-05-06 10:50:44 +08:00

compat

atomics: Fix the win32 atomic_exchange function

2021-04-04 11:06:08 +03:00

doc

lavfi/dnn_classify: add filter dnn_classify for classification based on detection bounding boxes

2021-05-06 10:50:44 +08:00

ffbuild

libavresample: Remove deprecated library

2021-04-27 10:43:13 -03:00

fftools

ffprobe: add option to control optional fields display

2021-05-05 15:04:54 +05:30

libavcodec

avcodec/decode: stop trying to initialize palette values in avcodec_default_get_buffer2()

2021-05-05 16:39:52 -03:00

libavdevice

avdevice: Constify all devices

2021-04-27 11:48:05 -03:00

libavfilter

lavfi/dnn_classify: add filter dnn_classify for classification based on detection bounding boxes

2021-05-06 10:50:44 +08:00

libavformat

avformat/rpl: cosmetics

2021-05-06 11:06:43 +10:00

libavutil

avutil/mem: Also poison new av_realloc-allocated blocks

2021-04-30 10:24:32 +02:00

libpostproc

Bump major versions of all libraries.

2021-04-27 11:48:05 -03:00

libswresample

Bump major versions of all libraries.

2021-04-27 11:48:05 -03:00

libswscale

Bump major versions of all libraries.

2021-04-27 11:48:05 -03:00

presets

…

tests

tests/image: remove colorspace conversion from jpegls tests

2021-05-03 18:32:01 -03:00

tools

avfilter/dnn/dnn_backend_tf: simplify the code with ff_hex_to_data

2021-04-29 20:02:29 +08:00

.gitattributes

…

.gitignore

tools/python: add script to convert TensorFlow model (.pb) to native model (.model)

2019-07-01 10:23:47 -03:00

.mailmap

mailmap: add entry for myself

2021-03-09 02:09:55 +00:00

.travis.yml

Merge commit '899ee03088d55152a48830df0899887f055da1de'

2019-03-14 15:53:16 -03:00

Changelog

avformat/westwoodaudenc: Adds muxer for Westwood AUD format.

2021-04-26 19:56:33 +10:00

configure

lavfi/dnn_classify: add filter dnn_classify for classification based on detection bounding boxes

2021-05-06 10:50:44 +08:00

CONTRIBUTING.md

…

COPYING.GPLv2

…

COPYING.GPLv3

…

COPYING.LGPLv2.1

…

COPYING.LGPLv3

…

CREDITS

…

INSTALL.md

…

LICENSE.md

avfilter/vf_geq: Relicense to LGPL

2019-12-28 11:20:48 +01:00

MAINTAINERS

MAINTAINERS: add myself as adpcm maintainer

2021-03-25 12:51:10 +10:00

Makefile

libavresample: Remove deprecated library

2021-04-27 10:43:13 -03:00

README.md

…

RELEASE

Bump Versions before release/4.4 branch

2021-03-20 01:01:12 +01:00

README.md

FFmpeg README

FFmpeg is a collection of libraries and tools to process multimedia content such as audio, video, subtitles and related metadata.

Libraries

libavcodec provides implementation of a wider range of codecs.
libavformat implements streaming protocols, container formats and basic I/O access.
libavutil includes hashers, decompressors and miscellaneous utility functions.
libavfilter provides a mean to alter decoded Audio and Video through chain of filters.
libavdevice provides an abstraction to access capture and playback devices.
libswresample implements audio mixing and resampling routines.
libswscale implements color conversion and scaling routines.

Tools

ffmpeg is a command line toolbox to manipulate, convert and stream multimedia content.
ffplay is a minimalistic multimedia player.
ffprobe is a simple analysis tool to inspect multimedia content.
Additional small tools such as aviocat, ismindex and qt-faststart.

Documentation

The offline documentation is available in the doc/ directory.

The online documentation is available in the main website and in the wiki.

Examples

Coding examples are available in the doc/examples directory.

License

FFmpeg codebase is mainly LGPL-licensed with optional components licensed under GPL. Please refer to the LICENSE file for detailed information.

Contributing

Patches should be submitted to the ffmpeg-devel mailing list using git format-patch or git send-email. Github pull requests should be avoided because they are not part of our review process and will be ignored.

Languages

C 90.1%

Assembly 7.9%

Makefile 1.3%

C++ 0.2%

Objective-C 0.2%

Other 0.1%