mirror of https://github.com/FFmpeg/FFmpeg.git synced 2025-08-10 06:10:52 +02:00

Go to file

Guo, Yejun 37d24a6c8f vf_dnn_processing: add support for more formats gray8 and grayf32

The following is a python script to halve the value of the gray
image. It demos how to setup and execute dnn model with python+tensorflow.
It also generates .pb file which will be used by ffmpeg.

import tensorflow as tf
import numpy as np
from skimage import color
from skimage import io
in_img = io.imread('input.jpg')
in_img = color.rgb2gray(in_img)
io.imsave('ori_gray.jpg', np.squeeze(in_img))
in_data = np.expand_dims(in_img, axis=0)
in_data = np.expand_dims(in_data, axis=3)
filter_data = np.array([0.5]).reshape(1,1,1,1).astype(np.float32)
filter = tf.Variable(filter_data)
x = tf.placeholder(tf.float32, shape=[1, None, None, 1], name='dnn_in')
y = tf.nn.conv2d(x, filter, strides=[1, 1, 1, 1], padding='VALID', name='dnn_out')
sess=tf.Session()
sess.run(tf.global_variables_initializer())
graph_def = tf.graph_util.convert_variables_to_constants(sess, sess.graph_def, ['dnn_out'])
tf.train.write_graph(graph_def, '.', 'halve_gray_float.pb', as_text=False)
print("halve_gray_float.pb generated, please use \
path_to_ffmpeg/tools/python/convert.py to generate halve_gray_float.model\n")
output = sess.run(y, feed_dict={x: in_data})
output = output * 255.0
output = output.astype(np.uint8)
io.imsave("out.jpg", np.squeeze(output))

To do the same thing with ffmpeg:
- generate halve_gray_float.pb with the above script
- generate halve_gray_float.model with tools/python/convert.py
- try with following commands
  ./ffmpeg -i input.jpg -vf format=grayf32,dnn_processing=model=halve_gray_float.model:input=dnn_in:output=dnn_out:dnn_backend=native out.native.png
  ./ffmpeg -i input.jpg -vf format=grayf32,dnn_processing=model=halve_gray_float.pb:input=dnn_in:output=dnn_out:dnn_backend=tensorflow out.tf.png

Signed-off-by: Guo, Yejun <yejun.guo@intel.com>
Signed-off-by: Pedro Arthur <bygrandao@gmail.com>

2020-01-07 10:51:38 -03:00

compat

compat/avisynth: Fix unicode compilation.

2020-01-01 16:38:28 +01:00

doc

vf_dnn_processing: add support for more formats gray8 and grayf32

2020-01-07 10:51:38 -03:00

ffbuild

build: add support for building CUDA files with clang

2019-08-04 19:08:08 +02:00

fftools

ffmpeg: don't force source-tracked keyframes for duplicates

2020-01-05 12:01:46 +05:30

libavcodec

avcodec/libvpxenc,cosmetics: prefer sizeof(var)

2020-01-06 22:35:38 -08:00

libavdevice

Silence "string-plus-int" warning shown by clang.

2020-01-06 22:38:56 +01:00

libavfilter

vf_dnn_processing: add support for more formats gray8 and grayf32

2020-01-07 10:51:38 -03:00

libavformat

avformat/aviobuf: Honor avio_open[2] documentation

2020-01-07 02:44:05 +01:00

libavresample

avresample: remove deprecated attribute from the AVAudioResampleContext struct

2018-01-09 10:56:53 -03:00

libavutil

Silence "string-plus-int" warning shown by clang.

2020-01-06 22:38:56 +01:00

libpostproc

Silence "string-plus-int" warning shown by clang.

2020-01-06 22:38:56 +01:00

libswresample

Silence "string-plus-int" warning shown by clang.

2020-01-06 22:38:56 +01:00

libswscale

Silence "string-plus-int" warning shown by clang.

2020-01-06 22:38:56 +01:00

presets

…

tests

avformat/matroskadec: Fix lzo decompression

2019-12-28 22:40:13 -03:00

tools

tools/target_dec_fuzzer: Stop negative block_align and sampling rate

2019-12-31 12:37:47 +01:00

.gitattributes

fate: add SCC test

2017-01-27 17:06:42 +01:00

.gitignore

tools/python: add script to convert TensorFlow model (.pb) to native model (.model)

2019-07-01 10:23:47 -03:00

.travis.yml

Merge commit '899ee03088d55152a48830df0899887f055da1de'

2019-03-14 15:53:16 -03:00

Changelog

avfilter: add thistogram video filter

2019-12-29 15:33:55 +01:00

configure

configure: Change the configure check for tonemap_vaapi

2020-01-07 00:04:50 +00:00

CONTRIBUTING.md

Add CONTRIBUTING.md

2016-09-18 10:02:13 +01:00

COPYING.GPLv2

…

COPYING.GPLv3

…

COPYING.LGPLv2.1

…

COPYING.LGPLv3

…

CREDITS

…

INSTALL.md

INSTALL.md: Fix Markdown formatting

2019-01-31 10:29:16 -09:00

LICENSE.md

avfilter/vf_geq: Relicense to LGPL

2019-12-28 11:20:48 +01:00

MAINTAINERS

MAINTAINERS: add myself as libxavs2 maintainer

2019-12-07 19:34:57 +01:00

Makefile

tools: add a fuzzer tool for bitstream filters

2019-12-05 20:49:15 -03:00

README.md

Remove the ffserver program

2018-01-06 18:31:37 +00:00

RELEASE

RELEASE: Update value for post 4.2 branch

2019-07-21 18:37:32 +02:00

README.md

FFmpeg README

FFmpeg is a collection of libraries and tools to process multimedia content such as audio, video, subtitles and related metadata.

Libraries

libavcodec provides implementation of a wider range of codecs.
libavformat implements streaming protocols, container formats and basic I/O access.
libavutil includes hashers, decompressors and miscellaneous utility functions.
libavfilter provides a mean to alter decoded Audio and Video through chain of filters.
libavdevice provides an abstraction to access capture and playback devices.
libswresample implements audio mixing and resampling routines.
libswscale implements color conversion and scaling routines.

Tools

ffmpeg is a command line toolbox to manipulate, convert and stream multimedia content.
ffplay is a minimalistic multimedia player.
ffprobe is a simple analysis tool to inspect multimedia content.
Additional small tools such as aviocat, ismindex and qt-faststart.

Documentation

The offline documentation is available in the doc/ directory.

The online documentation is available in the main website and in the wiki.

Examples

Coding examples are available in the doc/examples directory.

License

FFmpeg codebase is mainly LGPL-licensed with optional components licensed under GPL. Please refer to the LICENSE file for detailed information.

Contributing

Patches should be submitted to the ffmpeg-devel mailing list using git format-patch or git send-email. Github pull requests should be avoided because they are not part of our review process and will be ignored.

Languages

C 90.1%

Assembly 7.9%

Makefile 1.3%

C++ 0.2%

Objective-C 0.2%

Other 0.1%