mirror of
https://github.com/FFmpeg/FFmpeg.git
synced 2024-12-07 11:13:41 +02:00
697533e76d
Add an optional filter_line3 to the available optimisations. filter_line3 is equivalent to filter_line, memcpy, filter_line filter_line shares quite a number of loads and some calculations in common with its next iteration and testing shows that using aarch64 neon filter_line3s performance is 30% better than two filter_lines and a memcpy. Adds a test for vf_bwdif filter_line3 to checkasm Rounds job start lines down to a multiple of 4. This means that if filter_line3 exists then filter_line will not sometimes be called once at the end of a slice depending on thread count. The final slice may do up to 3 extra lines but filter_edge is faster than filter_line so it is unlikely to create any noticable thread load variation. Signed-off-by: John Cox <jc@kynesim.co.uk> Signed-off-by: Martin Storsjö <martin@martin.st> |
||
---|---|---|
.. | ||
aarch64 | ||
arm | ||
riscv | ||
x86 | ||
.gitignore | ||
aacpsdsp.c | ||
af_afir.c | ||
alacdsp.c | ||
audiodsp.c | ||
av_tx.c | ||
blockdsp.c | ||
bswapdsp.c | ||
checkasm.c | ||
checkasm.h | ||
exrdsp.c | ||
fixed_dsp.c | ||
flacdsp.c | ||
float_dsp.c | ||
fmtconvert.c | ||
g722dsp.c | ||
h264chroma.c | ||
h264dsp.c | ||
h264pred.c | ||
h264qpel.c | ||
hevc_add_res.c | ||
hevc_deblock.c | ||
hevc_idct.c | ||
hevc_pel.c | ||
hevc_sao.c | ||
huffyuvdsp.c | ||
idctdsp.c | ||
jpeg2000dsp.c | ||
llviddsp.c | ||
llviddspenc.c | ||
lpc.c | ||
Makefile | ||
motion.c | ||
opusdsp.c | ||
pixblockdsp.c | ||
sbrdsp.c | ||
sw_gbrp.c | ||
sw_rgb.c | ||
sw_scale.c | ||
synth_filter.c | ||
utvideodsp.c | ||
v210dec.c | ||
v210enc.c | ||
vc1dsp.c | ||
vf_blend.c | ||
vf_bwdif.c | ||
vf_colorspace.c | ||
vf_convolution.c | ||
vf_eq.c | ||
vf_gblur.c | ||
vf_hflip.c | ||
vf_nlmeans.c | ||
vf_threshold.c | ||
videodsp.c | ||
vorbisdsp.c | ||
vp8dsp.c | ||
vp9dsp.c |