1
0
mirror of https://github.com/FFmpeg/FFmpeg.git synced 2025-01-19 05:49:09 +02:00
FFmpeg/libavfilter
Wu Jianhua 4041c1029b libavfilter/x86/vf_gblur: add localbuf and ff_horiz_slice_avx2/512()
We introduced a ff_horiz_slice_avx2/512() implemented on a new algorithm.
In a nutshell, the new algorithm does three things, gathering data from
8/16 rows, blurring data, and scattering data back to the image buffer.
Here we used a customized transpose 8x8/16x16 to avoid the huge overhead
brought by gather and scatter instructions, which is dependent on the
temporary buffer called localbuf added newly.

Performance data:
ff_horiz_slice_avx2(old): 109.89
ff_horiz_slice_avx2(new): 666.67
ff_horiz_slice_avx512: 1000

Co-authored-by: Cheng Yanfei <yanfei.cheng@intel.com>
Co-authored-by: Jin Jun <jun.i.jin@intel.com>
Signed-off-by: Wu Jianhua <jianhua.wu@intel.com>
2021-08-29 19:58:33 +02:00
..
2021-04-19 14:34:10 +02:00
2020-02-14 09:59:27 +01:00
2021-08-28 18:51:04 +02:00
2021-08-21 16:11:11 +02:00
2021-01-17 12:06:01 +01:00
2021-01-17 12:06:01 +01:00
2021-07-22 14:47:57 +02:00
2021-08-29 13:31:37 +02:00
2021-04-11 23:18:20 +08:00
2020-02-04 18:28:04 +01:00
2021-08-29 13:31:37 +02:00
2019-09-26 08:10:31 +08:00