FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-11-21 10:55:51 +02:00

History

Hubert Mazur 1e9cfa5bb0 sw_scale: Add specializations for hscale 8 to 19 Add arm64 neon implementations for hscale 8 to 19 with filter sizes 4, 4X and 8. Both implementations are based on very similar ones dedicated to hscale 8 to 15. The major changes refer to saving the data - instead of writing the result as int16_t it is done with int32_t. These functions are heavily inspired on patches provided by J. Swinney and M. Storsjö for hscale8to15 which were slightly adapted for hscale8to19. The tests and benchmarks run on AWS Graviton 2 instances. The results from a checkasm tool shown below. hscale_8_to_19__fs_4_dstW_512_c: 5663.2 hscale_8_to_19__fs_4_dstW_512_neon: 1259.7 hscale_8_to_19__fs_8_dstW_512_c: 9306.0 hscale_8_to_19__fs_8_dstW_512_neon: 2020.2 hscale_8_to_19__fs_12_dstW_512_c: 12932.7 hscale_8_to_19__fs_12_dstW_512_neon: 2462.5 hscale_8_to_19__fs_16_dstW_512_c: 16844.2 hscale_8_to_19__fs_16_dstW_512_neon: 4671.2 hscale_8_to_19__fs_32_dstW_512_c: 32803.7 hscale_8_to_19__fs_32_dstW_512_neon: 5474.2 hscale_8_to_19__fs_40_dstW_512_c: 40948.0 hscale_8_to_19__fs_40_dstW_512_neon: 6669.7 Signed-off-by: Hubert Mazur <hum@semihalf.com> Signed-off-by: Martin Storsjö <martin@martin.st>		2022-11-01 15:24:43 +02:00
..
aarch64	sw_scale: Add specializations for hscale 8 to 19	2022-11-01 15:24:43 +02:00
arm	sws: rename SwsContext.swscale to convert_unscaled	2021-07-03 15:57:53 +02:00
loongarch	swscale/la: Add output_lasx.c file.	2022-09-10 22:56:39 +02:00
ppc	sws: rename SwsContext.swscale to convert_unscaled	2021-07-03 15:57:53 +02:00
riscv	sws/rgb2rgb: RISC-V 64-bit V packed YUYV/UYVY to planar 4:2:2	2022-09-30 07:25:44 +02:00
tests	swscale: introduce isSwappedChroma	2022-01-04 19:39:22 -06:00
x86	swscale/x86/rgb_2_rgb: Empty MMX state in ff_shuffle_bytes_2103_mmxext	2022-08-23 12:21:00 +02:00
alphablend.c	swscale/alphablend: Fix slice handling	2021-10-03 20:38:29 +02:00
bayer_template.c
gamma.c
half2float.c	swscale/input: add rgbaf16 input support	2022-08-19 22:09:36 +02:00
hscale_fast_bilinear.c
hscale.c	swscale: add opaque parameter to input functions	2022-08-19 22:09:36 +02:00
input.c	swscale/input: Avoid calls to av_pix_fmt_desc_get()	2022-09-19 23:40:41 +02:00
libswscale.v
log2_tab.c
Makefile	swscale/input: add rgbaf16 input support	2022-08-19 22:09:36 +02:00
options.c	Remove unnecessary libavutil/(avutil\|common\|internal).h inclusions	2022-02-24 12:56:49 +01:00
output.c	swscale/output: Don't call av_pix_fmt_desc_get() in a loop	2022-09-19 23:40:41 +02:00
rgb2rgb_template.c
rgb2rgb.c	sws/rgb2rgb: RISC-V V shuffle_bytes_xxxx functions	2022-09-30 07:24:09 +02:00
rgb2rgb.h	sws/rgb2rgb: RISC-V V shuffle_bytes_xxxx functions	2022-09-30 07:24:09 +02:00
slice.c	swscale/input: add rgbaf16 input support	2022-08-19 22:09:36 +02:00
swscale_internal.h	swscale/la: Optimize hscale functions with lasx.	2022-09-10 22:56:38 +02:00
swscale_unscaled.c	libswscale: force a minimum size of the slide for bayer sources	2022-10-14 12:19:13 +02:00
swscale.c	swscale/la: Optimize hscale functions with lasx.	2022-09-10 22:56:38 +02:00
swscale.h	swscale: document some missing arguments	2022-10-17 09:56:47 +02:00
swscaleres.rc
utils.c	swscale/la: Optimize hscale functions with lasx.	2022-09-10 22:56:38 +02:00
version_major.h	libswscale: Split version.h	2022-03-16 14:05:26 +02:00
version.c	lib*/version: Move library version functions into files of their own	2022-05-10 06:49:32 +02:00
version.h	swscale/output: add support for Y210LE and Y212LE	2022-09-10 12:29:12 -07:00
vscale.c	Replace all occurences of av_mallocz_array() by av_calloc()	2021-09-20 01:03:52 +02:00
yuv2rgb.c	swscale/la: Add yuv2rgb_lasx.c and rgb2rgb_lasx.c files	2022-09-10 22:56:38 +02:00