1
0
mirror of https://github.com/FFmpeg/FFmpeg.git synced 2025-01-13 21:28:01 +02:00
FFmpeg/libavcodec/riscv
Rémi Denis-Courmont c654e37254 lavc/h264dsp: R-V V high-depth h264_idct8_add
Unlike the 8-bit version, we need two iterations to process this within
128-bit vectors. This adds some extra complexity for pointer arithmetic
and counting down which is unnecessary in the 8-bit variant.

Accordingly the gain relative to C are just slight better than half as
good with 128-bit vectors as with 256-bit ones.

T-Head C908 (2 iterations):
h264_idct8_add_9bpp_c:       17.5
h264_idct8_add_9bpp_rvv_i32: 10.0
h264_idct8_add_10bpp_c:      17.5
h264_idct8_add_10bpp_rvv_i32: 9.7
h264_idct8_add_12bpp_c:      17.7
h264_idct8_add_12bpp_rvv_i32: 9.7
h264_idct8_add_14bpp_c:      17.7
h264_idct8_add_14bpp_rvv_i32: 9.7

SpacemiT X60 (single iteration):
h264_idct8_add_9bpp_c:       15.2
h264_idct8_add_9bpp_rvv_i32:  5.0
h264_idct8_add_10bpp_c:      15.2
h264_idct8_add_10bpp_rvv_i32: 5.0
h264_idct8_add_12bpp_c:      14.7
h264_idct8_add_12bpp_rvv_i32: 5.0
h264_idct8_add_14bpp_c:      14.7
h264_idct8_add_14bpp_rvv_i32: 4.7
2024-07-14 21:06:50 +03:00
..
aacencdsp_init.c lavc/aacencdsp: R-V V quant_bands 2024-06-03 22:43:37 +03:00
aacencdsp_rvv.S lavc/aacencdsp: fix rounding in R-V V quantize_bands 2024-06-08 18:30:43 +03:00
aacpsdsp_init.c
aacpsdsp_rvv.S
ac3dsp_init.c
ac3dsp_rvb.S
ac3dsp_rvv.S
ac3dsp_rvvb.S
alacdsp_init.c
alacdsp_rvv.S
audiodsp_init.c
audiodsp_rvf.S
audiodsp_rvv.S
blockdsp_init.c
blockdsp_rvv.S
bswapdsp_init.c
bswapdsp_rvb.S
bswapdsp_rvv.S
cpu_common.c riscv: probe for Zbb extension at load time 2024-06-11 20:12:37 +03:00
exrdsp_init.c
exrdsp_rvv.S
flacdsp_init.c
flacdsp_rvv.S lavc/flacdsp: fix sign extension in R-V V wasted33 2024-06-07 17:53:05 +03:00
fmtconvert_init.c
fmtconvert_rvv.S
g722dsp_init.c
g722dsp_rvv.S
h263dsp_init.c
h263dsp_rvv.S
h264_chroma_init_riscv.c
h264_mc_chroma.S
h264dsp_init.c lavc/h264dsp: R-V V high-depth h264_idct8_add 2024-07-14 21:06:50 +03:00
h264dsp_rvv.S lavc/h264dsp: R-V V 8-bit h264_biweight_pixels 2024-07-09 18:03:30 +03:00
h264idct_rvv.S lavc/h264dsp: R-V V high-depth h264_idct8_add 2024-07-14 21:06:50 +03:00
huffyuvdsp_init.c
huffyuvdsp_rvv.S
idctdsp_init.c
idctdsp_rvv.S
jpeg2000dsp_init.c
jpeg2000dsp_rvv.S
llauddsp_init.c
llauddsp_rvv.S
llviddsp_init.c
llviddsp_rvv.S
llvidencdsp_init.c
llvidencdsp_rvv.S
lpc_init.c lavc/lpc: optimise RVV vector type for compute_autocorr 2024-05-29 16:57:02 +03:00
lpc_rvv.S riscv: allow passing addend to vtype_vli macro 2024-05-30 18:30:52 +03:00
Makefile lavc/h264dsp: R-V V 8-bit h264_idct_add16 2024-07-05 18:56:02 +03:00
me_cmp_init.c
me_cmp_rvv.S
opusdsp_init.c
opusdsp_rvv.S
pixblockdsp_init.c
pixblockdsp_rvi.S
pixblockdsp_rvv.S
rv34dsp_init.c
rv34dsp_rvv.S
rv40dsp_init.c
rv40dsp_rvv.S
sbrdsp_init.c lavc/sbrdsp: add support for 256-bit vectors 2024-05-31 22:22:43 +03:00
sbrdsp_rvv.S lavc/sbrdsp: fold immediate offset into relocation 2024-05-28 19:44:11 +03:00
startcode_rvb.S
startcode_rvv.S lavc/startcode: fix RVV return value on no match 2024-05-28 19:43:40 +03:00
svqenc_init.c
svqenc_rvv.S
takdsp_init.c
takdsp_rvv.S
utvideodsp_init.c
utvideodsp_rvv.S
vc1dsp_init.c lavc/vc1dsp: R-V V vc1_inv_trans_4x4 2024-06-07 17:53:05 +03:00
vc1dsp_rvi.S
vc1dsp_rvv.S lavc/vc1dsp: fuse multiply-adds in R-V V inv_trans_8 2024-07-03 18:16:36 +03:00
vorbisdsp_init.c
vorbisdsp_rvv.S
vp7dsp_init.c lavc/vp7dsp: add R-V V vp7_idct_dc_add4uv 2024-06-04 17:42:07 +03:00
vp7dsp_rvv.S lavc/vp7dsp: add R-V V vp7_idct_dc_add4uv 2024-06-04 17:42:07 +03:00
vp8dsp_init.c lavc/vp8dsp: R-V V vp8_idct_add 2024-06-08 18:30:43 +03:00
vp8dsp_rvi.S
vp8dsp_rvv.S lavc/vp8dsp: R-V V bilin_load to bilin_load_h 2024-06-12 18:38:41 +03:00
vp8dsp.h
vp9_intra_rvi.S
vp9_intra_rvv.S
vp9_mc_rvi.S
vp9_mc_rvv.S lavc/vp9dsp: R-V V rename ff_avg to ff_vp9_avg 2024-05-30 18:30:52 +03:00
vp9dsp_init.c lavc/vp9dsp: R-V V rename ff_avg to ff_vp9_avg 2024-05-30 18:30:52 +03:00
vp9dsp.h lavc/vp9dsp: R-V V rename ff_avg to ff_vp9_avg 2024-05-30 18:30:52 +03:00