1
0
mirror of https://github.com/FFmpeg/FFmpeg.git synced 2025-01-19 05:49:09 +02:00
FFmpeg/libavcodec
Rémi Denis-Courmont 295092b46d lavc/flacdsp: R-V V LPC32
The entire set of 32 coefficients and corresponding past 32 samples can
fit in a single vector (with LMUL=8) exactly, but... since widening
double the needed vector sizes, we still end up too short with 128-bit
vectors. This adds a very simple version for future 256+-bit hardware,
and for pred_orders values up to 16, and a bit more involved loop for
for 128-bit hardware with pred_orders between 17 and 32.

With 128-bit hardware, the benchmarks look like this:
flac_lpc_32_13_c:       30152.0
flac_lpc_32_13_rvv_i32: 10244.7
flac_lpc_32_16_c:       37314.2
flac_lpc_32_16_rvv_i32: 10126.2
flac_lpc_32_29_c:       61910.0
flac_lpc_32_29_rvv_i32: 14495.2
flac_lpc_32_32_c:       68204.0
flac_lpc_32_32_rvv_i32: 13273.7
2023-11-18 22:05:43 +02:00
..
2023-11-18 22:05:43 +02:00
2023-11-10 00:14:02 +01:00
2023-11-08 17:37:58 +11:00
2023-11-18 19:55:42 +01:00
2023-11-08 17:37:58 +11:00
2023-11-08 17:37:58 +11:00
2023-11-14 19:15:51 +02:00
2023-11-08 17:37:58 +11:00
2023-11-08 17:37:58 +11:00
2023-11-09 12:53:35 +01:00
2023-11-01 22:53:26 +02:00
2023-11-01 22:53:26 +02:00
2023-11-08 17:37:58 +11:00
2023-11-01 20:16:02 +01:00