Rémi Denis-Courmont
a14d21a446
lavu/riscv: add forward-edge CFI landing pads
2024-07-25 23:10:14 +03:00
Rémi Denis-Courmont
5f10173fa1
lavu/riscv: require B or zba explicitly
2024-07-25 18:55:48 +03:00
Rémi Denis-Courmont
e33ce0d9dd
lavu/fixed_dsp: R-V V fmul_window_scaled
...
vector_fmul_window_scaled_fixed_c: 4393.7
vector_fmul_window_scaled_fixed_rvv_i64: 1642.7
2023-11-23 18:57:18 +02:00
Rémi Denis-Courmont
3a134e8299
lavu/fixed_dsp: optimise R-V V fmul_reverse
...
Gathers are (unsurprisingly) a notable exception to the rule that R-V V
gets faster with larger group multipliers. So roll the function to speed
it up.
Before:
vector_fmul_reverse_fixed_c: 2840.7
vector_fmul_reverse_fixed_rvv_i32: 2430.2
After:
vector_fmul_reverse_fixed_c: 2841.0
vector_fmul_reverse_fixed_rvv_i32: 962.2
It might be possible to further optimise the function by moving the
reverse-subtract out of the loop and adding ad-hoc tail handling.
2023-11-23 18:57:18 +02:00
Rémi Denis-Courmont
f39a8790e1
lavu/fixed_dsp: R-V V vector_fmul_window
2023-10-09 19:52:28 +03:00
Rémi Denis-Courmont
10eb3b9c9f
lavu/fixed_dsp: R-V V vector_fmul
...
vector_fmul_fixed_c: 4.0
vector_fmul_fixed_rvv_i64: 0.5
2023-10-09 19:52:28 +03:00
Rémi Denis-Courmont
da7a77fb0a
lavu/fixed_dsp: R-V V vector_fmul_reverse
2023-10-09 19:52:28 +03:00
Rémi Denis-Courmont
bf911cc1bf
lavu/fixed_dsp: R-V V vector_fmul_add
...
vector_fmul_add_fixed_c: 2.2
vector_fmul_add_fixed_rvv_i64: 0.5
2023-10-09 19:52:28 +03:00
Rémi Denis-Courmont
eb73d178ea
lavu/fixed_dsp: R-V V scalarproduct
2023-10-07 17:45:39 +03:00
Rémi Denis-Courmont
3d79afbe70
lavu/fixed_dsp: unroll RISC-V V loop
...
Before:
butterflies_fixed_c: 804.7
butterflies_fixed_rvv_i32: 348.2
After:
butterflies_fixed_rvv_i32: 308.7
2023-07-17 18:48:42 +03:00
Rémi Denis-Courmont
3ba5579e55
riscv: remove unnecessary #include's
...
Pointed out by Andreas Rheinhardt.
2022-10-05 06:54:56 +02:00
Rémi Denis-Courmont
c1bb19e263
lavu/fixeddsp: RISC-V V butterflies_fixed
2022-09-27 13:19:52 +02:00