Martin Storsjö
|
e0604d508e
|
swscale: aarch64: Add a NEON implementation of interleaveBytes
This allows speeding up format conversions from yuv420 to nv12.
Cortex A53 A72 A73
interleave_bytes_c: 86077.5 51433.0 66972.0
interleave_bytes_neon: 19701.7 23019.2 15859.2
interleave_bytes_aligned_c: 86603.0 52017.2 67484.2
interleave_bytes_aligned_neon: 9061.0 7623.0 6309.0
Signed-off-by: Martin Storsjö <martin@martin.st>
|
2020-05-15 23:38:17 +03:00 |
|
Clément Bœsch
|
c921f4f687
|
sws/aarch64: add ff_yuv2planeX_8_neon
|
2016-04-11 16:27:19 +02:00 |
|
Clément Bœsch
|
263eb76bdf
|
sws/aarch64: add ff_hscale_8_to_15_neon
./ffmpeg -nostats -f lavfi -i testsrc2=4k:d=2 -vf bench=start,scale=1024x1024,bench=stop -f null -
before: t:0.489726 avg:0.489883 max:0.491852 min:0.489482
after: t:0.256515 avg:0.256458 max:0.256999 min:0.253755
|
2016-03-31 10:12:55 +02:00 |
|
Clément Bœsch
|
f1148390d7
|
sws/aarch64: add {nv12,nv21,yuv420p,yuv422p}_to_{argb,rgba,abgr,rgba}_neon
|
2016-03-01 17:53:33 +01:00 |
|