Alexandra Hájková
|
ce080f47b8
|
hevc: Add NEON 32x32 IDCT
Signed-off-by: Martin Storsjö <martin@martin.st>
|
2017-05-04 14:08:39 +02:00 |
|
Alexandra Hájková
|
118dd4a321
|
hevc: 16x16 NEON idct: Use the right element size for loads/stores
This doesn't change the actual behaviour of the code but improves
readability.
Signed-off-by: Martin Storsjö <martin@martin.st>
|
2017-05-04 14:08:27 +02:00 |
|
Alexandra Hájková
|
edbf0fffb1
|
hevc: Add NEON add_residual for bitdepth 10
Signed-off-by: Martin Storsjö <martin@martin.st>
|
2017-05-01 23:39:55 +03:00 |
|
Martin Storsjö
|
e1c2453a4f
|
arm: hevc_idct: Tune the add_res_8x8 and add_res_32x32 functions
Before: Cortex A7 A8 A9 A53
hevc_add_res_8x8_8_neon: 116.0 58.7 80.2 90.7
hevc_add_res_32x32_8_neon: 1230.0 737.5 1187.5 974.4
After:
hevc_add_res_8x8_8_neon: 97.7 57.0 73.7 80.0
hevc_add_res_32x32_8_neon: 1216.0 698.7 1127.5 827.1
Signed-off-by: Martin Storsjö <martin@martin.st>
|
2017-04-28 12:02:14 +03:00 |
|
Seppo Tomperi
|
0d4d435137
|
hevc: Add NEON add_residual for bitdepth 8
Optimized by Alexandra Hájková.
Signed-off-by: Martin Storsjö <martin@martin.st>
|
2017-04-27 23:05:27 +03:00 |
|
Alexandra Hájková
|
3d69dd65c6
|
hevc: Add support for bitdepth 10 for IDCT DC
Signed-off-by: Martin Storsjö <martin@martin.st>
|
2017-04-25 22:48:45 +03:00 |
|
Seppo Tomperi
|
358adef030
|
hevc: Add NEON IDCT DC functions for bitdepth 8
Signed-off-by: Alexandra Hájková <alexandra@khirnov.net>
Signed-off-by: Martin Storsjö <martin@martin.st>
|
2017-04-25 22:48:45 +03:00 |
|
Alexandra Hájková
|
89d9869d24
|
hevc: Add NEON 16x16 IDCT
The speedup vs C code is around 6-13x.
Signed-off-by: Martin Storsjö <martin@martin.st>
|
2017-04-12 22:40:54 +03:00 |
|
Alexandra Hájková
|
0b9a237b23
|
hevc: Add NEON 4x4 and 8x8 IDCT
Optimized by Martin Storsjö <martin@martin.st>.
The speedup vs C code is around 3.2-4.4x.
Signed-off-by: Martin Storsjö <martin@martin.st>
|
2017-03-27 22:56:23 +03:00 |
|