James Darnley
13d71c28cc
avcodec/h264: sse2 and avx 4:2:2 idct add8 10-bit functions
Yorkfield:
- sse2:
- complex: 4.13x faster (1514 vs. 367 cycles)
- simple: 4.38x faster (1836 vs. 419 cycles)
Skylake:
- sse2:
- complex: 3.61x faster ( 936 vs. 260 cycles)
- simple: 3.97x faster (1126 vs. 284 cycles)
- avx (versus sse2):
- complex: 1.07x faster (260 vs. 244 cycles)
- simple: 1.03x faster (284 vs. 274 cycles)
2016-11-30 22:58:28 +01:00
..
2016-10-18 21:41:18 +01:00
2016-10-18 21:41:18 +01:00
2016-06-27 17:21:18 +02:00
2016-07-05 17:48:20 -03:00
2016-07-20 13:43:38 -03:00
2016-07-20 13:43:38 -03:00
2016-11-30 22:58:27 +01:00
2016-06-27 17:21:18 +02:00
2016-11-30 22:58:28 +01:00
2016-11-30 22:58:27 +01:00
2016-07-29 11:01:36 +02:00
2016-09-23 16:40:57 +02:00
2016-11-30 22:58:28 +01:00
2016-06-27 17:21:18 +02:00
2016-10-18 21:41:18 +01:00
2016-06-27 17:21:18 +02:00
2016-06-27 17:21:18 +02:00
2016-06-27 17:21:18 +02:00
2016-06-27 17:21:18 +02:00
2016-06-27 17:21:18 +02:00
2016-08-06 18:27:01 -03:00
2016-08-06 18:27:01 -03:00
2016-08-02 15:48:04 -03:00
2016-08-02 15:48:04 -03:00
2016-06-27 17:21:18 +02:00
2016-11-15 11:01:36 -05:00
2016-11-18 17:01:11 -03:00
2016-10-21 23:58:47 +02:00
2016-07-26 15:59:07 -04:00
2016-11-13 17:30:33 +01:00