1
0
mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-11-26 19:01:44 +02:00
FFmpeg/libavcodec/x86
Clément Bœsch 87434cf373 avcodec/vp9: add ff_vp9_idct_idct_{4x4,8x8}_ssse3().
1789 decicycles in idct_idct_4x4_add_c, 262136 runs, 8 skips
1839 decicycles in idct_idct_4x4_add_c, 524270 runs, 18 skips
1864 decicycles in idct_idct_4x4_add_c, 1048548 runs, 28 skips

529 decicycles in ff_vp9_idct_idct_4x4_add_ssse3, 262138 runs, 6 skips
516 decicycles in ff_vp9_idct_idct_4x4_add_ssse3, 524282 runs, 6 skips
474 decicycles in ff_vp9_idct_idct_4x4_add_ssse3, 1048565 runs, 11 skips

(~3.9x faster)

7726 decicycles in idct_idct_8x8_add_c, 1048433 runs, 143 skips
7732 decicycles in idct_idct_8x8_add_c, 2096882 runs, 270 skips
7731 decicycles in idct_idct_8x8_add_c, 4193772 runs, 532 skips

1145 decicycles in ff_vp9_idct_idct_8x8_add_ssse3, 1048549 runs, 27 skips
1137 decicycles in ff_vp9_idct_idct_8x8_add_ssse3, 2097097 runs, 55 skips
1086 decicycles in ff_vp9_idct_idct_8x8_add_ssse3, 4194188 runs, 116 skips

(~7.1x faster)

Overall decode time before commit:
  16.48s user 0.03s system 99% cpu 16.526 total
  16.54s user 0.01s system 99% cpu 16.566 total
  16.46s user 0.03s system 99% cpu 16.511 total

Overall decode time after commit:
  16.34s user 0.02s system 99% cpu 16.378 total
  16.28s user 0.02s system 99% cpu 16.315 total
  16.32s user 0.03s system 99% cpu 16.366 total

Tested on i7 920 with 40s 1080p footage.
2013-11-05 19:25:40 +01:00
..
ac3dsp_init.c
ac3dsp.asm
cabac.h avcodec/cabac: support UNCHECKED_BITSTREAM_READER = 0 2013-10-31 11:13:27 +01:00
cavsdsp.c
constants.c
constants.h
dct32.asm
dct_init.c
deinterlace.asm
dirac_dwt.c
dirac_dwt.h
diracdsp_mmx.c
diracdsp_mmx.h
diracdsp_yasm.asm
dnxhdenc.c
dsputil_init.c avcodec/x86/dsputil_init: move ff_idct_xvid_mmxext init 2013-10-15 02:06:12 +02:00
dsputil_mmx.c Merge commit '2ddb35b91131115c094d90e04031451023441b4d' 2013-10-06 11:50:01 +02:00
dsputil_qns_template.c
dsputil_x86.c Merge commit '2ddb35b91131115c094d90e04031451023441b4d' 2013-10-06 11:50:01 +02:00
dsputil_x86.h
dsputil.asm
dsputilenc_mmx.c Merge commit '258414d0771845d20f646ffe4d4e60f22fba217c' 2013-10-06 11:31:01 +02:00
dsputilenc.asm
dwt_yasm.asm
fdct.c Merge remote-tracking branch 'qatar/master' 2013-10-07 11:37:47 +02:00
fft_init.c
fft.asm x86: Fix compilation with nasm on PPC & OS/2 2013-10-08 12:36:19 +02:00
fft.h
fmtconvert_init.c
fmtconvert.asm
fpel_mmx.c
fpel.asm
h263_loopfilter.asm
h264_chromamc_10bit.asm
h264_chromamc.asm
h264_deblock_10bit.asm
h264_deblock.asm Merge commit 'bbe4a6db44f0b55b424a5cc9d3e89cd88e250450' 2013-10-08 11:23:00 +02:00
h264_i386.h
h264_idct_10bit.asm
h264_idct.asm Merge remote-tracking branch 'qatar/master' 2013-10-08 12:10:02 +02:00
h264_intrapred_10bit.asm
h264_intrapred_init.c
h264_intrapred.asm
h264_qpel_8bit.asm
h264_qpel_10bit.asm
h264_qpel.c
h264_weight_10bit.asm
h264_weight.asm
h264chroma_init.c
h264dsp_init.c
hpeldsp_init.c
hpeldsp_mmx.c
hpeldsp_rnd_template.c
hpeldsp.asm x86: hpeldsp: Use PAVGB instruction macro where necessary 2013-11-04 01:29:23 +01:00
idct_mmx_xvid.c Merge commit '0b8b2ae5e93d616c2ece59f7175f483154cff918' 2013-10-06 11:25:22 +02:00
idct_sse2_xvid.c Merge commit '0b8b2ae5e93d616c2ece59f7175f483154cff918' 2013-10-06 11:25:22 +02:00
idct_xvid.h
imdct36.asm
lpc.c
Makefile Merge commit '1700b4e678ed329611a16b20d11e64b7abda4839' 2013-11-02 10:13:14 +01:00
mathops.h
mlpdsp.c
motion_est.c
mpeg4qpel.asm
mpegaudiodsp.c
mpegvideo.c
mpegvideoenc_template.c
mpegvideoenc.c
pngdsp_init.c
pngdsp.asm
proresdsp_init.c
proresdsp.asm
qpel.asm
rnd_mmx.c
rnd_template.c
rv34dsp_init.c
rv34dsp.asm
rv40dsp_init.c
rv40dsp.asm Merge commit 'e2b5b097898c9155f4bdff4d83cdc54d5eef6930' 2013-11-05 10:26:07 +01:00
sbrdsp_init.c
sbrdsp.asm
simple_idct.c
snowdsp.c
v210-init.c
v210.asm
vc1dsp_init.c
vc1dsp_mmx.c
vc1dsp.asm
vc1dsp.h
videodsp_init.c avcodec/x86/videodsp: Properly mark sse2 instructions in emulated_edge_mc x86 simd as such. 2013-10-24 13:36:55 +02:00
videodsp.asm avcodec/x86/videodsp: Small speedups in ff_emulated_edge_mc x86 SIMD. 2013-10-27 15:02:48 +01:00
vorbisdsp_init.c
vorbisdsp.asm
vp3dsp_init.c
vp3dsp.asm
vp6dsp_init.c
vp6dsp.asm
vp8dsp_init.c
vp8dsp_loopfilter.asm Merge commit '1700b4e678ed329611a16b20d11e64b7abda4839' 2013-11-02 10:13:14 +01:00
vp8dsp.asm Merge commit '1700b4e678ed329611a16b20d11e64b7abda4839' 2013-11-02 10:13:14 +01:00
vp9dsp_init.c avcodec/vp9: add ff_vp9_idct_idct_{4x4,8x8}_ssse3(). 2013-11-05 19:25:40 +01:00
vp9dsp.asm avcodec/vp9: add ff_vp9_idct_idct_{4x4,8x8}_ssse3(). 2013-11-05 19:25:40 +01:00
vp56_arith.h
w64xmmtest.c