1
0
mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-11-21 10:55:51 +02:00
FFmpeg/libavcodec/x86
Christophe GISQUET 34454c761f SBR DSP x86: implement SSE sbr_sum_square_sse
The 32bits targets have been compiled with -mfpmath=sse for proper reference.
sbr_sum_square C  /32bits: 82c (unrolled)/102c
               C  /64bits: 69c (unrolled)/82c
               SSE/32bits: 42c
               SSE/64bits: 31c

Use of SSE4.1 dpps to perform the final sum is slower.
Not unrolling to perform 8 operations in a loop yields 10 more cycles.

Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
2012-02-23 15:50:06 -08:00
..
ac3dsp_mmx.c
ac3dsp.asm ac3dsp: do not use pshufb in ac3_extract_exponents_ssse3() 2012-02-09 21:04:44 -05:00
cabac.h
cavsdsp_mmx.c
dct32_sse.asm
deinterlace.asm
dnxhd_mmx.c
dsputil_mmx_avg_template.c
dsputil_mmx_qns_template.c
dsputil_mmx_rnd_template.c
dsputil_mmx.c dsputil: Add ff_ prefix to the dsputil*_init* functions 2012-02-15 22:06:34 +02:00
dsputil_mmx.h dsputil: Add ff_ prefix to the dsputil*_init* functions 2012-02-15 22:06:34 +02:00
dsputil_yasm.asm
dsputilenc_mmx.c dsputil: Add ff_ prefix to the dsputil*_init* functions 2012-02-15 22:06:34 +02:00
dsputilenc_yasm.asm
fdct_mmx.c cosmetics: Delete empty lines at end of file. 2012-02-09 12:26:45 +01:00
fft_3dn2.c cosmetics: Delete empty lines at end of file. 2012-02-09 12:26:45 +01:00
fft_3dn.c
fft_mmx.asm
fft_sse.c cosmetics: Delete empty lines at end of file. 2012-02-09 12:26:45 +01:00
fft.c cosmetics: Delete empty lines at end of file. 2012-02-09 12:26:45 +01:00
fft.h
fmtconvert_mmx.c
fmtconvert.asm
h264_chromamc_10bit.asm
h264_chromamc.asm
h264_deblock_10bit.asm
h264_deblock.asm h264: don't use redzone in loopfilter on win64. 2012-02-19 15:31:03 -08:00
h264_i386.h
h264_idct_10bit.asm
h264_idct.asm h264: manually save/restore XMM registers for functions using INIT_MMX. 2012-02-08 10:31:14 -08:00
h264_intrapred_10bit.asm
h264_intrapred_init.c
h264_intrapred.asm h264: manually save/restore XMM registers for functions using INIT_MMX. 2012-02-08 10:31:14 -08:00
h264_qpel_10bit.asm
h264_qpel_mmx.c
h264_weight_10bit.asm
h264_weight.asm
h264dsp_mmx.c
idct_mmx_xvid.c cosmetics: Delete empty lines at end of file. 2012-02-09 12:26:45 +01:00
idct_mmx.c cosmetics: Delete empty lines at end of file. 2012-02-09 12:26:45 +01:00
idct_sse2_xvid.c
idct_xvid.h
imdct36_sse.asm
lpc_mmx.c
Makefile SBR DSP x86: implement SSE sbr_sum_square_sse 2012-02-23 15:50:06 -08:00
mathops.h
mlpdsp.c
motion_est_mmx.c dsputil: Add ff_ prefix to the dsputil*_init* functions 2012-02-15 22:06:34 +02:00
mpegaudiodec_mmx.c mpegaudio: replace memcpy by SIMD code 2012-02-15 20:11:54 -08:00
mpegvideo_mmx_template.c dsputil: Add ff_ prefix to inv_zigzag_direct16 2012-02-15 22:06:42 +02:00
mpegvideo_mmx.c mpegvideo: Add ff_ prefix to nonstatic functions 2012-02-15 22:07:23 +02:00
pngdsp-init.c
pngdsp.asm
proresdsp-init.c
proresdsp.asm
rv34dsp_init.c rv34: change most "int stride" into "ptrdiff_t stride". 2012-02-20 14:58:25 -08:00
rv34dsp.asm
rv40dsp_init.c rv34: change most "int stride" into "ptrdiff_t stride". 2012-02-20 14:58:25 -08:00
rv40dsp.asm
sbrdsp_init.c SBR DSP x86: implement SSE sbr_sum_square_sse 2012-02-23 15:50:06 -08:00
sbrdsp.asm SBR DSP x86: implement SSE sbr_sum_square_sse 2012-02-23 15:50:06 -08:00
simple_idct_mmx.c
snowdsp_mmx.c
vc1dsp_mmx.c
vc1dsp_yasm.asm
vp3dsp.asm
vp8dsp-init.c
vp8dsp.asm
vp56_arith.h
vp56dsp_init.c
vp56dsp.asm
w64xmmtest.c