FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2025-02-04 06:08:26 +02:00

History

Christophe GISQUET 34454c761f SBR DSP x86: implement SSE sbr_sum_square_sse

The 32bits targets have been compiled with -mfpmath=sse for proper reference.
sbr_sum_square C  /32bits: 82c (unrolled)/102c
               C  /64bits: 69c (unrolled)/82c
               SSE/32bits: 42c
               SSE/64bits: 31c

Use of SSE4.1 dpps to perform the final sum is slower.
Not unrolling to perform 8 operations in a loop yields 10 more cycles.

Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>

2012-02-23 15:50:06 -08:00

ac3dsp_mmx.c

…

ac3dsp.asm

ac3dsp: do not use pshufb in ac3_extract_exponents_ssse3()

2012-02-09 21:04:44 -05:00

cabac.h

…

cavsdsp_mmx.c

…

dct32_sse.asm

…

deinterlace.asm

…

dnxhd_mmx.c

…

dsputil_mmx_avg_template.c

…

dsputil_mmx_qns_template.c

…

dsputil_mmx_rnd_template.c

…

dsputil_mmx.c

dsputil: Add ff_ prefix to the dsputil*_init* functions

2012-02-15 22:06:34 +02:00

dsputil_mmx.h

dsputil: Add ff_ prefix to the dsputil*_init* functions

2012-02-15 22:06:34 +02:00

dsputil_yasm.asm

…

dsputilenc_mmx.c

dsputil: Add ff_ prefix to the dsputil*_init* functions

2012-02-15 22:06:34 +02:00

dsputilenc_yasm.asm

…

fdct_mmx.c

cosmetics: Delete empty lines at end of file.

2012-02-09 12:26:45 +01:00

fft_3dn2.c

cosmetics: Delete empty lines at end of file.

2012-02-09 12:26:45 +01:00

fft_3dn.c

…

fft_mmx.asm

…

fft_sse.c

cosmetics: Delete empty lines at end of file.

2012-02-09 12:26:45 +01:00

fft.c

cosmetics: Delete empty lines at end of file.

2012-02-09 12:26:45 +01:00

fft.h

…

fmtconvert_mmx.c

…

fmtconvert.asm

…

h264_chromamc_10bit.asm

…

h264_chromamc.asm

…

h264_deblock_10bit.asm

…

h264_deblock.asm

h264: don't use redzone in loopfilter on win64.

2012-02-19 15:31:03 -08:00

h264_i386.h

…

h264_idct_10bit.asm

…

h264_idct.asm

h264: manually save/restore XMM registers for functions using INIT_MMX.

2012-02-08 10:31:14 -08:00

h264_intrapred_10bit.asm

…

h264_intrapred_init.c

…

h264_intrapred.asm

h264: manually save/restore XMM registers for functions using INIT_MMX.

2012-02-08 10:31:14 -08:00

h264_qpel_10bit.asm

…

h264_qpel_mmx.c

…

h264_weight_10bit.asm

…

h264_weight.asm

…

h264dsp_mmx.c

…

idct_mmx_xvid.c

cosmetics: Delete empty lines at end of file.

2012-02-09 12:26:45 +01:00

idct_mmx.c

cosmetics: Delete empty lines at end of file.

2012-02-09 12:26:45 +01:00

idct_sse2_xvid.c

…

idct_xvid.h

…

imdct36_sse.asm

…

lpc_mmx.c

…

Makefile

SBR DSP x86: implement SSE sbr_sum_square_sse

2012-02-23 15:50:06 -08:00

mathops.h

…

mlpdsp.c

…

motion_est_mmx.c

dsputil: Add ff_ prefix to the dsputil*_init* functions

2012-02-15 22:06:34 +02:00

mpegaudiodec_mmx.c

mpegaudio: replace memcpy by SIMD code

2012-02-15 20:11:54 -08:00

mpegvideo_mmx_template.c

dsputil: Add ff_ prefix to inv_zigzag_direct16

2012-02-15 22:06:42 +02:00

mpegvideo_mmx.c

mpegvideo: Add ff_ prefix to nonstatic functions

2012-02-15 22:07:23 +02:00

pngdsp-init.c

…

pngdsp.asm

pngdsp: swap argument inversion.

2012-02-07 14:32:26 -08:00

proresdsp-init.c

…

proresdsp.asm

…

rv34dsp_init.c

rv34: change most "int stride" into "ptrdiff_t stride".

2012-02-20 14:58:25 -08:00

rv34dsp.asm

…

rv40dsp_init.c

rv34: change most "int stride" into "ptrdiff_t stride".

2012-02-20 14:58:25 -08:00

rv40dsp.asm

…

sbrdsp_init.c

SBR DSP x86: implement SSE sbr_sum_square_sse

2012-02-23 15:50:06 -08:00

sbrdsp.asm

SBR DSP x86: implement SSE sbr_sum_square_sse

2012-02-23 15:50:06 -08:00

simple_idct_mmx.c

…

snowdsp_mmx.c

…

vc1dsp_mmx.c

…

vc1dsp_yasm.asm

…

vp3dsp.asm

…

vp8dsp-init.c

…

vp8dsp.asm

…

vp56_arith.h

…

vp56dsp_init.c

…

vp56dsp.asm

…

w64xmmtest.c

…