FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2025-11-23 21:54:53 +02:00

Files

Andreas Rheinhardt f4a87d8ca4 avcodec/x86/mpegvideoencdsp_init: Use xmm registers in SSSE3 functions

Improves performance and no longer breaks the ABI (by forgetting
to call emms).

Old benchmarks:
add_8x8basis_c:                                         43.6 ( 1.00x)
add_8x8basis_ssse3:                                     12.3 ( 3.55x)

New benchmarks:
add_8x8basis_c:                                         43.0 ( 1.00x)
add_8x8basis_ssse3:                                      6.3 ( 6.79x)

Notice that the output of try_8x8basis_ssse3 changes a bit:
Before this commit, it computes certain values and adds the values
for i,i+1,i+4 and i+5 before right shifting them; now it adds
the values for i,i+1,i+8,i+9. The second pair in these lists
could be avoided (by shifting xmm0 and xmm1 before adding both together
instead of only shifting xmm0 after adding them), but the former
i,i+1 is inherent in using pmaddwd. This is the reason that this
function is not bitexact.

Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>

2025-10-15 08:55:13 +02:00

h26x

…

hevc

…

vvc

avcodec/x86/vvc/sao_10bit: Remove unused functions

2025-09-26 06:21:26 +02:00

aacencdsp_init.c

…

aacencdsp.asm

…

aacpsdsp_init.c

…

aacpsdsp.asm

…

ac3dsp_downmix.asm

…

ac3dsp_init.c

…

ac3dsp.asm

…

alacdsp_init.c

…

alacdsp.asm

…

apv_dsp_init.c

…

apv_dsp.asm

avcodec/x86/apv_dsp: Don't export arrays unnecessarily

2025-09-24 01:21:32 +00:00

audiodsp_init.c

…

audiodsp.asm

…

blockdsp_init.c

…

blockdsp.asm

…

bswapdsp_init.c

…

bswapdsp.asm

…

cabac.h

…

cavs_qpel.asm

avcodec/x86/cavs_qpel: Add SSE2 vertical motion compensation

2025-10-08 20:40:08 +02:00

cavsdsp.c

avcodec/x86/fpel: Add blocksize x blocksize avg/put functions

2025-10-08 20:40:53 +02:00

cavsidct.asm

…

celt_pvq_init.c

…

celt_pvq_search.asm

…

cfhddsp_init.c

…

cfhddsp.asm

…

cfhdencdsp_init.c

…

cfhdencdsp.asm

…

constants.c

avcodec/x86/cavs_qpel: Add SSE2 vertical motion compensation

2025-10-08 20:40:08 +02:00

constants.h

avcodec/x86/cavs_qpel: Add SSE2 vertical motion compensation

2025-10-08 20:40:08 +02:00

dcadsp_init.c

avcodec/dcadsp: constify lfe_samples parameter

2025-10-04 14:18:30 -03:00

dcadsp.asm

…

dct32.asm

…

dirac_dwt_init.c

…

dirac_dwt.asm

…

diracdsp_init.c

…

diracdsp.asm

avcodec/x86/cavs_qpel: Add SSE2 vertical motion compensation

2025-10-08 20:40:08 +02:00

dnxhdenc_init.c

…

dnxhdenc.asm

…

exrdsp_init.c

…

exrdsp.asm

…

fdct.c

…

fdct.h

…

fdctdsp_init.c

…

flac_dsp_gpl.asm

…

flacdsp_init.c

…

flacdsp.asm

…

flacencdsp_init.c

…

fmtconvert_init.c

…

fmtconvert.asm

…

fpel.asm

avcodec/x86/fpel: Add blocksize x blocksize avg/put functions

2025-10-08 20:40:53 +02:00

fpel.h

avcodec/x86/fpel: Add blocksize x blocksize avg/put functions

2025-10-08 20:40:53 +02:00

g722dsp_init.c

…

g722dsp.asm

…

h263_loopfilter.asm

avcodec/x86/h263_loopfilter: Port loop filter to SSE2

2025-10-03 17:05:46 +00:00

h263dsp_init.c

avcodec/x86/h263_loopfilter: Port loop filter to SSE2

2025-10-03 17:05:46 +00:00

h264_cabac.c

…

h264_chromamc_10bit.asm

…

h264_chromamc.asm

…

h264_deblock_10bit.asm

…

h264_deblock.asm

…

h264_idct_10bit.asm

…

h264_idct.asm

…

h264_intrapred_10bit.asm

…

h264_intrapred_init.c

…

h264_intrapred.asm

…

h264_qpel_8bit.asm

avcodec/x86/h264_qpel: Split hv2_lowpass_sse2 into size 8,16 funcs

2025-10-07 18:06:40 +02:00

h264_qpel_10bit.asm

avcodec/x86/h264_qpel_10bit: Remove SSE2 "cache64" duplicates

2025-10-04 07:06:33 +02:00

h264_qpel.c

avcodec/x86/h264_qpel: Don't instantiate unused functions

2025-10-10 16:27:57 +02:00

h264_weight_10bit.asm

…

h264_weight.asm

…

h264chroma_init.c

…

h264dsp_init.c

…

hpeldsp_init.c

avcodec/x86/hpeldsp_init: Remove check for inline mmx

2025-10-14 12:31:15 +02:00

hpeldsp.asm

avcodec/x86/hpeldsp: Improve ff_{avg,put}_pixels8_xy2_ssse3()

2025-10-12 02:45:37 +02:00

hpeldsp.h

avcodec/x86/rv40dsp_init: Remove MMX(EXT) funcs overridden by SSSE3

2025-09-26 06:21:23 +02:00

huffyuvdsp_init.c

…

huffyuvdsp_template.asm

…

huffyuvdsp.asm

…

huffyuvencdsp_init.c

…

huffyuvencdsp.asm

…

idctdsp_init.c

…

idctdsp.asm

…

idctdsp.h

…

imdct36.asm

…

jpeg2000dsp_init.c

…

jpeg2000dsp.asm

…

lossless_audiodsp_init.c

…

lossless_audiodsp.asm

…

lossless_videodsp_init.c

…

lossless_videodsp.asm

…

lossless_videoencdsp_init.c

…

lossless_videoencdsp.asm

…

lpc_init.c

…

lpc.asm

…

Makefile

avcodec/x86/cavsdsp: Add SSE2 mc20 horizontal motion compensation

2025-10-08 20:40:08 +02:00

mathops.h

…

me_cmp_init.c

…

me_cmp.asm

…

mlpdsp_init.c

…

mlpdsp.asm

…

mpeg4videodsp.c

…

mpegaudiodsp.c

…

mpegvideo.c

…

mpegvideoenc_template.c

…

mpegvideoenc.c

…

mpegvideoencdsp_init.c

avcodec/x86/mpegvideoencdsp_init: Use xmm registers in SSSE3 functions

2025-10-15 08:55:13 +02:00

mpegvideoencdsp.asm

…

opusdsp_init.c

…

opusdsp.asm

…

pixblockdsp_init.c

…

pixblockdsp.asm

…

pngdsp_init.c

…

pngdsp.asm

…

proresdsp_init.c

…

proresdsp.asm

…

qpel.asm

avcodec/x86/qpel: Move ff_{put,avg}_pixels4_l2_mmxext to h264_qpel

2025-10-04 07:06:32 +02:00

qpeldsp_init.c

avcodec/x86/fpel: Add blocksize x blocksize avg/put functions

2025-10-08 20:40:53 +02:00

qpeldsp.asm

avcodec/x86/qpel{dsp,dsp_init}: Use ptrdiff_t for stride

2025-10-04 07:06:32 +02:00

rv34dsp_init.c

…

rv34dsp.asm

…

rv40dsp_init.c

avcodec/x86/rv40dsp_init: Remove MMX(EXT) funcs overridden by SSSE3

2025-09-26 06:21:23 +02:00

rv40dsp.asm

…

sbcdsp_init.c

…

sbcdsp.asm

…

sbrdsp_init.c

…

sbrdsp.asm

…

simple_idct10_template.asm

…

simple_idct10.asm

…

simple_idct.asm

…

simple_idct.h

…

snowdsp.c

…

svq1enc_init.c

…

svq1enc.asm

…

synth_filter_init.c

…

synth_filter.asm

…

takdsp_init.c

…

takdsp.asm

…

ttadsp_init.c

…

ttadsp.asm

…

ttaencdsp_init.c

…

ttaencdsp.asm

…

utvideodsp_init.c

…

utvideodsp.asm

…

v210-init.c

…

v210.asm

…

v210enc_init.c

…

v210enc.asm

…

vc1dsp_init.c

…

vc1dsp_loopfilter.asm

…

vc1dsp_mc.asm

…

vc1dsp_mmx.c

…

vc1dsp.h

…

videodsp_init.c

…

videodsp.asm

…

vorbisdsp_init.c

…

vorbisdsp.asm

…

vp3dsp_init.c

avcodec/vp3dsp: Remove unused flags parameter from ff_vp3dsp_init()

2025-10-13 18:59:24 +02:00

vp3dsp.asm

avcodec/x86/vp3dsp: Port loop filters to SSE2

2025-10-13 18:58:50 +02:00

vp6dsp_init.c

…

vp6dsp.asm

…

vp8dsp_init.c

…

vp8dsp_loopfilter.asm

…

vp8dsp.asm

…

vp9dsp_init_10bpp.c

…

vp9dsp_init_12bpp.c

…

vp9dsp_init_16bpp_template.c

…

vp9dsp_init_16bpp.c

…

vp9dsp_init.c

vp9: Remove 8bpc AVX asm for inverse transforms

2025-09-19 23:12:59 +00:00

vp9dsp_init.h

…

vp9intrapred_16bpp.asm

…

vp9intrapred.asm

…

vp9itxfm_16bpp_avx512.asm

…

vp9itxfm_16bpp.asm

…

vp9itxfm_avx2.asm

…

vp9itxfm_avx512.asm

…

vp9itxfm_template.asm

…

vp9itxfm.asm

vp9: Remove 8bpc AVX asm for inverse transforms

2025-09-19 23:12:59 +00:00

vp9lpf_16bpp.asm

…

vp9lpf.asm

…

vp9mc_16bpp.asm

…

vp9mc.asm

…

vpx_arith.h

…

w64xmmtest.c

…

xvididct_init.c

…

xvididct.asm

…

xvididct.h

…