FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-12-28 20:53:54 +02:00

Author	SHA1	Message	Date
Christophe Gisquet	996697e266	x86: float dsp: unroll SSE versions vector_fmul and vector_fmac_scalar are guaranteed that they can process in batch of 16 elements, but their SSE versions only does 8 at a time. Therefore, unroll them a bit. 299 to 261c for 256 elements in vector_fmac_scalar on Arrandale/Win64. Signed-off-by: Janne Grunau <janne-libav@jannau.net>	2014-02-20 14:18:05 +01:00
Christophe Gisquet	566b7a20fd	x86: float dsp: butterflies_float SSE 97c -> 49c Some codecs could benefit from more unrolling, but AAC doesn't.	2013-05-03 08:08:02 +02:00
Ronald S. Bultje	d56668bd80	floatdsp: move scalarproduct_float from dsputil to avfloatdsp. This makes the aac decoder and all voice codecs independent of dsputil.	2013-01-22 11:55:42 -08:00
Ronald S. Bultje	42d3246948	floatdsp: move vector_fmul_reverse from dsputil to avfloatdsp. Now, nellymoserenc and aacenc no longer depends on dsputil. Independent of this patch, wmaprodec also does not depend on dsputil, so I removed it from there also.	2013-01-22 11:55:42 -08:00
Ronald S. Bultje	55aa03b9f8	floatdsp: move vector_fmul_add from dsputil to avfloatdsp.	2013-01-22 11:55:42 -08:00
Justin Ruggles	1c012e6bfb	x86: float_dsp: fix loading of the len parameter on x86-32	2012-12-07 21:19:29 -05:00
Justin Ruggles	ecc8b02194	x86: float_dsp: fix compilation of ff_vector_dmul_scalar_avx() on x86-32 Signed-off-by: Janne Grunau <janne-libav@jannau.net>	2012-12-06 14:11:15 +01:00
Justin Ruggles	ac7eb4cb20	float_dsp: add vector_dmul_scalar() to multiply a vector of doubles Include x86-optimized versions for SSE2 and AVX.	2012-12-05 11:23:36 -05:00
Justin Ruggles	947f933687	x86: float_dsp: add SSE version of vector_fmul_scalar()	2012-11-26 11:30:19 -05:00
Diego Biurrun	2b479bcab0	build: Drop AVX assembly ifdefs An assembler able to cope with AVX instructions is now required.	2012-11-11 20:43:28 +01:00
Diego Biurrun	6860b4081d	x86: include x86inc.asm in x86util.asm This is necessary to allow refactoring some x86util macros with cpuflags.	2012-10-31 00:37:42 +01:00
Justin Ruggles	7327525997	x86: float_dsp: fix ff_vector_fmac_scalar_avx() on Win64 The SWAP macro does not work for explicit xmm/ymm usage, so instead just move the scalar value from xmm2 to xmm0.	2012-09-07 14:49:10 -04:00
Diego Biurrun	17337f54c0	x86: Split inline and external assembly #ifdefs	2012-08-31 01:53:25 +02:00
Mans Rullgard	a3df4781f4	x86: add colons after labels nasm prints a warning if the colon is missing. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-07 15:20:56 +01:00
Ronald S. Bultje	30b45d9c38	x86inc: automatically insert vzeroupper for YMM functions.	2012-07-26 13:43:16 -07:00
Justin Ruggles	82b2df9790	float_dsp: add x86-optimized functions for vector_fmac_scalar()	2012-06-18 18:01:14 -04:00
Justin Ruggles	d5a7229ba4	Add a float DSP framework to libavutil Move vector_fmul() from DSPContext to AVFloatDSPContext.	2012-06-08 13:14:38 -04:00

17 Commits