Christophe Gisquet
08e3ea60ff
x86: synth filter float: implement SSE2 version
...
Timings for Arrandale:
C SSE
win32: 2108 334
win64: 1152 322
Factorizing the inner loop with a call/jmp is a >15 cycles cost, even with
the jmp destination being aligned.
Unrolling for ARCH_X86_64 is a 20 cycles gain.
Signed-off-by: Janne Grunau <janne-libav@jannau.net>
2014-02-28 13:00:48 +01:00
Mans Rullgard
2912e87a6c
Replace FFmpeg with Libav in licence headers
...
Signed-off-by: Mans Rullgard <mans@mansr.com>
2011-03-19 13:33:20 +00:00
Justin Ruggles
80ba1ddb58
Remove unneeded add bias from 3 functions.
...
DSPContext.vector_fmul_window()
DCADSPContext.lfe_fir()
SynthFilterContext.synth_filter_float()
Signed-off-by: Mans Rullgard <mans@mansr.com>
2011-01-31 20:28:42 +00:00
Måns Rullgård
e73d1a5efc
ARM: NEON optimised synth_filter_float
...
2.7x faster DCA decoding on Cortex-A8
Originally committed as revision 22828 to svn://svn.ffmpeg.org/ffmpeg/trunk
2010-04-10 16:27:56 +00:00
Måns Rullgård
f462ed1f82
Make synth_filter a function pointer
...
Originally committed as revision 22827 to svn://svn.ffmpeg.org/ffmpeg/trunk
2010-04-10 16:27:53 +00:00
Måns Rullgård
9c9a0840d0
Add lots of missing includes
...
Originally committed as revision 22337 to svn://svn.ffmpeg.org/ffmpeg/trunk
2010-03-08 18:43:52 +00:00
Michael Niedermayer
f92e769666
Multiple inclusion guards for synth_filter.h.
...
Originally committed as revision 20415 to svn://svn.ffmpeg.org/ffmpeg/trunk
2009-10-29 10:46:29 +00:00
Michael Niedermayer
d886e4df10
Vertically align function arguments.
...
Originally committed as revision 20413 to svn://svn.ffmpeg.org/ffmpeg/trunk
2009-10-29 10:43:56 +00:00
Michael Niedermayer
4f99c31c39
Split synth filter out of dca.c.
...
Originally committed as revision 20396 to svn://svn.ffmpeg.org/ffmpeg/trunk
2009-10-28 10:51:51 +00:00