FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-11-26 19:01:44 +02:00

Author	SHA1	Message	Date
Michael Niedermayer	61ad2b42a1	Merge commit 'd961a79eb07a8911540a0bd356d68ae0cf93c6a1' * commit 'd961a79eb07a8911540a0bd356d68ae0cf93c6a1': sbrdsp: move #if to disable all educational code Merged-by: Michael Niedermayer <michaelni@gmx.at>	2014-03-18 19:59:00 +01:00
Janne Grunau	d961a79eb0	sbrdsp: move #if to disable all educational code Avoids a warning of the unused function 'autocorrelate'.	2014-03-18 13:08:00 +01:00
Thilo Borgmann	d814a839ac	Reinstate proper FFmpeg license for all files.	2013-08-30 15:47:38 +00:00
Michael Niedermayer	711c8ee71d	Merge commit '4a7af92cc80ced8498626401ed21f25ffe6740c8' * commit '4a7af92cc80ced8498626401ed21f25ffe6740c8': sbrdsp: Unroll and use integer operations sbrdsp: Unroll sbr_autocorrelate_c x86: sbrdsp: Implement SSE2 qmf_deint_bfly Conflicts: libavcodec/sbrdsp.c libavcodec/x86/sbrdsp.asm libavcodec/x86/sbrdsp_init.c Merged-by: Michael Niedermayer <michaelni@gmx.at>	2013-05-04 10:07:43 +02:00
Christophe Gisquet	4a7af92cc8	sbrdsp: Unroll and use integer operations This patch can be controversial, by assuming floats are IEEE-754 and particular behaviour of the FPU will get in the way. Timing on Arrandale and Win32 (thus, x87 FPU is used in the reference). sbr_qmf_pre_shuffle_c: 115 to 76 sbr_neg_odd_64_c: 84 to 55 sbr_qmf_post_shuffle_c: 112 to 83 Signed-off-by: Diego Biurrun <diego@biurrun.de>	2013-05-03 18:23:30 +02:00
Christophe Gisquet	8394d9a676	sbrdsp: Unroll sbr_autocorrelate_c 1410 cycles to 1148 on Arrandale/Win64 Signed-off-by: Diego Biurrun <diego@biurrun.de>	2013-05-03 18:23:30 +02:00
Christophe Gisquet	11774169ae	sbrdsp: unroll and use integer operations This patch can be controversial, by assuming floats are IEEE-754 and particular behaviour of the FPU will get in the way. Timing on Arrandale and Win32 (thus, x87 FPU is used in the reference). sbr_qmf_pre_shuffle_c: 115 to 76 sbr_neg_odd_64_c: 84 to 55 sbr_qmf_post_shuffle_c: 112 to 83 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2013-04-06 15:52:42 +02:00
Christophe Gisquet	f4ac80227b	sbrdsp: unroll sbr_autocorrelate_c 1410 cycles to 1148 on Arrandale/Win64. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2013-04-06 15:52:42 +02:00
Mirjana Vulin	8d2eb5fe58	mips: optimization for float aac decoder (sbr module) Signed-off-by: Mirjana Vulin <mvulin@mips.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2013-02-21 22:43:08 +01:00
Janne Grunau	f101eab1be	x86: call most of the x86 dsp init functions under if (ARCH_X86) Rename the called dsp init functions to *_init_x86.	2012-10-08 11:54:05 +02:00
Christophe GISQUET	dabf8dd34a	SBR DSP: unroll sum_square The length is even, so some unrolling can be performed. Timings are for x86: - 32bits: 102c -> 82c - 64bits: 82c -> 69c Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2012-03-07 10:29:52 -08:00
Christophe GISQUET	34454c761f	SBR DSP x86: implement SSE sbr_sum_square_sse The 32bits targets have been compiled with -mfpmath=sse for proper reference. sbr_sum_square C /32bits: 82c (unrolled)/102c C /64bits: 69c (unrolled)/82c SSE/32bits: 42c SSE/64bits: 31c Use of SSE4.1 dpps to perform the final sum is slower. Not unrolling to perform 8 operations in a loop yields 10 more cycles. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2012-02-23 15:50:06 -08:00
Christophe GISQUET	2e74a5abc2	SBR DSP: use intptr_t for the ixh parameter. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2012-02-23 15:48:40 -08:00
Mans Rullgard	be822d77b6	aacsbr: ARM NEON optimised sbrdsp functions Overall speedup of HE-AAC decoding 2.3x on Cortex-A8, 1.2x on A9. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-01-28 14:56:18 +00:00
Mans Rullgard	aac46e088d	aacsbr: move some simdable loops to function pointers This prepares for assembly optimisations by moving the most time-consuming loops to functions called through pointers in a new context. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-01-28 14:56:18 +00:00

15 Commits