Reimar Döffinger
eafa1c90e5
Replace complicated and currently broken manual alignment code by
...
DECLARE_ALIGNED_16. Fixes crash in ff_snow_horizontal_compose97i_sse2
Originally committed as revision 10261 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-30 11:41:58 +00:00
Michael Niedermayer
267b9479ac
typo
...
Originally committed as revision 10250 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-27 10:39:31 +00:00
Ramiro Polla
7bcc1d5b66
CONFIG_7REGS has been renamed to HAVE_7REGS
...
Originally committed as revision 10237 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 16:10:53 +00:00
Michael Niedermayer
90e9e94d0f
workaround gcc bug, untested as my gcc is not complaining
...
Originally committed as revision 10236 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 12:34:22 +00:00
Michael Niedermayer
cefa599953
optimize the first vertical lifting step, this also prevents another
...
overflow, the last known possible overflow
Originally committed as revision 10234 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 11:16:23 +00:00
Michael Niedermayer
c90762766a
optimize 1st horizontal lifting step
...
Originally committed as revision 10231 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 08:31:55 +00:00
Michael Niedermayer
1104bf2b20
typo
...
Originally committed as revision 10230 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 08:03:23 +00:00
Michael Niedermayer
8b5029295d
get rid of totally senseless "m" + read in register we have enough
...
registers to keep everything in registers
Originally committed as revision 10229 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 06:51:51 +00:00
Michael Niedermayer
bc1e78d896
simplify senselessly complex addressing
...
Originally committed as revision 10228 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 02:02:14 +00:00
Michael Niedermayer
25bb359f73
cosmetics
...
remove brain amputated mmx wrappers around sse2 macros
fix name of ..._sub macro to match ..._add naming
Originally committed as revision 10227 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 01:20:09 +00:00
Michael Niedermayer
629750290f
avoid overflow in the 3rd lifting step, this now needs mmx2 at minimum
...
(patch for plain mmx support is welcome ...)
Originally committed as revision 10226 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-26 01:11:02 +00:00
Michael Niedermayer
b696a4c91d
avoid an overflow in the 1 horizontal lifting step
...
Originally committed as revision 10225 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-25 19:04:33 +00:00
Michael Niedermayer
9caa1cccb9
prevent one overflow in the first vertical lifting step
...
Originally committed as revision 10224 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-25 16:28:45 +00:00
Michael Niedermayer
3e0f7126b5
update mmx code to latest snow changes
...
note, the code likely can overflow and thus needs some more changes
sse2 updated too but disabled as it is untested
Originally committed as revision 10223 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-25 15:20:56 +00:00
Michael Niedermayer
ce611a27be
Change rounding of the horizontal DWT to match the vertical one.
...
This allows some simplifications and optimizations and should
not have any effect on quality.
Originally committed as revision 10172 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-21 16:29:40 +00:00
Michael Niedermayer
30cd3e66bd
remove code which become unused by the previous changes
...
Originally committed as revision 10166 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-21 00:05:30 +00:00
Michael Niedermayer
72dee89b5d
Simplify and optimize the 4th vertical lifting step of the SSE2 code (untested)
...
This also reduces the needed headroom in that step by 1 bit
Originally committed as revision 10165 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-21 00:03:18 +00:00
Michael Niedermayer
d0dae46a90
Simplify and optimize the 4th vertical lifting step of the MMX code
...
This also reduces the needed headroom in that step by 1 bit
Originally committed as revision 10164 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-21 00:02:19 +00:00
Michael Niedermayer
1ffbbef217
Simplify and speedup code, reduce needed headroom by 2 bits in the 3rd
...
vertical lifting step of the SSE2 code (untested)
Originally committed as revision 10163 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 23:59:20 +00:00
Michael Niedermayer
4bf1790421
simplify, speedup and reduce needed headroom by 2 bits in the 3rd
...
vertical lifting step
Originally committed as revision 10162 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 23:54:49 +00:00
Michael Niedermayer
dd30437bbe
replace <<1 by add for SSE2 (untested)
...
Originally committed as revision 10161 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 23:11:05 +00:00
Michael Niedermayer
7e665a3943
replace <<1 by add
...
Originally committed as revision 10160 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 23:09:39 +00:00
Michael Niedermayer
eee649d3b5
slightly change horizontal lift3 so it needs 1 bit less headroom
...
Originally committed as revision 10159 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 23:02:59 +00:00
Michael Niedermayer
be3b22f9ca
remove idiotc double subtraction from the sse2 code (untested, no sse2 here)
...
Originally committed as revision 10158 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 22:41:47 +00:00
Michael Niedermayer
8510c519e7
fixme note
...
Originally committed as revision 10157 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 22:34:42 +00:00
Michael Niedermayer
3a9f44d5d5
and of course the unneeded double subtractions were blindly put in the
...
mmx code
this also makes the affected code 4% faster
Originally committed as revision 10156 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-08-20 22:29:21 +00:00
Ronald S. Bultje
b550bfaa61
Add libavcodec to compiler include flags in order to simplify header
...
include paths in the source files.
mostly from a patch by Ronald S. Bultje, rbultje ronald.bitfreak net
Originally committed as revision 9034 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-05-16 09:51:45 +00:00
Panagiotis Issaris
9dd6c80453
Add the const specifier as needed to reduce the number of warnings.
...
Originally committed as revision 7764 to svn://svn.ffmpeg.org/ffmpeg/trunk
2007-01-30 10:31:34 +00:00
Diego Biurrun
b78e7197a8
Change license headers to say 'FFmpeg' instead of 'this program/this library'
...
and fix GPL/LGPL version mismatches.
Originally committed as revision 6577 to svn://svn.ffmpeg.org/ffmpeg/trunk
2006-10-07 15:30:46 +00:00
Luca Barbato
ffad4ed154
Fix x86 SIMD asm and pic, patch from Martin von Gagern <Martin.vGagern@gmx.net>
...
Originally committed as revision 5973 to svn://svn.ffmpeg.org/ffmpeg/trunk
2006-08-10 16:05:29 +00:00
Luca Abeni
9c39071d6d
Move REG_* macros from libavcodec/i386/mmx.h to libavutil/x86_cpu.h
...
Originally committed as revision 5595 to svn://svn.ffmpeg.org/ffmpeg/trunk
2006-07-03 10:52:07 +00:00
Wolfram Gloger
f42635f558
gcc-2.95 compile fix, patch by Wolfram Gloger <wmglo A dent PIS med PIS uni-muenchen PIS de>
...
Originally committed as revision 5298 to svn://svn.ffmpeg.org/ffmpeg/trunk
2006-04-18 03:48:30 +00:00
Luca Barbato
22b48b85b6
altivec support for snow
...
Originally committed as revision 5228 to svn://svn.ffmpeg.org/ffmpeg/trunk
2006-03-27 12:51:19 +00:00
Robert Edele
e8600e5edc
add MMX and SSE versions of ff_snow_inner_add_yblock
...
Patch by Robert Edele < yartrebo AH earthlink POIS net >
Original Thread:
Date: Mar 22, 2006 3:24 AM
Subject: [Ffmpeg-devel] [PATCH] snow mmx + sse2 part 5
Originally committed as revision 5197 to svn://svn.ffmpeg.org/ffmpeg/trunk
2006-03-22 12:08:35 +00:00
Robert Edele
2c9a0285d4
snow mmx+sse2 optimizations, part 4
...
Patch by Robert Edele, yartrebo <<at>> earthlink <<dot>> net
Originally committed as revision 5191 to svn://svn.ffmpeg.org/ffmpeg/trunk
2006-03-21 21:51:07 +00:00
Robert Edele
4567b4bdab
Add the mmx and sse2 implementations of ff_snow_vertical_compose().
...
Patch by Robert Edele < yartrebo AH earthlink POIS net >
Original thread:
Date: Mar 20, 2006 5:54 PM
Subject: [Ffmpeg-devel] [PATCH] snow mmx + sse2 part 3
Originally committed as revision 5185 to svn://svn.ffmpeg.org/ffmpeg/trunk
2006-03-20 22:27:59 +00:00