Guillaume Poirier
f6d1338cb5
Add the rest of missing Reg_* macros to support both AMD-64 style regs and IA32 regs.
...
Not used yet, but should be once the SIMD code to accelerate Snow decoding is merged.
Originally committed as revision 4731 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-12-10 22:53:44 +00:00
Loren Merritt
ea15df8048
use sse16_sse2() in nsse
...
Originally committed as revision 4688 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-11-12 05:23:25 +00:00
Loren Merritt
a6624e21cb
faster h264_chroma_mc8_mmx, added h264_chroma_mc4_mmx.
...
2-4% overall speedup.
Originally committed as revision 4666 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-10-27 06:45:29 +00:00
Loren Merritt
b926572aa9
h264 mmx weighted prediction. up to 3% overall speedup.
...
Originally committed as revision 4630 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-10-09 23:38:52 +00:00
Loren Merritt
5693c08356
sse2 16x16 sum squared diff (306=>268 cycles on a K8)
...
faster 8x8 mmx ssd (77=>70 cycles)
Originally committed as revision 4623 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-09-30 02:31:47 +00:00
Michael Niedermayer
12e9668119
replace a few mov + psrlq with pshufw, there are more cases which could benefit from this but they would require us to duplicate some functions ...
...
the trick is from various places (my own code in libpostproc, a patch on the x264 list, ...)
Originally committed as revision 4608 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-09-21 21:17:09 +00:00
Reimar Döffinger
cd7af76d9e
Fix compile without CONFIG_GPL, misplaced #endif caused a missing }.
...
Originally committed as revision 4575 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-09-10 19:30:40 +00:00
Michael Niedermayer
9f211bc6d7
remove unused table entries
...
change non portable table access
Originally committed as revision 4574 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-09-10 19:03:37 +00:00
Michael Niedermayer
84740d5980
xvids mmx&mmx2 idcts
...
needed to decode xvid without some minor artefacts
under #ifdef CONFIG_GPL of course
Originally committed as revision 4572 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-09-10 17:01:30 +00:00
Måns Rullgård
79396ac685
Kill some compiler warnings. Compiled code verified identical after changes.
...
Originally committed as revision 4567 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-09-06 21:25:35 +00:00
Michael Niedermayer
d3a9f79871
simplify (d&a) and (d&~a) calculation, hint by skal
...
Originally committed as revision 4552 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-09-03 09:17:30 +00:00
Michael Niedermayer
b5b65df7a9
add consts (this was in my local tree, dunno where it came from, probably forgoten from some const patch)
...
Originally committed as revision 4551 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-09-02 21:13:19 +00:00
Måns Rullgård
bf4e3bd2d0
kill a bunch of compiler warnings
...
Originally committed as revision 4522 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-08-14 15:42:40 +00:00
Alexander Strasser
c11c2bc20b
libavutil: Utility code from libavcodec moved to a separate library.
...
Originally committed as revision 4489 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-08-01 20:07:05 +00:00
Loren Merritt
d2bb7db135
sort H.264 mmx dsp functions into their own file
...
Originally committed as revision 4338 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-02 20:45:35 +00:00
Michael Niedermayer
c26ae41db2
adding a few const
...
Originally committed as revision 4337 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 21:19:00 +00:00
Michael Niedermayer
435b0720a8
100l for myself (breaking amd64)
...
Originally committed as revision 4336 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 18:04:01 +00:00
Michael Niedermayer
6510f43cf3
merge a few asm blocks so gcc cant unoptimize it (658->631 dezicycles on duron)
...
Originally committed as revision 4334 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 11:56:58 +00:00
Michael Niedermayer
987ae784e6
get rid of 2 movq (680 -> 658 dezicycles on duron)
...
Originally committed as revision 4333 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 11:36:32 +00:00
Michael Niedermayer
e4b36d4434
avoid one transpose (730->680 dezicycles on duron)
...
Originally committed as revision 4332 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 08:43:40 +00:00
Loren Merritt
85bbfcd4ee
10l (symbol mangling)
...
Originally committed as revision 4331 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 04:51:46 +00:00
Michael Niedermayer
1f3dbc09b1
add rounding bias before the horizontal idct (765->730 dezicyles on duron)
...
Originally committed as revision 4330 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-06-01 01:18:41 +00:00
Loren Merritt
1d62fc8560
MMX for H.264 iDCT (adapted from x264)
...
Originally committed as revision 4329 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-31 22:48:33 +00:00
Zoltán Hidvégi
3072f0cb2e
MMX code for (put|avg)_h264_chroma_mc8
...
Originally committed as revision 4305 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-25 08:05:41 +00:00
Loren Merritt
5cf08f2393
H.264 deblocking optimizations (mmx for chroma_bS4 case, convert existing cases to 8-bit math)
...
Originally committed as revision 4271 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-18 09:17:22 +00:00
Michael Niedermayer
5773a74669
porting the mmx&sse2 (sse2 untested) vp3 idcts to the lavc idct API
...
Originally committed as revision 4260 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-17 18:28:40 +00:00
Michael Niedermayer
b178f758fa
disabling vp3 mmx&mmx2 idcts, they must be ported over to the lavc idct API, ill port the vp3 c idct
...
Originally committed as revision 4255 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-17 09:11:48 +00:00
Michael Niedermayer
c998bdd9a0
fix PIC
...
Originally committed as revision 4204 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-05-08 18:48:19 +00:00
Loren Merritt
42251a2a4f
MMX for H.264 deblocking filter
...
Originally committed as revision 4158 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-04-25 01:01:41 +00:00
Michael Niedermayer
4e492bf107
read 32bit instead of 64bit to avoid overreading and missalignments
...
Originally committed as revision 4133 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-04-17 13:30:45 +00:00
Martin Drab
4d9ae03b09
optimization and gcc 4.0 bug workaround patch by (Martin Drab >drab kepler.fjfi.cvut cz<)
...
Originally committed as revision 3945 to svn://svn.ffmpeg.org/ffmpeg/trunk
2005-02-07 17:09:48 +00:00
Michael Niedermayer
e42a152bbc
x86 cpu capabilities detection rewrite / cleanup
...
Originally committed as revision 3750 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-12-13 16:11:38 +00:00
Matthieu Castet
e7ddb0cf4c
detect sse on athlon-xp patch by (matthieu castet <castet >.< matthieu >at< free >.< fr>)
...
Originally committed as revision 3729 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-12-05 23:26:43 +00:00
Michael Niedermayer
1c3990dbba
H.261 encoder by (Maarten Daniels <maarten dot daniels at luc dot ac dot be>)
...
Originally committed as revision 3643 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-10-28 10:12:57 +00:00
Aurelien Jacobs
8b4d077fc9
gcc 3.4.3 preversions do not appreciate invalid instruction and operand combinations anymore patch by (Aurelien Jacobs <aurel at gnuage dot org>)
...
Originally committed as revision 3604 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-10-17 20:27:11 +00:00
Aurelien Jacobs
053dea12f2
adapting existing mmx/mmx2/sse/3dnow optimizations so they work on x86_64 patch by (Aurelien Jacobs <aurel at gnuage dot org>)
...
Originally committed as revision 3578 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-10-11 02:19:29 +00:00
Michael Niedermayer
178fcca848
1/2 resolution decoding
...
Originally committed as revision 3507 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-25 23:18:58 +00:00
Michael Niedermayer
e69538fa60
h264_qpel8_hv_lowpass_mmx2/3dnow
...
Originally committed as revision 3446 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-08 18:49:12 +00:00
Michael Niedermayer
e772bb8a82
h264_qpel4_hv_lowpass_mmx2/3dnow
...
alignment fixes
Originally committed as revision 3445 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-08 18:25:45 +00:00
Michael Niedermayer
56d8bd5659
optimization
...
Originally committed as revision 3440 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-08 00:52:49 +00:00
Michael Niedermayer
a6e39f45a2
optimization
...
Originally committed as revision 3439 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-07 22:56:50 +00:00
Michael Niedermayer
ed8ffdf46c
optimization
...
Originally committed as revision 3438 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-07 17:22:37 +00:00
Michael Niedermayer
437525c473
h264 luma motion compensation in mmx2/3dnow
...
Originally committed as revision 3437 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-07 01:48:45 +00:00
Michael Niedermayer
d6af6b0350
10000l fix and use more mmx2/3dnow code for mpeg4 qpel which has been written and commited long time ago but appearently never used, qpel motion compensation is 5% faster
...
now
Originally committed as revision 3435 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-09-06 03:17:31 +00:00
Michael Niedermayer
6206f8c920
10l and better MMX/SSE detection for VIA1000
...
Originally committed as revision 3323 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-07-16 01:54:07 +00:00
Michael Niedermayer
61b3b639e7
dc_precission=11 fix
...
Originally committed as revision 3291 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-07-06 12:27:36 +00:00
Michael Niedermayer
4c99f2cdbe
transmeta crusoe mmx detection by "Nobody/Anonymous - nobody" / sf feature request tracker
...
Originally committed as revision 3248 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-06-24 22:13:44 +00:00
Michael Niedermayer
1ec4df0fa8
sse8 and nsse in mmx
...
Originally committed as revision 3206 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-06-08 02:13:44 +00:00
Michael Niedermayer
e96682e6f4
some of the warning fixes by (Michael Roitzsch <mroi at users dot sourceforge dot net>)
...
Originally committed as revision 3140 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-05-18 17:09:46 +00:00
Michael Niedermayer
77d0e165b1
mmx dct optimization
...
Originally committed as revision 3100 to svn://svn.ffmpeg.org/ffmpeg/trunk
2004-04-30 19:53:54 +00:00