FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-12-23 12:43:46 +02:00

Author	SHA1	Message	Date
Mans Rullgard	ef15d71c1f	VP8: ARM NEON optimisations for dsp functions This adds NEON optimised versions of all functions in VP8DSPContext. Based on initial work by Rob Clark. Signed-off-by: Mans Rullgard <mans@mansr.com> (cherry picked from commit `a1c1d3c003`)	2011-02-09 03:31:21 +01:00
Ronald S. Bultje	e3c5395402	VP8: don't overread edges on fourtap MC. Fix C VP8 H+V MC functions which do two-dimensional 4/6-tap filters to not overread beyond their edges if the second filter is 4-tap, since the outer pixels aren't there anymore since `44002d8323`. (cherry picked from commit `22893e10ae`)	2011-01-28 03:15:34 +01:00
Jason Garrett-Glaser	f311208cf1	VP8: much faster DC transform handling A lot of the time the DC block is empty: don't do the WHT in this case. A lot of the rest of the time, there's only one coefficient: make a special DC-only transform for that case. When the block is empty, don't incorrectly mark luma DCT blocks as having DC coefficients. Originally committed as revision 24670 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-08-02 20:57:03 +00:00
Jason Garrett-Glaser	827d43bb9d	VP8: move zeroing of luma DC block into the WHT Lets us do the zeroing in asm instead of C. Also makes it consistent with the way the regular iDCT code does it. Originally committed as revision 24668 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-08-02 20:18:09 +00:00
Jason Garrett-Glaser	3ae079a3c8	VP8: optimize DC-only chroma case in the same way as luma. Add MMX idct_dc_add4uv function for this case. ~40% faster chroma idct. Originally committed as revision 24455 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-23 06:02:52 +00:00
Jason Garrett-Glaser	8a467b2d44	VP8: 30% faster idct_mb Take shortcuts based on statistically common situations. Add 4-at-a-time idct_dc function (mmx and sse2) since rows of 4 DC-only DCT blocks are common. TODO: tie this more directly into the MB mode, since the DC-level transform is only used for non-splitmv blocks? Originally committed as revision 24452 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-23 02:58:27 +00:00
Jason Garrett-Glaser	c25c776708	VP8: clear DCT blocks in iDCT instead of using clear_blocks. ~0.3% faster overall. Originally committed as revision 24448 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-23 00:07:16 +00:00
Ronald S. Bultje	3facfc99da	Change function prototypes for width=8 inner and mbedge loopfilter functions so that it does both U and V planes at the same time. This will have speed advantages when using SSE2 (or higher) optimizations, since we can do both the U and V rows together in a single xmm register. This also renames filter16 to filter16y and filter8 to filter8uv so that it's more obvious what each function is used for. Originally committed as revision 24337 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-19 21:18:04 +00:00
David Conrad	5245c04da3	VP8: Move calculation of outer filter limit out of dsp functions for normal filter to match the simple loop filter Originally committed as revision 24010 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-02 21:04:45 +00:00
David Conrad	982fac7357	Altivec VP8 MC functions Originally committed as revision 23884 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-29 06:42:17 +00:00
Jason Garrett-Glaser	95275094a5	Faster C VP8 normal inner loop filter Originally committed as revision 23881 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-29 04:34:04 +00:00
Jason Garrett-Glaser	9942b6a1b0	Use crop table in C implementations of VP8 DSP functions. Much faster VP8 C DSP functions; ~5-10% faster overall with asm off. Originally committed as revision 23880 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-29 03:34:24 +00:00
Stefano Sabatini	a64fadf62b	Fix linking if MMX is disabled. Originally committed as revision 23839 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-27 23:25:04 +00:00
Jason Garrett-Glaser	0178d14fe5	First shot at VP8 optimizations: - MMXEXT, SSE2 and SSSE3 MC functions - MMX and SSE4 IDCT dc_add functions Patch by Jason Garrett-Glaser <darkshikari gmail com> and myself. Originally committed as revision 23815 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-27 02:01:45 +00:00
David Conrad	0ef1dbedcb	VP8 bilinear filter Originally committed as revision 23813 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-27 01:46:29 +00:00
Jason Garrett-Glaser	15d31aa1e1	Really fix r23782 Originally committed as revision 23788 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-25 19:03:03 +00:00
Jason Garrett-Glaser	cd29c2b5a1	Fix c99ism in r23782 Originally committed as revision 23786 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-25 18:48:45 +00:00
Jason Garrett-Glaser	d6f8476be4	Make VP8 DSP functions take two strides This isn't useful for the C functions, but will allow re-using H and V functions for HV functions without adding separate H and V wrappers. Originally committed as revision 23782 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-25 18:14:07 +00:00
David Conrad	3b636f21da	Native VP8 decoder. Patch by David Conrad <lessen42 gmail com> and myself. Originally committed as revision 23719 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-22 19:24:09 +00:00

19 Commits