FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-12-18 03:19:31 +02:00

Author	SHA1	Message	Date
Jason Garrett-Glaser	3ae079a3c8	VP8: optimize DC-only chroma case in the same way as luma. Add MMX idct_dc_add4uv function for this case. ~40% faster chroma idct. Originally committed as revision 24455 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-23 06:02:52 +00:00
Jason Garrett-Glaser	3df56f4118	VP8: Clean up some variable shadowing. Originally committed as revision 24454 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-23 03:44:37 +00:00
Jason Garrett-Glaser	8a467b2d44	VP8: 30% faster idct_mb Take shortcuts based on statistically common situations. Add 4-at-a-time idct_dc function (mmx and sse2) since rows of 4 DC-only DCT blocks are common. TODO: tie this more directly into the MB mode, since the DC-level transform is only used for non-splitmv blocks? Originally committed as revision 24452 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-23 02:58:27 +00:00
Jason Garrett-Glaser	ef38842f0b	VP8: smarter prefetching Don't prefetch reference frames that were used less than 1/32th of the time so far in the frame. This helps speed up to ~2% on videos that, in many frames, make near-zero (but not entirely zero) use of golden and/or alt-refs. This is a very common property of videos encoded by libvpx. Originally committed as revision 24451 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-23 01:59:56 +00:00
Jason Garrett-Glaser	c25c776708	VP8: clear DCT blocks in iDCT instead of using clear_blocks. ~0.3% faster overall. Originally committed as revision 24448 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-23 00:07:16 +00:00
Jason Garrett-Glaser	b74f70d646	VP8: avoid a memset for non-i4x4 blocks with no coefficients Originally committed as revision 24447 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-23 00:05:44 +00:00
Jason Garrett-Glaser	145d31865d	Get rid of more unnecessary dereferences in VP8 deblocking Originally committed as revision 24446 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-22 23:11:40 +00:00
Jason Garrett-Glaser	867215336d	Shut up an uninitialized variable GCC warning in VP8. Originally committed as revision 24445 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-22 23:04:51 +00:00
Jason Garrett-Glaser	c4211046d2	Smarter VP8 prefetching Prefetch all refs (including altref), but only if they've been used so far this frame. ~2.5% faster overall. TODO: Do something even smarter, like using how often each ref has been used so far, so that a couple blocks of a rarely-used ref don't force us to prefetch it. Originally committed as revision 24444 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-22 23:03:08 +00:00
Jason Garrett-Glaser	8cfae560ad	Fix stupid bug in VP8 prefetching code Originally committed as revision 24443 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-22 22:15:43 +00:00
Jason Garrett-Glaser	2a38c2e99a	Eliminate a LUT in escape decoding in VP8 decode_block_coeffs Originally committed as revision 24441 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-22 22:08:09 +00:00
Jason Garrett-Glaser	d292c3455e	Eliminate some repeated dereferences in VP8 inter_predict Originally committed as revision 24438 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-22 21:05:30 +00:00
Jason Garrett-Glaser	b946111fde	Eliminate a pointless memset for intra blocks in P-frames in VP8 Originally committed as revision 24429 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-22 12:15:29 +00:00
Jason Garrett-Glaser	b9a7186bf4	VP8: Don't store segment in macroblock struct anymore. Not necessary with the previous patch. Originally committed as revision 24427 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-22 11:55:55 +00:00
Jason Garrett-Glaser	c55e0d34ba	Convert VP8 macroblock structures to a ring buffer. Uses a slightly nonintuitive ring buffer size of (width+height*2) to simplify addressing logic. Also split out the segmentation map to a separate structure, necessary to implement the ring buffer. Originally committed as revision 24426 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-22 11:45:18 +00:00
Jason Garrett-Glaser	968570d65f	Calculate deblock strength per-MB instead of per-row Gives better cache locality, since the VP8Macroblock structs are still in cache. Inspired by the way x264 does it. Originally committed as revision 24417 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-22 07:24:22 +00:00
Jason Garrett-Glaser	d1c58fce20	Avoid tracking i4x4 modes in P-frames in VP8 As in the previous commit, they aren't used for context selection, so it saves memory this way. Originally committed as revision 24416 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-22 07:04:45 +00:00
Jason Garrett-Glaser	158e062c95	Avoid useless fill_rectangle in P-frames in VP8 In VP8, i4x4 only uses contexts based on neighbors in I-frames. Originally committed as revision 24415 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-22 06:39:54 +00:00
Jason Garrett-Glaser	7bf254c41d	Optimize partition mv decoding in VP8 Originally committed as revision 24414 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-22 06:29:26 +00:00
Jason Garrett-Glaser	c0498b3031	Take shortcuts for mv0 case in VP8 MC Avoid edge emulation -- it isn't needed if there isn't any subpel. Originally committed as revision 24413 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-22 05:49:09 +00:00
Jason Garrett-Glaser	702e8d3376	Much faster VP8 mv and mode prediction Originally committed as revision 24412 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-22 04:26:41 +00:00
Jason Garrett-Glaser	d864dee8ab	Add prefetching to VP8 decoder ~5% faster overall, probably depends on CPU and resolution. Originally committed as revision 24410 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-22 03:09:10 +00:00
Måns Rullgård	096971e892	vp8: indent Originally committed as revision 24368 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-20 17:54:28 +00:00
Måns Rullgård	070ce7efad	vp8: add do { } while(0) around XCHG() macro to avoid confusing if/else This is the correct solution to the warning "fixed" in the previous commit. Originally committed as revision 24367 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-20 17:54:25 +00:00
Diego Biurrun	153da88dfb	Add some braces to silence the warning: libavcodec/vp8.c:892: warning: suggest explicit braces to avoid ambiguous `else' Originally committed as revision 24366 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-20 17:45:54 +00:00
Ronald S. Bultje	3facfc99da	Change function prototypes for width=8 inner and mbedge loopfilter functions so that it does both U and V planes at the same time. This will have speed advantages when using SSE2 (or higher) optimizations, since we can do both the U and V rows together in a single xmm register. This also renames filter16 to filter16y and filter8 to filter8uv so that it's more obvious what each function is used for. Originally committed as revision 24337 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-19 21:18:04 +00:00
David Conrad	9ac831c2c0	vp8: Save mb border needed for intra prediction so that loop filter can run immediately after a mb row is decoded Originally committed as revision 24252 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-16 07:20:35 +00:00
David Conrad	b6c420ce8f	vp8: Check for malloc failure Originally committed as revision 24251 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-16 07:20:31 +00:00
Ronald S. Bultje	e394953e62	Add missing doxy for function arguments. Originally committed as revision 24110 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-08 15:01:59 +00:00
David Conrad	5245c04da3	VP8: Move calculation of outer filter limit out of dsp functions for normal filter to match the simple loop filter Originally committed as revision 24010 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-02 21:04:45 +00:00
Diego Biurrun	3fa7626863	Avoid square brackets in Doxygen comments; Doxygen chokes on them. Originally committed as revision 23979 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-07-02 11:44:58 +00:00
Ronald S. Bultje	7ed06b2be8	Simplify MV parsing, removes laying out 2 or 4 (16x8/8x8/8x16) MVs over all 16 subblocks (since we no longer need that), which should also lead to a minor speedup. Originally committed as revision 23854 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-28 16:04:14 +00:00
Ronald S. Bultje	7c4dcf8165	Optimize split MC, so we don't always do 4x4 blocks of 4x4pixels each, but we apply them as 16x8/8x16/8x8 subblocks where possible. Since this allows us to use width=8/16 instead of width=4 MC functions, we can now take more advantage of SSE2/SSSE3 optimizations, leading to a total speedup for splitMV filter of about 10%. Originally committed as revision 23853 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-28 13:50:55 +00:00
David Conrad	0ef1dbedcb	VP8 bilinear filter Originally committed as revision 23813 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-27 01:46:29 +00:00
Måns Rullgård	92a544267b	vp8: warn and request sample if upscaling specified in header Originally committed as revision 23809 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-27 00:37:43 +00:00
Jason Garrett-Glaser	d6f8476be4	Make VP8 DSP functions take two strides This isn't useful for the C functions, but will allow re-using H and V functions for HV functions without adding separate H and V wrappers. Originally committed as revision 23782 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-25 18:14:07 +00:00
Jason Garrett-Glaser	03ac56e7f1	fix typo in vp8 decoder error message Originally committed as revision 23765 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-25 04:23:45 +00:00
Stefan Gehrer	8f910a5621	avoid conditional and division in chroma MV calculation Originally committed as revision 23745 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-23 21:45:26 +00:00
David Conrad	3b636f21da	Native VP8 decoder. Patch by David Conrad <lessen42 gmail com> and myself. Originally committed as revision 23719 to svn://svn.ffmpeg.org/ffmpeg/trunk	2010-06-22 19:24:09 +00:00

1 2 3

139 Commits