FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-12-28 20:53:54 +02:00

Author	SHA1	Message	Date
Speedy Gonzales	ffda8f0f0f	Proresenc: add multithreading support Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2012-08-13 17:16:47 +02:00
Mans Rullgard	90540c2d5a	x86: swscale: fix fragile memory accesses To access data at multiple fixed offsets from a base address, this code uses a single "m" operand and code of the form "32%0", relying on the memory operand instantiation having no displacement, giving a final result of the form "32(%rax)". If the compiler uses a register and displacement, e.g. "64(%rax)", the end result becomes "3264(%rax)", which obviously does not work. Replacing the "m" operands with "r" operands allows safe addition of a displacement. In theory, multiple memory operands could use a shared base register with different index registers, "(%rax,%rbx)", potentially making more efficient use of registers. In the cases at hand, no such sharing is possible since the addresses involved are entirely unrelated. After this change, the code somewhat rudely accesses memory without using a corresponding memory operand, which in some cases can lead to unwanted "optimisations" of surrounding code. However, the original code also accesses memory not covered by a memory operand, so this is not adding any defect not already present. It is also hightly unlikely that any such optimisations could be performed here since the memory locations in questions are not accessed elsewhere in the same functions. This fixes crashes with suncc. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 14:51:52 +01:00
Mans Rullgard	10b83cb653	x86: swscale: remove disabled code This code has been disabled since 2003. Nobody will ever look at it again. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 14:51:52 +01:00
Mans Rullgard	480178a295	x86: yadif: fix asm with suncc Under some circumstances, suncc will use a single register for the address of all memory operands, inserting lea instructions loading the correct address prior to each memory operand being used in the code. In the yadif code, the branch in the asm block bypasses such an lea instruction, causing an incorrect address to be used in the following load. This patch replaces the tmpX arrays with a single array and uses a register operand to hold its address. Although this prevents using offsets from the stack pointer to access these locations, the code still builds as 32-bit PIC even with old compilers. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 14:51:52 +01:00
Mans Rullgard	8ec0204ee4	x86: cabac: allow building with suncc This fixes two issues preventing suncc from building this code. The undocumented 'a' operand modifier, causing gcc to omit a $ in front of immediate operands (as required in addresses), is not supported by suncc. Luckily, the also undocumented 'c' modifer has the same effect and is supported. On some asm statements with a large number of operands, suncc for no obvious reason fails to correctly substitute some of the operands. Fortunately, some of the operands in these statements are plain numbers which can be inserted directly into the code block instead of passed as operands. With these changes, the code builds correctly with both gcc and suncc. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 14:51:52 +01:00
Mans Rullgard	c8252e80eb	x86: mlpdsp: avoid taking address of void This code contains a C array of addresses of labels defined in inline asm. To do this, the names must be declared as external in C. The declared type does not matter since only the address is used, and for some reason, the author of the code used the 'void' type despite taking the address of a void expression being invalid. Changing the type to char, a reasonable choice since the alignment of the code labels cannot be known or guaranteed, eliminates gcc warnings and allows building with suncc. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 14:51:52 +01:00
Mans Rullgard	87fa05a0da	ARM: intmath: use native-size return types for clipping functions This avoids having the compiler redundantly mask the values to the smaller size. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 14:51:52 +01:00
Michael Niedermayer	603221ebd0	g723_1dec: inline normalize_bits() in scale vector and optimize it. many branches and cases of scale_vector are irrelevant for the case here and by inlining they can be reliably removed. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2012-08-13 15:18:47 +02:00
Michael Niedermayer	20035fa241	g723_1dec: remove dead code that leaked in from libav It appears someone thinks this special case can be reached Well, it cannot, thus not only do we not need to optimize it we dont need it at all Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2012-08-13 15:02:21 +02:00
Michael Niedermayer	84d29df013	g723_1dec: remove unneeded cliping that leaked in from merge from libav Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2012-08-13 15:01:45 +02:00
Michael Niedermayer	a9040a1167	g723_1dec: avoid memcpy Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2012-08-13 14:53:29 +02:00
Michael Niedermayer	d8c3170c9f	Merge remote-tracking branch 'qatar/master' * qatar/master: (22 commits) g723.1: do not pass large structs by value g723.1: do not bounce intermediate values via memory g723.1: declare a variable in the block it is used g723.1: avoid saving/restoring excitation g723.1: avoid unnecessary memcpy() in residual_interp() g723.1: make postfilter write directly to output buffer g723.1: drop unnecessary variable buf_ptr in formant_postfilter() g723.1: make scale_vector() output to a separate buffer g723.1: make autocorr_max() work on an arbitrary buffer g723.1: do not needlessly use int64_t g723.1: use saturating addition functions g723.1: optimise scale_vector() g723.1: remove useless uses of MUL64() g723.1: remove unnecessary argument 'shift' from dot_product() g723.1: deobfuscate "(x << 4) - x" to "15 * x" celp: optimise ff_celp_lp_synthesis_filter() libavutil: add saturating addition functions cllc: Implement ARGB support cllc: Add support for QRGB cllc: Rename some funcs to represent what they actually do ... Conflicts: LICENSE libavcodec/g723_1.c libavcodec/x86/Makefile Merged-by: Michael Niedermayer <michaelni@gmx.at>	2012-08-13 14:38:43 +02:00
Paul B Mahol	bd70a52712	paf: prevent invalid write Closes #1631. Signed-off-by: Paul B Mahol <onemda@gmail.com>	2012-08-13 12:27:58 +00:00
Stefano Sabatini	c3da2c19e4	build: extend documentation building mechanism Allow to select specific documentation components, and reliably check for component dependencies. In particular, check for perl presence on the system.	2012-08-13 12:22:02 +02:00
Jérémy Tran	ae60d2c877	lavfi: add hue filter This is a port of the MPlayer hue filter (libmpcodecs/vf_hue.c) by Michael Niedermayer. Signed-off-by: Jérémy Tran <tran.jeremy.av@gmail.com> Signed-off-by: Stefano Sabatini <stefasab@gmail.com>	2012-08-13 12:00:54 +02:00
Nicolas George	03e8944fc1	lavc: add missing codec descriptors.	2012-08-13 10:45:04 +02:00
Nicolas George	f594dafc10	tools: add a script to find missing codec descriptors.	2012-08-13 10:44:59 +02:00
Michael Niedermayer	710600077d	h264_cavlc: switch forgotten assert to av_assert Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2012-08-13 05:59:44 +02:00
Michael Niedermayer	e9d0ab5717	h264: fix x264 build detection Fixes Ticket1503 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2012-08-13 04:58:15 +02:00
Mans Rullgard	69665bd6f4	g723.1: do not pass large structs by value Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:25 +01:00
Mans Rullgard	138914dcd8	g723.1: do not bounce intermediate values via memory Although a reasonable compiler will probably optimise out the actual store and load, this operation still implies a truncation to 16 bits which the compiler will probably not realise is not necessary here. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:25 +01:00
Mans Rullgard	cbcf1b411f	g723.1: declare a variable in the block it is used Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:25 +01:00
Mans Rullgard	35b533e4de	g723.1: avoid saving/restoring excitation Writing the scaled excitation to a scratch buffer (borrowing the 'audio' array) instead of modifying it in place avoids the need to save and restore the unscaled values. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:25 +01:00
Mans Rullgard	4b728b4712	g723.1: avoid unnecessary memcpy() in residual_interp() Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:25 +01:00
Mans Rullgard	f645710cf3	g723.1: make postfilter write directly to output buffer Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:25 +01:00
Mans Rullgard	1953264331	g723.1: drop unnecessary variable buf_ptr in formant_postfilter() Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:25 +01:00
Mans Rullgard	b2af2c4bee	g723.1: make scale_vector() output to a separate buffer Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:25 +01:00
Mans Rullgard	783da0d696	g723.1: make autocorr_max() work on an arbitrary buffer Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:25 +01:00
Mans Rullgard	3716105103	g723.1: do not needlessly use int64_t Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:25 +01:00
Mans Rullgard	47c73a73b0	g723.1: use saturating addition functions Use saturating addition functions instead of 64-bit intermediates and separate clipping. This is much faster when dedicated instructions are available. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:25 +01:00
Mans Rullgard	4aca716a53	g723.1: optimise scale_vector() Firstly, nothing in this function can overflow 32 bits so the use of a 64-bit type is completely unnecessary. Secondly, the scale is either a power of two or 0x7fff. Doing separate loops for these cases avoids using multiplications. Finally, since only the number of bits, not the actual value, of the maximum value is needed, the bitwise or of all the values serves the purpose while being faster. It is worth noting that even if overflow could happen, it was not handled correctly anyway. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:25 +01:00
Mans Rullgard	1eb1f6f281	g723.1: remove useless uses of MUL64() The operands in both cases are 16-bit so cannot overflow a 32-bit destination. In gain_scale() the inputs are reduced to 14-bit, so even the shift cannot overflow. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:25 +01:00
Mans Rullgard	5a43eba956	g723.1: remove unnecessary argument 'shift' from dot_product() The 'shift' argument is always 1 so there is no need to pass it explicitly in every call. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:25 +01:00
Mans Rullgard	8b0de73464	g723.1: deobfuscate "(x << 4) - x" to "15 * x" The compiler performs this optimisation. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:25 +01:00
Mans Rullgard	fddc5b9bea	celp: optimise ff_celp_lp_synthesis_filter() Adding instead of subtracting the products in the loop allows the compiler to generate more efficient multiply-accumulate instructions when 16-bit multiply-subtract is not available. ARM has only multiply-accumulate for 16-bit operands. In general, if only one variant exists, it is usually accumulate rather than subtract. In the same spirit, using the dedicated saturation function enables use of any special optimised versions of this. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:25 +01:00
Mans Rullgard	6c4975eaaf	libavutil: add saturating addition functions Fixed-point audio codecs often use saturating arithmetic, and special instructions for these operations are common. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-08-13 01:03:10 +01:00
Michael Niedermayer	ed8d827ad0	riffenc: fix aac Fixes Ticket1435 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2012-08-13 01:41:55 +02:00
Stefano Sabatini	5c0d8bc4ce	lavfi: add avfilter_get_class() and iteration callbacks Allow iteration over filter options.	2012-08-13 00:04:06 +02:00
Stefano Sabatini	a25346e65c	lavu/opt.h: add AV_OPT_FLAG_FILTERING_PARAM macro	2012-08-12 23:52:55 +02:00
Stefano Sabatini	3239382aef	doc/texi2pod: add "use warnings" directive The script was previously run with perl -w through the shebang command. Now that the script is executed through direct perl invocation the -w in the shebang command is ignored. This patch re-enables "use warnings" whatever way the script is invoked. Idea-By: jamal <jamrial@gmail.com>	2012-08-12 23:52:55 +02:00
Reimar Döffinger	118bd609f0	Optimized unscaled yuvp9/yuvp10 -> yuvp16 conversion. About 30% faster on 32 bit Atom, 120% faster on 64 bit Phenom2. This is interesting because supporting P16 is easier in e.g. OpenGL (can misuse support for any 2-component 8 bit format), whereas supporting p9/p10 without conversion needs a texture format with at least 14 bits actual precision. The shiftonly == 0 case is not optimized since the code is more complex and the speed gain less obvious. Signed-off-by: Reimar Döffinger <Reimar.Doeffinger@gmx.de>	2012-08-12 23:23:19 +02:00
Michael Niedermayer	bb7073921c	oggparsetheora: fix metadata parsing Fixes Ticket1508 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2012-08-12 23:09:45 +02:00
Derek Buitenhuis	17c11cef9f	cllc: Implement ARGB support Signed-off-by: Derek Buitenhuis <derek.buitenhuis@gmail.com>	2012-08-12 15:21:15 -04:00
Derek Buitenhuis	ba752dc016	cllc: Implement ARGB support Signed-off-by: Derek Buitenhuis <derek.buitenhuis@gmail.com>	2012-08-12 15:13:47 -04:00
Derek Buitenhuis	7fda47d53b	cllc: Add support for QRGB Signed-off-by: Derek Buitenhuis <derek.buitenhuis@gmail.com>	2012-08-12 15:07:00 -04:00
Derek Buitenhuis	f4bb38cc26	cllc: Rename some funcs to represent what they actually do This is in preparation for adding support for other colorspaces and coding types. Signed-off-by: Derek Buitenhuis <derek.buitenhuis@gmail.com>	2012-08-12 15:07:00 -04:00
Derek Buitenhuis	21d62c4730	cllc: Add support for QRGB Signed-off-by: Derek Buitenhuis <derek.buitenhuis@gmail.com>	2012-08-12 15:01:19 -04:00
Derek Buitenhuis	4637009e59	cllc: Rename some funcs to represent what they actually do This is in preparation for adding support for other colorspaces and coding types. Signed-off-by: Derek Buitenhuis <derek.buitenhuis@gmail.com>	2012-08-12 15:01:19 -04:00
Michael Niedermayer	ab0ea7cb41	ffplay: avoid SDL_atoi() It appears this function is not available everywhere Should fix Ticket1525 Reviewed-by: Marton Balint <cus@passwd.hu> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2012-08-12 20:05:39 +02:00
Boris Maksalov	d70231f02d	Fix reading past the end of frame buffer. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2012-08-12 18:51:23 +02:00

1 2 3 4 5 ...

43632 Commits