FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-12-23 12:43:46 +02:00

Author	SHA1	Message	Date
Dale Curtis	a246701e9a	Parse and drop gain control data, so that SSR packets decode. This will result in poor quality audio for SSR streams, but they will at least demux and decode without error; partially fixing ticket #1693. This pulls in the decode_gain_control() function from the ffmpeg summer-of-code repo (original author Maxim Gavrilov) at svn://svn.mplayerhq.hu/soc/aac/aac.c with some minor modifications and adds AOT_AAC_SSR to decode_audio_specific_config_gb(). Signed-off-by: Dale Curtis <dalecurtis@chromium.org> Co-authored-by: Maxim Gavrilov <maxim.gavrilov@gmail.com>	2018-02-21 18:17:24 -08:00
Paul B Mahol	dbc9a8f21f	avcodec/aac: Add floating point 960/120 MDCT window Co-Authored-By: Alex Converse <alex.converse@gmail.com> Co-Authored-By: Rostislav Pehlivanov <atomnuker@gmail.com>	2017-08-24 17:42:07 -07:00
Rostislav Pehlivanov	d2119f624d	imdct15: rename to mdct15 and add a forward transform Handles strides (needed for Opus transients), does pre-reindexing and folding without needing a copy. Signed-off-by: Rostislav Pehlivanov <atomnuker@gmail.com>	2017-02-14 06:15:36 +00:00
Ganesh Ajjanagadde	79abf2d0de	avcodec/aac_tablegen: get rid of hardcoded tables entirely Commit `96786a12f6` makes runtime initialization cheap. Tested with FATE, with/without --enable-hardcoded-tables. Reviewed-by: Rostislav Pehlivanov <atomnuker@gmail.com> Signed-off-by: Ganesh Ajjanagadde <gajjanagadde@gmail.com>	2015-11-28 09:41:15 -05:00
Rostislav Pehlivanov	27d23ae074	aacenc: add support for encoding files using Long Term Prediction Long Term Prediction allows for prediction of spectral coefficients via the previously decoded time-dependent samples. This feature works well with harmonic content 2 or more frames long, like speech, human or non-human, piano music or any constant tones at very low bitrates. It should be noted that the current coder is highly efficient and the rate control system is unable to encode files at extremely low bitrates (less than 14kbps seems to be impossible) so this extension isn't capable of optimum operation. Dramatic difference is observable with some types of audio and speech but for the most part the audiable differences are subtle. The spectrum looks better however so the encoder is able to harvest the additional bits that this feature provies, should the user choose to enable it. So it's best to enable this feature only if encoding at the absolutely lowest bitrate that the encoder is capable of.	2015-10-17 02:31:20 +01:00
Claudio Freire	01ecb7172b	AAC encoder: Extensive improvements This finalizes merging of the work in the patches in ticket #2686. Improvements to twoloop and RC logic are extensive. The non-exhaustive list of twoloop improvments includes: - Tweaks to distortion limits on the RD optimization phase of twoloop - Deeper search in twoloop - PNS information marking to let twoloop decide when to use it (turned out having the decision made separately wasn't working) - Tonal band detection and priorization - Better band energy conservation rules - Strict hole avoidance For rate control: - Use psymodel's bit allocation to allow proper use of the bit reservoir. Don't work against the bit reservoir by moving lambda in the opposite direction when psymodel decides to allocate more/less bits to a frame. - Retry the encode if the effective rate lies outside a reasonable margin of psymodel's allocation or the selected ABR. - Log average lambda at the end. Useful info for everyone, but especially for tuning of the various encoder constants that relate to lambda feedback. Psy: - Do not apply lowpass with a FIR filter, instead just let the coder zero bands above the cutoff. The FIR filter induces group delay, and while zeroing bands causes ripple, it's lost in the quantization noise. - Experimental VBR bit allocation code - Tweak automatic lowpass filter threshold to maximize audio bandwidth at all bitrates while still providing acceptable, stable quality. I/S: - Phase decision fixes. Unrelated to #2686, but the bugs only surfaced when the merge was finalized. Measure I/S band energy accounting for phase, and prevent I/S and M/S from being applied both. PNS: - Avoid marking short bands with PNS when they're part of a window group in which there's a large variation of energy from one window to the next. PNS can't preserve those and the effect is extremely noticeable. M/S: - Implement BMLD protection similar to the specified in ISO-IEC/13818:7-2003, Appendix C Section 6.1. Since M/S decision doesn't conform to section 6.1, a different method had to be implemented, but should provide equivalent protection. - Move the decision logic closer to the method specified in ISO-IEC/13818:7-2003, Appendix C Section 6.1. Specifically, make sure M/S needs less bits than dual stereo. - Don't apply M/S in bands that are using I/S Now, this of course needed adjustments in the compare targets and fuzz factors of the AAC encoder's fate tests, but if wondering why the targets go up (more distortion), consider the previous coder was using too many bits on LF content (far more than required by psy), and thus those signals will now be more distorted, not less. The extra distortion isn't audible though, I carried extensive ABX testing to make sure. A very similar patch was also extensively tested by Kamendo2 in the context of #2686.	2015-10-11 17:29:50 -03:00
Rostislav Pehlivanov	1cd5daee20	aac: remove now-unused redundant array This commit removes the array which was made redundant with the last commit. The current prediction system gets the quantization error directly (and without the single-frame delay) in the search_for_pred function. Signed-off-by: Rostislav Pehlivanov <atomnuker@gmail.com>	2015-08-29 06:44:20 +01:00
Rostislav Pehlivanov	44ddee945a	aacenc_pred: rework the way prediction is done This commit completely alters the algorithm of prediction. The original commit which introduced prediction was completely incorrect to even remotely care about what the actual coefficients contain or whether any options were enabled. Not my actual fault. This commit treats prediction the way the decoder does and expects to do: like lossy encryption. Everything related to prediction now happens at the very end but just before quantization and encoding of coefficients. On the decoder side, prediction happens before anything has had a chance to even access the coefficients. Also the original implementation had problems because it actually touched the band_type of special bands which already had their scalefactor indices marked and it's a wonder the asserion wasn't triggered when transmitting those. Overall, this now drastically increases audio quality and you should think about enabling it if you don't plan on playing anything encoded on really old low power ultra-embedded devices since they might not support decoding of prediction or AAC-Main. Though the specifications were written ages ago and as times change so do the FLOPS. Signed-off-by: Rostislav Pehlivanov <atomnuker@gmail.com>	2015-08-29 06:34:08 +01:00
Rostislav Pehlivanov	76b81b10d9	aacenc: implement the complete AAC-Main profile This commit finalizes AAC-Main profile encoding support by implementing all mandatory and optional tools available in the specifications and current decoders. The AAC-Main profile reqires that prediction support be present (although decoders don't require it to be enabled) for an encoder to be deemed capable of AAC-Main encoding, as well as TNS, PNS and IS, all of which were implemented with previous commits or earlier of this year. Users are encouraged to test the new functionality using either -profile:a aac_main or -aac_pred 1, the former of which will enable the prediction option by default and the latter will change the profile to AAC-Main. No other options shall be changed by enabling either, it's currently up to the users to decide what's best. The current implementation works best using M/S and/or IS, so users are also welcome to enable both options and any other options (TNS, PNS) for maximum quality. Signed-off-by: Rostislav Pehlivanov <atomnuker@gmail.com>	2015-08-21 19:38:05 +01:00
Rostislav Pehlivanov	a1c487e921	aacenc_tns: implement temporal noise shaping This commit implements temporal noise shaping support in the encoder, along with an -aac_tns option to toggle it on or off (off by default for now). TNS will increase audio quality and reduce quantization noise by applying a multitap FIR filter across allowed coefficients and transmit side information to the decoder so it could create an inverse filter. Users are encouraged to test the new functionality by enabling -aac_tns 1 during encoding. No major bugs are observable at this time so after a while if no new problems appear and if the current implementation is deemed of high enough quality and stability it will be enabled by default, possibly at the same time the encoder has its experimental flag removed and becomes the standard aac encoder in ffmpeg. Signed-off-by: Rostislav Pehlivanov <atomnuker@gmail.com>	2015-08-21 19:27:38 +01:00
Rostislav Pehlivanov	43b378a0d3	aaccoder: move the quantization functions to a separate file This commit moves the quantizer to a separate header file. This allows the quantizer to be used from a separate files outside of aaccoder without having to put another function pointer and will result in a slight speedup as the compiler can do more optimizations. This is required for commits following. Signed-off-by: Rostislav Pehlivanov <atomnuker@gmail.com>	2015-08-21 18:53:14 +01:00
Claudio Freire	59216e0525	AAC Encoder: clipping avoidance Avoid clipping due to quantization noise to produce audible artifacts, by detecting near-clipping signals and both attenuating them a little and encoding escape-encoded bands (usually the loudest) rounding towards zero instead of nearest, which tends to decrease overall energy and thus clipping. Currently fate tests measure numerical error so this change makes tests using asynth (which are near clipping) report higher error not less, because of window attenuation. Yet, they sound better, not worse (albeit subtle, other samples aren't subtle at all). Only measuring psychoacoustically weighted error would make for a representative test, so that will be left for a future patch. Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2015-07-27 19:13:48 +02:00
Djordje Pesut	f85bc147fb	avcodec: Implementation of AAC_fixed_decoder (SBR-module) Add fixed poind code. Signed-off-by: Nedeljko Babic <nedeljko.babic@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2015-07-20 17:20:16 +02:00
Djordje Pesut	b04f46cb4b	libavcodec: Implementation of AAC_fixed_decoder (LC-module) [3/4] Add fixed point implementation Signed-off-by: Nedeljko Babic <nedeljko.babic@imgtec.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2015-07-09 14:41:31 +02:00
Jovan Zelincevic	08be74ac81	libavcodec: Implementation of AAC_fixed_decoder (LC-module) [2/4] Add fixed point implementation of functions for generating tables Signed-off-by: Nedeljko Babic <nedeljko.babic@imgtec.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2015-07-09 14:41:19 +02:00
Rostislav Pehlivanov	d71935f883	aac: add additional fields needed by the encoder for intensity stereo This commit adds additional fields which are used by the native encoder to add intensity stereo support. It also adds some clarifying statements to the comments for the codebooks. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2015-06-28 00:15:21 +02:00
Rostislav Pehlivanov	013498ba15	aacenc: Adjust the initial offset for PNS values This commit adjusts the intial offset for PNS values, introduced with commit `f7f71b5795` earlier. This commit shifts the value in such a way that no further offsets are required in the aaccoder.c file. Earlier version of the PNS patch had 2 offsets in both the aaccoder and aacenc. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2015-04-14 03:42:57 +02:00
Rostislav Pehlivanov	f7f71b5795	aacenc: Add support for Perceptual Noise Substitution energy values This commit implements support for writing the noise energy values used in PNS. The difference between regular scalefactors and noise energy values is that the latter require a small preamble (NOISE_PRE + energy_value_diff) to be written as the first noise-containing band. Any following noise energy values use the previous one to base their "diff" on. Ordinary scalefactors remain unchanged other than that they ignore the noise values. This commit should not change anything by itself, the following commits will bring it in use. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2015-04-13 04:14:27 +02:00
Claudio Freire	6394acaf36	AAC: Fix M/S stereo encoding This patch fixes a pointer arithmetic bug in adjust_frame_information that resulted in heavily corrupted audio when using M/S encoding. Also, a backup copy of untransformed coefficients has to be kept around or attempts at re-processing the frame (which happens when hevavily overspending bits during transients) will result in re-encoding of the coefficients and subsequent corruption of the resulting stream. A/B testing shows the bug as corrected, but still cannot prove that M/S coding is a win at least in numbers. Limited listening tests do show improvement on M/S encoded samples in lower bitrates, but they're hidden among the other artifacts that remain to be corrected in the encoder. Some of the regressions flagged in the report do show poor stereo image (but not buggy), so M/S encoding is clearly not good enough yet to be defaulted to auto. In numbers, Patched against Unpatched, stereo_mode auto: Files: 114 Bitrates: 6 Tests: 683 Serious Regressions: 0 (0%) Regressions: 0 (0%) Improvements: 227 (33%) Big improvements: 92 (13%) Worst regression - mybloodrusts.wv - 256k - StdDev: 28.61 pSNR: -0.43 maxdiff: 1372.00 Best improvement - 60.wv - 384k - StdDev: -369.57 pSNR: 45.02 maxdiff: -13322.00 Average - StdDev: -80.56 pSNR: 2.49 maxdiff: -8858.00 Patched against Unpatched stereo_mode ms_off shows no difference. Patched stereo_mode auto vs Unpatched stereo_mode ms_off shows a small average improvement, just not too significant: Serious Regressions: 0 (0%) Regressions: 10 (1%) Improvements: 45 (6%) Big improvements: 2 (0%) Worst regression - Illinois.wv - 256k - StdDev: 33.20 pSNR: -2.03 maxdiff: 477.00 Best improvement - song_of_circomstances.flac - 384k - StdDev: -3.97 pSNR: 7.61 maxdiff: -826.00 Average - StdDev: -10.25 pSNR: 0.20 maxdiff: -281.00 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2015-03-03 13:57:42 +01:00
Michael Niedermayer	e82b0e6126	Merge commit 'ee964145b5d229571e00bf6883a44189d02babe2' * commit 'ee964145b5d229571e00bf6883a44189d02babe2': lavc: remove unused traces of fmtconvert usage Conflicts: libavcodec/aac.h libavcodec/aacdec.c libavcodec/atrac3.c libavcodec/vorbisdec.c libavcodec/wma.c libavcodec/wma.h Merged-by: Michael Niedermayer <michaelni@gmx.at>	2015-02-28 23:41:36 +01:00
Anton Khirnov	ee964145b5	lavc: remove unused traces of fmtconvert usage Those decoders have been switched to float output and so do not use fmtconvert anymore.	2015-02-28 21:51:24 +01:00
Michael Niedermayer	ba4fba8f48	Merge commit 'd615187f74ddf3413778a8b5b7ae17255b0df88e' * commit 'd615187f74ddf3413778a8b5b7ae17255b0df88e': aacdec: Support for ER AAC ELD 480. Conflicts: libavcodec/aacdec.c Merged-by: Michael Niedermayer <michaelni@gmx.at>	2015-02-04 13:49:17 +01:00
Alex Converse	d615187f74	aacdec: Support for ER AAC ELD 480. Based in part on work from Niel van der Westhuizen <espes@pequalsnp.com>.	2015-02-03 20:32:16 -08:00
Michael Niedermayer	68b8e21b8b	avcodec/aacdec: Use avpriv_float_dsp_alloc() Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2014-12-02 19:32:45 +01:00
Michael Niedermayer	55d592f7d9	avcodec/aacdec: Skip processing channel elements which have not been present Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2014-11-09 11:41:13 +01:00
Benoit Fouet	e56425d1a7	avcodec/aacdec: warn user when remapping streams. Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2014-10-24 18:54:06 +02:00
Michael Niedermayer	fa915d4193	avcodec/aac: fix () in IS_CODEBOOK_UNSIGNED macro Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2014-05-25 03:45:10 +02:00
Michael Niedermayer	fdb4822559	Merge remote-tracking branch 'qatar/master' * qatar/master: aac: Add support for Enhanced AAC Low Delay (ER AAC ELD). Conflicts: Changelog libavcodec/aacdec.c libavcodec/version.h Merged-by: Michael Niedermayer <michaelni@gmx.at>	2013-10-23 10:02:43 +02:00
Alex Converse	b3be41ca82	aac: Add support for Enhanced AAC Low Delay (ER AAC ELD). This does not include support for LD SBR, epTool, data resilience, nor the 960 transform family.	2013-10-23 00:08:29 -07:00
Michael Niedermayer	32ea39f56d	Merge commit '1914e6f010b3320025c7b692aaea51d9b9a992a8' * commit '1914e6f010b3320025c7b692aaea51d9b9a992a8': aacdec: Add support for LD (Low Delay) AAC Conflicts: Changelog libavcodec/aacdec.c libavcodec/version.h Merged-by: Michael Niedermayer <michaelni@gmx.at>	2013-09-19 12:55:26 +02:00
Alex Converse	1914e6f010	aacdec: Add support for LD (Low Delay) AAC	2013-09-18 12:01:53 -07:00
Michael Niedermayer	46ad2d9e44	Merge remote-tracking branch 'qatar/master' * qatar/master: miscellaneous typo fixes Conflicts: configure libavformat/avisynth.c Merged-by: Michael Niedermayer <michaelni@gmx.at>	2013-07-26 11:12:11 +02:00
Diego Biurrun	03039f4c8c	miscellaneous typo fixes	2013-07-25 19:43:32 +02:00
Michael Niedermayer	eab49f4fb5	Revert "aacdec: Reconfigure output as needed, disable pop_output_configuration()" This reverts commit `60dbf2eff9`. This is not needed anymore, Ticket 1694 has been fixed differently	2013-03-08 14:21:40 +01:00
Michael Niedermayer	60dbf2eff9	aacdec: Reconfigure output as needed, disable pop_output_configuration() Fixes Ticket1694 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2013-03-07 19:50:31 +01:00
Michael Niedermayer	a984efd104	Merge commit 'c242bbd8b6939507a1a6fb64101b0553d92d303f' * commit 'c242bbd8b6939507a1a6fb64101b0553d92d303f': Remove unnecessary dsputil.h #includes Conflicts: libavcodec/ffv1.c libavcodec/h261dec.c libavcodec/h261enc.c libavcodec/h264pred.c libavcodec/lpc.h libavcodec/mjpegdec.c libavcodec/rectangle.h libavcodec/x86/idct_sse2_xvid.c Merged-by: Michael Niedermayer <michaelni@gmx.at>	2013-02-26 13:05:10 +01:00
Diego Biurrun	c242bbd8b6	Remove unnecessary dsputil.h #includes	2013-02-26 00:51:34 +01:00
Mirjana Vulin	8d2eb5fe58	mips: optimization for float aac decoder (sbr module) Signed-off-by: Mirjana Vulin <mvulin@mips.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2013-02-21 22:43:08 +01:00
Michael Niedermayer	4789955ec4	Merge commit 'e57daa876bf0cf50782550e366e589441cd8c2bd' * commit 'e57daa876bf0cf50782550e366e589441cd8c2bd': adpcm: decode directly to the user-provided AVFrame ac3: decode directly to the user-provided AVFrame aac: decode directly to the user-provided AVFrame 8svx: decode directly to the user-provided AVFrame Conflicts: libavcodec/8svx.c libavcodec/ac3dec.c libavcodec/adpcm.c Merged-by: Michael Niedermayer <michaelni@gmx.at>	2013-02-13 11:27:54 +01:00
Justin Ruggles	ffd2123095	aac: decode directly to the user-provided AVFrame	2013-02-12 12:21:21 -05:00
Mirjana Vulin	2b6a8187a6	mips: optimization for float aac decoder (core module) Signed-off-by: Mirjana Vulin <mvulin@mips.com> Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2013-01-31 01:23:09 +01:00
Michael Niedermayer	8102f27b5b	Merge commit '73b704ac609d83e0be124589f24efd9b94947cf9' * commit '73b704ac609d83e0be124589f24efd9b94947cf9': arm: Add some missing header #includes floatdsp: move scalarproduct_float from dsputil to avfloatdsp. Conflicts: libavcodec/acelp_pitch_delay.c libavcodec/amrnbdec.c libavcodec/amrwbdec.c libavcodec/ra288.c libavcodec/x86/dsputil_mmx.c libavutil/x86/float_dsp.asm Merged-by: Michael Niedermayer <michaelni@gmx.at>	2013-01-23 14:31:55 +01:00
Ronald S. Bultje	d56668bd80	floatdsp: move scalarproduct_float from dsputil to avfloatdsp. This makes the aac decoder and all voice codecs independent of dsputil.	2013-01-22 11:55:42 -08:00
Michael Niedermayer	b113d4a83c	aacdec: make dual mono mode selectable through AVOptions too. Based on patch by Akihiro Tsukada Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2012-12-30 05:29:17 +01:00
Michael Niedermayer	644f021ccf	aacdec: simplify dmono Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2012-12-30 05:29:17 +01:00
Michael Niedermayer	59b68ee887	Merge commit '3d3cf6745e2a5dc9c377244454c3186d75b177fa' * commit '3d3cf6745e2a5dc9c377244454c3186d75b177fa': aacdec: use float planar sample format for output Conflicts: libavcodec/aacdec.c libavcodec/aacsbr.c Merged-by: Michael Niedermayer <michaelni@gmx.at>	2012-11-26 15:15:02 +01:00
Justin Ruggles	3d3cf6745e	aacdec: use float planar sample format for output	2012-11-25 19:06:36 -05:00
Michael Niedermayer	81ff0c24ef	Merge commit '1cd432e167b1a80853760c89a33606e2b5f229c2' * commit '1cd432e167b1a80853760c89a33606e2b5f229c2': configure: fix libcdio check rtsp: Allow setting the reordering buffer size via an AVOption rtsp: Vertically align a constant definition rtp: Update the check for distinguishing between RTP and RTCP aac: fix build with hardcoded tables fate: dependencies for screen codec tests riff: Move functions around to be covered by appropriate #ifdefs Conflicts: configure tests/fate/screen.mak Merged-by: Michael Niedermayer <michaelni@gmx.at>	2012-10-19 13:58:14 +02:00
Mans Rullgard	7a12d97eb1	aac: fix build with hardcoded tables aac_tablegen.h includes aac.h for the POW_SF2_ZERO definition, but this also pulls in a raft of other headers, some of which are not safe to use in code built with the host compiler. Moving POW_SF2_ZERO to aac_tablegen_decl.h, where the declaration of the array it relates to already resides, fixes the problems. Signed-off-by: Mans Rullgard <mans@mansr.com>	2012-10-18 19:59:43 +01:00
Michael Niedermayer	55c49afc42	Merge remote-tracking branch 'qatar/master' * qatar/master: yuv4mpeg: return proper error codes. Give all anonymously typedeffed structs in headers a name fate: Add parseutils test parseutils-test: Drop random colors from parsing test vf_pad/scale: use double precision for aspect ratios. build: error on variable-length arrays ppc: swscale: rework yuv2planeX_altivec() ppc: fmtconvert: kill VLA in float_to_int16_interleave_altivec() x86: dsputil: kill VLA in gmc_mmx() libspeexenc: Updated commentary to reflect recent changes libspeexenc: Add an option for enabling DTX doc/APIchanges: fill in missing dates and hashes. lavr: bump major to 1 and declare it stable. lavr: change the type of the data buffers to uint8_t**. lavc: deprecate the audio resampling API. Conflicts: cmdutils.h configure doc/APIchanges ffplay.c libavcodec/dwt.h libavcodec/libspeexenc.c libavfilter/vf_pad.c libavfilter/vf_scale.c libavformat/asf.h tests/fate/libavutil.mak tests/ref/fate/parseutils Merged-by: Michael Niedermayer <michaelni@gmx.at>	2012-10-06 13:45:08 +02:00

1 2 3

120 Commits