FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-12-23 12:43:46 +02:00

Author	SHA1	Message	Date
Miroslav Slugeň	c4aca65a42	avcodec/nvenc: maximum usable surfaces are limited to maximum registered frames Maximum usable surfaces is limited to MAX_REGISTERED_FRAMES constant in nvenc.h Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	2016-11-22 10:34:27 +01:00
Timo Rothenpieler	a66835bcb1	avcodec/nvenc: use dynamically loaded CUDA	2016-11-22 10:34:27 +01:00
Timo Rothenpieler	d9ad18f3b4	avcodec/cuvid: use dynamically loaded CUDA/CUVID And remove the now obsolete compat headers.	2016-11-22 10:34:27 +01:00
Mark Thompson	f242e0a0ff	vaapi_encode: Fix format specifier for bitrate logging Same as `e0df56f25d`. This was accidentally reintroduced while merging `c8241e730f`.	2016-11-21 22:59:58 +00:00
Jun Zhao	e72662e131	lavc/vaapi_encode_h264: fix poc incorrect issue after meeting idr frame. when meeting IDR frame, vaapi_encode_h264 poc number don't reset, now fix this issue based on h264 spec. Some decoder don't care this case, but this fix will enhance the encoder action. Before this fix, poc number is negative in some case. Reviewed-by: Jun Zhao <jun.zhao@intel.com> Signed-off-by: Wang, Yi A <yi.a.wang@intel.com> Signed-off-by: Mark Thompson <sw@jkqxz.net>	2016-11-21 22:37:02 +00:00
Mark Thompson	30ebabca7c	vaapi_h265: Fix buffering parameters A decoder may need this to be set correctly to output frames in the right order. (cherry picked from commit `b8cac1e830`)	2016-11-21 22:13:41 +00:00
Mark Thompson	ae0230cc3e	vaapi_h265: Fix slice header writing This was not observed earlier because the only syntax element which it normally misses with the current setup is slice_qp_delta, but that is always going to be zero (in IDR frames QP isn't varied on the slice) which will always exp-golomb code as a single 1 bit. The immediately following part is the byte alignment, which is always a 1 bit followed by 0s which are ignored, so as long as the bitstream is never aligned at that point we will never notice because the only difference is that an ignored bit is a 1 instead of a 0. (cherry picked from commit `fc30a90898`)	2016-11-21 22:13:41 +00:00
Mark Thompson	6796e6ea84	vaapi_h264: Write bitstream restriction fields (cherry picked from commit `ec17ab381e`)	2016-11-21 22:13:41 +00:00
Mark Thompson	658c5afaa0	vaapi_h264: Fix CFR mode with frame_rate set in AVCodecContext (cherry picked from commit `17a0f9481c`)	2016-11-21 22:13:41 +00:00
Mark Thompson	ded1859df1	vaapi_encode: Decide on GOP setup before initialising sequence parameters This was always too late; several fields related to it have been incorrectly zero since the encoder was added. (cherry picked from commit `314b421dd8`)	2016-11-21 22:13:41 +00:00
Mark Thompson	ee1d04f970	vaapi_h264: Set max_num_ref_frames to 1 when not using B frames (cherry picked from commit `956a54129d`)	2016-11-21 22:13:41 +00:00
Mark Thompson	94f446c628	vaapi_encode: Sync to input surface rather than output While outwardly bizarre, this change makes the behaviour consistent with other VAAPI encoders which sync to the encode /input/ picture in order to wait for /output/ from the encoder. It is not harmful on i965 (because synchronisation already happens in vaRenderPicture(), so it has no effect there), and it allows the encoder to work on mesa/gallium which assumes this behaviour. (cherry picked from commit `086e4b58b5`)	2016-11-21 22:13:41 +00:00
Mark Thompson	478a4b7e6d	vaapi_encode: Check packed header capabilities This improves behaviour with drivers which do not support packed headers, such as AMD VCE on mesa/gallium. (cherry picked from commit `892bbbcdc1`)	2016-11-21 22:13:41 +00:00
Mark Thompson	c8241e730f	vaapi_encode: Refactor initialisation This allows better checking of capabilities and will make it easier to add more functionality later. It also commonises some duplicated code around rate control setup and adds more comments explaining the internals. (cherry picked from commit `80a5d05108`)	2016-11-21 22:13:41 +00:00
Mark Thompson	06d73d002e	vaapi_h264: Fix HRD bit_rate/cpb_size scaling There should be an extra offset of 6 on bit_rate_scale and of 4 on cpb_size_scale which were not accounted for here. (cherry picked from commit `3a9662af6c`)	2016-11-21 22:13:41 +00:00
Carl Eugen Hoyos	322568c079	lavc/ffv1: Support YUV4xxP12 and GRAY12.	2016-11-20 22:23:01 +01:00
James Almer	574929d8b6	avcodec/avpacket: fix leak on realloc in av_packet_add_side_data() If realloc fails, the pointer is overwritten and the previously allocated buffer is leaked, which goes against the expected behavior of keeping the packet unchanged in case of error. Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: James Almer <jamrial@gmail.com>	2016-11-19 20:23:25 -03:00
Andreas Cadhalpun	7289aa2d71	options_table: limit codec parameters to sane values Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-18 22:40:42 +01:00
James Almer	2de1c79b61	x86/vp9itxfm: add missing AVX2 guards Fixes compilation with Yasm 1.1.0 and older. Signed-off-by: James Almer <jamrial@gmail.com>	2016-11-18 17:01:11 -03:00
Michael Niedermayer	d1d18de6ad	avcodec/ffv1dec: Set packed_at_lsb for 16bit YUV This avoids unneeded computations Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-18 18:04:28 +01:00
Michael Niedermayer	d7a3bb2088	avcodec/ffv1dec: Support gray 10/12/16 explicitly avoid shifts Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-18 18:04:28 +01:00
James Almer	16c429166d	Revert "apngdec: use side data to pass extradata to the decoder" This reverts commit `e0c6b32046`. Said commit changed the behavior of the demuxer and decoder in a non backwards compatible way. Demuxers should make extradata available at init if possible, and send new extradata as side data within a packet if needed. A better fix for the remuxing crash will follow. Signed-off-by: James Almer <jamrial@gmail.com>	2016-11-18 12:24:28 -03:00
Hendrik Leppkes	07502e473f	Merge commit '7a76371437f9562c3414f985523f883489e3936a' * commit '7a76371437f9562c3414f985523f883489e3936a': libopenh264enc: Simplify init by setting FF_CODEC_CAP_INIT_CLEANUP Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-18 10:47:08 +01:00
Hendrik Leppkes	7e9474ca47	Merge commit '2d097c16b833c532ac974a7f1fd05c0a1f3b7675' * commit '2d097c16b833c532ac974a7f1fd05c0a1f3b7675': libopenh264enc: Return a more sensible error code in some init failure paths Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-18 10:46:02 +01:00
Hendrik Leppkes	0bd76401d1	Merge commit '36b380dcd52ef47d7ba0559ed51192c88d82a9bd' * commit '36b380dcd52ef47d7ba0559ed51192c88d82a9bd': libopenh264dec: Simplify the init thanks to FF_CODEC_CAP_INIT_CLEANUP being set Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-18 10:45:08 +01:00
Michael Niedermayer	ae514b1254	avcodec/ass_split: Change order of operations in ass_split_section() This matches the other branch Fixes out of array read Fixes: 4d142ca76d39fe685effcf5017098723/asan_heap-oob_31ae824_8611_348fdb64f9009b63c8a8eae9a0e497c5.mkv Found-by: Mateusz "j00ru" Jurczyk and Gynvael Coldwind Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-17 18:05:18 +01:00
Hendrik Leppkes	2f1a539d4b	Merge commit '61bd0ed781b56eea1e8e851aab34a2ee3b59fbac' * commit '61bd0ed781b56eea1e8e851aab34a2ee3b59fbac': h264: Log more information about invalid NALu size Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-17 15:24:25 +01:00
Hendrik Leppkes	cca4fd4778	Merge commit 'a8cbe5a0ccebf60a8a8b0aba5d5716dd54c1595c' * commit 'a8cbe5a0ccebf60a8a8b0aba5d5716dd54c1595c': h264_ps: export actual height in MBs as SPS.mb_height Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-17 15:17:21 +01:00
Hendrik Leppkes	e999a4ed6c	Merge commit '2866d108c9e9da7baf53ff57a51d470691049a57' * commit '2866d108c9e9da7baf53ff57a51d470691049a57': vp8dsp: Remove the comment saying that the height is equal to the width Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-17 15:06:28 +01:00
Hendrik Leppkes	2818aaaba0	Merge commit '5f74bd31a9bd1ac7655103b11743c12d38e0419f' * commit '5f74bd31a9bd1ac7655103b11743c12d38e0419f': vp8/armv6: mc: avoid boolean expression in calculation Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-17 15:05:07 +01:00
Carl Eugen Hoyos	55a424c5a8	lavc/ffv1dec: Scale output for msb-packed compression to full 16bit. 2% slowdown for existing decode-line timer.	2016-11-17 13:00:47 +01:00
Carl Eugen Hoyos	f8247c0cce	lavc/ffv1enc: Support pix_fmt GRAY10.	2016-11-17 12:47:39 +01:00
Michael Niedermayer	2c9106257f	avcodec/mpeg4videodec: Workaround interlaced mpeg4 edge MC bug Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-17 12:21:48 +01:00
Michael Niedermayer	85407c7e63	avcodec/mpegvideo: Fix edge emu buffer overlap with interlaced mpeg4 Fixes Ticket5936 Regression since `c5fc8ae126` Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-17 12:21:48 +01:00
Martin Vignali	52da3f6f70	libavcodec/exr : fix channel size calculation for uint32 channel uint32 need 4 bytes not 1. Fix decoding when there is half/float and uint32 channel. This fixes crashes due to pointer corruption caused by invalid writes. The problem was introduced in commit `03152e74df`. Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-16 23:45:44 +01:00
Andreas Cadhalpun	ce3147eb19	exr: reindent after previous commit Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-16 22:37:24 +01:00
Andreas Cadhalpun	ffdc5d09e4	exr: fix out-of-bounds read channel_index can be -1. This problem was introduced in commit `2dd7b46132`. Reviewed-by: Paul B Mahol <onemda@gmail.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-16 22:37:17 +01:00
Andreas Cadhalpun	3c0328d58d	libschroedingerdec: fix leaking of framewithpts Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-16 19:31:11 +01:00
Andreas Cadhalpun	a86ebbf7f6	libschroedingerdec: don't produce empty frames They are not valid and can cause problems/crashes for API users. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-16 19:30:49 +01:00
Andreas Cadhalpun	90ebf3c428	dds: limit 4 bpp handling to AV_PIX_FMT_PAL8 This fixes NULL pointer dereferencing for formats, where frame->data[1] is not allocated. The problem was introduced in commit `257fbc3af4`. Reviewed-by: Paul B Mahol <onemda@gmail.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-16 19:29:45 +01:00
Thierry Foucu	c512546689	Fix -Werror=parentheses error Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-16 02:39:57 +01:00
Michael Niedermayer	1546d487cf	avcodec/rv40: Test remaining space in loop of get_dimension() Fixes infinite loop Fixes: 178/fuzz-3-ffmpeg_VIDEO_AV_CODEC_ID_RV40_fuzzer Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-15 23:08:43 +01:00
Andreas Cadhalpun	1abcd972c4	mlz: limit next_code to data buffer size This fixes a heap-buffer-overflow detected by AddressSanitizer. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-15 22:01:08 +01:00
Martin Storsjö	f1212e472b	aarch64: vp9: Implement NEON loop filters This work is sponsored by, and copyright, Google. These are ported from the ARM version; thanks to the larger amount of registers available, we can do the loop filters with 16 pixels at a time. The implementation is fully templated, with a single macro which can generate versions for both 8 and 16 pixels wide, for both 4, 8 and 16 pixels loop filters (and the 4/8 mixed versions as well). For the 8 pixel wide versions, it is pretty close in speed (the v_4_8 and v_8_8 filters are the best examples of this; the h_4_8 and h_8_8 filters seem to get some gain in the load/transpose/store part). For the 16 pixels wide ones, we get a speedup of around 1.2-1.4x compared to the 32 bit version. Examples of runtimes vs the 32 bit version, on a Cortex A53: ARM AArch64 vp9_loop_filter_h_4_8_neon: 144.0 127.2 vp9_loop_filter_h_8_8_neon: 207.0 182.5 vp9_loop_filter_h_16_8_neon: 415.0 328.7 vp9_loop_filter_h_16_16_neon: 672.0 558.6 vp9_loop_filter_mix2_h_44_16_neon: 302.0 203.5 vp9_loop_filter_mix2_h_48_16_neon: 365.0 305.2 vp9_loop_filter_mix2_h_84_16_neon: 365.0 305.2 vp9_loop_filter_mix2_h_88_16_neon: 376.0 305.2 vp9_loop_filter_mix2_v_44_16_neon: 193.2 128.2 vp9_loop_filter_mix2_v_48_16_neon: 246.7 218.4 vp9_loop_filter_mix2_v_84_16_neon: 248.0 218.5 vp9_loop_filter_mix2_v_88_16_neon: 302.0 218.2 vp9_loop_filter_v_4_8_neon: 89.0 88.7 vp9_loop_filter_v_8_8_neon: 141.0 137.7 vp9_loop_filter_v_16_8_neon: 295.0 272.7 vp9_loop_filter_v_16_16_neon: 546.0 453.7 The speedup vs C code in checkasm tests is around 2-7x, which is pretty much the same as for the 32 bit version. Even if these functions are faster than their 32 bit equivalent, the C version that we compare to also became around 1.3-1.7x faster than the C version in 32 bit. Based on START_TIMER/STOP_TIMER wrapping around a few individual functions, the speedup vs C code is around 4-5x. Examples of runtimes vs C on a Cortex A57 (for a slightly older version of the patch): A57 gcc-5.3 neon loop_filter_h_4_8_neon: 256.6 93.4 loop_filter_h_8_8_neon: 307.3 139.1 loop_filter_h_16_8_neon: 340.1 254.1 loop_filter_h_16_16_neon: 827.0 407.9 loop_filter_mix2_h_44_16_neon: 524.5 155.4 loop_filter_mix2_h_48_16_neon: 644.5 173.3 loop_filter_mix2_h_84_16_neon: 630.5 222.0 loop_filter_mix2_h_88_16_neon: 697.3 222.0 loop_filter_mix2_v_44_16_neon: 598.5 100.6 loop_filter_mix2_v_48_16_neon: 651.5 127.0 loop_filter_mix2_v_84_16_neon: 591.5 167.1 loop_filter_mix2_v_88_16_neon: 855.1 166.7 loop_filter_v_4_8_neon: 271.7 65.3 loop_filter_v_8_8_neon: 312.5 106.9 loop_filter_v_16_8_neon: 473.3 206.5 loop_filter_v_16_16_neon: 976.1 327.8 The speed-up compared to the C functions is 2.5 to 6 and the cortex-a57 is again 30-50% faster than the cortex-a53. This is an adapted cherry-pick from libav commits `9d2afd1eb8` and `31756abe29`. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2016-11-15 15:10:03 -05:00
Martin Storsjö	f43079e11c	aarch64: vp9: Add NEON itxfm routines This work is sponsored by, and copyright, Google. These are ported from the ARM version; thanks to the larger amount of registers available, we can do the 16x16 and 32x32 transforms in slices 8 pixels wide instead of 4. This gives a speedup of around 1.4x compared to the 32 bit version. The fact that aarch64 doesn't have the same d/q register aliasing makes some of the macros quite a bit simpler as well. Examples of runtimes vs the 32 bit version, on a Cortex A53: ARM AArch64 vp9_inv_adst_adst_4x4_add_neon: 90.0 87.7 vp9_inv_adst_adst_8x8_add_neon: 400.0 354.7 vp9_inv_adst_adst_16x16_add_neon: 2526.5 1827.2 vp9_inv_dct_dct_4x4_add_neon: 74.0 72.7 vp9_inv_dct_dct_8x8_add_neon: 271.0 256.7 vp9_inv_dct_dct_16x16_add_neon: 1960.7 1372.7 vp9_inv_dct_dct_32x32_add_neon: 11988.9 8088.3 vp9_inv_wht_wht_4x4_add_neon: 63.0 57.7 The speedup vs C code (2-4x) is smaller than in the 32 bit case, mostly because the C code ends up significantly faster (around 1.6x faster, with GCC 5.4) when built for aarch64. Examples of runtimes vs C on a Cortex A57 (for a slightly older version of the patch): A57 gcc-5.3 neon vp9_inv_adst_adst_4x4_add_neon: 152.2 60.0 vp9_inv_adst_adst_8x8_add_neon: 948.2 288.0 vp9_inv_adst_adst_16x16_add_neon: 4830.4 1380.5 vp9_inv_dct_dct_4x4_add_neon: 153.0 58.6 vp9_inv_dct_dct_8x8_add_neon: 789.2 180.2 vp9_inv_dct_dct_16x16_add_neon: 3639.6 917.1 vp9_inv_dct_dct_32x32_add_neon: 20462.1 4985.0 vp9_inv_wht_wht_4x4_add_neon: 91.0 49.8 The asm is around factor 3-4 faster than C on the cortex-a57 and the asm is around 30-50% faster on the a57 compared to the a53. This is an adapted cherry-pick from libav commit `3c9546dfaf`. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2016-11-15 15:10:03 -05:00
Martin Storsjö	1f7801c2bc	aarch64: vp9: Add NEON optimizations of VP9 MC functions This work is sponsored by, and copyright, Google. These are ported from the ARM version; it is essentially a 1:1 port with no extra added features, but with some hand tuning (especially for the plain copy/avg functions). The ARM version isn't very register starved to begin with, so there's not much to be gained from having more spare registers here - we only avoid having to clobber callee-saved registers. Examples of runtimes vs the 32 bit version, on a Cortex A53: ARM AArch64 vp9_avg4_neon: 27.2 23.7 vp9_avg8_neon: 56.5 54.7 vp9_avg16_neon: 169.9 167.4 vp9_avg32_neon: 585.8 585.2 vp9_avg64_neon: 2460.3 2294.7 vp9_avg_8tap_smooth_4h_neon: 132.7 125.2 vp9_avg_8tap_smooth_4hv_neon: 478.8 442.0 vp9_avg_8tap_smooth_4v_neon: 126.0 93.7 vp9_avg_8tap_smooth_8h_neon: 241.7 234.2 vp9_avg_8tap_smooth_8hv_neon: 690.9 646.5 vp9_avg_8tap_smooth_8v_neon: 245.0 205.5 vp9_avg_8tap_smooth_64h_neon: 11273.2 11280.1 vp9_avg_8tap_smooth_64hv_neon: 22980.6 22184.1 vp9_avg_8tap_smooth_64v_neon: 11549.7 10781.1 vp9_put4_neon: 18.0 17.2 vp9_put8_neon: 40.2 37.7 vp9_put16_neon: 97.4 99.5 vp9_put32_neon/armv8: 346.0 307.4 vp9_put64_neon/armv8: 1319.0 1107.5 vp9_put_8tap_smooth_4h_neon: 126.7 118.2 vp9_put_8tap_smooth_4hv_neon: 465.7 434.0 vp9_put_8tap_smooth_4v_neon: 113.0 86.5 vp9_put_8tap_smooth_8h_neon: 229.7 221.6 vp9_put_8tap_smooth_8hv_neon: 658.9 621.3 vp9_put_8tap_smooth_8v_neon: 215.0 187.5 vp9_put_8tap_smooth_64h_neon: 10636.7 10627.8 vp9_put_8tap_smooth_64hv_neon: 21076.8 21026.9 vp9_put_8tap_smooth_64v_neon: 9635.0 9632.4 These are generally about as fast as the corresponding ARM routines on the same CPU (at least on the A53), in most cases marginally faster. The speedup vs C code is pretty much the same as for the 32 bit case; on the A53 it's around 6-13x for ther larger 8tap filters. The exact speedup varies a little, since the C versions generally don't end up exactly as slow/fast as on 32 bit. This is an adapted cherry-pick from libav commit `383d96aa22`. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2016-11-15 15:10:03 -05:00
Martin Storsjö	6bec60a683	arm: vp9: Add NEON loop filters This work is sponsored by, and copyright, Google. The implementation tries to have smart handling of cases where no pixels need the full filtering for the 8/16 width filters, skipping both calculation and writeback of the unmodified pixels in those cases. The actual effect of this is hard to test with checkasm though, since it tests the full filtering, and the benefit depends on how many filtered blocks use the shortcut. Examples of relative speedup compared to the C version, from checkasm: Cortex A7 A8 A9 A53 vp9_loop_filter_h_4_8_neon: 2.72 2.68 1.78 3.15 vp9_loop_filter_h_8_8_neon: 2.36 2.38 1.70 2.91 vp9_loop_filter_h_16_8_neon: 1.80 1.89 1.45 2.01 vp9_loop_filter_h_16_16_neon: 2.81 2.78 2.18 3.16 vp9_loop_filter_mix2_h_44_16_neon: 2.65 2.67 1.93 3.05 vp9_loop_filter_mix2_h_48_16_neon: 2.46 2.38 1.81 2.85 vp9_loop_filter_mix2_h_84_16_neon: 2.50 2.41 1.73 2.85 vp9_loop_filter_mix2_h_88_16_neon: 2.77 2.66 1.96 3.23 vp9_loop_filter_mix2_v_44_16_neon: 4.28 4.46 3.22 5.70 vp9_loop_filter_mix2_v_48_16_neon: 3.92 4.00 3.03 5.19 vp9_loop_filter_mix2_v_84_16_neon: 3.97 4.31 2.98 5.33 vp9_loop_filter_mix2_v_88_16_neon: 3.91 4.19 3.06 5.18 vp9_loop_filter_v_4_8_neon: 4.53 4.47 3.31 6.05 vp9_loop_filter_v_8_8_neon: 3.58 3.99 2.92 5.17 vp9_loop_filter_v_16_8_neon: 3.40 3.50 2.81 4.68 vp9_loop_filter_v_16_16_neon: 4.66 4.41 3.74 6.02 The speedup vs C code is around 2-6x. The numbers are quite inconclusive though, since the checkasm test runs multiple filterings on top of each other, so later rounds might end up with different codepaths (different decisions on which filter to apply, based on input pixel differences). Disabling the early-exit in the asm doesn't give a fair comparison either though, since the C code only does the necessary calcuations for each row. Based on START_TIMER/STOP_TIMER wrapping around a few individual functions, the speedup vs C code is around 4-9x. This is pretty similar in runtime to the corresponding routines in libvpx. (This is comparing vpx_lpf_vertical_16_neon, vpx_lpf_horizontal_edge_8_neon and vpx_lpf_horizontal_edge_16_neon to vp9_loop_filter_h_16_8_neon, vp9_loop_filter_v_16_8_neon and vp9_loop_filter_v_16_16_neon - note that the naming of horizonal and vertical is flipped between the libraries.) In order to have stable, comparable numbers, the early exits in both asm versions were disabled, forcing the full filtering codepath. Cortex A7 A8 A9 A53 vp9_loop_filter_h_16_8_neon: 597.2 472.0 482.4 415.0 libvpx vpx_lpf_vertical_16_neon: 626.0 464.5 470.7 445.0 vp9_loop_filter_v_16_8_neon: 500.2 422.5 429.7 295.0 libvpx vpx_lpf_horizontal_edge_8_neon: 586.5 414.5 415.6 383.2 vp9_loop_filter_v_16_16_neon: 905.0 784.7 791.5 546.0 libvpx vpx_lpf_horizontal_edge_16_neon: 1060.2 751.7 743.5 685.2 Our version is consistently faster on on A7 and A53, marginally slower on A8, and sometimes faster, sometimes slower on A9 (marginally slower in all three tests in this particular test run). This is an adapted cherry-pick from libav commit `dd299a2d6d`. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2016-11-15 15:10:03 -05:00
Martin Storsjö	b4dc7c341e	arm: vp9: Add NEON itxfm routines This work is sponsored by, and copyright, Google. For the transforms up to 8x8, we can fit all the data (including temporaries) in registers and just do a straightforward transform of all the data. For 16x16, we do a transform of 4x16 pixels in 4 slices, using a temporary buffer. For 32x32, we transform 4x32 pixels at a time, in two steps of 4x16 pixels each. Examples of relative speedup compared to the C version, from checkasm: Cortex A7 A8 A9 A53 vp9_inv_adst_adst_4x4_add_neon: 3.39 5.83 4.17 4.01 vp9_inv_adst_adst_8x8_add_neon: 3.79 4.86 4.23 3.98 vp9_inv_adst_adst_16x16_add_neon: 3.33 4.36 4.11 4.16 vp9_inv_dct_dct_4x4_add_neon: 4.06 6.16 4.59 4.46 vp9_inv_dct_dct_8x8_add_neon: 4.61 6.01 4.98 4.86 vp9_inv_dct_dct_16x16_add_neon: 3.35 3.44 3.36 3.79 vp9_inv_dct_dct_32x32_add_neon: 3.89 3.50 3.79 4.42 vp9_inv_wht_wht_4x4_add_neon: 3.22 5.13 3.53 3.77 Thus, the speedup vs C code is around 3-6x. This is mostly marginally faster than the corresponding routines in libvpx on most cores, tested with their 32x32 idct (compared to vpx_idct32x32_1024_add_neon). These numbers are slightly in libvpx's favour since their version doesn't clear the input buffer like ours do (although the effect of that on the total runtime probably is negligible.) Cortex A7 A8 A9 A53 vp9_inv_dct_dct_32x32_add_neon: 18436.8 16874.1 14235.1 11988.9 libvpx vpx_idct32x32_1024_add_neon 20789.0 13344.3 15049.9 13030.5 Only on the Cortex A8, the libvpx function is faster. On the other cores, ours is slightly faster even though ours has got source block clearing integrated. This is an adapted cherry-pick from libav commits `a67ae67083` and `52d196fb30`. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2016-11-15 15:10:03 -05:00
Martin Storsjö	68caef9d48	arm: vp9: Add NEON optimizations of VP9 MC functions This work is sponsored by, and copyright, Google. The filter coefficients are signed values, where the product of the multiplication with one individual filter coefficient doesn't overflow a 16 bit signed value (the largest filter coefficient is 127). But when the products are accumulated, the resulting sum can overflow the 16 bit signed range. Instead of accumulating in 32 bit, we accumulate the largest product (either index 3 or 4) last with a saturated addition. (The VP8 MC asm does something similar, but slightly simpler, by accumulating each half of the filter separately. In the VP9 MC filters, each half of the filter can also overflow though, so the largest component has to be handled individually.) Examples of relative speedup compared to the C version, from checkasm: Cortex A7 A8 A9 A53 vp9_avg4_neon: 1.71 1.15 1.42 1.49 vp9_avg8_neon: 2.51 3.63 3.14 2.58 vp9_avg16_neon: 2.95 6.76 3.01 2.84 vp9_avg32_neon: 3.29 6.64 2.85 3.00 vp9_avg64_neon: 3.47 6.67 3.14 2.80 vp9_avg_8tap_smooth_4h_neon: 3.22 4.73 2.76 4.67 vp9_avg_8tap_smooth_4hv_neon: 3.67 4.76 3.28 4.71 vp9_avg_8tap_smooth_4v_neon: 5.52 7.60 4.60 6.31 vp9_avg_8tap_smooth_8h_neon: 6.22 9.04 5.12 9.32 vp9_avg_8tap_smooth_8hv_neon: 6.38 8.21 5.72 8.17 vp9_avg_8tap_smooth_8v_neon: 9.22 12.66 8.15 11.10 vp9_avg_8tap_smooth_64h_neon: 7.02 10.23 5.54 11.58 vp9_avg_8tap_smooth_64hv_neon: 6.76 9.46 5.93 9.40 vp9_avg_8tap_smooth_64v_neon: 10.76 14.13 9.46 13.37 vp9_put4_neon: 1.11 1.47 1.00 1.21 vp9_put8_neon: 1.23 2.17 1.94 1.48 vp9_put16_neon: 1.63 4.02 1.73 1.97 vp9_put32_neon: 1.56 4.92 2.00 1.96 vp9_put64_neon: 2.10 5.28 2.03 2.35 vp9_put_8tap_smooth_4h_neon: 3.11 4.35 2.63 4.35 vp9_put_8tap_smooth_4hv_neon: 3.67 4.69 3.25 4.71 vp9_put_8tap_smooth_4v_neon: 5.45 7.27 4.49 6.52 vp9_put_8tap_smooth_8h_neon: 5.97 8.18 4.81 8.56 vp9_put_8tap_smooth_8hv_neon: 6.39 7.90 5.64 8.15 vp9_put_8tap_smooth_8v_neon: 9.03 11.84 8.07 11.51 vp9_put_8tap_smooth_64h_neon: 6.78 9.48 4.88 10.89 vp9_put_8tap_smooth_64hv_neon: 6.99 8.87 5.94 9.56 vp9_put_8tap_smooth_64v_neon: 10.69 13.30 9.43 14.34 For the larger 8tap filters, the speedup vs C code is around 5-14x. This is significantly faster than libvpx's implementation of the same functions, at least when comparing the put_8tap_smooth_64 functions (compared to vpx_convolve8_horiz_neon and vpx_convolve8_vert_neon from libvpx). Absolute runtimes from checkasm: Cortex A7 A8 A9 A53 vp9_put_8tap_smooth_64h_neon: 20150.3 14489.4 19733.6 10863.7 libvpx vpx_convolve8_horiz_neon: 52623.3 19736.4 21907.7 25027.7 vp9_put_8tap_smooth_64v_neon: 14455.0 12303.9 13746.4 9628.9 libvpx vpx_convolve8_vert_neon: 42090.0 17706.2 17659.9 16941.2 Thus, on the A9, the horizontal filter is only marginally faster than libvpx, while our version is significantly faster on the other cores, and the vertical filter is significantly faster on all cores. The difference is especially large on the A7. The libvpx implementation does the accumulation in 32 bit, which probably explains most of the differences. This is an adapted cherry-pick from libav commits `ffbd1d2b00`, `392caa65df`, `557c1675cf` and `11623217e3`. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2016-11-15 15:10:03 -05:00
Martin Storsjö	6409e9b6cc	vp9dsp: Deduplicate the subpel filters Make them aligned, to allow efficient access to them from simd. This is an adapted cherry-pick from libav commit `a4cfcddcb0`. Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>	2016-11-15 15:10:03 -05:00
Michael Niedermayer	2baf36caed	avcodec/ituh263dec: Avoid spending a long time in slice sync Fixes: 177/fuzz-3-ffmpeg_VIDEO_AV_CODEC_ID_FLV1_fuzzer Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-15 18:27:31 +01:00
Ronald S. Bultje	83a139e3d8	vp9: add avx2 iadst16 implementations. Also a small cosmetic change to the avx2 idct16 version to make it explicit that one of the arguments to the write-out macros is unused for >=avx2 (it uses pmovzxbw instead of punpcklbw).	2016-11-15 11:01:36 -05:00
Michael Niedermayer	0eb3198005	avcodec/movtextdec: Add error message for tsmb_size check Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-15 15:08:20 +01:00
Michael Niedermayer	a609905723	avcodec/movtextdec: Fix tsmb_size check==0 check Fixes: 173/fuzz-3-ffmpeg_SUBTITLE_AV_CODEC_ID_MOV_TEXT_fuzzer Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-15 15:08:20 +01:00
Michael Niedermayer	6ea2715768	avcodec/movtextdec: Fix potential integer overflow Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-15 15:08:20 +01:00
Hendrik Leppkes	51f5542c77	Merge commit 'e8b96a77010dd62624c3c65c357d7ae3b397ceaa' * commit 'e8b96a77010dd62624c3c65c357d7ae3b397ceaa': arm: Fix a typo in a comment Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-14 15:21:49 +01:00
Hendrik Leppkes	5a447edd47	Merge commit 'dc08bbf63a217c839aa4c143f2a1d0b7e2e6d997' * commit 'dc08bbf63a217c839aa4c143f2a1d0b7e2e6d997': vp8dsp: Clarify the first dimension of the mc function tables Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-14 15:21:24 +01:00
Hendrik Leppkes	68b0d7e0be	Merge commit '924e2ecd2b7d51cca60c79351ef16b04dd4245c3' * commit '924e2ecd2b7d51cca60c79351ef16b04dd4245c3': qsvdec: when a frames ctx is supplied, use its frame dimensions Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-14 15:20:09 +01:00
Hendrik Leppkes	3c81fa9a9c	Merge commit '92736c74fb1633e36f7134a880422a9b7db14d3f' * commit '92736c74fb1633e36f7134a880422a9b7db14d3f': qsvdec: add support for P010 (10-bit 420) decoding Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-14 15:20:00 +01:00
Hendrik Leppkes	220e773915	Merge commit 'ce320cf1c4daab3e2e3726ed7d2e879d10f7b991' * commit 'ce320cf1c4daab3e2e3726ed7d2e879d10f7b991': qsvdec: use the same mfxFrameInfo for allocating frames that was passed to DECODE_Init Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-14 15:19:51 +01:00
Hendrik Leppkes	1bc6cdf2fc	Merge commit '536bb17e9659c5ed7576a218d4085cdd6d5742fa' * commit '536bb17e9659c5ed7576a218d4085cdd6d5742fa': qsvdec: make ff_qsv_map_pixfmt() return a MFX fourcc as well Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-14 15:19:43 +01:00
Hendrik Leppkes	985bc8b496	Merge commit '6c445990e64124ad64c79423dfd3764520648c89' * commit '6c445990e64124ad64c79423dfd3764520648c89': tiffenc: Check zlib support for deflate option during initialization Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-14 12:32:08 +01:00
Hendrik Leppkes	bebab21176	Merge commit '9f732e4c996243c1e57c2bbbec6c8b94c37a7a22' * commit '9f732e4c996243c1e57c2bbbec6c8b94c37a7a22': tiffenc: Check av_pix_fmt_desc_get() return value Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-14 12:30:35 +01:00
Hendrik Leppkes	bbd0ebfd83	Merge commit 'd8f3b0fb584677d4882e3a2d7c28f8b15c7319f5' * commit 'd8f3b0fb584677d4882e3a2d7c28f8b15c7319f5': targaenc: Move size check to initialization function Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-14 12:16:32 +01:00
Hendrik Leppkes	25004c7e6e	Merge commit 'eeb6849cedac099d41feb482da581f4059c63ca7' * commit 'eeb6849cedac099d41feb482da581f4059c63ca7': rle: K&R formatting cosmetics Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-14 12:03:00 +01:00
Hendrik Leppkes	444e65299b	Merge commit '326d9116936ab61d13ac4142b49c7337daf7c4c0' * commit '326d9116936ab61d13ac4142b49c7337daf7c4c0': build: Drop unnecessary libavcodec <-> libavformat object dependencies Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-14 12:01:17 +01:00
Hendrik Leppkes	a0bc6b51d4	Merge commit 'e72d6fa08a3c1876109149401753a8d2c736d418' * commit 'e72d6fa08a3c1876109149401753a8d2c736d418': build: Move MP2 muxer declaration away from MP3 muxer code Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-14 11:20:15 +01:00
Hendrik Leppkes	9b4cc0f35c	Merge commit 'fe27792fd779ac4cdd5e57be5f6f488483c307b2' * commit 'fe27792fd779ac4cdd5e57be5f6f488483c307b2': build: Move ff_mpeg12_frame_rate_tab to a separate file Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-14 10:42:36 +01:00
Carl Eugen Hoyos	0674d1938e	lavc/hevc_ps: Use correct pix_fmt for 10bit 4:0:0. Fixes the second sample from ticket #5544.	2016-11-14 10:36:25 +01:00
Hendrik Leppkes	575e8d11f1	Merge commit '8c929037ec75fbe9f367e0a31ee34839e92de481' * commit '8c929037ec75fbe9f367e0a31ee34839e92de481': build: Add a new component for H.264 parsing code Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-14 10:09:44 +01:00
Dmitry Kalinkin	dc23e359ef	lavc/audiotoolboxdec: fix OSX SDK detection __MAC_10_11 can be present in updated revision of an older SDK so it can't reliably detect availability of kAudioFormatEnhancedAC3 constant. Fixes: `b4daa2c40f` ('lavc/audiotoolboxdec: add eac3 decoder') Cc: Rodger Combs <rodger.combs@gmail.com> Signed-off-by: Dmitry Kalinkin <dmitry.kalinkin@gmail.com> Previous version reviewed by: Rodger Combs <rodger.combs@gmail.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-14 02:35:26 +01:00
Carl Eugen Hoyos	b1367f7e5e	lavc/dpx: Support GRAY12 colourspace.	2016-11-14 00:33:12 +01:00
Hendrik Leppkes	bd0db4a32d	Merge commit '7a745f014f528d1001394ae4d2f4ed1a20bf7fa2' * commit '7a745f014f528d1001394ae4d2f4ed1a20bf7fa2': options_table: Add aliases for color properties Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-13 22:29:04 +01:00
Mark Thompson	2dee500f4c	vaapi_encode: Respect driver quirks around buffer destruction No longer leaks memory when used with a driver with the "render does not destroy param buffers" quirk (i.e. Intel i965). (cherry picked from commit `221ffca631`) Fixes ticket #5871.	2016-11-13 20:39:48 +00:00
Hendrik Leppkes	2d7cf6f72b	Merge commit 'f172e22d6aed0bff36e975bafb0183b6779f9444' * commit 'f172e22d6aed0bff36e975bafb0183b6779f9444': pixdesc: Add aliases to SMPTE color properties Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-13 18:35:28 +01:00
Hendrik Leppkes	724a71dced	Merge commit '8a62d2c28fbacd1ae20c35887a1eecba2be14371' * commit '8a62d2c28fbacd1ae20c35887a1eecba2be14371': vaapi_encode: Maintain a pool of bitstream output buffers Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-13 17:38:40 +01:00
Hendrik Leppkes	db854c6c4a	Merge commit '4a081f224e12f4227ae966bcbdd5384f22121ecf' * commit '4a081f224e12f4227ae966bcbdd5384f22121ecf': libavcodec: fix constness in clobber test avcodec_open2() wrappers Merged-by: Hendrik Leppkes <h.leppkes@gmail.com>	2016-11-13 17:30:33 +01:00
Andreas Cadhalpun	7112b56a34	vp9_mc_template: limit assert to SCALED == 0 The handling of the other block sizes was limited to 'SCALED == 0' in commit `dc96c0f9fc`, so this assert should be disabled, too, as it can now be triggered. Reviewed-by: Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-13 12:38:15 +01:00
Michael Niedermayer	04bd1b38ee	avcodec/htmlsubtitles: Fix reading one byte beyond the array Fixes: fuzz-2-ffmpeg_SUBTITLE_AV_CODEC_ID_SUBRIP_fuzzer Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/targets/ffmpeg Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-12 03:23:03 +01:00
Andreas Cadhalpun	cdb5479c9d	pnmdec: make sure v is capped by maxval Otherwise put_bits can be called with a value that doesn't fit in the sample_len, causing an assertion failure. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-12 01:36:47 +01:00
Andreas Cadhalpun	484151df7c	pnm: limit maxval to UINT16_MAX From 'man ppm': The maximum color value (Maxval), again in ASCII decimal. Must be less than 65536. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-12 01:36:47 +01:00
Andreas Cadhalpun	360bc0d90a	smvjpegdec: make sure cur_frame is not negative This fixes a heap-buffer-overflow detected by AddressSanitizer. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-12 01:36:47 +01:00
Andreas Cadhalpun	c82b8ef0e4	dvbsubdec: fix division by zero in compute_default_clut This problem was introduced in commit `4b90dcb849`. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-10 21:01:59 +01:00
Andreas Cadhalpun	1e33035ee7	proresdec_lgpl: explicitly check coff[3] against slice_data_size The implicit checks via v_data_size and a_data_size don't work in the case '(hdr_size > 7) && !ctx->alpha_info'. This fixes segmentation faults due to invalid reads. This problem was introduced in commit `547c2f002a`. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-10 21:00:44 +01:00
Sasi Inguva	18108f3618	lavc/utils.c: Make sure skip_samples never goes negative. Signed-off-by: Sasi Inguva <isasi@google.com> Reviewed-by: Derek Buitenhuis <derek.buitenhuis@gmail.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-10 17:44:47 +01:00
Tom Butterworth	bd6fa80d56	avcodec/hap: add "compressor" option to Hap encoder to disable secondary compression The secondary compression in Hap is optional, this change exposes that option to the user as some use-cases favour higher bitrate files to reduce workload decoding. Adds "none" or "snappy" as options for "compressor". Selecting "none" disregards "chunks" option: chunking is only of benefit decompressing Snappy. Reviewed-by: Martin Vignali <martin.vignali@gmail.com> Signed-off-by: Tom Butterworth <bangnoise@gmail.com>	2016-11-10 14:27:38 +00:00
Carl Eugen Hoyos	08be65a075	lavc/hevc_ps: Fix an error message.	2016-11-10 08:22:26 +01:00
Carl Eugen Hoyos	edb8af6e92	lavc/hevc_ps: Use correct pix_fmt for 12bit 4:0:0. Fixes part of ticket #5544.	2016-11-10 08:11:12 +01:00
Michael Niedermayer	2bc66d9e43	nut: add gray12 support Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-10 01:18:43 +01:00
Andreas Cadhalpun	226d35c845	escape124: reject codebook size 0 It causes a cb_depth of 32, leading to assertion failures in get_bits. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-09 21:10:59 +01:00
Tom Butterworth	0a24587588	avcodec/hap: pass texture-compression destination as argument, not in context This allows a subsequent change to compress directly into the output packet when possible. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Tom Butterworth <bangnoise@gmail.com>	2016-11-08 17:05:27 +00:00
Rostislav Pehlivanov	317be31eaf	opus: move the entropy decoding functions to opus_rc.c The intention is to have both encoding and decoding functions in opus_rc.c. Signed-off-by: Rostislav Pehlivanov <atomnuker@gmail.com>	2016-11-08 14:18:59 +00:00
Rostislav Pehlivanov	0660a09dd1	opus: move all tables to a separate file Signed-off-by: Rostislav Pehlivanov <atomnuker@gmail.com>	2016-11-08 14:18:59 +00:00
Rostislav Pehlivanov	0cf6853804	aacenc: quit when the audio queue reaches 0 rather than keeping track of empty frames The libopus encoder does the same thing and its better than keeping track of when the empty flush frames appear. Signed-off-by: Rostislav Pehlivanov <atomnuker@gmail.com>	2016-11-08 00:50:51 +00:00
Andreas Cadhalpun	5249706e9d	mpegaudio_parser: don't return AVERROR_PATCHWELCOME The API does not allow returning AVERROR codes. It triggers an assert in av_parser_parse2. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-07 19:41:17 +01:00
Andreas Cadhalpun	0747754622	mpeg4audio: validate sample_rate A negative sample rate doesn't make sense and triggers assertions in av_rescale_rnd. Also check for errors from avpriv_mpeg4audio_get_config in ff_mp4_read_dec_config_descr. Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-07 00:51:49 +01:00
Andreas Cadhalpun	bb6a7b6f75	lzf: update pointer p after realloc This fixes heap-use-after-free detected by AddressSanitizer. Reviewed-by: Luca Barbato <lu_zero@gentoo.org> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-05 18:56:26 +01:00
Matt Oliver	6ead033bca	avcodec/nvenc.c: Use new safe dlopen code. Signed-off-by: Matt Oliver <protogonoi@gmail.com>	2016-11-05 18:09:03 +11:00
James Almer	51e329918d	avcodec/rawdec: check for side data before checking its size Fixes valgrind warnings about usage of uninitialized values. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: James Almer <jamrial@gmail.com>	2016-11-04 23:38:56 -03:00
Andreas Cadhalpun	db79dedb1a	diracdec: check return code of get_buffer_with_edge If it fails, buffers aren't allocated, causing NULL pointer dereferencing. Reviewed-by: Rostislav Pehlivanov <atomnuker@gmail.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-04 20:35:23 +01:00
Andreas Cadhalpun	24d20496d2	diracdec: clear slice_params_num_buf on allocation failure Otherwise it can be non-zero next time decode_lowdelay is called, causing slice_params_buf not to be allocated, leading to a NULL pointer dereference. The problem was introduced in commit `dcad4677d6`. Reviewed-by: Rostislav Pehlivanov <atomnuker@gmail.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-04 20:34:51 +01:00
Andreas Cadhalpun	8a4ea96448	diracdec: use correct buffer for slice_params_buf realloc This fixes a double-free detected by AddressSanitizer. The problem was introduced in commit `dcad4677d6`. Reviewed-by: Rostislav Pehlivanov <atomnuker@gmail.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-04 20:34:38 +01:00
Tom Butterworth	92280f86b4	avcodec/hap: consistent name for codec "Vidvox Hap", not "Vidvox Hap encoder" or "Vidvox Hap decoder". Fixes bad name in "ffmpeg -codecs", matches other codec naming. Signed-off-by: Paul B Mahol <onemda@gmail.com>	2016-11-04 11:19:47 -08:00
Anton Khirnov	fb240a6276	qsvenc: do not re-execute encoding on all positive status codes It should only be done for DEVICE_BUSY/IN_EXECUTION (cherry picked from commit `0956fd4606`) Fixes ticket #5924.	2016-11-04 18:56:01 +00:00
Derek Buitenhuis	8a8902f221	libx265: Add option to force IDR frames This is in the same the same vein as `c981b1145a`. Signed-off-by: Derek Buitenhuis <derek.buitenhuis@gmail.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-04 02:45:51 +01:00
Michael Niedermayer	cee1f4c069	avcodec/ac3dec: Check expacc this is somewhat a magic number, which can be understood from reading section "7.1.2 Exponent Strategy" of the ac3 specification, in short: Three exponents each represented as number 0-4 are grouped together and base-5 encoded, so the maximal correct value is 254 + 54 + 4 = 124. Reviewed-by: Andreas Cadhalpun <andreas.cadhalpun@googlemail.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-03 22:05:46 +01:00
Vittorio Giovara	067910ed13	hevc: Move hevc_decode_extradata before frame decoding Avoids a forward-declaration in the following commit. Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-03 16:28:04 +01:00
Andreas Cadhalpun	3932ccc472	ppc: pixblockdsp: do unaligned block accesses correctly again This was broken by the following Libav commit: `4c387c7` ppc: dsputil: do unaligned block accesses correctly The following tests fail due to this: fate-checkasm fate-vsynth1-dnxhd-2k-hr-hq fate-vsynth1-dnxhd-edge1-hr fate-vsynth1-dnxhd-edge2-hr fate-vsynth1-dnxhd-edge3-hr fate-vsynth1-dnxhd-hr-sq-mov fate-vsynth1-dnxhd-hr-hq-mov fate-vsynth2-dnxhd-2k-hr-hq fate-vsynth2-dnxhd-edge1-hr fate-vsynth2-dnxhd-edge2-hr fate-vsynth2-dnxhd-edge3-hr fate-vsynth2-dnxhd-hr-sq-mov fate-vsynth2-dnxhd-hr-hq-mov fate-vsynth3-dnxhd-2k-hr-hq fate-vsynth3-dnxhd-edge1-hr fate-vsynth3-dnxhd-edge2-hr fate-vsynth3-dnxhd-edge3-hr fate-vsynth3-dnxhd-hr-sq-mov fate-vsynth3-dnxhd-hr-hq-mov Fixes trac ticket #5508. Reviewed-by: Carl Eugen Hoyos <ceffmpeg@gmail.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-03 01:23:36 +01:00
Philip Langdale	d0a9af851e	crystalhd: Update high level description We don't need to document the horrible hacks that we removed.	2016-11-02 13:47:57 -07:00
Philip Langdale	a07c07e7aa	crystalhd: Simplify output frame handling The old code had to retain a partial frame across two calls in the case of separate interlaced fields. Now, we know that we'll get both fields within the same receive_frame call, and so we don't need to manage the frame as private state any more.	2016-11-02 13:47:57 -07:00
Philip Langdale	3019b4f648	crystalhd: Loop for a frame internally where possible. It's not possible to return EAGAIN when we've passed input EOF and are in draining mode. If do return EAGAIN, we're saying there's no way to get any more output - which isn't true in many cases. So let's handled these cases in an internal loop as best we can.	2016-11-02 13:47:57 -07:00
Philip Langdale	0eb836942f	crystalhd: Keep NOPTS_VALUE so we know it's not there.	2016-11-02 13:47:57 -07:00
Philip Langdale	13dbf77b81	crystalhd: Remove h.264 parser Now that we don't need to do ridiculous things to work out if a frame is interlaced or not, we don't need an extra h.264 parser.	2016-11-02 13:47:57 -07:00
Philip Langdale	89ba55dc0d	crystalhd: We don't need the track the last picture number anymore This was needed to detect an interlaced failure case that doesn't happen with the new decode api.	2016-11-02 13:47:57 -07:00
Philip Langdale	badce88fdf	crystalhd: Remove trust_interlaced heuristic It seems that without all the other 1:1 heuristics, we don't have a fundamental problem trusting the interlaced flag on output pictures. That's a relief.	2016-11-02 13:47:57 -07:00
Philip Langdale	6cc390dd5a	crystalhd: Revert back to letting hardware handle packed b-frames I'm not sure why, but the mpeg4_unpack_bframes bsf is not interacting well with seeking. Looking at the code, it should be ok, with possibly one warning shown, but I see it getting stuck for an extended period of time after a seek where a packed frame is cached to be shown later. So, I gave up on that and went back to making the old hardware based path work. Turns out that it wasn't broken except that some samples have a 6 byte drop packet which I wasn't accounting for. Now it works again and seeks are good.	2016-11-02 13:47:57 -07:00
Philip Langdale	b5d714f493	crystalhd: Switch to new decode API and remove the insanity The new decode API allows for m:n decode patterns, which is what you need to use this hardware in a sane way. There are so many situations where 1:1 doesn't happen naturally that it's a miracle I got it working as well as I did. With this change, we can throw all of the crazy heuristics and sleeps(!) out, and things work correctly.	2016-11-02 13:47:30 -07:00
Philip Langdale	234d3cbf46	crystalhd: Fix up the missing first sample Why on earth the hardware returns garbage for the first sample of a decoded picture is anyone's guess. The simplest reasonable way to patch it up is to copy the first sample of the second line. This should result in the correct chroma values (because the data was original 4:2:0 upsampled to 4:2:2) even if the luma is isn't.	2016-11-02 13:43:59 -07:00
Vittorio Giovara	271afd632f	lavc: Add hevc main10 profile to ffmpeg cli Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-02 12:27:13 +01:00
Michael Niedermayer	37138338ff	avcodec/sunrast: Fix input buffer pointer check Fixes: out of array read Fixes: poc.dat Found-by: Bingchang, Liu @VARAS of IIE Tested-by: bc L <l.bing.chang.bc@gmail.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-02 12:06:22 +01:00
Carl Eugen Hoyos	5a51ca2da7	lavc/hapenc: Use the correct printf length modifier for size_t arguments. Fixes the following warning: libavcodec/hapenc.c:122:20: warning: format ‘%lu’ expects argument of type ‘long unsigned int’, but argument 4 has type ‘size_t’ [-Wformat] Based on a patch by Diego Biurrun.	2016-11-02 01:55:40 +01:00
Michael Niedermayer	7ddfa0be62	avcodec/dnxhdenc: Fix alignment of edge_buf* Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-01 21:02:26 +01:00
Andreas Cadhalpun	e0c6b32046	apngdec: use side data to pass extradata to the decoder Fixes remuxing apng streams coming from the apng demuxer. This is a regression since `940b8908b9`. Found-by: James Almer <jamrial@gmail.com> Reviewed-by: James Almer <jamrial@gmail.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-01 18:49:28 +01:00
Adriano Pallavicino	6089c44a2a	Fix build warnings due to misleading indentation Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-01 17:44:10 +01:00
Andreas Cadhalpun	60178e78f2	interplayacm: increase bitstream buffer size by AV_INPUT_BUFFER_PADDING_SIZE This fixes out-of-bounds reads by the bitstream reader. Reviewed-by: Paul B Mahol <onemda@gmail.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-11-01 00:39:06 +01:00
Michael Niedermayer	140f48b90f	avcodec/smc: Check side data size before use Fixes out of array read Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-31 23:20:47 +01:00
Michael Niedermayer	979bca5134	avcodec/tscc: Check side data size before use Fixes out of array read Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-31 23:20:47 +01:00
Michael Niedermayer	e167610794	avcodec/rscc: Fix constant Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-31 23:20:31 +01:00
Kyle Schwarz	5d54293668	avcodec/qsv: remove MFX_EXTBUFF_CODING_OPTION3 4th generation Intel CPUs don't support MFX_EXTBUFF_CODING_OPTION3. This patch fixes bug #5324.	2016-10-31 19:23:40 +00:00
Mark Thompson	4e7a7a96cf	qsvdec: Avoid probing with qsv decoders Set the AV_CODEC_CAP_AVOID_PROBING flag on all of the qsv decoders.	2016-10-31 19:23:40 +00:00
Mark Thompson	1f26a231bb	qsv: Merge libav implementation Merged as-at libav `398f015`, and therefore includes outstanding skipped merges `04b17ff` and `130e1f1`. All features not in libav are preserved, and no options change.	2016-10-31 19:23:40 +00:00
Mark Thompson	309fe16a12	mpegvideo: Return correct coded frame sizes from parser	2016-10-31 19:23:40 +00:00
Mark Thompson	0c559f7893	hevc: Return stream format information from parser	2016-10-31 19:23:40 +00:00
Mark Thompson	4df6605da7	vc1: Return stream format information from parser	2016-10-31 19:23:40 +00:00
Michael Niedermayer	5f0bc0215a	avcodec/rawdec: Check side data size before use Fixes out of array read Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-31 01:19:16 +01:00
Michael Niedermayer	0f64b6cd22	avcodec/rscc: Check side data size before use Fixes out of array read Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-31 01:11:03 +01:00
Michael Niedermayer	161ccdaa06	avcodec/msvideo1: Check side data size before use Fixes out of array read Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-31 01:08:45 +01:00
Michael Niedermayer	16793504df	avcodec/qpeg: Check side data size before use Fixes out of array read Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-31 00:36:12 +01:00
Michael Niedermayer	7d196f2a5a	avcodec/qtrle: Check side data size before use Fixes out of array read Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-31 00:27:45 +01:00
Michael Niedermayer	a6330119a0	avcodec/msrle: Check side data size before use Fixes out of array read Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-31 00:19:02 +01:00
Philip Langdale	21b68cdbae	avcodec/cuvid: Don't claim to decode h.263 (it doesn't) Turns out cuvid doesn't support h.263.	2016-10-30 15:47:37 -07:00
Andreas Cadhalpun	5540d6c134	interplayacm: validate number of channels The number of channels is used as divisor in decode_frame, so it must not be zero to avoid SIGFPE crashes. Reviewed-by: Paul B Mahol <onemda@gmail.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-10-30 22:38:23 +01:00
Andreas Cadhalpun	14e4e26559	interplayacm: check for too large b This fixes out-of-bounds reads. Reviewed-by: Paul B Mahol <onemda@gmail.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-10-30 22:38:03 +01:00
Michael Niedermayer	2d99101d09	avcodec/kmvc: Check side data size before use Fixes out of array read Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-30 15:38:44 +01:00
Michael Niedermayer	a2b8dde659	avcodec/idcinvideo: Check side data size before use Fixes out of array read Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-30 15:38:44 +01:00
Michael Niedermayer	121be31060	avcodec/cinepak: Check side data size before use Fixes out of array read Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-30 14:15:00 +01:00
Michael Niedermayer	042faa847f	avcodec/8bps: Check side data size before use Fixes out of array read Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-30 14:15:00 +01:00
Andreas Cadhalpun	1e660fe88d	doc: fix spelling errors Reviewed-by: Lou Logan <lou@lrcd.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-10-29 20:43:15 +02:00
Philip Langdale	7c27da686c	crystalhd: Reorder mspeg4 decoder after software decoders This avoids it getting picked by default, which is generally undesirable and can break test runs.	2016-10-28 19:57:36 -07:00
Andreas Cadhalpun	940b8908b9	apng: use side data to pass extradata to muxer This fixes creating apng files, which is broken since commit `5ef1959080`. Reviewed-by: James Almer <jamrial@gmail.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-10-28 01:53:52 +02:00
James Almer	bf709098c9	avcodec: remove missing incompatible_libav_abi references Signed-off-by: James Almer <jamrial@gmail.com>	2016-10-26 17:36:12 -03:00
Michael Niedermayer	1609935b6c	Bump minor versions after 3.2 branchpoint to seperate release Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-26 20:52:42 +02:00
Michael Niedermayer	3f3025205f	Bump minor versions for 3.2 Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-26 20:52:42 +02:00
Michael Niedermayer	c92f55847a	avcodec/dvdsubdec: Fix off by 1 error Fixes out of array read Found-by: Thomas Garnier using libFuzzer Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-26 19:50:53 +02:00
Michael Niedermayer	25ab1a65f3	avcodec/dvdsubdec: Fix buf_size check Fixes out of array access Found-by: Thomas Garnier using libFuzzer Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-26 18:46:10 +02:00
Carl Eugen Hoyos	134233972e	lavc/utvideoenc: Set bits_per_coded_sample for rgba. Allows to write correct value for biBitCount into BITMAPINFOHEADER. Before, ff_put_bmp_header() always wrote "24" as biBitCount for utvideo because bits_per_coded_sample was never set by the encoder.	2016-10-25 13:44:08 +02:00
Michael Niedermayer	85d23e5cbc	avcodec/interplayvideo: Check side data size before use Fixes out of array read Found-by: Thomas Garnier using libFuzzer Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-25 04:46:02 +02:00
Michael Niedermayer	c1173437fc	avcodec/ffv1enc: Fix storing RGB48 without explicitly set level the bps value is only stored with level >= 1, using rgb48 with level 0 requires the user app to keep track of the bps by external means, which does not always happen also we force level >= 1 for other 16bps formats, so this is consistent. Found-by: Jerome Martinez <jerome@mediaarea.net> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-25 02:51:34 +02:00
Ronald S. Bultje	be885da342	vp9: change order of operations in adapt_prob(). This is intended to workaround bug "665 Integer Divide Instruction May Cause Unpredictable Behavior" on some early AMD CPUs, which causes a div-by-zero in this codepath, such as reported in Mozilla bug #1293996. Note that this isn't guaranteed to fix the bug, since a compiler is free to reorder instructions that don't depend on each other. However, it appears to fix the bug in Firefox, and a similar patch was applied to libvpx also (see Chrome bug #599899).	2016-10-24 16:02:39 -04:00
Rodger Combs	ba53504e57	lavc/utils: avcodec_string: dump field order when known	2016-10-24 01:24:22 -05:00
Rodger Combs	f271a9bd99	lavc/h264_parser: export field order in more cases	2016-10-24 01:20:18 -05:00
Rodger Combs	d13740f3a2	lavc/parser: export field order if not already set Some codecs set this in the parser, but not the decoder	2016-10-24 01:20:18 -05:00
Zhou Xiaoyong	89ec4adad6	avcodec/mips: loongson optimize mmi load and store operators 1.MMI_ load/store macros are defined in libavutil/mips/mmiutils.h 2.Replace some unnecessary unaligned access with aligned operator 3.The MMI_ load/store is compatible with cpu loongson2e/2f which not support instructions start with gs Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-23 03:23:09 +02:00
Philip Langdale	ee7d6738ca	avcodec/cuvid: Allow reinitialization of decoder In practice, this works fine.	2016-10-22 14:57:00 -07:00
Michael Niedermayer	2c1d38d1e1	avcodec/snowenc: Clear MMX state after edge drawing and picture encode Fixes undefined behavior from calling libc allocation with unclean FPU state. Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-22 13:46:58 +02:00
Michael Niedermayer	de0cd0ffc9	avcodec/mpegvideo_enc: Add missing emms_c() to clear MMX state after SIMD use Fixes undefined behavior due to calling libc allocation with unclean FPU state Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-22 13:46:46 +02:00
Michael Niedermayer	966c5c7bb8	avcodec/utils: Move emms_c() before memory allocation functions in avcodec_encode_video2() Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-22 13:46:10 +02:00
Michael Niedermayer	493ad519dd	avcodec/cavsdec: Clear MMX state after MB decode loop The MMX state must be cleared between using MMX and using memory allocation thats basically the only location between the 2 Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-22 13:46:05 +02:00
Michael Niedermayer	70dc6bbf1b	avcodec/svq1enc: Clear MMX state after svq1_encode_plane() svq1_encode_plane() uses SIMD and we call libc memory allocation functions after it Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-22 13:45:59 +02:00
Kagami Hiiragi	41da4f8cb3	lavc/libvpxenc: fix -auto-alt-ref option type vp9_cx_iface actually allows values in range [0..2]. This fixes ticket #5894. Signed-off-by: Kagami Hiiragi <kagami@genshiken.org> Signed-off-by: James Zern <jzern@google.com>	2016-10-21 18:16:46 -07:00
Andreas Cadhalpun	c8a6eb58d7	doc: fix spelling errors Thanks to Mathieu Malaterre <malat@debian.org> for reporting the Que/Queue typo. (https://bugs.debian.org/839542) Reviewed-by: Lou Logan <lou@lrcd.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-10-21 23:58:47 +02:00
Carlos Fernandez	d53a120ad6	lavc: add SCTE-35 CUI codec ID Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Carlos Fernandez <carlos@ccextractor.org> Signed-off-by: Marton Balint <cus@passwd.hu>	2016-10-21 20:39:27 +02:00
Andreas Cadhalpun	a92f8edf0c	mpeg12dec: unref discarded picture from extradata Otherwise another frame gets referenced into picture, triggering an assert (from commit 13aae8) in av_frame_ref. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-10-21 19:41:29 +02:00
Andreas Cadhalpun	1966ea012f	cavsdec: unref frame before referencing again This fixes asserts (from commit 13aae8) in av_frame_ref and av_frame_move_ref. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-10-21 19:41:15 +02:00
Steven Liu	4d92bd3ca2	avcodec/vda: define av_vda_default_init2 when CONFIG_H264_VDA_HWACCEL equ 0 on OSX: ../configure --disable-everything --enable-demuxer=hls make error message: Undefined symbols for architecture x86_64: "_av_vda_default_init2", referenced from:_videotoolbox_init in ffmpeg_videotoolbox.o so add av_vda_default_init2 when CONFIG_H264_VDA_HWACCEL=0 Signed-off-by: Steven Liu <lq@chinaffmpeg.org> Reviewed-by: wm4 <nfxjfg@googlemail.com> Reviewed-by: Xidorn Quan <quanxunzhen@gmail.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-21 16:54:25 +02:00
Michael Niedermayer	03ec6b780c	avcodec/mpegvideo_enc: Clear mmx state in ff_mpv_reallocate_putbitbuffer() This function must be called from the mb or slice encoding loop and MMX state may not be clean there Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-21 14:17:50 +02:00
Michael Niedermayer	4f96f9d111	avcodec/utils: Clear MMX state before returning from avcodec_default_execute*() Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-21 14:17:50 +02:00
Michael Niedermayer	6c5b98d40b	avcodec/dnxhdenc: Move allocation out of radix_sort() Its slow, its not checked, FPU state is not clean either currently there Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-21 14:17:50 +02:00
Hendrik Leppkes	8bd38ec5bd	dxva2: fix surface selection when compiled with both d3d11va and dxva2 Fixes a regression introduced in `9b462a0b9`	2016-10-20 19:31:34 +02:00
Carl Eugen Hoyos	c0e2846dcd	lavc/sheervideo: Increase av_get_codec_tag_string() input buffer size. A size of 32 is typically used.	2016-10-20 09:55:52 +02:00
Sven C. Dack	aebbcb2706	avcodec/nvenc_hevc: Added missing option -temporal_aq The option is present in h264_nvenc, but was missing from hevc_nvenc. Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	2016-10-19 12:45:52 +02:00
Sven C. Dack	da4d0fa86b	avcodec/nvenc: add test for Temporal AQ support Adds a check to see if the hardware supports temporal aq. Signed-off-by: Timo Rothenpieler <timo@rothenpieler.org>	2016-10-19 12:41:41 +02:00
Matthieu Bouron	d5082a2ce7	lavc/mediacodec: use more meaningful filenames Adds the following changes: * mediacodecdec.{c,h} -> mediacodecdec_common.{c,h} * mediacodecdec_h2645.c -> mediacodecdec.c	2016-10-19 10:50:56 +02:00
Matthieu Bouron	f62c54456d	lavc: add mpeg4 mediacodec decoder	2016-10-19 10:50:52 +02:00
Matthieu Bouron	0f7fce87ea	lavc: add vp8/vp9 mediacodec decoders	2016-10-19 10:50:12 +02:00
Matthieu Bouron	b8c158a4ed	lavc/mediacodec_wrapper: do not discard codecs reporting they do not support any profile Depending on the device, some (VP8/VP9/...) decoders report that they do not support any profiles.	2016-10-19 09:52:15 +02:00
Aman Gupta	f45d5e07dd	lavc/videotoolboxenc: skip SEI allocation when side data is not present Signed-off-by: Rick Kern <kernrj@gmail.com>	2016-10-18 19:51:42 -04:00
Rostislav Pehlivanov	d2ae5f77c6	aacenc: add SIMD optimizations for abs_pow34 and quantization Performance improvements: quant_bands: with: 681 decicycles in quant_bands, 8388453 runs, 155 skips without: 1190 decicycles in quant_bands, 8388386 runs, 222 skips Around 42% for the function Twoloop coder: abs_pow34: with/without: 7.82s/8.17s Around 4% for the entire encoder Both: with/without: 7.15s/8.17s Around 12% for the entire encoder Fast coder: abs_pow34: with/without: 3.40s/3.77s Around 10% for the entire encoder Both: with/without: 3.02s/3.77s Around 20% faster for the entire encoder Signed-off-by: Rostislav Pehlivanov <atomnuker@gmail.com> Tested-by: Michael Niedermayer <michael@niedermayer.cc> Reviewed-by: James Almer <jamrial@gmail.com>	2016-10-18 21:41:18 +01:00
Jon Toohill	81f4f789de	lavc/libmp3lame: send encoder delay/padding in packet side data Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-10-18 20:19:29 +02:00
Michael Niedermayer	9545ff3ec3	avcodec/mediacodec: Factor duplicate include Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-18 15:32:14 +02:00
Carl Eugen Hoyos	f04c27fe7c	lavc/videotoolboxenc: Enable a53cc by default.	2016-10-17 17:50:29 +02:00
Rick Kern	d3874b74f3	lavc/videotoolboxenc: Error log formatting. Signed-off-by: Rick Kern <kernrj@gmail.com>	2016-10-17 08:58:17 -04:00
Rick Kern	9875695e2c	lavc/videotoolboxenc: Update a53cc handling Handles insertion into existing SEI NAL unit, inserts emulation prevention bytes. Signed-off-by: Rick Kern <kernrj@gmail.com>	2016-10-17 08:58:17 -04:00
Rick Kern	aa413b810a	lavc/videotoolboxenc: flush/free frames on close Prevents encode callback from running after codec is closed. Fixes a crash when an error is returned. Signed-off-by: Rick Kern <kernrj@gmail.com>	2016-10-17 08:58:17 -04:00
Aman Gupta	9ea91e4114	lavc/videotoolboxenc: implement a53cc Signed-off-by: Rick Kern <kernrj@gmail.com>	2016-10-17 08:58:17 -04:00
James Almer	4b0f37dadb	avcodec/utils: print Chroma Location string in verbose log level It's container level information on some formats (Matroska, MXF, yuv4mpeg), so it should be printed at higher log levels than debug. Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: James Almer <jamrial@gmail.com>	2016-10-16 12:18:39 -03:00
Andreas Cadhalpun	56706ac0d5	libopenjpegenc: fix out-of-bounds reads when filling the edges The calculation of width/height should round up, not round down to prevent setting width or height to 0. Also image->comps[compno].w is unsigned (at least in openjpeg2), so the calculation could silently wrap around without the explicit cast to int. Reviewed-by: Michael Bradshaw <mjbshaw@gmail.com> Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-10-14 16:56:14 +02:00
Andreas Cadhalpun	69c8505f3b	libopenjpegenc: stop reusing image data buffer for openjpeg 2 openjpeg 2 sets the data pointers of the image components to NULL, causing segfaults if the image is reused. Reviewed-by: Michael Bradshaw <mjbshaw@gmail.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-10-14 16:55:45 +02:00
Andreas Cadhalpun	7a65aef00d	configure: fix detection of libopenjpeg Use check_lib2 to test the header together with the function. This is necessary, because '-DOPJ_STATIC' changes what the included header does. Also add '-DOPJ_STATIC' to CPPFLAGS, so that it isn't necessary to hardcode this in libavcodec/libopenjpeg{dec,enc}.c. Finally, check for non-static openjpeg 2.1, too. Reviewed-by: Michael Bradshaw <mjbshaw@gmail.com> Signed-off-by: Andreas Cadhalpun <Andreas.Cadhalpun@googlemail.com>	2016-10-13 21:04:19 +02:00
Vicente Olivert Riera	04b0792e4a	libavcodec/mips/h264dsp_msa.c: fix type in some function parameters This fixes a build problem for MIPS architecture that looks like this: libavcodec/mips/h264dsp_msa.c:2498:6: error: conflicting types for ‘ff_weight_h264_pixels16_8_msa’ void ff_weight_h264_pixels16_8_msa(uint8_t *src, int stride, This bug was introduced by commit `bc26fe8927`: avcodec/h264: Use ptrdiff_t for (bi)weight functions That commit changed the data type of some function parameters in some function definitions. However, the implementation of those functions in libavcodec/mips/h264dsp_msa.c wasn't changed accordingly. Signed-off-by: Vicente Olivert Riera <Vincent.Riera@imgtec.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-10-13 19:15:48 +02:00

... 2 3 4 5 6 ...

36773 Commits