FFmpeg

mirror of https://github.com/FFmpeg/FFmpeg.git synced 2024-11-26 19:01:44 +02:00

Author	SHA1	Message	Date
Andreas Rheinhardt	4d7a1a4619	swscale/input: Avoid calls to av_pix_fmt_desc_get() Up until now, libswscale/input.c used a macro to read an input pixel which involved a call to av_pix_fmt_desc_get() to find out whether the input pixel format is BE or LE despite this being known at compile-time (there are templates per pixfmt). Even worse, these calls are made in a loop, so that e.g. there are six calls to av_pix_fmt_desc_get() for every pair of UV pixel processed in rgb64ToUV_half_c_template(). This commit modifies these macros to ensure that isBE() is evaluated at compile-time. This saved 9743B of .text for me (GCC 11.2, -O3). For a simple RGB64LE->YUV420P transformation like ffmpeg -f lavfi -i haldclutsrc,format=rgba64le -pix_fmt yuv420p \ -threads 1 -t 1:00 -f null - the amount of decicycles spent in rgb64LEToUV_half_c (which is created via the template mentioned above) decreases from 19751 to 5341; for RGBA64BE the number went down from 11945 to 5393. For shared builds (where the call to av_pix_fmt_desc_get() is indirect) the old numbers are 15230 for RGBA64BE and 27502 for RGBA64LE, whereas the numbers with this patch are indistinguishable from the numbers from a static build. Also make the macros that are touched conform to the usual convention of using uppercase names while just at it. Reviewed-by: Anton Khirnov <anton@khirnov.net> Reviewed-by: Paul B Mahol <onemda@gmail.com> Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-19 23:40:41 +02:00
Andreas Rheinhardt	d2428d80ce	swscale/input: Remove spec-incompliant ';' These macros are definitions, not only declarations and therefore should not contain a semicolon. Such a semicolon is actually spec-incompliant, but compilers happen to accept them. Reviewed-by: Philip Langdale <philipl@overt.org> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-09-08 19:21:30 +02:00
Philip Langdale	4a59eba227	swscale/input: add support for Y212LE	2022-09-06 12:49:10 -07:00
Philip Langdale	198b5b90d5	swscale/input: add support for XV30LE	2022-09-06 12:49:10 -07:00
Philip Langdale	5bdd726115	swscale/input: add support for P012 As we now have three of these formats, I added macros to generate the conversion functions.	2022-09-06 12:49:10 -07:00
Philip Langdale	8d9462844a	swscale/input: add support for XV36LE	2022-09-06 12:49:10 -07:00
Philip Langdale	45726aa117	libswscale: add support for VUYX format As we already have support for VUYA, I figured I should do the small amount of work to support VUYX as well. That means a little refactoring to share code.	2022-08-25 19:03:49 -07:00
Timo Rothenpieler	aca569aad2	swscale/input: add rgbaf16 input support This is by no means perfect, since at least ddagrab will return scRGB data with values outside of 0.0f to 1.0f for HDR values. Its primary purpose is to be able to work with the format at all.	2022-08-19 22:09:36 +02:00
Timo Rothenpieler	f2de911818	swscale: add opaque parameter to input functions	2022-08-19 22:09:36 +02:00
James Almer	f0abd07996	swscale/input: add VUYA input support Reviewed-by: Philip Langdale <philipl@overt.org> Signed-off-by: James Almer <jamrial@gmail.com>	2022-08-05 09:39:21 -03:00
Matthieu Bouron	0a6bb7da55	swscale: add NV16 input/output Signed-off-by: Anton Khirnov <anton@khirnov.net>	2022-07-19 12:20:16 +02:00
Andreas Rheinhardt	636631d9db	Remove unnecessary libavutil/(avutil\|common\|internal).h inclusions Some of these were made possible by moving several common macros to libavutil/macros.h. While just at it, also improve the other headers a bit. Reviewed-by: Martin Storsjö <martin@martin.st> Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2022-02-24 12:56:49 +01:00
rcombs	f0204de47d	swscale: add P210/P410/P216/P416 input	2021-11-28 16:40:43 -06:00
Mark Reid	3f4ce004b8	swscale/input: clip rgbf32 values before lrintf if the float pixel * 65535.0f > 2147483647.0f lrintf may overfow and return negative values, depending on implementation. nan and +/-inf values may also be implementation defined clip the value first so lrintf always works. values < 0.0f, -inf, nan = 0.0f values > 65535.0f, +inf = 65535.0f old timings 195960 decicycles in planar_rgbf32le_to_uv, 1 runs, 0 skips 186120 decicycles in planar_rgbf32le_to_uv, 2 runs, 0 skips 188645 decicycles in planar_rgbf32le_to_uv, 4 runs, 0 skips 183625 decicycles in planar_rgbf32le_to_uv, 8 runs, 0 skips 181157 decicycles in planar_rgbf32le_to_uv, 16 runs, 0 skips 177533 decicycles in planar_rgbf32le_to_uv, 32 runs, 0 skips 175689 decicycles in planar_rgbf32le_to_uv, 64 runs, 0 skips 232960 decicycles in planar_rgbf32be_to_uv, 1 runs, 0 skips 221380 decicycles in planar_rgbf32be_to_uv, 2 runs, 0 skips 216640 decicycles in planar_rgbf32be_to_uv, 4 runs, 0 skips 213505 decicycles in planar_rgbf32be_to_uv, 8 runs, 0 skips 211558 decicycles in planar_rgbf32be_to_uv, 16 runs, 0 skips 210596 decicycles in planar_rgbf32be_to_uv, 32 runs, 0 skips 210202 decicycles in planar_rgbf32be_to_uv, 64 runs, 0 skips 161680 decicycles in planar_rgbf32le_to_y, 1 runs, 0 skips 153540 decicycles in planar_rgbf32le_to_y, 2 runs, 0 skips 148255 decicycles in planar_rgbf32le_to_y, 4 runs, 0 skips 140600 decicycles in planar_rgbf32le_to_y, 8 runs, 0 skips 132935 decicycles in planar_rgbf32le_to_y, 16 runs, 0 skips 128531 decicycles in planar_rgbf32le_to_y, 32 runs, 0 skips 140933 decicycles in planar_rgbf32le_to_y, 64 runs, 0 skips 190980 decicycles in planar_rgbf32be_to_y, 1 runs, 0 skips 176080 decicycles in planar_rgbf32be_to_y, 2 runs, 0 skips 167980 decicycles in planar_rgbf32be_to_y, 4 runs, 0 skips 164685 decicycles in planar_rgbf32be_to_y, 8 runs, 0 skips 162751 decicycles in planar_rgbf32be_to_y, 16 runs, 0 skips 162404 decicycles in planar_rgbf32be_to_y, 32 runs, 0 skips 167849 decicycles in planar_rgbf32be_to_y, 64 runs, 0 skips new timings 183320 decicycles in planar_rgbf32le_to_uv, 1 runs, 0 skips 175700 decicycles in planar_rgbf32le_to_uv, 2 runs, 0 skips 179570 decicycles in planar_rgbf32le_to_uv, 4 runs, 0 skips 172932 decicycles in planar_rgbf32le_to_uv, 8 runs, 0 skips 168707 decicycles in planar_rgbf32le_to_uv, 16 runs, 0 skips 165224 decicycles in planar_rgbf32le_to_uv, 32 runs, 0 skips 163423 decicycles in planar_rgbf32le_to_uv, 64 runs, 0 skips 184940 decicycles in planar_rgbf32be_to_uv, 1 runs, 0 skips 185150 decicycles in planar_rgbf32be_to_uv, 2 runs, 0 skips 185790 decicycles in planar_rgbf32be_to_uv, 4 runs, 0 skips 185472 decicycles in planar_rgbf32be_to_uv, 8 runs, 0 skips 185277 decicycles in planar_rgbf32be_to_uv, 16 runs, 0 skips 185813 decicycles in planar_rgbf32be_to_uv, 32 runs, 0 skips 185332 decicycles in planar_rgbf32be_to_uv, 64 runs, 0 skips 145400 decicycles in planar_rgbf32le_to_y, 1 runs, 0 skips 145100 decicycles in planar_rgbf32le_to_y, 2 runs, 0 skips 143490 decicycles in planar_rgbf32le_to_y, 4 runs, 0 skips 136687 decicycles in planar_rgbf32le_to_y, 8 runs, 0 skips 131271 decicycles in planar_rgbf32le_to_y, 16 runs, 0 skips 128698 decicycles in planar_rgbf32le_to_y, 32 runs, 0 skips 127170 decicycles in planar_rgbf32le_to_y, 64 runs, 0 skips 156020 decicycles in planar_rgbf32be_to_y, 1 runs, 0 skips 146990 decicycles in planar_rgbf32be_to_y, 2 runs, 0 skips 142020 decicycles in planar_rgbf32be_to_y, 4 runs, 0 skips 141052 decicycles in planar_rgbf32be_to_y, 8 runs, 0 skips 138973 decicycles in planar_rgbf32be_to_y, 16 runs, 0 skips 138027 decicycles in planar_rgbf32be_to_y, 32 runs, 0 skips 143939 decicycles in planar_rgbf32be_to_y, 64 runs, 0 skips Reviewed-by: Michael Niedermayer <michael@niedermayer.cc> Signed-off-by: James Almer <jamrial@gmail.com>	2021-11-15 16:50:10 -03:00
Mark Reid	74e49cc583	swscale/input: unify grayf32 funcs with rgbf32 funcs This is ment to be a cosmetic change old timings: 42780 UNITS in grayf32le, 1 runs, 0 skips 56720 UNITS in grayf32le, 2 runs, 0 skips 67265 UNITS in grayf32le, 4 runs, 0 skips 58082 UNITS in grayf32le, 8 runs, 0 skips 63512 UNITS in grayf32le, 16 runs, 0 skips 52720 UNITS in grayf32le, 32 runs, 0 skips 46491 UNITS in grayf32le, 64 runs, 0 skips 68500 UNITS in grayf32be, 1 runs, 0 skips 66930 UNITS in grayf32be, 2 runs, 0 skips 62305 UNITS in grayf32be, 4 runs, 0 skips 55510 UNITS in grayf32be, 8 runs, 0 skips 50216 UNITS in grayf32be, 16 runs, 0 skips 44480 UNITS in grayf32be, 32 runs, 0 skips 42394 UNITS in grayf32be, 64 runs, 0 skips new timings: 46660 UNITS in grayf32le, 1 runs, 0 skips 51830 UNITS in grayf32le, 2 runs, 0 skips 53390 UNITS in grayf32le, 4 runs, 0 skips 50910 UNITS in grayf32le, 8 runs, 0 skips 44968 UNITS in grayf32le, 16 runs, 0 skips 40349 UNITS in grayf32le, 32 runs, 0 skips 38330 UNITS in grayf32le, 64 runs, 0 skips 39980 UNITS in grayf32be, 1 runs, 0 skips 49630 UNITS in grayf32be, 2 runs, 0 skips 53540 UNITS in grayf32be, 4 runs, 0 skips 59767 UNITS in grayf32be, 8 runs, 0 skips 51206 UNITS in grayf32be, 16 runs, 0 skips 44743 UNITS in grayf32be, 32 runs, 0 skips 41468 UNITS in grayf32be, 64 runs, 0 skips Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2021-11-14 17:12:13 +01:00
Mark Reid	d2379bd6a0	swscale/input: fix planar_rgb16_to_a for gbrap10be and gbrap12be formats Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2021-11-04 11:52:33 +01:00
Manuel Stoeckl	32329397e2	swscale: add input/output support for X2BGR10LE Signed-off-by: Manuel Stoeckl <code@mstoeckl.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2021-09-26 16:26:10 +02:00
Andreas Rheinhardt	2c05ee092b	avutil/internal, swresample/audioconvert: Remove cpu.h inclusions These inclusions are not necessary, as cpu.h is already included wherever it is needed (via direct inclusion or via the arch-specific headers). Also remove other unnecessary cpu.h inclusions from ordinary non-headers. Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@outlook.com>	2021-07-22 14:33:45 +02:00
Mark Reid	a48adcd136	libswcale/input: use more accurate planer rgb16 yuv conversions These conversion appears to be exhibiting the same rounding error as the rgbf32 formats where. I seperated the rounding value from the 16 and 128 offsets, I think it makes it a little more clear. Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2020-10-06 17:56:52 +02:00
Mark Reid	453004fde6	libswcale/input: use more accurate rgbf32 yuv conversions Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2020-10-02 14:59:52 +02:00
Fei Wang	8544783280	swscale/yuv2rgb: consider x2rgb10le on big endian hardware This fixed FATE fail report by filter-pixfmts* for x2rgb10le on big endian hardware. Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2020-07-20 21:00:00 +02:00
Fei Wang	c721b45014	swscale: Add swscale input/output support for X2RGB10LE Signed-off-by: Fei Wang <fei.w.wang@intel.com>	2020-06-12 17:56:15 +01:00
Mark Reid	ba5d0515a6	libswscale: add input support AV_PIX_FMT_GBRAPF32 Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2020-05-05 20:06:58 +02:00
Carl Eugen Hoyos	5f8c383452	lsws/input: Do not change transparency range. Fixes ticket #8509.	2020-03-11 22:55:49 +01:00
Linjie Fu	d2aa1fbfd4	swscale: Add swscale input support for Y210LE Add swscale input support for Y210LE, output support and fate test could be added later if there is requirement for software CSC to this packed format. Signed-off-by: Linjie Fu <linjie.fu@intel.com>	2020-02-24 00:09:51 +00:00
Michael Niedermayer	d48e510124	swscale/input: Fix several invalid shifts related to rgb2yuv constants Fixes: Invalid shifts Fixes: #8140 Fixes: #8146 Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2020-01-22 21:50:49 +01:00
Philip Langdale	cd48318035	swscale: Add support for NV24 and NV42 The implementation is pretty straight-forward. Most of the existing NV12 codepaths work regardless of subsampling and are re-used as is. Where necessary I wrote the slightly different NV24 versions. Finally, the one thing that confused me for a long time was the asm specific x86 path that did an explicit exclusion check for NV12. I replaced that with a semi-planar check and also updated the equivalent PPC code, which Lauri kindly checked.	2019-05-12 07:51:02 -07:00
Martin Vignali	86e6f0dbc7	swscale : add support for YUVA444P12 and YUVA422P12	2018-11-24 16:24:47 +01:00
Sergey Lavrushkin	582bc5a348	libswscale: Adds conversions from/to float gray format. Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2018-08-14 18:22:39 +02:00
Paul B Mahol	b9dd058f7a	swscale: add gray14 support Signed-off-by: Paul B Mahol <onemda@gmail.com>	2018-05-05 21:35:31 +02:00
Paul B Mahol	de48710c11	libswscale: add gray9 support	2017-08-07 13:09:41 +02:00
Clément Bœsch	fa8db3f597	Merge commit 'de8e096c7eda2bce76efd0a1c1c89d37348c2414' * commit 'de8e096c7eda2bce76efd0a1c1c89d37348c2414': swscale: Consistently order input YUV pixel formats Merged-by: Clément Bœsch <u@pkh.me>	2017-03-21 08:24:39 +01:00
Carl Eugen Hoyos	a9c20598b5	lsws/input: Do not define unused functions. Fixes warnings like the following: libswscale/input.c:951:13: warning: ‘planar_rgb14be_to_a’ defined but not used	2017-03-08 00:30:06 +01:00
Philip Langdale	4c2176d45b	swscale: add P016 input support	2017-02-01 14:29:11 -08:00
Carl Eugen Hoyos	b5177c7051	lsws: Add GRAY10 conversion. Based on `19be5fb7` by Luca Barbato.	2016-11-14 10:35:06 +01:00
Luca Barbato	19be5fb728	swscale: Add GRAY12 Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-10 00:49:29 +01:00
Michael Niedermayer	a340cfef82	swscale/input: Fix alpha bswap for set readAlpPlanar Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-11-10 00:33:12 +01:00
Vittorio Giovara	de8e096c7e	swscale: Consistently order input YUV pixel formats Follow a 420, 422, 444 order instead of a random one. This simplifies double-checking additions of new formats. Signed-off-by: Diego Biurrun <diego@biurrun.de>	2016-09-27 08:33:09 +02:00
Hendrik Leppkes	e78e5b735f	swscale: add P010 input support Signed-off-by: Anton Khirnov <anton@khirnov.net>	2016-07-02 09:35:41 +02:00
Paul B Mahol	9d30690f20	swscale: add input support for gbrap10 pixel format Signed-off-by: Paul B Mahol <onemda@gmail.com>	2016-06-10 17:17:12 +02:00
Pedro Arthur	6de58b4903	swscale: cleanup unused code Removed previous swscale code under '#ifndef NEW_FILTER' and removed unused fields of SwsContext	2016-03-31 16:36:16 -03:00
Kieran Kunhya	4170a44bbc	Add GBRAP12 pixel format Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-02-24 03:22:31 +01:00
Michael Niedermayer	df36257a53	swscale/input: Fix GBRAP16 input Fixes part of Ticket5264 Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>	2016-02-24 00:02:44 +01:00
Hendrik Leppkes	2e31434d84	swscale: add P010 input support	2016-01-26 12:10:45 +01:00
Paul B Mahol	052f64ecb2	swscale: ayuv64le input support Signed-off-by: Paul B Mahol <onemda@gmail.com>	2015-07-24 03:44:58 +00:00
Ronald S. Bultje	711d8812ad	swscale: add yuv440p10/12 pixfmts.	2015-05-06 12:04:31 -04:00
Michael Niedermayer	f30798584f	swscale/input: fix rgba64 alpha non native Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2015-01-21 21:21:00 +01:00
Michael Niedermayer	95d04690aa	swscale/input: Fix alpha of YA16 input Fixes Ticket4278 Signed-off-by: Michael Niedermayer <michaelni@gmx.at>	2015-01-21 21:21:00 +01:00
Vittorio Giovara	2df7277711	swscale: fix gbrap to gbrap alpha scaling	2015-01-16 02:47:15 +01:00
Vittorio Giovara	89df3fd49e	swscale: fix warning about incompatible function pointer type	2015-01-16 02:47:10 +01:00

1 2

85 Commits