pgbackrest

mirror of https://github.com/pgbackrest/pgbackrest.git synced 2025-07-05 00:28:52 +02:00

Author	SHA1	Message	Date
Thibault VINCENT	c8ccaaa755	Fix PostgreSQL query performance for large datasets. The asynchronous logic used to implement the query timeout was misusing PQisBusy(), which caused the wait handler to throttle the consumption of command results. It could introduce a large delay on a query up to `db-timeout` because of the back-off sequence. Following the recommendation of libpq, fix by polling the client socket for data availability and then continue consuming results and checking for command busyness.	2024-10-10 09:48:43 +03:00
David Steele	80c9b3001c	PostgreSQL 17beta3 support. This release changed the control and WAL format, which is very unusual for a beta. Update control and WAL versions/structs to match.	2024-08-13 11:53:12 +08:00
David Steele	df8cbc91c3	Protocol command multiplexing. Previously it was not possible to read or write two files at the same time on the same remote because the protocol was entirely taken over by the read or write command. Multiple reads are required to make restores efficient when a list of bundled files is being read but blocks need to be retrieved from a separate file or a different part of the same file. Improve that situation with sessions that allow related commands to be run with shared state. Also break read/write into separate requests (rather than pushing all data at once) so they can be multiplexed. The disadvantage for read/write is that they now require more back and forth to transfer a file. This is mitigated by sending asynchronous read/write requests to keep both server and client as busy as possible. Reads that can fit into a single buffer are optimized to transfer in a single command. Reads that transfer the entire file can also skip the close command since it is implicit on end-of-file. These changes allow the protocol to be simplified to provide one response per request, which makes the data end message obsolete. Any data sent for the request is now added to the parameters so no data needs to be sent separately to the server outside the request parameters. Also update the Db protocol to use the new sessions. Previously this code had tracked its own sessions.	2024-07-22 11:48:32 +07:00
Viktor Kurilko	4ac3b82c99	Allow alternative WAL segment sizes for PostgreSQL <= 10. Alternative WAL segment sizes can be configured in PostgreSQL <= 10 with compile-time options. We have not allowed these before since it was not a well-tested feature of PostgreSQL. However, forks such as Greenplum allow alternative WAL segment sizes at initdb time (which are presumably well-tested) so it makes sense to allow it. Since the PostgreSQL versions in question are all EOL it is not important to have this restriction in place anymore.	2024-06-11 12:08:52 +10:00
David Steele	899b892788	New CI container build for PostgreSQL 17 beta1. Update the catalog version for beta 1 so pgbackrest will not work with any prior development versions. Also improve the integration/all test so the catalog version does not need to be updated again during the beta period.	2024-05-24 12:24:11 +10:00
David Steele	c6fcc81db6	Update db/postgres modules to recent coding standards. Add const as appropriate and avoid initializing variables if the variable will definitely be set later on or is immediately returned.	2024-04-21 13:01:40 +10:00
David Steele	fb22f04555	PostgreSQL 17 Support. Add catalog version and WAL magic for PostgreSQL 17.	2024-04-18 10:56:24 +10:00
David Steele	014e24889c	Remove extra space before colons in meson.build files. The spacing was not consistent so use the style that best matches our general coding standards.	2024-03-27 09:53:49 +11:00
David Steele	e634fd85ce	Prevent invalid recovery when backup_label removed. If backup_label is removed from a restored backup then PostgreSQL will instead use checkpoint information from pg_control to attempt (what is thinks is) crash recovery. This will nearly always result in a corrupt cluster because the checkpoint will not be from the beginning of the backup, and even if it is, the end point will not be specified, which could lead to recovery stopping too early. To prevent this, invalidate the checkpoint LSN in pg_control on restore. If backup_label is removed then recovery will still fail because PostgreSQL will not be able to find the invalid checkpoint. The LSN of the checkpoint is not logged but it will be visible in pg_controldata output as 0/DEAD. This value is invalid because PostgreSQL always skips the first WAL segment when initializing a cluster.	2024-03-10 17:08:42 +13:00
David Steele	960b43589d	Add validation for WAL segment size in pg_control. This serves as an additional sanity check to be sure the pg_control format is as expected. The field is useful for being near the end and containing a limited number of discrete values.	2024-03-10 16:17:50 +13:00
David Steele	63541b2273	Add validation for page checksum version in pg_control. This serves as an additional sanity check to be sure the pg_control format is as expected. The field is useful for being all the way at the end and being four bytes that can only have one of two values. Something more distinctive than 0 and 1 would be better, but this is what we have to work with. Convert PgControl.pageChecksum to unsigned int and rename to PgControl.pageChecksumVersion and make all downstream changes required for the new datatype.	2024-03-10 15:50:10 +13:00
David Steele	3926dd346e	Update LICENSE.txt and PostgreSQL copyright for 2024.	2024-01-04 14:55:44 -03:00
Viktor Kurilko	89d5278b74	Add support for alternate compile-time page sizes. Alternate pages sizes can be selected at compile-time, .e.g. 4096. While compile-time settings are generally not well tested by core, some established forks such as Greenplum use them.	2023-12-14 13:28:52 -03:00
David Steele	dcf0781987	Remove support for PostgreSQL 9.3. Per our policy to support five EOL versions of PostgreSQL, 9.3 is no longer supported by pgBackRest. Remove all logic associated with 9.3 and update the tests.	2023-11-09 12:59:12 -03:00
David Steele	657c1a3e06	Finalize catalog number for PostgreSQL 16 release.	2023-09-14 18:41:36 -04:00
David Steele	f42d927d2d	Retry reads of pg_control until checksum is valid. On certain file systems (e.g. ext4) pg_control may appear torn if there is a concurrent write while reading the file. To prevent an invalid read, retry until the checksum matches the control data. Special handling is required for the pg-version-force feature since the offset of the checksum is not known. In this case, scan from the default position to the end of the data looking for a checksum match. This is a bit imprecise, but better than nothing, and the chance of a random collision in the control data seems very remote considering the ratio of data size (< 512 bytes) to checksum size (4 bytes). This was discovered and a possible solution proposed for PostgreSQL in [1]. The proposed solution may work for backup, but pgBackRest needs to be able to read pg_control reliably outside of backup. So no matter what fix is adopted for PostgreSQL, pgBackRest need retries. Further adjustment may be required as the PostgreSQL fix evolves. [1] https://www.postgresql.org/message-id/20221123014224.xisi44byq3cf5psi%40awork3.anarazel.de	2023-09-10 09:47:49 -04:00
David Steele	1bd5530a59	Remove double spaces from comments and documentation. Double spaces have fallen out of favor in recent years because they no longer contribute to readability. We have been using single spaces and editing related paragraphs for some time, but now it seems best to update the remaining instances to avoid churn in unrelated commits and to make it clearer what spacing contributors should use.	2023-05-02 12:57:12 +03:00
David Steele	3fc3690dd7	PostgreSQL 16 Support. Add catalog version and WAL magic for PostgreSQL 16. The GUC to force parallel mode has be renamed so update that in the tests.	2023-04-27 10:30:50 +03:00
David Steele	f5e6bc2698	Allow page header checks to be skipped. These checks cause false negatives for page checksum verification when the page is encrypted because pd_upper might end up as 0 in the encrypted data. This issue is rare but reproducible given a large enough cluster. Make these checks optional, but leave them enabled by default.	2023-04-20 13:24:12 +03:00
David Steele	8240eb5da5	Autogenerate PostgreSQL versions. This will make adding/removing versions of PostgreSQL more reliable.	2023-04-16 17:41:27 +03:00
David Steele	a05bf6bb15	Rename PG_VERSION__STR constants to PG_VERSION__Z. This is more consistent with other zero-terminated string constants and also has the benefit of being shorter.	2023-04-16 17:32:24 +03:00
David Steele	b111599bad	Simplify object creation with OBJ_NEW_BEGIN() macro. Eliminate the boilerplate of declaring this and assigning memory to it, which is the same for the vast majority of object creations. Keep the old version of the macro as OBJ_NEW_BASE_BEGIN() for a few exceptions in the core code and (mostly) in the tests.	2023-03-28 15:05:18 +06:00
Stefan Fercot	740c2258e3	Add pg-version-force option for fork integration. Forks may update pg_control version or WAL magic without affecting the structures that pgBackRest depends on. This option forces pgBackRest to treat a cluster as the specified version when it cannot be automatically identified.	2023-03-09 08:23:15 +07:00
Stefan Fercot	4394479776	Fix typo and remove extraneous linefeed.	2023-02-28 08:47:51 +07:00
David Steele	d4070c9064	Reformat code with uncrustify. uncrustify has been configured to be as close to the current format as possible but the following changes were required: * Break long struct initializiers out of function calls. * Bit fields get extra spacing. * Strings that continue from the previous line no longer indented. * Ternary operators that do not fit on a single line moved to the next line first. * Align under parens for multi-line if statements. * Macros in header #if blocks are no longer indented. * Purposeful lack of function indentation in tests has been removed. Currently uncrustify does not completely reflow the code so there are some edge cases that might not be caught. However, this still represents a huge improvement and the formatting can be refined going forward. Support code for uncrustify will be in a followup commit.	2023-01-30 11:55:54 +07:00
David Steele	b2202c36d9	Fix formatting errors. Errors in our current (manually-maintained) code format discovered by uncrustify.	2023-01-30 11:16:31 +07:00
David Steele	ccee5c0fb1	Reduce log level of pgVersionFromStr() and pgVersionToStr().	2023-01-14 17:12:15 +07:00
David Steele	596c62c54e	Simply return in pgVersionToStr().	2023-01-14 15:25:25 +07:00
David Steele	de1dfb66ca	Refactor logging functions to never allocate memory. Allocating memory made these functions simpler but it meant that memory was leaking into the calling context when logging was enabled. It is not clear that this was an issue but it seems that trace level logging could result it a lot of memory usage depending on the use case. This also makes it possible to audit allocations returned to the calling context, which will be done in a followup commit. Also rename objToLog() to objNameToLog() since it seemed logical to name the new function objToLog().	2023-01-12 17:14:36 +07:00
David Steele	877bb2ac9e	Update LICENSE.txt and PostgreSQL copyright for 2023.	2023-01-03 08:26:44 +07:00
David Steele	f018912908	Split VR_EXTERN/FN_EXTERN macros from FV_EXTERN. This should make it a little clearer what the variable (VR) macros are doing since the declaration/definition cannot both be set to extern (but functions can). Splitting the variable macros out also allows them to be changed in the future with little churn, while changing the function macro creates a large amount of churn.	2023-01-02 15:24:51 +07:00
David Steele	4fb8a0ecdd	Add meson unity build and tests. This is immediately useful because it will detect any extern'd functions or variables that are not being used. It also detects functions or variables that are declared but not defined. If a FV/VR_EXTERN macro is missing it will be detected either because of a mismatch in the declaration/definition or because a new defined symbol will appear in the nm test. Eventually the unity build will be used to create a more optimized pgbackrest binary but that will need to wait.	2022-12-31 17:13:41 +07:00
David Steele	cebbf0d012	Remove unused functions. These functions were either added with the intention that they would be used or they became obsolete over time.	2022-12-30 16:26:48 +07:00
David Steele	77c721eb63	Remove support for PostgreSQL 9.0/9.1/9.2. Our new policy is to support ten versions of PostgreSQL, the five supported releases and the last five EOL releases. As of PostgreSQL 15, that means 9.0/9.1/9.2 are no longer supported by pgBackRest. Remove all logic associated with 9.0/9.1/9.2 and update the tests. Document the new support policy. Update InfoPg to read/write control versions for the history in backup.info, since we can no longer rely on the mappings being available. In theory this could have been an issue after removing 8.3/8.4 if anybody was using a version that old.	2022-12-20 12:20:47 +07:00
David Steele	65be4c64a9	Finalize catalog number for PostgreSQL 15 release.	2022-10-16 09:58:35 +13:00
David Steele	8fb61a809d	Add FN_INLINE_ALWAYS macro. Eliminate a lot of useless repetition for a commonly used pattern.	2022-09-08 18:36:03 -06:00
David Steele	c625f05a13	Unify code builder binaries into a single binary. Creating new binaries was convenient at first but has now become a maintenance issue. Solve this by combining that into a single binary that takes an additional parameter to indicate which code should be built. Also clean up path handling to make it easier to build code from the command line.	2022-07-20 17:45:39 -04:00
David Steele	f92ce674f7	Automatically create PostgreSQL version interfaces. Maintaining the version interfaces was complicated by the fact that each interface needed to be in separate compilation unit to avoid type conflicts. This also meant that various build/test files needed to be updated to add the new interfaces. Solve these problems by auto-generating all the interfaces into a single file. This is made possible by parsing defines and types out of the header files and creating macros to rename the types. At the end of the version interface everything is undef'd. Another benefit is that the auto-generated interfaces can be static and included directly into postgres/interface.c. Since some code generation is now always required for tests, change --no-gen to --min-gen in test.pl. It would also make sense to auto-generate the version defines in postgres/version.h, but that will be left for a future commit.	2022-06-06 13:52:56 -04:00
David Steele	2feaaeaac8	Add .inc extension to C files included in other C files. These files were never intended to be compiled on their own so the .c extension was a bit misleading. In particular Meson does not like .c files that are not intended to be compiled independently. Leave header files as is since they are already protected against being included more than once and are never expected to be compiled.	2022-05-31 16:06:41 -04:00
David Steele	c7a66ac1af	Improve memory usage of mem contexts. Each mem context can track child contexts, allocations, and a callback. Before this change memory was allocated for tracking all three even if they were not used for a particular context. This made mem contexts unsuitable for String and Variant objects since they are plentiful and need to be as small as possible. This change allows mem contexts to be configured to track any combination of child contexts, allocations, and a callback. In addition, the mem context can be configured to track a single child context and/or allocation, which saves memory and is a common use case. Another benefit is that Variants can own objects (e.g. KeyValue) that they encapsulate. All of this makes memory accounting simpler because mem contexts have names while allocations do not. No more memory is used than before since Variants and Strings still had to store the memory context they were originally allocated in so they could be easily freed. Update the String and Variant objects to use this new functionality. The custom strFree() and varFree() functions are no longer required and can now be a wrapper around objFree(). Lastly, this will allow strMove() and varMove() to be implemented and used in cases where strDup() and varDup() are being used to move a String or Variant to a new context. Since this will be a bit noisy it is saved for a future commit.	2022-05-18 10:52:01 -04:00
David Steele	20782c88bc	PostgreSQL 15 support. PostgreSQL 15 drops support for exclusive backup and renames the start/stop backup commands. This is based on the pgdg-testing repo since beta1 has not been released yet, but it seems unlikely that breaking changes will be made at this point. beta1 should be tagged just before our next release so we'll retest before the release.	2022-05-04 11:55:59 -04:00
David Steele	692fe496bd	Remove dependency on pg_database.datlastsysoid. This column has been removed in PostgreSQL 15. Rather than add a lot of special handling, it seems better just to update all versions to not depend on this column. Add centralized functions to identify the type of database (i.e. system or user) by name and use FirstNormalObjectId when a name is not available. The new query in the db module will still return the prior result for PostgreSQL <= 15, which will be stored in the manifest. This is important to preserve behavior when downgrading pgBackRest. There are no concerns here for PostgreSQL 15 since older versions of pgBackRest won't be able to restore backups for PostgreSQL 15 anyway.	2022-05-04 08:22:45 -04:00
David Steele	302e0c0921	Remove extra linefeed.	2022-05-03 16:53:29 -04:00
David Steele	bc46d4e37b	Add cvtZSubNTo*() functions. These functions allow conversion from substrings without needing to create a String or a temporary buffer. httpDateToTime() no longer requires a temp mem context. Also improve handling of month search to avoid an allocation. httpUriDecode() no longer requires a temp mem context. jsonReadStr() no longer requires a temp mem context. pgLsnFromWalSegment() no longer requires a temp mem context. pgVersionFromStr() no longer requires a temp mem context. Also do a bit of refactoring. storageGcsCvtTime() no longer leaks six Strings per call. storageS3CvtTime() no longer leaks six Strings per call.	2022-04-28 09:50:23 -04:00
David Steele	cca6df872a	Refactor functions in postgres/interface module and fix leak. pgLsnFromWalSegment() leaked two Strings. Refactor pgLsnRangeToWalSegmentList() to create the StringList in the calling context rather than moving it later.	2022-04-26 12:09:44 -04:00
David Steele	7900660d3a	Add strLstNewFmt(). Simplifies adding a formatted string to a list and removes a common cause of leaks.	2022-04-25 11:47:43 -04:00
David Steele	1e2b545ba4	Require type for FUNCTION_TEST_RETURN*() macros. This allows code to run after the return type has been generated in the case where it is an expression. No new functionality here yet, but this will be used by a future commit that audits memory usage.	2022-04-24 19:19:46 -04:00
David Steele	c304fafd45	Refactor PgClient to return results in Pack format. Packs support stronger typing than JSON and are more efficient. For the small result sets that we deal with efficiency is probably not very important, but this removes another place where we are using JSON instead of Pack. Push checking for result struct (e.g. single row) down into PgClient since it has easy access to this information rather than needing to parse the result set to find out. Refactor all code downstream that depends on PgClient results.	2022-04-20 08:36:53 -04:00
David Steele	cfd6c7ceb4	Use specific integer types in postgres/client and db unit tests. This will work better once we are able to transmit the results with stronger typing. Also remove int2 which was not being used.	2022-04-18 12:14:22 -04:00
David Steele	571dceefec	Add LENGTH_OF() macro. Determining the length of arrays that could be calculated at compile time was a bit piecemeal, with special macros used sometimes and with the math done directly other times. This macro makes the task easier, uses less space, and automatically adjusts when the type changes.	2022-04-07 19:00:15 -04:00

1 2 3 4

158 Commits