pgbackrest

mirror of https://github.com/pgbackrest/pgbackrest.git synced 2024-12-14 10:13:05 +02:00

Author	SHA1	Message	Date
David Steele	5a4b91f90a	v2.28: Azure Repository Storage Bug Fixes: * Fix restore --force acting like --force --delta. This caused restore to replace files based on timestamp and size rather than overwriting, which meant some files that should have been updated were left unchanged. Normal restore and restore --delta were not affected by this issue. (Reviewed by Cynthia Shang.) Features: * Azure support for repository storage. (Reviewed by Cynthia Shang, Don Seiler.) * Add expire-auto option. This allows automatic expiration after a successful backup to be disabled. (Contributed by Stefan Fercot. Reviewed by Cynthia Shang, David Steele.) Improvements: * Asynchronous S3 multipart upload. (Reviewed by Stephen Frost.) * Automatic retry for backup, restore, archive-get, and archive-push. (Reviewed by Cynthia Shang.) * Disable query parallelism in PostgreSQL sessions used for backup control. (Reviewed by Stefan Fercot.) * PostgreSQL 13 beta2 support. Changes to the control/catalog/WAL versions in subsequent betas may break compatibility but pgBackRest will be updated with each release to keep pace. * Improve handling of invalid HTTP response status. (Reviewed by Cynthia Shang.) * Improve error when pg1-path option missing for archive-get command. (Reviewed by Cynthia Shang.) * Add hint when checksum delta is enabled after a timeline switch. (Reviewed by Matt Bunter, Cynthia Shang.) * Use PostgreSQL instead of postmaster where appropriate. (Reviewed by Cynthia Shang.) Documentation Bug Fixes: * Fix incorrect example for repo-retention-full-type option. (Reported by Höseyin Sönmez.) * Remove internal commands from HTML and man command references. (Reported by Cynthia Shang.) Documentation Improvements: * Update PostgreSQL versions used to build user guides. Also add version ranges to indicate that a user guide is accurate for a range of PostgreSQL versions even if it was built for a specific version. (Reviewed by Stephen Frost.) * Update FAQ for expiring a specific backup set. (Contributed by Cynthia Shang. Reviewed by David Steele.) * Update FAQ to clarify default PITR behavior. (Contributed by Cynthia Shang. Reviewed by David Steele.)	2020-07-20 08:57:22 -04:00
David Steele	67d9d3350c	Update recovery.conf in user guide for PostgreSQL >= 12. The postgresql.auto.conf file was being used instead of recovery.conf, but there were still instances in the text that used recovery.conf. Update to postgresql.auto.conf for PostgreSQL >= 10 and change wording where needed.	2020-07-17 13:27:14 -04:00
David Steele	24d2c5b277	Remove real/all integration tests now covered by unit tests. Remove all check and stanza-* tests except for the ones that are intended to succeed. The successful tests show that the queries run with expected results against each version of PG which should also validate queries for the failure tests in the unit tests. Also remove the tests for --no-online backups since they don't require a database and are well tested in the unit tests.	2020-07-16 13:57:14 -04:00
David Steele	332f2fb7f5	Update PostgreSQL versions used to build user guides. Also add version ranges to indicate that a user guide is accurate for a range of PostgreSQL versions even if it was built for a specific version.	2020-07-16 12:54:52 -04:00
David Steele	232d14993d	Fix CentOS path for PostgreSQL >= 10 logs in user guide. Debian worked since logs default to another location, but CentOS/RHEL on PostgreSQL >= 10 requires the new location.	2020-07-16 12:45:24 -04:00
Stefan Fercot	047d85c263	Automatically determine cipher passphrase in repo-get command. The prior code was only able to use the main passphrase automatically and expected sub passphrases to be specified for each operation. This was fine for testing but hardly sufficient for a user-facing feature. Update the code to determine which passphrase to use for any file in the repository and error when an invalid file or location is selected. The repo-get command is still internal for now, but with this improvement it should be ready to be made public.	2020-07-16 12:24:03 -04:00
David Steele	aa4e13b665	Move encrypted files as raw in integration tests. The encryption key should not be changed when moving a file so no need to decrypt/encrypt.	2020-07-16 11:27:14 -04:00
David Steele	50ff5d905e	Update comment and parameter in HttpRequest.	2020-07-15 13:54:01 -04:00
David Steele	88b0f6245d	Run non version specific real/tests on the expect version. There are a few non version specific tests that need to be run in integration because we can't get coverage in the unit tests. To save some time we'll only run those tests against the same version we use for expect testing.	2020-07-15 13:19:16 -04:00
David Steele	574f36c9d2	Rename httpRequest() to httpRequestResponse() and fix comment.	2020-07-14 15:14:41 -04:00
David Steele	620a8d17cf	Automatic retry for backup, restore, archive-get, and archive-push. If a local command, e.g. backupFile(), fails it will stop the entire process. Instead, retry local commands to deal with transient errors. Remove special logic in the S3 storage driver to retry RequestTimeTooSkewed errors since this is now handled by the general retry mechanism in the places where it is most likely to happen, i.e. file read/write. Also, this error should have been entirely eliminated by the asynchronous TLS implementation.	2020-07-14 15:05:31 -04:00
David Steele	91c7adc834	Allow redactions for HTTP queries. The Azure storage driver exposes secrets in the query when using SAS authorization. These secrets can show up during logging or when an error occurs. Allow redaction of queries to prevent secrets from being exposed in logs and errors.	2020-07-14 13:09:48 -04:00
Stefan Fercot	d3dd32a031	Add expire-auto option. This allows automatic expiration after a successful backup to be disabled.	2020-07-14 08:12:25 -04:00
David Steele	083350aeda	Inline Buffer functions when possible. It makes sense to inline these functions for the same reasons String functions were inlined in `fbff2995` and `f1edf0ad`.	2020-07-10 08:18:15 -04:00
David Steele	f1edf0ad10	Inline strSize(). Inlining strPtr() in `fbff2995` does not seem to have caused any problems so do the same with strSize().	2020-07-10 08:00:18 -04:00
David Steele	d5df3974b5	Read segment size from WAL headers. This allows validation of the WAL segment size for PostgreSQL versions <= 10.	2020-07-09 17:32:36 -04:00
David Steele	2f7823c627	Add shared access signature (SAS) authorization for Azure. A shared access signature (SAS) provides granular, delegated access to resources in a storage account. This is often preferable to using a shared key which provides more access and is a greater security risk if compromised.	2020-07-09 14:46:48 -04:00
David Steele	511e5db5bf	Improve buffer size limit implementation. Rework size limits so that this->size is always the current size no matter how much is allocated. Most importantly, this removes the conditional in bufSize(), which makes it a better candidate for inlining.	2020-07-09 11:16:45 -04:00
David Steele	15502f5b4b	Remove bufNewUseC(). This was used in the Perl LibC interface to wrap Perl-allocated buffers but is no longer needed since LibC was removed.	2020-07-09 07:16:15 -04:00
David Steele	18f36752ae	Add ASSERT_INLINE() macro. When coverage testing ASSERT() macros in inline functions will be expanded and won't be recognized in our coverage rules that ignore ASSERT(). Since there are then uncovered conditions the coverage is incomplete. The prior method required copying several lines of code and an explanatory comment into each inline function. Instead create a special macro for inclusion in inline functions. Another possibility would be to automatically identify inline functions and add them to the coverage exclusions but that's an idea for another day.	2020-07-08 17:31:48 -04:00
David Steele	eaa05fdc49	Write HTTP request as a buffer to hide secrets. The prior method of writing headers as strings could expose secrets in trace level logs. Instead write the entire request as a buffer to prevent secrets from being logged and also reduce the amount of logging.	2020-07-08 15:07:29 -04:00
David Steele	dd9e14b628	Add pgLsnFromWalSegment(). Provides the reverse operation for pgLsnToWalSegment().	2020-07-08 12:25:39 -04:00
David Steele	a27ff7c335	Remove dead test code that should have been removed in `3f4371d7`.	2020-07-07 08:24:08 -04:00
David Steele	57ddd38c51	S3 storage driver cleanup inspired by Azure review. These improvements were suggested during the review of `3f4371d7` and it seemed a good idea to apply them to the S3 driver as well.	2020-07-06 16:57:12 -04:00
David Steele	682ac656f5	Fix restore --force acting like --force --delta. This caused restore to replace files based on timestamp and size rather than overwriting, which meant some files that should have been updated were left unchanged. Normal restore and restore --delta were not affected by this issue.	2020-07-06 15:03:24 -04:00
Cynthia Shang	8409ab7b2b	Fix typo.	2020-07-06 12:04:35 -04:00
David Steele	cf284fbe8a	Add httpUriDecode(), httpQueryNewStr(), and httpQueryMerge(). httpUriDecode() reverses the encoding in httpUriEncode(). httpQueryNewStr() creates a new HttpQuery by parsing a query string. httpQueryMerge() merges the contents of one query into another query.	2020-07-06 07:48:12 -04:00
David Steele	3f4371d7a2	Azure support for repository storage. Azure and Azure-compatible object stores can now be used for repository storage. Currently only shared key authentication is supported but SAS will be added soon.	2020-07-02 16:24:34 -04:00
Cynthia Shang	3e2c8874f7	Fix typo.	2020-07-01 07:39:29 -04:00
David Steele	be16bf69a8	Remove internal commands from HTML and man command references. Some of these commands will be made public in the future but for now their interfaces are not stable so they remain internal.	2020-06-29 15:07:17 -04:00
David Steele	c2dea180fb	Remove redundant storage type constants. These constants predate the C storage drivers which now provide their own constants.	2020-06-26 16:50:29 -04:00
David Steele	1416dc3225	Add missing end braces in debug log functions. These had no effect on functionality but made debug messages harder to read.	2020-06-26 16:44:10 -04:00
David Steele	96adf8e513	PostgreSQL 13 beta2 support. There don't appear to be any behavioral changes since PostgreSQL 12 and all the tests pass. Changes to the control/catalog/WAL versions in subsequent betas may break compatibility but pgBackRest will be updated with each release to keep pace.	2020-06-26 07:44:56 -04:00
David Steele	e53de3ce46	Add/update comments in storage/s3 module.	2020-06-26 06:50:20 -04:00
David Steele	974cc10b90	Minor improvements to storage/s3 unit test.	2020-06-26 06:46:25 -04:00
David Steele	e46eeefada	Add review for `ea04ec7b`.	2020-06-26 06:34:21 -04:00
David Steele	ea04ec7b3f	Disable query parallelism in PostgreSQL sessions used for backup control. There is no need to have parallelism enabled in a backup control session. In particular, 9.6 marks pg_stop_backup() as parallel-safe but an error will be thrown if pg_stop_backup() is run in a worker.	2020-06-25 08:02:48 -04:00
David Steele	c5a507b9a6	Add comment about setting application_name.	2020-06-24 19:33:56 -04:00
David Steele	ce98e326e1	Replace HRNPQ_MACRO_OPEN_92() test macro with HRNPQ_MACRO_OPEN_GE_92().	2020-06-24 18:40:19 -04:00
David Steele	f55cb386d4	Fix versions passed to HRNPQ_MACRO_OPEN_GE_92() test macro. These were not noticed because currently 9.3 and 9.6 behave the same on open.	2020-06-24 18:33:20 -04:00
David Steele	c5892d1291	Asynchronous S3 multipart upload. When uploading large files the upload is split into multiple parts which are assembled at the end to create the final file. Previously we waited until each part was acknowledged before starting on the processing (i.e. compression, etc.) of the next part. Now, the request for each part is sent while processing continues and the response is read just before sending the request for the next part. This asynchronous method allows us to continue processing while the S3 server formulates a response. Testing from outside AWS in a high-bandwidth, low-latency environment showed a 35% improvement in the upload time of 1GB files. The time spent waiting for multipart notifications was reduced by ~300% (this measurement included the final part which is not uploaded asynchronously). There are still some possible improvements: 1) the creation of the multipart id could be made asynchronous when it looks like the upload will need to be multipart (this may incur cost if the upload turns out not to be multipart). 2) allow more than one async request (this will use more memory). A fair amount of refactoring was required to make the HTTP responses asynchronous. This may seem like overkill but having well-defined request, response, and session objects will also be advantageous for the upcoming HTTP server functionality. Another advantage is that the lifecycle of an HttpSession is better defined. We only want to reuse sessions that complete the request/response cycle successfully, otherwise we consider the session to be in a bad state and would prefer to start clean with a new one. Previously, this required complex notifications to mark a session as "successfully done". Now, ownership of the session is passed to the request and then the response and only returned to the client after a successful response. If an error occurs anywhere along the way the session will be automatically closed by the object destructor when the request/response object is freed (depending on which one currently owns the session).	2020-06-24 13:44:00 -04:00
David Steele	45d9b03136	Add strCatZ(). strCat() did not follow our convention of appending Z to functions that accept zero-terminated strings rather than String objects. Add strCatZ() to accept zero-terminated strings and update strCat() to accept String objects. Use LF_STR where appropriate but don't use other String constants because they do not improve readability.	2020-06-24 12:09:24 -04:00
David Steele	dab00e2010	Remove expect logs obsoleted in `a3e5e66f`. These expect logs are no longer used but are not automatically removed by test.pl.	2020-06-24 07:45:00 -04:00
David Steele	a3e5e66f05	Simplify test matrix for real/all tests. Test matrices were previously simplified for the mock/* tests (e.g. `d4410611`, `d489eb87`) but not for real/all since the rules for which tests would run with which options was extremely complex. This only got more complex when new compression formats were added. Because the loop-generated matrix was so large, mosts tests were skipped for most option combinations following arcane logic which was nearly impossible to decipher even when reading the code, and completely impossible from the test.pl interface. As a consequence, important tests got excluded. For example, backup from standby was excluded for most versions of PostgreSQL because it was only run once per distro, against the latest version to be included in that distro. Simplify the tests by having a single run per PostgreSQL version and vary test parameters according to the capabilities of each version and the underlying distro. So, ZST testing is based on whether the distro supports ZST. Every test is run for each set of parameters based on the capabilities of the PostgreSQL version, e.g. backup from standby is not attempted on versions that don't support it. Note that since more tests are running the overall time to run the mock/all tests has increased by about 20-25%. Some time may be saved my removing tests that are adequately covered by unit tests but that should the subject of another commit. Another option would be to limit some non version-specific tests to a single, well defined version of PostgreSQL, .e.g the version that is run by expect tests, currently 9.6. The motivation for this refactor is that new storage drivers are coming and the loop-generated test matrix simply was not up to the task of adding them. The following is an example of the new test log (note longer runtime of each test): module=real, test=all, run=1, pg-version=10 (106.91s) module=real, test=all, run=1, pg-version=9.5 (151.09s) module=real, test=all, run=1, pg-version=9.2 (123.11s) module=real, test=all, run=1, pg-version=9.1 (129s) vs. the old test log (sub-second tests were skipped entirely): module=real, test=all, run=2, pg-version=10 (0.31s) module=real, test=all, run=3, pg-version=10 (0.26s) module=real, test=all, run=4, pg-version=10 (60.39s) module=real, test=all, run=1, pg-version=10 (69.12s) module=real, test=all, run=6, pg-version=10 (34s) module=real, test=all, run=5, pg-version=10 (42.75s) module=real, test=all, run=2, pg-version=9.5 (0.21s) module=real, test=all, run=3, pg-version=9.5 (0.21s) module=real, test=all, run=4, pg-version=9.5 (0.21s) module=real, test=all, run=5, pg-version=9.5 (0.26s) module=real, test=all, run=6, pg-version=9.5 (0.21s) module=real, test=all, run=1, pg-version=9.2 (72.78s) module=real, test=all, run=2, pg-version=9.2 (0.26s) module=real, test=all, run=3, pg-version=9.2 (0.31s) module=real, test=all, run=4, pg-version=9.2 (0.21s) module=real, test=all, run=5, pg-version=9.2 (0.21s) module=real, test=all, run=6, pg-version=9.2 (0.21s) module=real, test=all, run=1, pg-version=9.5 (88.41s) module=real, test=all, run=2, pg-version=9.1 (0.21s) module=real, test=all, run=3, pg-version=9.1 (0.26s) module=real, test=all, run=4, pg-version=9.1 (0.21s) module=real, test=all, run=5, pg-version=9.1 (0.31s) module=real, test=all, run=6, pg-version=9.1 (0.26s) module=real, test=all, run=1, pg-version=9.1 (72.4s)	2020-06-23 13:44:29 -04:00
David Steele	d560c1bf19	Ignore "unsupported frontend protocol" error on Centos/RHEL 6. The unsupported version error is showing up on older versions of PostgreSQL (e.g. 9.1, 9.2) on RHEL6 when setting up a standby with streaming replication. The error occurs when a client does not properly send a version number and it's not clear why it is happening here, but it does not appear to have anything to do with pgBackRest and only affects RHEL6, i.e. 9.1 and 9.2 do not show this error on other distros. For now ignore the error since RHEL6 is nearly EOL.	2020-06-23 12:42:46 -04:00
David Steele	04b2e4a831	Increase log level of checkManifest() to debug. This function is only called once and is very likely throw errors so debug level is more appropriate.	2020-06-23 09:24:18 -04:00
David Steele	1aedc75b03	Rename http/Http to HTTP in comments and messages. HTTP is an acronym so it should be capitalized. Coding conventions dictate otherwise for function and type names but that should not have been propagated to comments and messages.	2020-06-21 11:47:41 -04:00
David Steele	911384d9b9	Add httpDateFromTime(). Also rename httpLastModifiedToTime() to httpDateToTime() since the RFC-2822 date format used by HTTP is used in all Date headers.	2020-06-21 11:07:18 -04:00
David Steele	fbff29957c	Inline strPtr() to increase profiling accuracy. strPtr() is called more than any other function and during profiling (with or without optimization) it can end up using a disproportionate amount of the total runtime. Even though it is fast, the profiler has a minimum resolution for each function call so strPtr() will often end up towards the top of the list even though the real runtime is quite small. Instead, inline strPtr() and indicate to gcc that it should be inlined even for non-optimized builds, since that's how profiles are usually generated. To make strPtr() smaller require "this" to be non-NULL and add another function, strPtrNull(), to deal with the few cases where we need NULL handling. As a bonus this makes the executable about 1% smaller even when compared to a prior optimized build which would inline some percentage of strPtr() calls.	2020-06-18 13:13:55 -04:00
David Steele	3d74ec1190	Use PostgreSQL instead of postmaster where appropriate. Using postmaster in messages was not very helpful since users rarely interact directly with the postmaster. Using PostgreSQL instead seems clearer.	2020-06-17 15:14:59 -04:00

... 3 4 5 6 7 ...

3069 Commits