pgbackrest

mirror of https://github.com/pgbackrest/pgbackrest.git synced 2024-12-12 10:04:14 +02:00

Author	SHA1	Message	Date
David Steele	01b8e2258f	Improve archive-push command fault tolerance. `3b8f0ef` missed some cases that could cause archive-push to fail: * Checking archive info. * Checking to see if a WAL segment already exists. These cases are now handled so archive-push can succeed on any valid repos.	2021-03-25 12:54:49 -04:00
Cynthia Shang	31c7824a4d	Allow stanza-* commands to be run remotely. The stanza-create, stanza-upgrade and stanza-delete were required to be run on the repository host. When there was only one repository allowed this was not a problem. However, with the introduction of multiple repository support, this becomes more of a burden to the user, therefore the stanza-create, stanza-upgrade and stanza-delete commands have been improved to allow for them to be run remotely.	2021-03-10 08:10:46 -05:00
David Steele	fe4ba455ed	Move configuration definition to src/build/config/config.yaml. Moving to YAML allows the configuration data to be read by C programs. Also go back to using YAML::XS since it is the only implementation that has proper boolean support.	2021-03-08 16:01:05 -05:00
David Steele	1dbb3bf50b	Multiple repository support. Up to four repositories may be configured. A potential benefit is the ability to have a local repository for fast restores and a remote repository for redundancy. Some commands, e.g. stanza-create/stanza-update, will automatically work with all configured repositories while others, e.g. stanza-delete, will require a repository to be specified using the repo option. See the command reference for details on which commands require the repository to be specified. Note that the repo option is not required when only repo1 is configured in order to maintain backward compatibility. However, the repo option is required when a single repo is configured as, e.g. repo2. This is to prevent command breakage if a new repository is added later. The archive-push command will always push WAL to the archive in all configured repositories but backups will need to be scheduled individually for each repository. In many cases this is desirable since backup types and retention will vary by repository. Likewise, restores must specify a repository. It is generally better to specify a repository for restores that has low latency/cost even if that means more recovery time. Only restore testing can determine which repository will be most efficient. For single repository configurations there should be no change in behavior.	2021-03-08 13:31:13 -05:00
David Steele	088662d986	GCS support for repository storage. GCS and GCS-compatible object stores can now be used for repository storage.	2021-03-05 12:13:51 -05:00
David Steele	d1aa765a9d	Consolidate less commonly used repository storage options. The following options are renamed as specified: repo1-azure-ca-file -> repo1-storage-ca-file repo1-azure-ca-path -> repo1-storage-ca-path repo1-azure-host -> repo1-storage-host repo1-azure-port -> repo1-storage-port repo1-azure-verify-tls -> repo1-storage-verify-tls repo1-s3-ca-file -> repo1-storage-ca-file repo1-s3-ca-path -> repo1-storage-ca-path repo1-s3-host -> repo1-storage-host repo1-s3-port -> repo1-storage-port repo1-s3-verify-tls -> repo1-storage-verify-tls The old option names (e.g. repo1-s3-port) will continue to work for repo1, but repo2, etc. will require the new names.	2021-03-02 13:51:40 -05:00
Cynthia Shang	13dc8e68d7	Make --repo optional for backup command. If there are multiple repos and the --repo option is not specified then backup will automatically select the highest priority repo.	2021-02-26 14:49:50 -05:00
Cynthia Shang	0ddc0380ff	Remove restore default repo from integration tests. The default is now to scan all repos so update the integration tests to reflect that.	2021-02-24 11:32:13 -05:00
David Steele	bec3e20b2c	Add archive-get command multi-repo support. Repositories will be searched in order for the requested archive file. Errors will be reported as warnings as long as a valid copy of the archive file is found.	2021-02-23 15:34:28 -05:00
David Steele	9154d73030	Add -g accidentally removed in `4e8d469f`. The tests all run fine without debug info but gdb and valgrind are a lot less useful without it.	2021-02-02 17:05:55 -05:00
David Steele	8e9f04cc32	Add HRN_INTEST_* define to indicate when a test is being run. This is useful for initialization that needs to be done for the test and all subsequent tests. Use the new defines to implement initialization for sockets and statistics.	2021-01-27 16:54:41 -05:00
David Steele	87eb081a8f	Make unit test builds incremental based on coverage in prior tests. When building tests only include files covered by the current test or by prior tests. This increases performance (less compilation and linking) and also helps detect cross-dependencies in the code. Since there are currently cross-dependencies the depend option is used to document them and allow compilation. The idea is to resolve them incrementally over time. Add the harness option to include harness modules when the minimum requirements for compilation are met. Add the feature option to indicate which features are now available in the harness (based on source modules already tested). This allows conditional compilation in harness modules when some features are not yet available.	2021-01-27 10:57:42 -05:00
David Steele	f669da7dcc	Use minio latest in documentation and integration tests. At one time Minio had stability problems with latest but that appears to be resolved for the last year or so. Use latest so we'll know if something breaks since Minio is frequently used in production.	2021-01-26 11:25:29 -05:00
David Steele	4e8d469f4d	Use configure to generate Makefile variables for unit tests. The unit test Makefile generation was a hodge-podge of constants and rules based on distros/versions that easily got out of date and did not work on an unknown system. All of this dates from the mixed Perl/C unit test implementation. Instead use configure to generate most of the important Makefile variables, which allows the unit tests to run on multiple platforms, e.g. MacOS and FreeBSD. There is plenty of work to be done here and not all the unit tests work on MacOS and FreeBSD for various reasons. As a POC update the MacOS and FreeBSD tests on Cirrus-CI to run a few command unit tests.	2021-01-24 16:24:14 -05:00
David Steele	ef2dc6d3f4	Add chmod to make file removal after tests more reliable. MacOS does not allow files to be removed recursively unless the owner has write and execute permissions on all the directories. Some tests leave the permissions in a bad state so fix them up before trying to delete.	2021-01-24 15:48:32 -05:00
David Steele	04e84da0ef	Allow the make command to be configured for test.pl.	2021-01-24 15:35:40 -05:00
David Steele	d2057c53bd	Use YAML::Any module instead of YAML::XS in Perl. YAML::XS requires libyaml so it not as portable as pure Perl versions of YAML. Instead of using YAML:PP just use the general YAML::Any module which uses whatever is installed. We are not concerned about performance for YAML so whatever works is fine.	2021-01-24 15:06:38 -05:00
David Steele	5cb9f166ec	Add stderr to unit test error messages. Messages on stderr were being lost due to the error suppression used to customize the error message. Also update the formatting to be more informative and concise.	2021-01-24 08:23:59 -05:00
Cynthia Shang	f32eb9b94e	Partial multi-repository implementation. Multi-repository implementations for the archive-push, check, info, stanza-create, stanza-upgrade, and stanza-delete commands. Multi-repo configuration is disabled so there should be no behavioral changes between these commands and their current single-repo implementations. Multi-repo documentation and integration tests are still in the multi-repo development branch. All unit tests work as multi-repo since they are able to bypass the configuration restrictions.	2021-01-21 15:21:50 -05:00
David Steele	4e56948128	Compensate for numeric auto conversion in newer Perls.	2021-01-19 12:07:05 -05:00
David Steele	065b5f93ae	Improve test coverage list handling. All unit tests now require full coverage so the "full" keyword is obsolete and has been removed. The covered code modules are simply listed, with only "no code" modules annotated.	2021-01-15 10:56:51 -05:00
David Steele	c2c702c09d	Add co7 package to support llvm. This is required for new package versions. Also remove the obsolete 9.2 package and update the supported versions list.	2021-01-13 17:32:42 -05:00
David Steele	96fd678662	Add job-retry and job-retry-interval options. These options specify the number of local worker job retries and the retry interval after one immediate retry. There is some value in allowing retries to be specified by the user but for the most part these options are for suppressing retries during testing, which can save a lot of time. The bug introduced in `d1d25c7` and fixed in `8b86d5e` also suggests it is better not to use retries in tests. Remove the default delayed retries for archive-get/archive-push, leaving only the immediate retry. These commands are retried by PostgreSQL so it doesn't make sense to do too many retries internally. These options are currently internal.	2021-01-11 15:15:25 -05:00
David Steele	6e7a3eb383	Remove archive-timeout from test in mock/archive. No timeout is expected here but the small timeout prevents errors from being thrown. This is not a bug since the error would be thrown on the next archive-get call but it does make the tests harder to debug when there is an error. It is not clear why there was a timeout here at all. It is likely cruft from a prior test or a copy/paste error.	2021-01-05 18:11:28 -05:00
David Steele	d01669aa58	Move most tests to Github Actions. Testing on Travis-CI has been getting slower (from ~18 minutes to 3-6 hours) and the travis-ci.org service will be terminated at the end of the year. Moving to travis-ci.com is an option but the quotas are too low for our purposes. Instead use Github Actions, which does not currently have quotas, and runs our current tests with just a few tweaks. This still leaves multi-architecture tests on Travis-CI but we may be able to run those and stay within the new quotas. Also fix a minor bug in restoreTest.c exposed by Github Actions using a different name for the user and group.	2020-12-09 15:19:01 -05:00
David Steele	e116b535e6	v2.31: Minor Bug Fixes and Improvements Bug Fixes: * Allow [, #, and space as the first character in database names. (Reviewed by Stefan Fercot, Cynthia Shang. Reported by Jefferson Alexandre.) * Create standby.signal only on PostgreSQL 12 when restore type is standby. (Fixed by Stefan Fercot. Reviewed by David Steele. Reported by Keith Fiske.) Features: * Expire history files. (Contributed by Stefan Fercot. Reviewed by David Steele.) * Report page checksum errors in info command text output. (Contributed by Stefan Fercot. Reviewed by Cynthia Shang.) * Add repo-azure-endpoint option. (Reviewed by Cynthia Shang, Brian Peterson. Suggested by Brian Peterson.) * Add pg-database option. (Reviewed by Cynthia Shang.) Improvements: * Improve info command output when a stanza is specified but missing. (Contributed by Stefan Fercot. Reviewed by Cynthia Shang, David Steele. Suggested by uspen.) * Improve performance of large file lists in backup/restore commands. (Reviewed by Cynthia Shang, Oscar.) * Add retries to PostgreSQL sleep when starting a backup. (Reviewed by Cynthia Shang. Suggested by Vitaliy Kukharik.) Documentation Improvements: * Replace RHEL/CentOS 6 documentation with RHEL/CentOS 8.	2020-12-07 09:55:00 -05:00
David Steele	31becf05b7	Add RHEL/CentOS 8 documentation. Update RHEL/CentOS 7 to cover the versions that were previously covered by RHEL/CentOS 6. Since RHEL/CentOS 7/8 work the same update the documentation logic and labels to reflect this compatibility.	2020-12-04 10:59:57 -05:00
David Steele	ec9f23d31f	Remove CentOS 6 from tests and documentation. CentOS6 EOL'd and the mirrors were swiftly deleted, leading to failures in tests and documentation. Remove CentOS 6 for now to get builds going again with the intention to replace it in the near future with CentOS 8.	2020-12-02 16:23:05 -05:00
David Steele	7fda83b31e	Allow multiple remote locks from the same main process. Improve locking on remote processes by introducing an exec-id that is unique to the main process and passed to all remote processes. This allows the remote processes to determine if a lock is held by a remote from the same main process. If so, the lock is allowed. The exec-id is also useful for associating remote logs with main logs for debugging purposes.	2020-11-23 12:41:54 -05:00
David Steele	2d38d2fc82	Reset additional options in real/all integration test. Currently indexes above 1 do not have dependencies checked, so this doesn't error. In a future commit we will enable those checks and this will error if it is not fixed.	2020-10-19 17:06:52 -04:00
David Steele	b096a25b49	Update test containers for PostgreSQL 13. Add older PostgreSQL versions to the u18 container that were not available before. This also updates all minor versions for prior versions of PostgreSQL.	2020-09-24 11:19:51 -04:00
David Steele	4cd61152f5	Update PostgreSQL 13 test catalog versions missed in `6bb111c1`. These values are not used by the Perl integration tests so maybe it would be better to remove them, but for now just update since they should not be changing again for PG13.	2020-09-17 12:39:30 -04:00
David Steele	959f77cd6a	Add general-purpose statistics collector. Currently each module that needs to collect statistics implements custom code to do so. This is cumbersome. Create a general purpose module for collecting and reporting statistics. Statistics are output in the log at detail level, but there are other uses they could be put to eventually. No new functionality is added. This is just a drop-in replacement for the current statistics, with the advantage of being more flexible. The new stats are slower because they involve a list lookup, but performance testing shows stats can be updated at about 40,000/ms which seems fast enough for our purposes.	2020-08-20 14:04:26 -04:00
David Steele	e81533bbab	Improve memory usage of unlogged relation detection in manifest build. This loop was using a lot of memory without freeing it at intervals. Rewrite to use char arrays when possible to reduce memory that needs to be allocated and freed.	2020-08-04 10:16:51 -04:00
David Steele	a260d4a53b	Add zstd to CentOS/RHEL 6 test container. Zstd is now required by the upstream yum package.	2020-07-28 08:09:10 -04:00
David Steele	24d2c5b277	Remove real/all integration tests now covered by unit tests. Remove all check and stanza-* tests except for the ones that are intended to succeed. The successful tests show that the queries run with expected results against each version of PG which should also validate queries for the failure tests in the unit tests. Also remove the tests for --no-online backups since they don't require a database and are well tested in the unit tests.	2020-07-16 13:57:14 -04:00
David Steele	aa4e13b665	Move encrypted files as raw in integration tests. The encryption key should not be changed when moving a file so no need to decrypt/encrypt.	2020-07-16 11:27:14 -04:00
David Steele	88b0f6245d	Run non version specific real/tests on the expect version. There are a few non version specific tests that need to be run in integration because we can't get coverage in the unit tests. To save some time we'll only run those tests against the same version we use for expect testing.	2020-07-15 13:19:16 -04:00
Stefan Fercot	d3dd32a031	Add expire-auto option. This allows automatic expiration after a successful backup to be disabled.	2020-07-14 08:12:25 -04:00
David Steele	d5df3974b5	Read segment size from WAL headers. This allows validation of the WAL segment size for PostgreSQL versions <= 10.	2020-07-09 17:32:36 -04:00
David Steele	682ac656f5	Fix restore --force acting like --force --delta. This caused restore to replace files based on timestamp and size rather than overwriting, which meant some files that should have been updated were left unchanged. Normal restore and restore --delta were not affected by this issue.	2020-07-06 15:03:24 -04:00
David Steele	3f4371d7a2	Azure support for repository storage. Azure and Azure-compatible object stores can now be used for repository storage. Currently only shared key authentication is supported but SAS will be added soon.	2020-07-02 16:24:34 -04:00
David Steele	ea04ec7b3f	Disable query parallelism in PostgreSQL sessions used for backup control. There is no need to have parallelism enabled in a backup control session. In particular, 9.6 marks pg_stop_backup() as parallel-safe but an error will be thrown if pg_stop_backup() is run in a worker.	2020-06-25 08:02:48 -04:00
David Steele	a3e5e66f05	Simplify test matrix for real/all tests. Test matrices were previously simplified for the mock/* tests (e.g. `d4410611`, `d489eb87`) but not for real/all since the rules for which tests would run with which options was extremely complex. This only got more complex when new compression formats were added. Because the loop-generated matrix was so large, mosts tests were skipped for most option combinations following arcane logic which was nearly impossible to decipher even when reading the code, and completely impossible from the test.pl interface. As a consequence, important tests got excluded. For example, backup from standby was excluded for most versions of PostgreSQL because it was only run once per distro, against the latest version to be included in that distro. Simplify the tests by having a single run per PostgreSQL version and vary test parameters according to the capabilities of each version and the underlying distro. So, ZST testing is based on whether the distro supports ZST. Every test is run for each set of parameters based on the capabilities of the PostgreSQL version, e.g. backup from standby is not attempted on versions that don't support it. Note that since more tests are running the overall time to run the mock/all tests has increased by about 20-25%. Some time may be saved my removing tests that are adequately covered by unit tests but that should the subject of another commit. Another option would be to limit some non version-specific tests to a single, well defined version of PostgreSQL, .e.g the version that is run by expect tests, currently 9.6. The motivation for this refactor is that new storage drivers are coming and the loop-generated test matrix simply was not up to the task of adding them. The following is an example of the new test log (note longer runtime of each test): module=real, test=all, run=1, pg-version=10 (106.91s) module=real, test=all, run=1, pg-version=9.5 (151.09s) module=real, test=all, run=1, pg-version=9.2 (123.11s) module=real, test=all, run=1, pg-version=9.1 (129s) vs. the old test log (sub-second tests were skipped entirely): module=real, test=all, run=2, pg-version=10 (0.31s) module=real, test=all, run=3, pg-version=10 (0.26s) module=real, test=all, run=4, pg-version=10 (60.39s) module=real, test=all, run=1, pg-version=10 (69.12s) module=real, test=all, run=6, pg-version=10 (34s) module=real, test=all, run=5, pg-version=10 (42.75s) module=real, test=all, run=2, pg-version=9.5 (0.21s) module=real, test=all, run=3, pg-version=9.5 (0.21s) module=real, test=all, run=4, pg-version=9.5 (0.21s) module=real, test=all, run=5, pg-version=9.5 (0.26s) module=real, test=all, run=6, pg-version=9.5 (0.21s) module=real, test=all, run=1, pg-version=9.2 (72.78s) module=real, test=all, run=2, pg-version=9.2 (0.26s) module=real, test=all, run=3, pg-version=9.2 (0.31s) module=real, test=all, run=4, pg-version=9.2 (0.21s) module=real, test=all, run=5, pg-version=9.2 (0.21s) module=real, test=all, run=6, pg-version=9.2 (0.21s) module=real, test=all, run=1, pg-version=9.5 (88.41s) module=real, test=all, run=2, pg-version=9.1 (0.21s) module=real, test=all, run=3, pg-version=9.1 (0.26s) module=real, test=all, run=4, pg-version=9.1 (0.21s) module=real, test=all, run=5, pg-version=9.1 (0.31s) module=real, test=all, run=6, pg-version=9.1 (0.26s) module=real, test=all, run=1, pg-version=9.1 (72.4s)	2020-06-23 13:44:29 -04:00
David Steele	d560c1bf19	Ignore "unsupported frontend protocol" error on Centos/RHEL 6. The unsupported version error is showing up on older versions of PostgreSQL (e.g. 9.1, 9.2) on RHEL6 when setting up a standby with streaming replication. The error occurs when a client does not properly send a version number and it's not clear why it is happening here, but it does not appear to have anything to do with pgBackRest and only affects RHEL6, i.e. 9.1 and 9.2 do not show this error on other distros. For now ignore the error since RHEL6 is nearly EOL.	2020-06-23 12:42:46 -04:00
David Steele	3d74ec1190	Use PostgreSQL instead of postmaster where appropriate. Using postmaster in messages was not very helpful since users rarely interact directly with the postmaster. Using PostgreSQL instead seems clearer.	2020-06-17 15:14:59 -04:00
David Steele	417818dcca	Add --no-coverage-report to test.pl to disable report generation. There is no sense in generating detailed coverage reports in CI environments where they will never be seen. It takes time and format differences in some older versions can cause problems in the report generation code. Note that missing coverage will still be reported on stdout and the test will fail.	2020-06-17 15:07:30 -04:00
David Steele	0680cfc8dc	Rename most instances of master to primary in tests. This aligns better with general PostgreSQL usage and our own documentation (updated in `4bcef702`). Usage in the backup.manifest tests has not been updated since it might break the file format.	2020-06-16 14:06:38 -04:00
David Steele	ec7b7c5a3e	PostgreSQL 13 beta1 support. There don't appear to be any behavioral changes since PostgreSQL 12 and all the tests pass. Changes to the control/catalog/WAL versions in subsequent betas may break compatibility but pgBackRest will be updated with each release to keep pace.	2020-05-21 13:46:16 -04:00
David Steele	688ec2a8f5	Use an extension to denote vendorized code. Vendorized code is copied from another project when a library is not available and a git subproject won't work. Currently all the vendorized code is copied from PostgreSQL but it makes sense to have a more general mechanism for indicating vendorized code. The .vendor extension will be used to denote vendorized code in the same way that .auto is used to denote auto-generated code.	2020-05-18 19:11:26 -04:00

1 2 3 4 5 ...

649 Commits