pgbackrest

mirror of https://github.com/pgbackrest/pgbackrest.git synced 2024-12-14 10:13:05 +02:00

Author	SHA1	Message	Date
David Steele	4a73a02863	Simplify manifest defaults. Manifest defaults for user, group, and mode were previously generated by scanning the data to find the most common values. This was very accurate but slow and complicated. It could also lead to surprising changes in the manifest when a default value suddenly changed. Instead, use the $PGDATA path to generate defaults. In the vast majority of cases the same user/group should own all the path/files and the default file mode is easily derived from the path mode. There may be some edge cases where this generates larger manifests, but in general it reduces time and complexity when saving the manifest. Remove the MCV code since it is longer longer used.	2022-01-21 15:22:48 -05:00
David Steele	b0db4b8ff0	Simplify base path mode in mock/all integration tests. Change the mode back to 0700 earlier to reduce churn in the expect logs. This will be especially important in a future commit that gets the defaults exclusively from the base path.	2022-01-21 08:52:51 -05:00
David Steele	8c062e1af8	Remove primary flag from manifest. This flag was only being used by the backup command after manifestNewBuild() and had no other uses. There was a time when it was important for integration testing but the unit tests now fulfill this role. Since backup is the only code concerned with the primary flag, move the code into the backup module. We don't have any cross-version testing but this change was tested manually with the most recent version of pgBackRest to make sure it was tolerant of the missing primary info. When an older version of pgBackRest loads a newer manifest the primary flag will always be set to false, which is fine since it is not used.	2022-01-20 14:01:10 -05:00
Reid Thompson	6e635764a6	Match backup log size with size reported by info command. Properly log the size of files copied during the backup, matching the backup size returned from the info command. In the reference issue, the incremental backup after switchover logs the size of all files evaluated rather than only the size of the files copied in the backup.	2021-11-09 13:24:56 -05:00
David Steele	ccc255d3e0	Add TLS Server. The TLS server is an alternative to using SSH for protocol connections to remote hosts. This command is currently experimental and intended only for trial and testing. As such, the new commands and options will not show up in the command-line help unless directly requested.	2021-10-18 14:32:41 -04:00
David Steele	5701620408	Rename manifest file primary flag in tests.	2021-10-13 19:02:58 -04:00
Stefan Fercot	34f7873432	Report backup file validation errors in backup.info. Currently errors found during the backup are only available in text output when specifying --set. Add a flag to backup.info that is available in both the text and json output when --set is not specified. This at least provides the basic info that an error was found in the cluster during the backup, though details are still only available as described above.	2021-10-04 13:45:53 -04:00
David Steele	a0bdfa436c	Log backup file total and restore size/file total. The backup size was a bit off because it did not include any files (e.g. backup_label, WAL files) that were added to the manifest after the main copy. To fix this move the log message to the very end of the backup. Add size/file total log message to restore since it did not exist before.	2021-08-11 13:39:36 -04:00
David Steele	849ab343aa	Change level of backup/restore copied file logging to detail. The log level for copied files in the backup/restore commands has been changed to detail. This makes the info log level less noisy but if these messages are required then set the log level for the backup/restore commands to detail.	2021-07-09 13:50:35 -04:00
David Steele	320c6e1aad	Remove stanza archive spool path on restore. Remove stanza archive spool path so existing files do not interfere with the new cluster. For instance, old archive-push acknowledgements could cause a new cluster to skip archiving. This should not happen if a new timeline is selected but better to be safe. Missing stanza spool paths are ignored. Also add new path expression STORAGE_SPOOL_ARCHIVE to easily access this path.	2021-05-18 15:49:22 -04:00
Stefan Fercot	6942ff569d	Include recreated system databases during selective restore. Some standard system databases (e.g. postgres) may be recreated by the user and have an OID that makes them look like user databases. Identify the standard three system databases (template0, template1, postgres) and restore them non-zeroed no matter what OID they have.	2021-03-15 12:54:14 -04:00
Cynthia Shang	13dc8e68d7	Make --repo optional for backup command. If there are multiple repos and the --repo option is not specified then backup will automatically select the highest priority repo.	2021-02-26 14:49:50 -05:00
Cynthia Shang	0ddc0380ff	Remove restore default repo from integration tests. The default is now to scan all repos so update the integration tests to reflect that.	2021-02-24 11:32:13 -05:00
Cynthia Shang	118d9e64fe	Enhance restore command multi-repo support. The restore command automatically defaults to selecting the latest backup from a single repository. With multiple repositories configured, the restore command will now default to selecting the latest backup from the first repository where backups exist. The order in which the repositories are checked is dictated by the pgbackrest.conf order. To select from a specific repository, the --repo option can be passed (e.g. --repo=1). The --set option can be passed if a backup other than the latest is desired.	2021-02-23 16:17:27 -05:00
Cynthia Shang	e28f6f11e9	Expire continues if an error occurs processing a repository. Errors are logged to the log file rather than thrown. If, after processing all repos, one or more errors occurred, then a single error error will be thrown to indicate there were errors and the log file should be inspected. Also update log messages to be more consistent with new patterns.	2021-02-23 12:20:02 -05:00
Cynthia Shang	d5b919e657	Update expire command log messages with repo prefix. In preparation for multi-repo support, a repo tag is added in this commit to the expire command log and error messages. This change also affects the expect logs and the user-guide. The format of the tag is "repoX:" where X is the repo key used in the configuration. Until multi-repo support has been completed, this tag will always be "repo1:".	2021-01-27 16:33:01 -05:00
Cynthia Shang	f32eb9b94e	Partial multi-repository implementation. Multi-repository implementations for the archive-push, check, info, stanza-create, stanza-upgrade, and stanza-delete commands. Multi-repo configuration is disabled so there should be no behavioral changes between these commands and their current single-repo implementations. Multi-repo documentation and integration tests are still in the multi-repo development branch. All unit tests work as multi-repo since they are able to bypass the configuration restrictions.	2021-01-21 15:21:50 -05:00
David Steele	96fd678662	Add job-retry and job-retry-interval options. These options specify the number of local worker job retries and the retry interval after one immediate retry. There is some value in allowing retries to be specified by the user but for the most part these options are for suppressing retries during testing, which can save a lot of time. The bug introduced in `d1d25c7` and fixed in `8b86d5e` also suggests it is better not to use retries in tests. Remove the default delayed retries for archive-get/archive-push, leaving only the immediate retry. These commands are retried by PostgreSQL so it doesn't make sense to do too many retries internally. These options are currently internal.	2021-01-11 15:15:25 -05:00
David Steele	108038292c	Audit options valid for expire command.	2020-12-31 12:13:20 -05:00
David Steele	0acfcb669e	Audit options valid for start/stop commands.	2020-12-31 11:10:48 -05:00
David Steele	7fda83b31e	Allow multiple remote locks from the same main process. Improve locking on remote processes by introducing an exec-id that is unique to the main process and passed to all remote processes. This allows the remote processes to determine if a lock is held by a remote from the same main process. If so, the lock is allowed. The exec-id is also useful for associating remote logs with main logs for debugging purposes.	2020-11-23 12:41:54 -05:00
David Steele	3d74ec1190	Use PostgreSQL instead of postmaster where appropriate. Using postmaster in messages was not very helpful since users rarely interact directly with the postmaster. Using PostgreSQL instead seems clearer.	2020-06-17 15:14:59 -04:00
David Steele	0680cfc8dc	Rename most instances of master to primary in tests. This aligns better with general PostgreSQL usage and our own documentation (updated in `4bcef702`). Usage in the backup.manifest tests has not been updated since it might break the file format.	2020-06-16 14:06:38 -04:00
David Steele	b5dd14e6f3	Make storage type more generic in the integration tests. Rather than bS3 use strStorage which can indicate more than two storage types. For the moment there are still only two storage types but this change is required before more can be added.	2020-05-12 18:55:20 -04:00
Cynthia Shang	cdebfb09e0	Add time-based retention for full backups. The --repo-retention-full-type option allows retention of full backups based on a time period, specified in days. The new option will default to 'count' and therefore will not affect current installations. Setting repo-retention-full-type to 'time' will allow the user to use a time period, in days, to indicate full backup retention. Using this method, a full backup can be expired only if the time the backup completed is older than the number of days set with repo-retention-full (calculated from the moment the 'expire' command is run) and at least one full backup meets the retention period. If archive retention has not been configured, then the default settings will expire archives that are prior to the oldest retained full backup. For example, if there are three full backups ending in times that are 25 days old (F1), 20 days old (F2) and 10 days old (F3), then if the full retention period is 15 days, then only F1 will be expired; F2 will be retained because F1 is not at least 15 days old.	2020-05-08 15:25:03 -04:00
David Steele	47aa765375	Add Zstandard compression support. Zstandard is a fast lossless compression algorithm targeting real-time compression scenarios at zlib-level and better compression ratios. It's backed by a very fast entropy stage, provided by Huff0 and FSE library. Zstandard version >= 1.0 is required, which is generally only available on newer distributions.	2020-05-04 15:25:27 -04:00
David Steele	438b957f9c	Add infrastructure for multiple compression type support. Add compress-type option and deprecate compress option. Since the compress option is boolean it won't work with multiple compression types. Add logic to cfgLoadUpdateOption() to update compress-type if it is not set directly. The compress option should no longer be referenced outside the cfgLoadUpdateOption() function. Add common/compress/helper module to contain interface functions that work with multiple compression types. Code outside this module should no longer call specific compression drivers, though it may be OK to reference a specific compression type using the new interface (e.g., saving backup history files in gz format). Unit tests only test compression using the gz format because other formats may not be available in all builds. It is the job of integration tests to exercise all compression types. Additional compression types will be added in future commits.	2020-03-06 14:41:03 -05:00
David Steele	dbf6255ab8	Remove compress/compress-level options from commands where unused. These commands (e.g. restore, archive-get) never used the compress options but allowed them to be passed on the command line. Now they will error when these options are passed on the command line. If these errors occur then remove the unused options.	2020-02-27 12:25:32 -05:00
David Steele	620386f034	Remove integration tests that are now covered in the unit tests. Most of these tests are just checking that errors are thrown when required. These are well covered in various unit tests. The "cannot resume" tests are also well covered in the backup unit tests. Finally, config warnings are well covered in the config unit tests. There is more to be done here, but this accounts for the low-hanging fruit.	2019-12-17 20:14:45 -05:00
David Steele	977ec2e307	Integration test improvements for disk and memory efficiency. Set log-level-file=off when more that one test will run. In this case is it impossible to see the logs anyway since they will be automatically cleaned up after the test. This improves performance pretty dramatically since trace-level logging is expensive. If a singe integration test is run then log-level-file is trace by default but can be changed with the --log-level-test-file option. Reduce buffer-size to 64k to save memory during testing and allow more processes to run in parallel. Update log replacement rules so that these options can change without affecting expect logs.	2019-12-17 15:23:07 -05:00
David Steele	1f2ce45e6b	The backup command is implemented entirely in C. For the most part this is a direct migration of the Perl code into C except as noted below. A backup can now be initiated from a linked directory. The link will not be stored in the manifest or recreated on restore. If a link or directory does not already exist in the restore location then a directory will be created. The logic for creating backup labels has been improved and it should no longer be possible to get a backup label earlier than the latest backup even with timezone changes or clock skew. This has never been an issue in the field that we know of, but we found it in testing. For online backups all times are fetched from the PostgreSQL primary host (before only copy start was). This doesn't affect backup integrity but it does prevent clock skew between hosts affecting backup duration reporting. Archive copy now works as expected when the archive and backup have different compression settings, i.e. when one is compressed and the other is not. This was a long-standing bug in the Perl code. Resume will now work even if hardlink settings have been changed. Reviewed by Cynthia Shang.	2019-12-13 17:14:26 -05:00
David Steele	d0ba8ff58c	Remove test point infrastructure. `82df7e6f` and `9856fef5` updated tests that used test points in preparation for the feature not being available in the C code. Since tests points are no longer used remove the infrastructure. Also remove one stray --test option in mock/all that was essentially a noop but no longer works now that the option has been removed.	2019-12-10 13:16:47 -05:00
David Steele	8dfe0e48e2	Use more general error code when tablespace linked into PGDATA. The specific error code was not that useful since we also test the error message which contains details of the link error.	2019-12-02 10:49:25 -05:00
David Steele	fc291b6f28	Reduce the scope of mock/all exclusion tests. Run exclusions only on the tests where they will have an effect to reduce churn in the expect logs when they change.	2019-12-01 17:47:47 -05:00
David Steele	686b6f91da	Set archive-check option in manifest correctly when offline. Archive check does not run when in offline backup mode but the option was set to true in the manifest. It's harmless since these options are informational only but it could cause confusion when debugging.	2019-11-28 08:27:21 -05:00
David Steele	b145c72b5c	Update missing manifest warning in BackupInfo. This brings the Perl message in line with C to reduce expect log churn.	2019-11-25 08:51:28 -05:00
David Steele	8800f32ad9	Remove exclusions once they have been tested in mock/all. The exclusions no longer have any effect after a restore and just add noise to the expect log.	2019-11-25 08:35:26 -05:00
David Steele	9856fef586	Update integration tests in mock/all that use test points. Test points will not be available in the C code so update these tests as best as possible without using them. This represents a loss of coverage for the Perl code (soon to be removed) which will be made up in the C code with unit tests.	2019-11-25 07:48:52 -05:00
David Steele	3cd45a7411	Remove start/stop --force integration tests in mock/all. These tests require test points which are not being implemented in the C code. This functionality is fully tested in the command/control unit tests so integration tests are no longer required.	2019-11-25 07:45:58 -05:00
David Steele	01aefc563d	Update Perl page checksum expression. This expression determines which files contain page checksums but it was also including the directory above the relation directories. In a real PostgreSQL installation this not a problem because these directories don't contain any files. However, our tests place a file in `base` which the Perl code thought should have page checksums while the new C code says no. Update the expression to document the change and avoid churn in the expect logs later.	2019-11-25 07:37:09 -05:00
David Steele	c524ec4f95	Remove obsolete integration tests from mock/all. The protocol timeout tests have been superceded by unit tests. The TEST_BACKUP_RESUME test point was incorrectly included into a number of tests, probably a copy pasto. It didn't hurt anything but it did add 200ms to each test where it appeared. Catalog and control version tests were redundant. The database version and system id tests covered the important code paths and the C code gets these values from a lookup table. Finally, fix an incomplete update to the backup.info file while munging for tests.	2019-11-21 16:06:27 -05:00
David Steele	3b879c2cb3	Filter logged command options based on the command definition. Previously, options were being filtered based on what was currently valid. For chained commands (e.g. backup then expire) some options may be valid for the first command but not the second. Filter based on the command definition rather than what is currently valid to avoid logging options that are not valid for subsequent commands. This reduces the number of options logged and will hopefully help avoid confusion and expect log churn.	2019-11-14 16:48:41 -05:00
Cynthia Shang	2972580566	Remove info expect tests from mock/all and mock/stanza. These tests are redundant now that we have full coverage in the unit tests are are not worth maintaining anymore.	2019-10-11 12:38:03 -04:00
David Steele	29e132f5e9	PostgreSQL 12 support. Recovery settings are now written into postgresql.auto.conf instead of recovery.conf. Existing recovery_target* settings will be commented out to help avoid conflicts. A comment is added before recovery settings to identify them as written by pgBackRest since it is unclear how, in general, old settings will be removed. recovery.signal and standby.signal are automatically created based on the recovery settings.	2019-10-01 13:20:43 -04:00
David Steele	451ae397be	The restore command is implemented entirely in C. For the most part this is a direct migration of the Perl code into C. There is one important behavioral change with regard to how file permissions are handled. The Perl code tried to set ownership as it was in the manifest even when running as an unprivileged user. This usually just led to errors and frustration. The C code works like this: If a restore is run as a non-root user (the typical scenario) then all files restored will belong to the user/group executing pgBackRest. If existing files are not owned by the executing user/group then an error will result if the ownership cannot be updated to the executing user/group. In that case the file ownership will need to be updated by a privileged user before the restore can be retried. If a restore is run as the root user then pgBackRest will attempt to recreate the ownership recorded in the manifest when the backup was made. Only user/group names are stored in the manifest so the same names must exist on the restore host for this to work. If the user/group name cannot be found locally then the user/group of the PostgreSQL data directory will be used and finally root if the data directory user/group cannot be mapped to a name. Reviewed by Cynthia Shang.	2019-09-26 07:52:02 -04:00
David Steele	4d84820021	Improve performance of info file load/save. Info files required three copies in memory to be loaded (the original string, an ini representation, and the final info object). Not only was this memory inefficient but the Ini object does sequential scans when searching for keys making large files very slow to load. This has not been an issue since archive.info and backup.info are very small, but it becomes a big deal when loading manifests with hundreds of thousands of files. Instead of holding copies of the data in memory, use a callback to deliver the ini data directly to the object when loading. Use a similar method for save to avoid having an intermediate copy. Save is a bit complex because sections/keys must be written in alpha order or older versions of pgBackRest will not calculate the correct checksum. Also move the load retry logic to helper functions rather than embedding it in the Info object. This allows for more flexibility in loading and ensures that stack traces will be available when developing unit tests. Reviewed by Cynthia Shang.	2019-09-06 13:48:28 -04:00
David Steele	01c2669b97	Fix exclusions for special files. Prior to 2.16 the Perl manifest code would skip any file that began with a dot. This was not intentional but it allowed PostgreSQL socket files to be located in the data directory. The new C code in 2.16 did not have this unintentional exclusion so socket files in the data directory caused errors. Worse, the file type error was being thrown before the exclusion check so there was really no way around the issue except to move the socket files out of the data directory. Special file types (e.g. socket, pipe) will now be automatically skipped and a warning logged to notify the user of the exclusion. The warning can be suppressed with an explicit --exclude. Reported by CluelessTechnologist, Janis Puris, Rachid Broum.	2019-08-23 07:47:54 -04:00
David Steele	c002a2ce2f	Move info file checksum to the end of the file. Putting the checksum at the beginning of the file made it impossible to stream the file out when saving. The entire file had to be held in memory while it was checksummed so the checksum could be written at the beginning. Instead place the checksum at the end. This does not break the existing Perl or C code since the read is not order dependent. There are no plans to improve the Perl code to take advantage of this change, but it will make the C implementation more efficient. Reviewed by Cynthia Shang.	2019-08-21 19:45:48 -04:00
Cynthia Shang	c733319063	The stanza-create/update/delete commands are implemented entirely in C. Contributed by Cynthia Shang.	2019-08-21 16:26:28 -04:00
David Steele	8fc1d3883b	Fix expire not immediately writing into separate file after backup. Logging stayed in the backup log until the Perl code started. Fix this so it logs to the correct file and will still work after the Perl code is removed.	2019-08-17 17:43:56 -04:00

1 2 3

113 Commits