pgbackrest

mirror of https://github.com/pgbackrest/pgbackrest.git synced 2024-12-14 10:13:05 +02:00

Author	SHA1	Message	Date
David Steele	1f2ce45e6b	The backup command is implemented entirely in C. For the most part this is a direct migration of the Perl code into C except as noted below. A backup can now be initiated from a linked directory. The link will not be stored in the manifest or recreated on restore. If a link or directory does not already exist in the restore location then a directory will be created. The logic for creating backup labels has been improved and it should no longer be possible to get a backup label earlier than the latest backup even with timezone changes or clock skew. This has never been an issue in the field that we know of, but we found it in testing. For online backups all times are fetched from the PostgreSQL primary host (before only copy start was). This doesn't affect backup integrity but it does prevent clock skew between hosts affecting backup duration reporting. Archive copy now works as expected when the archive and backup have different compression settings, i.e. when one is compressed and the other is not. This was a long-standing bug in the Perl code. Resume will now work even if hardlink settings have been changed. Reviewed by Cynthia Shang.	2019-12-13 17:14:26 -05:00
David Steele	d0ba8ff58c	Remove test point infrastructure. `82df7e6f` and `9856fef5` updated tests that used test points in preparation for the feature not being available in the C code. Since tests points are no longer used remove the infrastructure. Also remove one stray --test option in mock/all that was essentially a noop but no longer works now that the option has been removed.	2019-12-10 13:16:47 -05:00
David Steele	686b6f91da	Set archive-check option in manifest correctly when offline. Archive check does not run when in offline backup mode but the option was set to true in the manifest. It's harmless since these options are informational only but it could cause confusion when debugging.	2019-11-28 08:27:21 -05:00
David Steele	b145c72b5c	Update missing manifest warning in BackupInfo. This brings the Perl message in line with C to reduce expect log churn.	2019-11-25 08:51:28 -05:00
Cynthia Shang	56bf9d0566	Update HINT messages to conform to new standard detailed in CODING.md.	2019-09-14 12:21:08 -04:00
Josh Soref	c2771e5469	Fix comment typos. This includes some variable names in tests which don't seem important enough for their own commits. Contributed by Josh Soref.	2019-08-26 12:05:36 -04:00
David Steele	59f135340d	The local command for backup is implemented entirely in C. The local process is now entirely migrated to C. Since all major I/O operations are performed in the local process, the vast majority of I/O is now performed in C. Contributed by David Steele, Cynthia Shang.	2019-07-25 14:34:16 -04:00
David Steele	e10577d0b0	Fix incorrect offline upper bound for ignoring page checksum errors. For offline backups the upper bound was being set to 0x0000FFFF0000FFFF rather than UINT64_MAX. This meant that page checksum errors might be ignored for databases with a lot of past WAL in offline mode. Online mode is not affected since the upper bound is retrieved from pg_start_backup().	2019-07-11 09:13:56 -04:00
David Steele	488fb67294	Force PostgreSQL versions to string for newer versions of JSON:PP. Since 2.91 JSON::PP has a bias for saving variables that look like numbers as numbers even if they were declared as strings. Force versions to strings where needed by appending ''. Update the json-pp-perl package on Ubuntu 18.04 to 2.97 to provide test coverage.	2019-07-05 17:25:01 -04:00
David Steele	4815752ccc	Add Perl interface to C storage layer. Maintaining the storage layer/drivers in two languages is burdensome. Since the integration tests require the Perl storage layer/drivers we'll need them even after the core code is migrated to C. Create an interface layer so the Perl code can be removed and new storage drivers/features introduced without adding Perl equivalents. The goal is to move the integration tests to C so this interface will eventually be removed. That being the case, the interface was designed for maximum compatibility to ease the transition. The result looks a bit hacky but we'll improve it as needed until it can be retired.	2019-06-26 08:24:58 -04:00
blogh	e4e2606fce	Add additional options to backup.manifest for debugging purposes. Add the buffer-size, compress-level, compress-level-network, and process-max options to the backup:option section in backup.manifest to aid in debugging. It may also make sense to propagate these options up to backup.info so they can be displayed in the info command, but for now this is deemed sufficient. Contributed by blogh.	2019-03-10 11:03:52 +02:00
David Steele	db4b447be8	The archive-get command is implemented entirely in C. This new implementation should behave exactly like the old Perl code with the exception of a few updated log messages. Remove as much of the Perl code as possible without breaking other commands.	2019-02-27 23:03:02 +02:00
David Steele	1ad67644da	Remove request for S3 object info directly after putting it. After a file is copied during backup the size is requested from the storage in case it differs from what was written so that repo-size can be reported accurately. This is useful for situations where compression is being done by the filesystem (e.g. ZFS) and what is stored can differ in size from what was written. In S3 the reported size will always be exactly what was written so there is no need to check the size and doing so immediately can cause problems because the new file might not appear in list commands. This has not been observed on S3 (though it seems to be possible) but it has been reported on the Swift S3 gateway. Add a driver capability to determine if size needs to be called after a file is written and if not then simply use the number of bytes written for repo-size. Reported by Matt Kunkel.	2018-11-30 10:38:02 -05:00
David Steele	801e2a5a2c	Rename PGBACKREST/BACKREST constants to PROJECT. This brings consistency between the C and Perl constants and allows for easier code reuse.	2018-11-24 19:05:03 -05:00
David Steele	8efa5e6a6a	Rename CipherError to CryptoError. This aligns with the general renaming from cipher to crypto.	2018-11-06 19:38:38 -05:00
Cynthia Shang	34c63276cd	Automatically enable backup checksum delta when anomalies (e.g. timeline switch) are detected. There are a number of cases where a checksum delta is more appropriate than the default time-based delta: * Timeline has switched since the prior backup * File timestamp is older than recorded in the prior backup * File size changed but timestamp did not * File timestamp is in the future compared to the start of the backup * Online option has changed since the prior backup A practical example is that checksum delta will be enabled after a failover to standby due to the timeline switch. In this case, timestamps can't be trusted and our recommendation has been to run a full backup, which can impact the retention schedule and requires manual intervention. Now, a checksum delta will be performed if the backup type is incr/diff. This means more CPU will be used during the backup but the backup size will be smaller and the retention schedule will not be impacted. Contributed by Cynthia Shang.	2018-11-01 11:31:25 -04:00
David Steele	d038b9a029	Support configurable WAL segment size. PostgreSQL 11 introduces configurable WAL segment sizes, from 1MB to 1GB. There are two areas that needed to be updated to support this: building the archive-get queue and checking that WAL has been archived after a backup. Both operations require the WAL segment size to properly build a list. Checking the archive after a backup is still implemented in Perl and has an active database connection, so just get the WAL segment size from the database. The archive-get command does not have a connection to the database, so get the WAL segment size from pg_control instead. This requires a deeper inspection of pg_control than has been done in the past, so it seemed best to copy the relevant data structures from each version of PostgreSQL and build a generic interface layer to address them. While this approach is a bit verbose, it has the advantage of being relatively simple, and can easily be updated for new versions of PostgreSQL. Since the integration tests generate pg_control files for testing, teach Perl how to generate files with the correct offsets for both 32-bit and 64-bit architectures.	2018-09-25 10:24:42 +01:00
Cynthia Shang	880fbb5e57	Add checksum delta for incremental backups. Use checksums rather than timestamps to determine if files have changed. This is useful in cases where the timestamps may not be trustworthy, e.g. when performing an incremental after failing over to a standby. If checksum delta is enabled then checksums will be used for verification of resumed backups, even if they are full. Resumes have always used checksums to verify the files in the repository, enabling delta performs checksums on the database files as well. Note that the user must manually enable this feature in cases were it would be useful or just keep in enabled all the time. A future commit will address automatically enabling the feature in cases where it seems likely to be useful. Contributed by Cynthia Shang.	2018-09-19 11:12:45 -04:00
Cynthia Shang	052e483057	Restore bIgnoreMissing flag in backupFile() lost in storage refactor. The test to make sure that some files (e.g. pg_control) do not get removed during the backup was lost during the storage refactor committed at `de7fc37f`. This did not impact the integrity of the backups, but bring it back since it is a nice sanity check. Contributed by Cynthia Shang.	2018-09-18 10:18:39 -04:00
David Steele	c688bc8627	Improve support for special characters in filenames. % characters caused issues in backup/restore due to filenames being appended directly into a format string. Reserved XML characters (<>&') caused issues in the S3 driver due to improper escaping. Add a file with all common special characters to regression testing.	2018-09-10 10:54:34 -04:00
David Steele	bef58a7974	Allow arbitrary directories and/or files to be excluded from a backup. Misuse of this feature can lead to inconsistent backups so read the --exclude documentation carefully before using.	2018-08-27 15:51:05 -04:00
Cynthia Shang	0acf705416	Require PostgreSQL catalog version when instantiating a Manifest object (and not loading it from disk). Contributed by Cynthia Shang.	2018-07-16 17:25:15 -04:00
David Steele	db17973cd0	Fix critical bug in resume that resulted in inconsistent backups. A regression in v0.82 removed the timestamp comparison when deciding which files from the aborted backup to keep on resume. All resumed backups should be considered inconsistent. A resumed backup can be identified by checking the log for the message "aborted backup of same type exists, will be cleaned to remove invalid files and resumed". Reported by David Youatt, Yogesh Sharma, Stephen Frost.	2018-07-03 14:01:57 -04:00
David Steele	8dbbf2de92	Fix archive-copy throwing "path not found" error for incr/diff backups. Reported by yummyliu, Vitaliy Kukharik.	2018-05-01 12:37:06 -04:00
David Steele	71ba08f579	Use path list in the backup manifest to do restore path syncs. Remove recursive path sync functionality since it is no longer used.	2018-05-01 11:05:37 -04:00
David Steele	f0250dab4b	Move async forking and more error handling to C. The Perl process was exiting directly when called but that interfered with proper locking for the forked async process. Now Perl returns results to the C process which handles all errors, including signals.	2018-04-12 20:42:26 -04:00
David Steele	348278bb68	Make backup directory sync more efficient. Scanning the entire backup directory can be very expensive if there are a lot of small tables. The backup manifest contains the backup directory list so use it to perform syncs instead of scanning the backup directory.	2018-04-03 21:30:15 -04:00
David Steele	0dce8ba2fa	Make backup.history sync more efficient. Only the backup.history/[year] directory was being synced, so check if the backup.history is newly created and sync it as well.	2018-04-03 20:54:18 -04:00
David Steele	be90028100	Rename db-* options to pg-* and backup-* options to repo-* to improve consistency. * repo-* options are now indexed although only one is allowed. * List deprecated option names in documentation and command-line help.	2018-02-03 18:27:38 -05:00
Cynthia Shang	bacc08792d	Ensure latest db-id is selected on when matching archive.info to backup.info. This provides correct matching in the event there are system-id and db-version duplicates (e.g. after reverting a pg_upgrade). Fixed by Cynthia Shang. Reported by Adam K. Sumner.	2018-01-30 16:49:47 -05:00
David Steele	7cf955425e	The C library is now required. This eliminates conditional loading and eases development of new library features.	2017-11-26 17:45:00 -05:00
Cynthia Shang	b03c26968a	Repository encryption support. Contributed by Cynthia Shang.	2017-11-06 12:51:12 -05:00
David Steele	ac542788da	Convert configuration definitions from auto-generated functions to auto-generated data structures.	2017-11-02 08:14:13 -04:00
David Steele	d989cf8ac2	Replace dynamically built class hierarchies in I/O layer with fixed parent() calls.	2017-10-22 19:07:17 -04:00
David Steele	d82fe88031	Fix custom settings for compress-level option being ignored. Reported by Jens Wilke.	2017-10-22 18:05:46 -04:00
David Steele	c94e52f697	Remove error when overlapping timelines are detected. Overlapping timelines are valid in many Point-in-Time-Recovery (PITR) scenarios. Reported by blogh.	2017-10-16 14:34:44 -04:00
David Steele	904b83747c	Perl error handler recognizes errors thrown from the C library.	2017-10-16 10:47:31 -04:00
David Steele	fcb7c6fd1d	PostgreSQL 10 support.	2017-09-01 12:29:34 -04:00
Cynthia Shang	36e576b483	Up to seven standbys can be configured for backup from standby.` Contributed by Cynthia Shang.	2017-08-31 19:15:44 -04:00
David Steele	1e0ed07455	Configuration rules are now pulled from the C library when present.	2017-08-25 16:47:47 -04:00
Cynthia Shang	f9bdc092ba	Fixed an issue that could cause the empty archive directory for an old PostgreSQL version to be left behind after a stanza-upgrade. Fixed by Cynthia Shang.	2017-08-04 16:03:55 -04:00
David Steele	11ee528e0e	Rename Archive modules to remove redundancy.	2017-06-21 08:02:21 -04:00
David Steele	f6d4457d58	Full/Synthetic test refactor. * Combine hardlink and non/compressed in synthetic tests to reduce test time and improve coverage. * Change log level of hardlink logging to detail. * Cast size in S3 manifest to integer.	2017-06-15 15:32:10 -04:00
David Steele	de7fc37f88	Storage and IO layer refactor: Refactor storage layer to allow for new repository filesystems using drivers. (Reviewed by Cynthia Shang.) Refactor IO layer to allow for new compression formats, checksum types, and other capabilities using filters. (Reviewed by Cynthia Shang.)	2017-06-09 17:51:41 -04:00
David Steele	5296747271	Move backup modules into Backup directory.	2017-05-15 16:01:00 -04:00

45 Commits