pgbackrest

mirror of https://github.com/pgbackrest/pgbackrest.git synced 2024-12-14 10:13:05 +02:00

Author	SHA1	Message	Date
David Steele	aaa15b9709	Add help for all internal options valid for default roles. Fix the segfault when getting help for an internal option is requested by adding help for all internal options that are valid for a default command role. Also print warnings about internal options in code rather than putting in each command/option description.	2021-04-23 11:46:03 -04:00
Stefan Fercot	292f836f12	Add db-exclude option. Restore excluding the specified databases. Databases excluded will be restored as sparse, zeroed files to save space but still allow PostgreSQL to perform recovery. After recovery, those databases will not be accessible but can be removed with the drop database command. The --db-exclude option can be passed multiple times to specify more than one database to exclude. When used in combination with the --db-include option, --db-exclude will only apply to standard system databases (template0, template1, and postgres).	2021-04-19 15:01:00 -04:00
Isaacwhyuenac	5bf160643b	Clarify that repo-s3-role is not an ARN.	2021-04-13 14:02:20 -04:00
Cynthia Shang	d372dd652c	Update reference to include links to user guide examples. The command-example and command-example-list elements were removed from the documentation rendering some time ago so these tags were dead code. The tags, however, contained some examples and information that were pertinent to the command, so where possible, the information was included in the description of the command and/or the user-guide and links to the relevant user guide sections were added. Note that some commands could not be updated with user guide references since doing so would cause a cyclical reference in the user guide. These commands have an internal comment to indicate this. In addition, some clarifications were added (e.g. expire --set option) where information was lacking.	2021-03-31 09:36:56 -04:00
Cynthia Shang	75987621fa	Add note about required NFS settings being the same as PostgreSQL.	2021-03-26 10:11:06 -04:00
Cynthia Shang	3e206088e7	Add compress-level defaults per compress-type value. Document these defaults until they can be added to the config parser and automated.	2021-03-26 09:25:31 -04:00
David Steele	b6106f3c1f	Add archive-header-check option. Enabled by default, this option checks the WAL header against the PostgreSQL version and system identifier to ensure that the WAL is being copied to the correct stanza. This is in addition to checking pg_control against the stanza and verifying that WAL is being copied from the same PostgreSQL data directory where pg_control is located. Therefore, disabling this check is fairly safe but should only be done when required, e.g. if the WAL is encrypted.	2021-03-25 15:33:50 -04:00
Cynthia Shang	2789d3b620	Improve info command fault tolerance. This improvement reduces the number of errors thrown; these errors will now be reported as a status for the stanza or repo as appropriate. Invalid option configurations are still thrown but all other errors are caught, formatted and reported. This was necessary for multiple repositories so that the command can complete gathering information from each repository and report the results rather than immediately aborting when an error occurs. Two new error codes were introduced: 6 = requested backup not found 99 = other, which is used to indicate an error has occurred that requires more details to be provided A new stanza name of "[invalid]" was created for instances where a stanza was not specified and no stanza can be found. If there is only one repository configured the error will move up to the stanza level with the standard error formatting of 'error (message)' where the message will be "other" and the details of the error will be listed on the next line(s): stanza: stanza1 status: error (other) [CryptoError] unable to load info file '/var/lib/pgbackrest/repo/backup/stanza1/backup.info' or '/var/lib/pgbackrest/repo/backup/stanza1/backup.info.copy': CryptoError: cipher header invalid HINT: is or was the repo encrypted? FileMissingError: unable to open missing file '/var/lib/pgbackrest/repo/backup/stanza1/backup.info.copy' for read HINT: backup.info cannot be opened and is required to perform a backup. HINT: has a stanza-create been performed? HINT: use option --stanza if encryption settings are different for the stanza than the global cipher: aes-256-cbc If a backup set is requested but is not found on any repo, a stanza-level status error of 'requested backup not found' is reported when there are no other errors: pgbackrest info --stanza=demo --set=bogus stanza: demo status: error (requested backup not found) cipher: mixed repo1: aes-256-cbc repo2: none If there are multiple repositories configured and a single repo is in error but the other repos are ok or have a different error: pgbackrest info --stanza=demo --set=20210322-171211F stanza: demo status: mixed repo1: error [CryptoError] unable to load info file '/var/lib/pgbackrest/repo/backup/stanza1/backup.info' or '/var/lib/pgbackrest/repo/backup/stanza1/backup.info.copy': CryptoError: cipher header invalid HINT: is or was the repo encrypted? FileMissingError: unable to open missing file '/var/lib/pgbackrest/repo/backup/stanza1/backup.info.copy' for read HINT: backup.info cannot be opened and is required to perform a backup. HINT: has a stanza-create been performed? HINT: use option --stanza if encryption settings are different for the stanza than the global repo2: ok cipher: mixed repo1: aes-256-cbc repo2: none db (current) wal archive min/max (12): 000000010000000000000001/000000010000000000000003 full backup: 20210322-171211F timestamp start/stop: 2021-03-22 17:12:11 / 2021-03-22 17:12:28 wal start/stop: 000000010000000000000002 / 000000010000000000000002 database size: 23.4MB, database backup size: 23.4MB repo2: backup set size: 2.8MB, backup size: 2.8MB database list: postgres (13359) Json output will include the repository information and any error information. If no stanzas are found, then [invalid] will be set as the name: [ { "archive":[], "backup":[], "cipher":"none", "db":[], "name":"[invalid]", "repo":[ { "cipher":"none", "key":1, "status":{ "code":99, "message":"[PathOpenError] unable to list file info for path '/var/lib/pgbackrest/repo2/backup': [13] Permission denied" } } ], "status":{ "code":99, "lock":{"backup":{"held":false}}, "message":"other" } } ]	2021-03-25 12:29:36 -04:00
David Steele	92d12ccb9b	Update selective restore documentation with caveats. Recovery may error unless --type=immediate is specified. This is because after consistency is reached PostgreSQL will flag zeroed pages as errors even for a full-page write. For PostgreSQL ≥ 13 the ignore_invalid_pages setting may be used to ignore invalid pages. In this case it is important to check the logs after recovery to ensure that no invalid pages were reported in the selected databases.	2021-03-11 10:19:50 -05:00
David Steele	9506ffae39	Add compress-type clarification to archive-copy documentation. It is best if the archive-push and backup commands have the same compress-type (e.g. lz4) when using archive-copy. Otherwise, the WAL segments will need to be recompressed with the compress-type used by the backup, which can be fairly expensive depending on how much WAL was generated during the backup.	2021-03-11 07:53:10 -05:00
Cynthia Shang	31c7824a4d	Allow stanza-* commands to be run remotely. The stanza-create, stanza-upgrade and stanza-delete were required to be run on the repository host. When there was only one repository allowed this was not a problem. However, with the introduction of multiple repository support, this becomes more of a burden to the user, therefore the stanza-create, stanza-upgrade and stanza-delete commands have been improved to allow for them to be run remotely.	2021-03-10 08:10:46 -05:00
David Steele	1dbb3bf50b	Multiple repository support. Up to four repositories may be configured. A potential benefit is the ability to have a local repository for fast restores and a remote repository for redundancy. Some commands, e.g. stanza-create/stanza-update, will automatically work with all configured repositories while others, e.g. stanza-delete, will require a repository to be specified using the repo option. See the command reference for details on which commands require the repository to be specified. Note that the repo option is not required when only repo1 is configured in order to maintain backward compatibility. However, the repo option is required when a single repo is configured as, e.g. repo2. This is to prevent command breakage if a new repository is added later. The archive-push command will always push WAL to the archive in all configured repositories but backups will need to be scheduled individually for each repository. In many cases this is desirable since backup types and retention will vary by repository. Likewise, restores must specify a repository. It is generally better to specify a repository for restores that has low latency/cost even if that means more recovery time. Only restore testing can determine which repository will be most efficient. For single repository configurations there should be no change in behavior.	2021-03-08 13:31:13 -05:00
David Steele	088662d986	GCS support for repository storage. GCS and GCS-compatible object stores can now be used for repository storage.	2021-03-05 12:13:51 -05:00
David Steele	d1aa765a9d	Consolidate less commonly used repository storage options. The following options are renamed as specified: repo1-azure-ca-file -> repo1-storage-ca-file repo1-azure-ca-path -> repo1-storage-ca-path repo1-azure-host -> repo1-storage-host repo1-azure-port -> repo1-storage-port repo1-azure-verify-tls -> repo1-storage-verify-tls repo1-s3-ca-file -> repo1-storage-ca-file repo1-s3-ca-path -> repo1-storage-ca-path repo1-s3-host -> repo1-storage-host repo1-s3-port -> repo1-storage-port repo1-s3-verify-tls -> repo1-storage-verify-tls The old option names (e.g. repo1-s3-port) will continue to work for repo1, but repo2, etc. will require the new names.	2021-03-02 13:51:40 -05:00
David Steele	bec3e20b2c	Add archive-get command multi-repo support. Repositories will be searched in order for the requested archive file. Errors will be reported as warnings as long as a valid copy of the archive file is found.	2021-02-23 15:34:28 -05:00
Cynthia Shang	d350d1cc21	Improve expire command documentation.	2021-02-05 11:48:07 -05:00
David Steele	b65c370346	Add repo-get command.	2021-02-05 10:39:03 -05:00
David Steele	218cd078a6	Add repo-ls command.	2021-02-05 10:07:43 -05:00
Stefan Fercot	4b46115345	Add archive-mode-check option. This option disallows the PostgreSQL archive_mode=always setting and disabling it allows the setting.	2021-02-02 13:43:14 -05:00
Cynthia Shang	e251ec574a	Add note about removing configuration to stanza-delete documentation.	2021-01-25 11:14:28 -05:00
Cynthia Shang	00fac1c0d1	Improve info command text output and --set handling. The info command provides total sizes for files in the backup on the database as well as the repository. The text output and associated user documentation has been updated to provide more clarity regarding the sizes being displayed. In addition, the info command is updated to allow a user to optionally specify the repository when requesting a specific backup set. In this case, the text output will reflect the status of the stanza, the cipher types and archive min/max over all the repositories instead of a single repository when the repo option is specified.	2021-01-25 09:19:05 -05:00
David Steele	09fdde359c	Limit pg option validity and make it command-line only. The pg option only has one current usage, to let the backup local know which pg index it should copy files from. There are other possible uses for this option, but they need thought, tests, and documentation.	2020-12-31 10:08:58 -05:00
David Steele	951cfa9e90	Remove repo option. This option was added in advance of the multi-repo functionality but it has no purpose and it is not clear what the validity rules should be. The option will be added back when multi-repo functionality is committed.	2020-12-31 08:12:35 -05:00
David Steele	b0ea337965	Add pg-database option. In some rare cases there is no postgres database so this option may be used to specify an alternate database.	2020-12-02 22:42:50 -05:00
David Steele	117f03eba1	Prepare configuration module for multi-repository support. Refactor the code to allow a dynamic number of indexes for indexed options, e.g. pg-path. Our reliance on getopt_long() still limits the number of indexes we can have per group, but once this limitation is removed the rest of the code should be happy with dynamic numbers of indexes (with a reasonable maximum). Add an option to set a default in each group. This was previously handled by the host-id option but now there is a specific option for each group, pg and repo. These remain internal until they can be fully tested with multi-repo support. They are fully tested for internal usage. Remove the ConfigDefineOption enum and use the ConfigOption enum instead. They are now equal since the indexed options (e.g. cfgOptRepoHost2) have been removed from ConfigOption. Remove the config/config test module and add required tests to the config/parse test module. Parsing is now the only way to load a config so this removes some redundancy. Split new internal config structures and functions into a new header file, config.intern.h. More functions will need to be moved over from config.h but that will need to be done in a future commit to reduce churn. Add repoIdx to repoIsLocal() and storageRepo*(). Multi-repository support requires that repo locality and storage be accessible by index. This allows, for example, multiple repos to be iterated in a loop. This could be done in a separate commit but doesn't seem worth it since the code is related. Remove the type parameter from storageRepoGet(). This parameter existed solely to provide coverage for the case where the storage type was invalid. A better pattern is to check that the type is S3 once all other types have been ruled out.	2020-11-23 15:55:46 -05:00
David Steele	9377d05072	Add repo-azure-endpoint option. This option allows alternate endpoints (e.g. Azure Government) to be configured.	2020-10-06 17:15:48 -04:00
David Steele	597739fafe	Move info command text to the reference and link to user guide. This means the same text will appear in both places, which should make it easier to find. Also update the link code to allow both page and section to be specified rather than only one or the other.	2020-09-25 11:26:27 -04:00
Cynthia Shang	ad79932ba5	Add internal verify command. Scan the WAL archive for missing or invalid files and build up ranges of WAL that will be used to verify backup integrity. A number of errors and warnings are currently emitted but they should not be considered authoritative (yet). The command is incomplete so is marked internal.	2020-09-22 11:57:38 -04:00
David Steele	14e1fd10ca	Add none to compress-type option reference and fix example.	2020-08-27 10:59:04 -04:00
David Steele	8c2960fab3	Add archive-mode option to disable archiving on restore. When restoring a cluster that will be promoted but is not intended to be the new primary, it is important to disable archiving to avoid polluting the repository with useless WAL. This option makes disabling archiving a bit easier.	2020-08-25 15:05:41 -04:00
David Steele	851f2e814e	Automatically retrieve temporary S3 credentials on AWS instances. Automatically retrieve the role and temporary credentials for S3 when the AWS instance is associated with an IAM role. Credentials are automatically updated when they are <= 5 minutes from expiring. Basic configuration is to set repo1-s3-key-type=auto. repo1-s3-role can be used to set a specific role, otherwise it will be retrieved automatically.	2020-08-25 10:38:49 -04:00
Don Seiler	afcc4d193d	Add missing azure type in repo-type option reference.	2020-08-11 14:38:38 -04:00
Don Seiler	f40c7b65fa	Fix typo in repo-cipher-type option reference.	2020-08-11 10:41:06 -04:00
David Steele	ed88293861	Clarify that expire must be run regularly when expire-auto is disabled.	2020-07-21 10:57:47 -04:00
Stefan Fercot	d3dd32a031	Add expire-auto option. This allows automatic expiration after a successful backup to be disabled.	2020-07-14 08:12:25 -04:00
David Steele	2f7823c627	Add shared access signature (SAS) authorization for Azure. A shared access signature (SAS) provides granular, delegated access to resources in a storage account. This is often preferable to using a shared key which provides more access and is a greater security risk if compromised.	2020-07-09 14:46:48 -04:00
David Steele	3f4371d7a2	Azure support for repository storage. Azure and Azure-compatible object stores can now be used for repository storage. Currently only shared key authentication is supported but SAS will be added soon.	2020-07-02 16:24:34 -04:00
David Steele	9efbafc84c	Fix incorrect example for repo-retention-full-type option.	2020-06-01 13:19:47 -04:00
Magnus Hagander	b8a5c3ac6f	Fix incorrect command in reference documentation. Also update process to command to be more consistent with the surrounding text.	2020-05-12 13:13:04 -04:00
Cynthia Shang	cdebfb09e0	Add time-based retention for full backups. The --repo-retention-full-type option allows retention of full backups based on a time period, specified in days. The new option will default to 'count' and therefore will not affect current installations. Setting repo-retention-full-type to 'time' will allow the user to use a time period, in days, to indicate full backup retention. Using this method, a full backup can be expired only if the time the backup completed is older than the number of days set with repo-retention-full (calculated from the moment the 'expire' command is run) and at least one full backup meets the retention period. If archive retention has not been configured, then the default settings will expire archives that are prior to the oldest retained full backup. For example, if there are three full backups ending in times that are 25 days old (F1), 20 days old (F2) and 10 days old (F3), then if the full retention period is 15 days, then only F1 will be expired; F2 will be retained because F1 is not at least 15 days old.	2020-05-08 15:25:03 -04:00
Stephen Frost	a021c9fe05	Add bzip2 compression support. bzip2 is a widely available, high-quality data compressor. It typically compresses files to within 10% to 15% of the best available techniques (the PPM family of statistical compressors), while being around twice as fast at compression and six times faster at decompression. bzip2 is currently available on all supported platforms.	2020-05-05 16:49:01 -04:00
David Steele	47aa765375	Add Zstandard compression support. Zstandard is a fast lossless compression algorithm targeting real-time compression scenarios at zlib-level and better compression ratios. It's backed by a very fast entropy stage, provided by Huff0 and FSE library. Zstandard version >= 1.0 is required, which is generally only available on newer distributions.	2020-05-04 15:25:27 -04:00
Cynthia Shang	1c1a710460	Add --set option to the expire command. The specified backup set (i.e. the backup label provided and all of its dependent backups, if any) will be expired regardless of backup retention rules except that at least one full backup must remain in the repository.	2020-04-27 14:00:36 -04:00
Cynthia Shang	93e4fe0199	Specify that the io-timeout option is measured in seconds.	2020-04-20 13:11:34 -04:00
David Steele	5d25e508ae	Add io-timeout option. Timeout used for connections and read/write operations. Note that the entire read/write operation does not need to complete within this timeout but some progress must be made, even if it is only a single byte.	2020-04-17 09:18:52 -04:00
David Steele	789e364e6b	Rename tcp-keep-alive option to sck-keep-alive. This is really a socket option so the new name is clearer. Since common/io/socket/tcp will contains a mix of options it makes sense to rename it to socket and cascade name changes as needed.	2020-04-01 15:44:51 -04:00
David Steele	5c6fb88bef	TCP keep-alive options are configurable. Prior to 2.25 the individual TCP keep-alive options were not being configured due to a missing header. In 2.25 they were being configured incorrectly due to a disconnect between the timeout specified in ms and what was expected by the TCP options, i.e. seconds. Instead make the TCP keep-alive options directly configurable, with correct units and better testing. Keep-alive is enabled by default (though it can be defaulted to the system setting instead) and the rest of the options are not set by default. This is in line with what PostgreSQL does, though PostgreSQL does not allow keep-alive to be defaulted. Also move configuration of TCP options before connect() as PostgreSQL does.	2020-03-31 18:13:11 -04:00
Adrian Vondendriesch	e1c72f6f97	Fix typos.	2020-03-28 17:48:57 -04:00
Cynthia Shang	2fa69af8da	Add --dry-run option to the expire command. Use dry-run to see which backups/archive would be removed by the expire command without actually removing anything.	2020-03-16 13:56:52 -04:00
David Steele	c279a00279	Add lz4 compression support. LZ4 compresses data faster than gzip but at a lower ratio. This can be a good tradeoff in certain scenarios. Note that setting compress-type=lz4 will make new backups and archive incompatible (unrestorable) with prior versions of pgBackRest.	2020-03-10 14:45:27 -04:00

1 2 3

146 Commits