Introduction

Features

More information using --output=text for info command

Github Issue

The text (default) info output should include everything (or nearly everything) that is in the JSON output nicely formatted for human consumption.

Delete backup_label as soon as possible

Github Issue

If crashes during a backup it may not be able to recover if the backup label is present. Copy and delete right after start_backup(). Stop backup will want to delete it so it might be necessary to copy it back or at least touch a file that can delete. Check after the backup is complete to make sure it's really gone.

Ability to test that WAL archiving is working

Github Issue

Add a new command, archive-test, that will execute pg_switch_xlog and check that the xlog makes it to the archive.

Abandon threads and go to processes

Github Issue

Even with the thread refactor they are not reliable on all platforms. Processes would be more compatible across platforms and basic testing has shown there are no significant performance tradeoffs.

Time-based retention

Github Issue

Create a setting that allows a time period to be set for retention. Right now only a certain number of backups can be set. If retention-full=2 and two backups are done back to back, then the time period of protection will be very short.

The new option retention-period will be expressed in hours, days, weeks, months, years and will work with current retention like this: The time period will be honored but middle backups will be pruned to match retention-full. For example, if retention-period=2 weeks full-rentention=2 and a third full backup was taken, the first one would be pruned if it were older than two weeks, else the middle one will be pruned. This gives two weeks of backup coverage but keeps the number of backups down.

As part of this feature, WAL should be expired as part of retention-period if nothing else is set. This will support installations that are using backrest only for archiving.

Only use fully-qualified paths remotely when used locally

Github Issue

If is run without being fully-qualified locally it should also be done when running remotely. They might be in different paths but still on the search path. The remote-cmd option can still be used to set it explicitly.

Implement --link-only option

This option would allow the user to restore the link but none of the data at the destination. This would be useful for linked configuration files which are good to backup but not appropriate to restore in every situation, e.g. a standby that has its own custom config. Another example would be a link to the log directory from within $PGDATA.

Stanza add/update/remove

Github Issue

Formalize this process and allow upgrades to new database versions.

Make sure that restore and expiration still work - perhaps new unit tests.
Add db-id param to restore and archive-get to specify a database.

This should also make it easy to throw a stanza does not exist error.

Add running state to info

Github Issue

Add running (true/false) to info based on the lock file.

Improve restore permissions

Github Issue

Two improvements:

Make sure recovery.conf has same ownership/mode as base path
Make sure base path has original ownership/permissions.

Pull options from remote when required

Github Issue

Certain parameters like db-path and repo-path must be configured on both sides when the backup server is remote. It would be better if these parameters were pulled from the remote side so they aren't repeated.

Config file validation

Github Issue

It's possible to have bogus settings in the configuration file since it is only checked for certain values.

Detect options that are placed in the wrong sections.
Detect options that do not exist.

Add repo-owner option

Suggested by Michael Renner Github Issue

Allow (or perhaps require) a repo-owner to allow to check permissions on the repository.

Automatically pick a backup for time PITR

Github Issue

If time PITR is being done it should be possible to automatically pick the best backup to use.

Async archiving with pg_receivexlog

Github Issue

Make async archiving work when pg_receivexlog is writing directly to the out spool directory on the backup server. Setting this up with a replication slot will make it more reliable.

BackRest will not directly call pg_receivexlog but it can work in concert for more reliable logging.

Support two archivers on same repository

Github Issue

The archive_mode=always setting in 9.5 allows both the master and the standby to log to the same archive for redundancy. Currently will error if this mode is set because it is not supported.

Async archive-get with prefetch

Github Issue

Getting one archive file at a time can be tedious if the cluster is very far behind. An async get with some sort of prefetch would speed the process a lot.

Should be able to specify how may archive logs to prefetch.

Multi-processing for archive-get and archive-push

Github Issue

Multi-processing would improved performance for these operations, especially archive-push. However, even very large systems have been working well with asynchronous archiving so this is not a big priority.

Add checksum-delta option

Github Issue

Checksums are calculated during the backup process, but the delta is still done during diff/incr backups. Add a new option checksum_delta (default n) that does the delta using checksums. Of course, if the timestamp or size has changed the checksum does not need to be calculated.

Set --log-level-console=info for interactive sessions

Suggested by Michael Renner Github Issue

This would prevent surprises for new users who expect to see more output and think the process has locked up if it is not logging.

Return hard error for holes in the the archive log

Github Issue

If an archive log is missing in the middle of an archive stream will return a soft error (1), even though there is probably no chance of that archive log showing up.

If an archive log is missing then check to see if the next one is present - if so return a hard error. This is tricky because there is a question of how long to wait. With parallel async push it's very possible that the WALs could arrive out of order.

Here's a possible solution: on the database server knows the oldest WAL segment that is currently on the db server and not pushed. If this is reported to the backup server, then it can determine if a hole in the archive stream may be filled, or if it is a permanent condition.

Add configurable sleep to archiver process to reduce ssh connections

Github Issue

The async archiver exits as soon as there are no files left to transfer. A configurable sleep would be good because it would reduce the number of SSH connections made to the remote.

Preserve WAL timestamps

Github Issue

Preserve exact WAL timestamps to make measurement of WAL rates more accurate in monitoring. Timestamp should be taken from the file before copying so delay in archiving can also be measured.

Pin/lock a backup

Github Issue

Allow user to indicate that a backup is locked and should be preserved until unlocked. This could be handy for the last backup of a previous PG version or just to save data that is known to be important for any reason.

Write stanza name into archive.info and check it

Github Issue

This would allow to detect when paths inside the repo have been renamed.

Allow pgbackrest.d directory to hold config snippets

Suggested by Michael Renner Github Issue

This would allow easier configuration for CM software.

Throttling

Github Issue

Add a throttling feature to limit the amount of disk i/o and or network bandwidth being used by the backup.

Make this a per thread limitation to start. That simplifies the problem quite a bit and most users who are throttling will probably be single threaded.

Hook scripts

Github Issue

Allow user-defined hook scripts to be run before and after backups.

Notify user earlier during backup if WAL is not being archived

Github Issue

It is possible to notify users earlier if archiving is not working during a backup. Check in the main backup loop to see if archiving is proceeding - if not then fail after a configurable amount of time.

Allow sudo user to be specified when calling remote via ssh

Github Issue

Add new options db-sudo-user and backup-sudo-user to allow the backrest command to be run through sudo for security. This is especially important on the db side.

Allow db user to be different than OS user for backup

Github Issue

Although the file system backup needs to run as , it can be advantageous to have the start/stop backup run as a less privileged database user with the REPLICATION role. This will need to be tested to see if it works.

Ideally the file system backup could be run as a user in the group rather than itself but does not grant group permissions so sadly it is not possible to backup as a user other than the database owner at the file-system level. This limitation will need to be addressed in core .

Allow path options to be relative

Github Issue

There are some use cases where would be valuable to use relative paths. However to avoid confusion it might be best to implement it like so:

relative paths would need to being with ./
--config would always be allowed to be relative
In the config file a new option 'relative-path' would enable relative paths for all other path options.

This would avoid strange effects for users who do not need this feature.

Verify <postgres/> data file checksums during backup

Github Issue

>= 9.3 has the option to enable checksums on data files. should be able to test these checksums and report if it finds any issues. This could also be a stand-alone function.

For extra credit, test the checksums in WAL.

Database logging

Github Issue

Add the ability to log into a database - especially a centralized database. Include all backup metadata plus the logs.

This would make for easy querying and monitoring.

Encryption

Github Issue

Backup encryption is a good thing ans might be required before users would be comfortable pushing to services such as S3.

Alternative storage methods (S3)

Github Issue

Consider alternative storage methods like S3. Ideally there would be an option to store a certain number of backups (at least the last one) locally for fast restores, while using S3 for long-term storage.