pgbackrest

mirror of https://github.com/pgbackrest/pgbackrest.git synced 2024-12-12 10:04:14 +02:00

Author	SHA1	Message	Date
David Steele	6ad79d16ca	v2.45: Block Incremental Backup (BETA) Bug Fixes: * Skip writing recovery.signal by default for restores of offline backups. (Reviewed by Stefan Fercot. Reported by Marcel Borger.) Features: * Block incremental backup (BETA). (Reviewed by John Morris, Stephen Frost, Stefan Fercot.) Improvements: * Keep only one all-default group index. (Reviewed by Stefan Fercot.) Documentation Improvements: * Add explicit instructions for upgrading between 2.x versions. (Contributed by Christophe Courtois. Reviewed by David Steele.) * Remove references to SSH made obsolete when TLS was introduced.	2023-03-20 09:37:23 +08:00
David Steele	04d224b88e	Add beta feature and documentation for block incremental backup.	2023-03-18 20:47:03 +08:00
David Steele	78c036efb7	Updates required for new manifest command to work remotely. Additional options are required and the BlockChecksum must work remotely.	2023-03-16 17:46:32 +07:00
David Steele	505a639f1e	Prototype manifest command. Output a manifest in text or JSON format. Neither format is complete but they cover the basics. In particular the manifest command outputs the complete block restore when the filter option is specified and the block delta when the pg option is also specified. This is non-destructive so it is safe to execute against a running cluster.	2023-03-14 21:56:05 +07:00
David Steele	c30d3e439b	Block incremental map fixes and improvements. Bug Fixes: * Remove the distinction between maps where super block size is equal to block size and maps where they are not. In practice, maps with equal blocks are now rare and most of the optimizations can be applied directly to super blocks where the blocks are equal. This fixes a bug where a map that was created with equal size blocks and then converted to differing block sizes would generate an invalid map. * Free reads during restore to avoid running out of file handles. Improvements: * Store super block sizes in the block map. This allows the final block size to be removed from the block list and provides a more optimal restore and better potential for analysis. * Always round the super block size up to the next block size. This makes the number of blocks per super block more predictable. * Allow super block sizes to be changed at will in the map. The first case for this is to store the reduced super block size required when the last super block is short but it could be used to dynamically change the super block size to optimize compression. * Store a block count rather than a list of blocks in a super block. Blocks must always be sequential, though there may be an offset to the first block in a super block. This saves 11-14% on space for checksum sizes 6-7. * In the case that all the blocks for a super block are present, and there is no offset, the block size is omitted.	2023-03-14 17:48:25 +07:00
David Steele	5c1f78d4dd	Fix typo in blockIncrProcess().	2023-03-12 22:38:38 +07:00
David Steele	1281a6eaf8	Ensure no continuations when block size equals super block size. In this case each super block contains a single block so continuations are not possible.	2023-03-12 16:21:43 +07:00
David Steele	d7704a8bc8	Move backup pq test script generation to backup harness. This allows scripted online backups to be used by other test modules.	2023-03-11 14:04:43 +07:00
David Steele	2fffd64213	Fix error handling in build-code binary. `120a49b6` improved the error handling but due to a copy/pasto errors were being handled as asserts, which tripped an assertion in the log module.	2023-03-10 16:11:30 +07:00
David Steele	19f3a1d304	Add missing header.	2023-03-10 16:08:30 +07:00
David Steele	24f725212d	Add beta feature infrastructure. This allows options to be marked as beta, which will require that the --beta option be supplied to prevent accidental usage of a beta feature. The online and command-line documentation also show warnings when options are beta.	2023-03-10 15:30:27 +07:00
David Steele	6b409d049e	Update default block size and super block values based on testing. Block sizes are incremented when the size of the map becomes as large as a single block. This is arbitrary but it appears to give a good balance of block size vs map size. The full backup super block size is set to minimize loss of compression efficiency since most blocks in the database will likely never be modified. For diff/incr backup super blocks, a smaller size is allowable since only modified blocks are stored. The overall savings of not storing unmodified blocks offsets the small loss in compression efficiency due to the smaller super block and allows more granular fetches during restore.	2023-03-10 14:01:38 +07:00
David Steele	1119a53539	Rename BlockHash to BlockChecksum. Checksum is the generally used terminology in the code base, even when a hash is being used as a checksum.	2023-03-09 11:04:03 +07:00
David Steele	6252c0e448	Exclude backup set size from info for block incremental backups. As calculated this size is not correct since it does not include the parts of prior block incrementals that are required to make the current block incremental valid. At best this could be approximated and the resulting values might be very confusing. For now, at least, exclude this metric for block incremental backups.	2023-03-09 10:30:57 +07:00
David Steele	210bed4511	Use xxHash instead of SHA-1 for block incremental checksums. xxHash is significantly faster than SHA-1 so this helps reduce the overhead of the feature. A variable number of bytes are used from the xxHash depending on the block size with a minimum of six bytes for the smallest block size. This keeps the maps smaller while still providing enough bits to detect block changes.	2023-03-09 10:02:04 +07:00
David Steele	8b5153ad21	Block-level incremental backup super blocks. Small blocks sizes can lead to reduced compression efficiency, so allow multiple blocks to be compressed together in a super block. The disadvantage is that the super block must be read sequentially to retrieve blocks. However, different super block sizes can be used for different backup types, so the full backup super block sizes are large for compression efficiency and diff/incr are smaller for retrieval efficiency.	2023-03-09 09:39:54 +07:00
Stefan Fercot	740c2258e3	Add pg-version-force option for fork integration. Forks may update pg_control version or WAL magic without affecting the structures that pgBackRest depends on. This option forces pgBackRest to treat a cluster as the specified version when it cannot be automatically identified.	2023-03-09 08:23:15 +07:00
David Steele	2fa7e53c5d	Skip writing recovery.signal by default for restores of offline backups. When restoring an offline backup on PostgreSQL >= 12, skip writing recovery.signal by default since this will error if the backup was made with wal_level=minimal. If the user explicitly sets the type option to something other than none, then write recovery.signal as usual since it is possible to do Point-In-Time-Recovery from an offline backup as long as wal_level was not minimal.	2023-03-08 19:05:23 +07:00
David Steele	7e5adc0359	Use raw compression/encryption to bundling and block incremental backup. Raw encryption was already being used for block incremental. This commit adds raw compression to block incremental where possible (see `da918587`). Raw compression/encryption is also added to bundling for a backup set when block incremental is enabled on the full backup. This prevents a break in backward compatibility since block incremental is not backward compatible.	2023-03-07 18:46:24 +07:00
David Steele	da91858702	Add optional raw format for compression types. Raw format saves 12 bytes of header for gzip and 4 bytes of checksum for lz4 (plus CPU overhead). This may not seem like much, but over millions of small files or incremental blocks can really add up. Even though it may be a relatively small percentage of the overall backup size it is still objectively a large amount of data. Use raw format for protocol compression to exercise the feature. Raw compression format will be added to bundling and block incremental in a followup commit.	2023-03-07 18:31:17 +07:00
David Steele	f6e307365f	Improve interface handling in storage module. Make the interface object the parent of the driver object rather than the interface being allocated directly in the driver object. The prior method was more efficient when mem contexts had a much higher cost. Now mem contexts are cheap so it makes more sense to structure the objects in a way that works better with mem context auditing. This also means the mem context does not need to be stored separately since it can be extracted directly from the interface object. There are other areas that need to get the same improvement before the specialized objMoveContext() and objFreeContext() functions can be removed.	2023-03-07 11:27:41 +07:00
David Steele	120a49b659	Improved error handling in build-code binary. Show a full stack trace instead of just the error.	2023-03-06 21:07:08 +07:00
David Steele	0818601c05	Set online flag in manifest in command/restore unit tests. This flag does not currently affect restore behavior but it will in an upcoming commit. Set the flag here to simplify the test diff in the upcoming commit.	2023-03-06 09:49:33 +07:00
David Steele	c656669ac2	Allow control version and WAL magic to be overridden in test harness. This makes it easier to write tests for invalid control version/WAL magic. Also add HRN_PG_WAL_TO_BUFFER() to simplify generation of WAL headers.	2023-03-04 12:50:02 +07:00
David Steele	1648c133d6	Keep only one all-default group index. It is possible for a group index to be created for an option that is later found to not meet dependencies. In this case all values would be default leading to a phantom group, which can be quite confusing. Remove group indexes that are all default (except the final one) and make sure the key for the final all default group index is 1.	2023-03-04 12:45:08 +07:00
Stefan Fercot	4394479776	Fix typo and remove extraneous linefeed.	2023-02-28 08:47:51 +07:00
David Steele	16ac5ee8d3	Rename block incremental manifest keys. Since the keys need to be read/written in order, these keys make the logic a bit simpler.	2023-02-26 16:13:44 +07:00
David Steele	a9867cb0b8	Add repo-block-age-map and repo-block-size-map options. Make these options configurable. This is primarily for testing purposes so the new options will be kept internal.	2023-02-26 14:49:34 +07:00
Christophe Courtois	15d5dcdd3b	Add explicit instructions for upgrading between 2.x versions. Add an explicit statement that there is nothing special to do when upgrading between 2.x versions. Leave the previous paragraph about the default location that changed between 2.00 and 2.02, as it is more a matter of transitioning from 1.x to 2.x.	2023-02-26 14:41:32 +07:00
David Steele	dffc933384	Rename DeltaMap to BlockHash. This more accurately describes what the object does.	2023-02-13 09:17:30 +07:00
David Steele	779efe0d7a	Consistently declare block incremental size as size_t. The block is often allocated in memory so size_t makes more sense than uint64_t.	2023-02-09 13:01:56 +07:00
David Steele	d520816acf	Remove parameter list from deltaMapNew(). Since this filter cannot be used remotely (yet) there is no reason to create a parameter list.	2023-02-09 08:11:05 +07:00
David Steele	3feed389a2	Improve IoChunkedRead end-of-file handling. Determine end-of-file earlier to improve throughput. Also clean up some comments and formatting.	2023-02-08 22:34:23 +07:00
David Steele	089fae035b	Add block incremental to real/all test output.	2023-02-07 14:09:50 +07:00
David Steele	31cad5e09e	Check for stray execute permissions in test.pl --code-format. Sometimes these can get set while updating permissions from with a VM or container. Also fix a few permissions that were not correct.	2023-02-01 12:57:04 +07:00
David Steele	8e7e9d36a1	Fix contributors in release notes.	2023-01-31 21:28:28 +07:00
David Steele	c5907a2e71	Remove references to SSH made obsolete when TLS was introduced. Also remove details about SSH compression that are not helpful.	2023-01-31 08:28:32 +07:00
David Steele	9ae6b6ef0f	Update comments missed in copy/paste.	2023-01-31 08:04:09 +07:00
David Steele	ce0ea2cfab	Use uncrustify for code formatting. The code is not completely reflowed yet so there are some cases that uncrustify will not catch. The formatting will be improved over time. Some block of code require special formatting so have been surrounded with the {uncrustify-off}/{uncrustify-on} markers. These exceptions should be kept to a minimum. Add --code-format (to reformat code) and --code-format-check (to check formatting) to test.pl. Add a CI test that will check code formatting. Code must be correctly formatted before it can be merge to integration. Add documentation to the coding standards for code formatting.	2023-01-30 12:22:41 +07:00
David Steele	d4070c9064	Reformat code with uncrustify. uncrustify has been configured to be as close to the current format as possible but the following changes were required: * Break long struct initializiers out of function calls. * Bit fields get extra spacing. * Strings that continue from the previous line no longer indented. * Ternary operators that do not fit on a single line moved to the next line first. * Align under parens for multi-line if statements. * Macros in header #if blocks are no longer indented. * Purposeful lack of function indentation in tests has been removed. Currently uncrustify does not completely reflow the code so there are some edge cases that might not be caught. However, this still represents a huge improvement and the formatting can be refined going forward. Support code for uncrustify will be in a followup commit.	2023-01-30 11:55:54 +07:00
David Steele	b2202c36d9	Fix formatting errors. Errors in our current (manually-maintained) code format discovered by uncrustify.	2023-01-30 11:16:31 +07:00
David Steele	a96837d2ed	Refactor loop limit in removeExpiredArchive(). This simplifies the formatting, which was a bit tortured. Also, there is no need to check the limit on each iteration so this is a bit more efficient.	2023-01-30 10:55:14 +07:00
David Steele	f91af30584	Add FN_PRINTF() macro. Bring the format(printf) attribute in line with the FN_NO_RETURN and FN_INLINE_ALWAYS macros. This is simpler to read and can be customized for different compilers.	2023-01-30 10:51:48 +07:00
David Steele	f8ea51bf8f	Refactor stackTraceToZ() to avoid multiple versions of the function. stackTraceToZ() was split this way in `c8264291` to allow complete coverage. `0becb6da` added a shim to improve coveage but missed simplifying the function.	2023-01-30 10:27:17 +07:00
David Steele	0d8e08f69c	Update config.guess and config.sub to latest versions.	2023-01-30 09:29:07 +07:00
David Steele	240312110c	Begin v2.45 development.	2023-01-30 09:27:04 +07:00
David Steele	053468bfb1	v2.44: Remove PostgreSQL 9.0/9.1/9.2 Support Improvements: * Remove support for PostgreSQL 9.0/9.1/9.2. (Reviewed by Stefan Fercot.) * Restore errors when no backup matches the current version of PostgreSQL. (Contributed by Stefan Fercot. Reviewed by David Steele. Suggested by Soulou.) * Add compress-level range checking for each compress-type. (Reviewed by Stefan Fercot. Suggested by gkleen, ViperRu.) Documentation Improvements: * Add warning about enabling "hierarchical namespace" on Azure storage. (Reviewed by Stefan Fercot. Suggested by Vojtech Galda, Pluggi, asjonos.) * Add replacement for linefeeds in monitoring example. (Reviewed by Stefan Fercot. Suggested by rudonx, gmustdie, Ivan Shelestov.) * Clarify target-action behavior on various PostgreSQL versions. (Contributed by Chris Bandy. Reviewed by David Steele, Anton Kurochkin, Stefan Fercot. Suggested by Anton Kurochkin, Chris Bandy.) * Updates and clarifications to index page. (Reviewed by Stefan Fercot.) * Add dark mode to the website. (Suggested by Stephen Frost.)	2023-01-30 09:15:44 +07:00
David Steele	a28f3d49c2	Add --no-back-trace option to test.pl. Running valgrind and backtrace together has been causing tests to timeout in CI, mostly likely due to limited resources. This has not been a problem in normal development environments. Since it is still important to run backtraces for debugging, split the u22 test that was doing all this work to run coverage and backtrace together and valgrind-only as a separate test. As a bonus these tests run faster separately and since they run in parallel the total execution time is faster.	2023-01-28 18:55:53 +07:00
David Steele	16c625353d	Skip test modules in coverage report. Coverage of the test code is useful during development but it does not seem worth adding it to the core code coverage report.	2023-01-28 16:22:04 +07:00
David Steele	d5ef4ce747	Fix permissions on source files.	2023-01-28 11:01:29 +07:00

1 2 3 4 5 ...

4174 Commits