pgbackrest

mirror of https://github.com/pgbackrest/pgbackrest.git synced 2024-12-14 10:13:05 +02:00

Author	SHA1	Message	Date
David Steele	a1d7d50223	Fix macro spacing.	2020-08-05 17:30:38 -04:00
David Steele	d5edc3fe37	Remove unused port struct member. This does not appear to have been used in quite some time and the tests are equally useless because they don't prove the correct port was passed to httpClientNew().	2020-08-05 15:24:13 -04:00
David Steele	d74c8f4b58	Add memContextFreeing(). This can be used to determine if functions are running in a callback from memContextFree(). In this case, data in child mem contexts is not valid.	2020-08-05 14:07:04 -04:00
David Steele	847e61ce21	Improve TLS error reporting. Before `9f2d647` TLS errors included additional details in at least some cases. After `9f2d647` a connection to an HTTP server threw `TLS error [1]` instead of `unable to negotiate TLS connection: [336031996] unknown protocol`. Bring back the detailed messages to make debugging TLS errors easier. Since the error routine is now generic the `unable to negotiate TLS connection context` is not available so the error looks like `TLS error [1:336031996] unknown protocol`.	2020-08-04 15:15:24 -04:00
David Steele	e81533bbab	Improve memory usage of unlogged relation detection in manifest build. This loop was using a lot of memory without freeing it at intervals. Rewrite to use char arrays when possible to reduce memory that needs to be allocated and freed.	2020-08-04 10:16:51 -04:00
David Steele	5a9856c2f9	Add functions for Zigzag encoding/decoding. Zigzag encoding places the sign bit in the least significant bit so that -1 is encoded as 1, 1 as 2, etc. This moves as many bits as possible into the low order bits which is good for other types of encoding, e.g. base-128. See https://en.wikipedia.org/wiki/Variable-length_quantity#Zigzag_encoding.	2020-08-01 09:42:03 -04:00
David Steele	caef2ba482	Accept NULL filenames in hrnStorageInfoListCallback(). This allows it to be used with StorageInfo objects returned from storageInfo(), which do not contain a filename.	2020-07-31 16:35:58 -04:00
David Steele	a8e47c38c6	Add lstGetLast() and lstRemoveLast(). When a list is being treated as a stack it is useful to get/remove the last (top) item.	2020-07-31 16:27:57 -04:00
David Steele	bfb489a82d	Add file name to make performance/storage test more realistic. Also add timing information.	2020-07-31 16:18:56 -04:00
David Steele	9f1e7d88da	Remove special protocol encoding for StorageType. It seems like overkill to encode this when other enums (e.g. StorageInfoLevel) are passed as integers. Instead note that StorageType values should not be changed and remove the special encoding.	2020-07-31 16:13:05 -04:00
David Steele	ac72e1f193	Preserve Variant parsed from JSON in iniLoad(). The fix for = characters in info files (`039d314`) added JSON validation but discarded the resulting Variant which means the JSON is being parsed twice. This nearly doubles the time to load a manifest since a lot of complex JSON is involved. Time to load a million file manifest: Before `039d314`: 7.8s `039d314`: 15.5s This patch: 7.5s To fix this regression return the Variant in the callback so the caller does not have to parse it again. The new code appears slightly more efficient overall, probably because there are fewer operations against Strings.	2020-07-30 10:59:50 -04:00
David Steele	3e9dce0d76	Rename strPtr()/strPtrNull() to strZ()/strZNull(). We use the Z suffix in many functions to indicate that we are expecting a zero-terminated string so make this function conform to the pattern. As a bonus the new name is a bit shorter, which is a good quality in a commonly-used function.	2020-07-30 07:49:06 -04:00
David Steele	039d314438	Fix issue with = character in file or database names. The manifest uses the = character as the key/value separator so = characters in the key cause parsing errors and lead to an error or segfault. Since the value must be valid JSON we can keep checking the value on the right side of the = and stop building the key when the value is valid. It's a bit hackish but it does seem to do the job without breaking the manifest format. Unsurprisingly this makes parsing about 50% slower but it's still more than fast enough. Parsing 10 million key/values takes about 6.5s for the old code and 10s for the new code. Since the value is used as JSON downstream we can reclaim most of this time by just passing the JSON value rather than making the callback reparse it. We'll save that for another commit, though.	2020-07-28 14:00:23 -04:00
David Steele	63a93db6fd	Suppress errors when closing local/remote processes. Since the command has completed it is counterproductive to throw an error but still warn to indicate that something unusual happened. Also fix the related issue that the local processes were not being shut down when they completed, which meant that they might timeout before being closed when pgbackrest terminated.	2020-07-28 12:15:33 -04:00
David Steele	a260d4a53b	Add zstd to CentOS/RHEL 6 test container. Zstd is now required by the upstream yum package.	2020-07-28 08:09:10 -04:00
David Steele	8e776cc537	Add manifestNewBuild() to performance/type test. Use a test storage driver to allow manifestNewBuild() to be run against a test cluster at any scale without having to write files to disk. Simplify the test by using the output of manifestNewBuild() to feed manifestSave() and manifestNewLoad(). Also add manifest size to the output.	2020-07-25 11:26:14 -04:00
David Steele	f9d923ca3b	Add memContextSize(). Calculates the memory used by the context and all child contexts. This is primarily useful for debugging but it is not conditional on DEBUG because it is useful for profile/performance tests.	2020-07-25 11:06:25 -04:00
David Steele	216a61d936	Move dummy storage driver to test harness. The dummy driver is the basis for creating test storage drivers so it makes sense to locate it in the harness where all tests can access it.	2020-07-25 08:44:41 -04:00
David Steele	cbf5dd19c2	Fix non-JSON ini values in tests. A number of tests used invalid JSON values where an error was expected or the value would be ignored. Update these tests to use valid JSON values so all values in the file can be validated even if they are not used.	2020-07-24 11:05:09 -04:00
David Steele	9d8f6d9fc2	Error in jsonToVar() when input not entirely consumed. Something like 3="string" would return an Int64 variant and ignore the invalid portion after the integer. Other JSON interface functions have this check but it was forgotten here. There are no current issues because of this but we want to be able to validate arbitrary JSON strings and this function was not working correctly for that usage.	2020-07-24 10:47:51 -04:00
Cynthia Shang	78ef442a18	Add storage parameter to pgWalFromFile().	2020-07-21 16:28:05 -04:00
David Steele	90ffde8881	Remove unused constructor strLstNewParam(). This function is not used in the core code so remove it and update the test where it was used. There may eventually be a need for a strLstNewP() function but it doesn't seem worth the code churn until there is an actual requirement.	2020-07-20 18:09:32 -04:00
David Steele	1783e0490a	Remove lstNew() constructor in favor of lstNewP(). The old constructor was left around to reduce code churn during the migration but it just makes the code harder to read and search. Remove the old constructor and rename all remaining instances to lstNewP(), which by default has the same semantics.	2020-07-20 15:22:33 -04:00
David Steele	3f0b41eb9c	Add support for testing on 64-bit big-endian architectures. In particular add support for s390x but we hope this will work for other 64-bit big-endian architectures. Run basic unit tests on Travis CI for 390x.	2020-07-20 09:59:16 -04:00
David Steele	55277357b8	Reduce reliance on static checksums in unit tests. Testing against static checksums is valuable but it can be become burdensome when supporting multiple architectures. Reduce the number of tests we are doing against static checksums when the architecture can cause the checksum to vary.	2020-07-20 09:47:43 -04:00
David Steele	5a4b91f90a	v2.28: Azure Repository Storage Bug Fixes: * Fix restore --force acting like --force --delta. This caused restore to replace files based on timestamp and size rather than overwriting, which meant some files that should have been updated were left unchanged. Normal restore and restore --delta were not affected by this issue. (Reviewed by Cynthia Shang.) Features: * Azure support for repository storage. (Reviewed by Cynthia Shang, Don Seiler.) * Add expire-auto option. This allows automatic expiration after a successful backup to be disabled. (Contributed by Stefan Fercot. Reviewed by Cynthia Shang, David Steele.) Improvements: * Asynchronous S3 multipart upload. (Reviewed by Stephen Frost.) * Automatic retry for backup, restore, archive-get, and archive-push. (Reviewed by Cynthia Shang.) * Disable query parallelism in PostgreSQL sessions used for backup control. (Reviewed by Stefan Fercot.) * PostgreSQL 13 beta2 support. Changes to the control/catalog/WAL versions in subsequent betas may break compatibility but pgBackRest will be updated with each release to keep pace. * Improve handling of invalid HTTP response status. (Reviewed by Cynthia Shang.) * Improve error when pg1-path option missing for archive-get command. (Reviewed by Cynthia Shang.) * Add hint when checksum delta is enabled after a timeline switch. (Reviewed by Matt Bunter, Cynthia Shang.) * Use PostgreSQL instead of postmaster where appropriate. (Reviewed by Cynthia Shang.) Documentation Bug Fixes: * Fix incorrect example for repo-retention-full-type option. (Reported by Höseyin Sönmez.) * Remove internal commands from HTML and man command references. (Reported by Cynthia Shang.) Documentation Improvements: * Update PostgreSQL versions used to build user guides. Also add version ranges to indicate that a user guide is accurate for a range of PostgreSQL versions even if it was built for a specific version. (Reviewed by Stephen Frost.) * Update FAQ for expiring a specific backup set. (Contributed by Cynthia Shang. Reviewed by David Steele.) * Update FAQ to clarify default PITR behavior. (Contributed by Cynthia Shang. Reviewed by David Steele.)	2020-07-20 08:57:22 -04:00
David Steele	24d2c5b277	Remove real/all integration tests now covered by unit tests. Remove all check and stanza-* tests except for the ones that are intended to succeed. The successful tests show that the queries run with expected results against each version of PG which should also validate queries for the failure tests in the unit tests. Also remove the tests for --no-online backups since they don't require a database and are well tested in the unit tests.	2020-07-16 13:57:14 -04:00
Stefan Fercot	047d85c263	Automatically determine cipher passphrase in repo-get command. The prior code was only able to use the main passphrase automatically and expected sub passphrases to be specified for each operation. This was fine for testing but hardly sufficient for a user-facing feature. Update the code to determine which passphrase to use for any file in the repository and error when an invalid file or location is selected. The repo-get command is still internal for now, but with this improvement it should be ready to be made public.	2020-07-16 12:24:03 -04:00
David Steele	aa4e13b665	Move encrypted files as raw in integration tests. The encryption key should not be changed when moving a file so no need to decrypt/encrypt.	2020-07-16 11:27:14 -04:00
David Steele	88b0f6245d	Run non version specific real/tests on the expect version. There are a few non version specific tests that need to be run in integration because we can't get coverage in the unit tests. To save some time we'll only run those tests against the same version we use for expect testing.	2020-07-15 13:19:16 -04:00
David Steele	574f36c9d2	Rename httpRequest() to httpRequestResponse() and fix comment.	2020-07-14 15:14:41 -04:00
David Steele	620a8d17cf	Automatic retry for backup, restore, archive-get, and archive-push. If a local command, e.g. backupFile(), fails it will stop the entire process. Instead, retry local commands to deal with transient errors. Remove special logic in the S3 storage driver to retry RequestTimeTooSkewed errors since this is now handled by the general retry mechanism in the places where it is most likely to happen, i.e. file read/write. Also, this error should have been entirely eliminated by the asynchronous TLS implementation.	2020-07-14 15:05:31 -04:00
David Steele	91c7adc834	Allow redactions for HTTP queries. The Azure storage driver exposes secrets in the query when using SAS authorization. These secrets can show up during logging or when an error occurs. Allow redaction of queries to prevent secrets from being exposed in logs and errors.	2020-07-14 13:09:48 -04:00
Stefan Fercot	d3dd32a031	Add expire-auto option. This allows automatic expiration after a successful backup to be disabled.	2020-07-14 08:12:25 -04:00
David Steele	d5df3974b5	Read segment size from WAL headers. This allows validation of the WAL segment size for PostgreSQL versions <= 10.	2020-07-09 17:32:36 -04:00
David Steele	2f7823c627	Add shared access signature (SAS) authorization for Azure. A shared access signature (SAS) provides granular, delegated access to resources in a storage account. This is often preferable to using a shared key which provides more access and is a greater security risk if compromised.	2020-07-09 14:46:48 -04:00
David Steele	511e5db5bf	Improve buffer size limit implementation. Rework size limits so that this->size is always the current size no matter how much is allocated. Most importantly, this removes the conditional in bufSize(), which makes it a better candidate for inlining.	2020-07-09 11:16:45 -04:00
David Steele	15502f5b4b	Remove bufNewUseC(). This was used in the Perl LibC interface to wrap Perl-allocated buffers but is no longer needed since LibC was removed.	2020-07-09 07:16:15 -04:00
David Steele	dd9e14b628	Add pgLsnFromWalSegment(). Provides the reverse operation for pgLsnToWalSegment().	2020-07-08 12:25:39 -04:00
David Steele	a27ff7c335	Remove dead test code that should have been removed in `3f4371d7`.	2020-07-07 08:24:08 -04:00
David Steele	682ac656f5	Fix restore --force acting like --force --delta. This caused restore to replace files based on timestamp and size rather than overwriting, which meant some files that should have been updated were left unchanged. Normal restore and restore --delta were not affected by this issue.	2020-07-06 15:03:24 -04:00
David Steele	cf284fbe8a	Add httpUriDecode(), httpQueryNewStr(), and httpQueryMerge(). httpUriDecode() reverses the encoding in httpUriEncode(). httpQueryNewStr() creates a new HttpQuery by parsing a query string. httpQueryMerge() merges the contents of one query into another query.	2020-07-06 07:48:12 -04:00
David Steele	3f4371d7a2	Azure support for repository storage. Azure and Azure-compatible object stores can now be used for repository storage. Currently only shared key authentication is supported but SAS will be added soon.	2020-07-02 16:24:34 -04:00
David Steele	c2dea180fb	Remove redundant storage type constants. These constants predate the C storage drivers which now provide their own constants.	2020-06-26 16:50:29 -04:00
David Steele	96adf8e513	PostgreSQL 13 beta2 support. There don't appear to be any behavioral changes since PostgreSQL 12 and all the tests pass. Changes to the control/catalog/WAL versions in subsequent betas may break compatibility but pgBackRest will be updated with each release to keep pace.	2020-06-26 07:44:56 -04:00
David Steele	974cc10b90	Minor improvements to storage/s3 unit test.	2020-06-26 06:46:25 -04:00
David Steele	ea04ec7b3f	Disable query parallelism in PostgreSQL sessions used for backup control. There is no need to have parallelism enabled in a backup control session. In particular, 9.6 marks pg_stop_backup() as parallel-safe but an error will be thrown if pg_stop_backup() is run in a worker.	2020-06-25 08:02:48 -04:00
David Steele	ce98e326e1	Replace HRNPQ_MACRO_OPEN_92() test macro with HRNPQ_MACRO_OPEN_GE_92().	2020-06-24 18:40:19 -04:00
David Steele	f55cb386d4	Fix versions passed to HRNPQ_MACRO_OPEN_GE_92() test macro. These were not noticed because currently 9.3 and 9.6 behave the same on open.	2020-06-24 18:33:20 -04:00
David Steele	c5892d1291	Asynchronous S3 multipart upload. When uploading large files the upload is split into multiple parts which are assembled at the end to create the final file. Previously we waited until each part was acknowledged before starting on the processing (i.e. compression, etc.) of the next part. Now, the request for each part is sent while processing continues and the response is read just before sending the request for the next part. This asynchronous method allows us to continue processing while the S3 server formulates a response. Testing from outside AWS in a high-bandwidth, low-latency environment showed a 35% improvement in the upload time of 1GB files. The time spent waiting for multipart notifications was reduced by ~300% (this measurement included the final part which is not uploaded asynchronously). There are still some possible improvements: 1) the creation of the multipart id could be made asynchronous when it looks like the upload will need to be multipart (this may incur cost if the upload turns out not to be multipart). 2) allow more than one async request (this will use more memory). A fair amount of refactoring was required to make the HTTP responses asynchronous. This may seem like overkill but having well-defined request, response, and session objects will also be advantageous for the upcoming HTTP server functionality. Another advantage is that the lifecycle of an HttpSession is better defined. We only want to reuse sessions that complete the request/response cycle successfully, otherwise we consider the session to be in a bad state and would prefer to start clean with a new one. Previously, this required complex notifications to mark a session as "successfully done". Now, ownership of the session is passed to the request and then the response and only returned to the client after a successful response. If an error occurs anywhere along the way the session will be automatically closed by the object destructor when the request/response object is freed (depending on which one currently owns the session).	2020-06-24 13:44:00 -04:00

1 2 3 4 5 ...

1564 Commits