pgbackrest

mirror of https://github.com/pgbackrest/pgbackrest.git synced 2024-12-12 10:04:14 +02:00

Author	SHA1	Message	Date
David Steele	9382283586	Fix issues when a path option is / terminated. This condition was not being properly checked for in the C code and it caused problems in the info command, at the very least. Instead of applying a local fix, introduce a new path option type that will rigorously check the format of any incoming paths. Reported by Marc Cousin.	2019-03-14 13:48:33 +04:00
David Steele	b8ebea6b1c	Add separate archive-push-async command. This command was previously forked off from the archive-push command which required a bit of artificial option and log manipulation. A separate command is easier to test and will work on platforms that don't have fork(), e.g. Windows.	2019-03-14 13:38:55 +04:00
blogh	e4e2606fce	Add additional options to backup.manifest for debugging purposes. Add the buffer-size, compress-level, compress-level-network, and process-max options to the backup:option section in backup.manifest to aid in debugging. It may also make sense to propagate these options up to backup.info so they can be displayed in the info command, but for now this is deemed sufficient. Contributed by blogh.	2019-03-10 11:03:52 +02:00
David Steele	21f56f64eb	Add hints when unable to find a WAL segment in the archive. When this error happens in the context of a backup it can be a bit mystifying as to why the backup is failing. Add some hints to get the user started. These hints will appear any time a WAL segment can't be found, which makes the hint about the check command redundant when the user is actually running the check command, but it doesn't seem worth trying to exclude the hint in that case. Suggested by Hans-Jürgen Schönig.	2019-03-10 10:38:12 +02:00
David Steele	d441061168	Create test matrix for mock/all to increase coverage and reduce tests. The same test configurations are run on all four test VMs, which seems a real waste of resources. Vary the tests per VM to increase coverage while reducing the total number of tests. Be sure to include each major feature (remote, s3, encryption) in each VM at least once.	2019-03-02 15:01:02 +02:00
David Steele	f7d1d4400f	Create test matrix for mock/expire to increase coverage and reduce tests. The same test configurations are run on all four test VMs, which seems a real waste of resources. Vary the tests per VM to increase coverage while reducing the total number of tests.	2019-03-01 19:04:26 +02:00
David Steele	91622942c2	Create test matrix for mock/archive-stop to increase coverage and reduce tests. The same test configurations are run on all four test VMs, which seems a real waste of resources. Vary the tests per VM to increase coverage while reducing the total number of tests. Be sure to include each major feature (remote, s3, encryption) in each VM at least once.	2019-03-01 17:12:41 +02:00
Marc Cousin	cb3b4fa24b	Enable socket keep-alive on older Perl versions. The prior method depended on IO:Socket:SSL to push the keep-alive options down to the socket but it only worked for recent versions of the module. Instead, create the socket directly using IO::Socket::IP if available or IO:Socket:INET as a fallback. The keep-alive option is set directly on the socket before it is passed to IO:Socket:SSL. Contributed by Marc Cousin.	2019-02-28 14:33:29 +02:00
David Steele	db4b447be8	The archive-get command is implemented entirely in C. This new implementation should behave exactly like the old Perl code with the exception of a few updated log messages. Remove as much of the Perl code as possible without breaking other commands.	2019-02-27 23:03:02 +02:00
David Steele	9367cc461c	Migrate local command to C. The C local is only used for C commands in the main process. Some tweaking of the existing protocolGet() command was required. Originally the idea was to share the function for local and remote requests but the differences (as in Perl) were too great to make that practical.	2019-02-27 22:34:21 +02:00
David Steele	18b62a4220	Only run test-level stack trace by default for unit-tested modules. This amends `70c30dfb` which disabled test tracing in general. Instead, only enable test tracing by default for modules that are being unit tested. This saves lots of time but still ensures that test tracing is working and helps with debugging in unit tests. Also rename the option to --debug-test-trace for a clarity.	2019-02-27 17:09:19 +02:00
David Steele	3a05359087	Create test matrix for mock/stanza to increase coverage and reduce tests. The same test configurations are run on all four test VMs, which seems a real waste of resources. Vary the tests per VM to increase coverage while reducing the total number of tests. Be sure to include each major feature (remote, s3, encryption) in each VM at least once.	2019-02-24 07:42:41 +02:00
David Steele	6d3e18b181	Reduce expect log level in mock/stanza tests. The expect tests were originally a rough-and-ready type of unit test so monitoring changes in the expect log helped us detect changes in behavior. Now the stanza code is heavily unit-tested so the detailed logs mainly cause churn and don't have any measurable benefit. Reduce the log level to DETAIL to make the logs less verbose and volatile, yet still check user-facing log messages.	2019-02-24 06:55:59 +02:00
David Steele	2f081f3ec7	Rename test modules for consistency. The conventions for command and info tests have shifted in the C modules, though not even all the C modules got the message.	2019-02-23 18:51:52 +02:00
David Steele	d489eb87f7	Create test matrix for mock/archive to increase coverage and reduce tests. The same test configurations are run on all four test VMs, which seems a real waste of resources. Vary the tests per VM to increase coverage while reducing the total number of tests. Be sure to include each major feature (remote, s3, encryption) in each VM at least once.	2019-02-23 15:59:39 +02:00
David Steele	4a7588e604	Create aliases for test VMs ordered by age. This will allow for smarter allocation of tests in the next commit.	2019-02-23 15:13:23 +02:00
David Steele	59d7958914	Reduce expect log level in mock/archive tests. The expect tests were originally a rough-and-ready type of unit test so monitoring changes in the expect log helped us detect changes in behavior. Now the archive code is heavily unit-tested so the detailed logs mainly cause churn and don't have any measurable benefit. Reduce the log level to DETAIL to make the logs less verbose and volatile, yet still check user-facing log messages.	2019-02-23 15:05:06 +02:00
David Steele	70c30dfb61	Disable test-level stack trace by default. Detailed stack traces for low-level functions (e.g. strCat, bufMove) can be very useful for debugging but leaving them on for all tests has become quite burdensome in terms of time. Complex operations like generating JSON on a large KevValue can lead to timeouts even with generous values. Add a new param, --debug-trace, to enable test-level stack trace, but leave it off by default.	2019-02-22 11:40:30 +02:00
David Steele	d211c2b8b5	Fix possible truncated WAL segments when an error occurs mid-write. The file write object destructors called close() and finalized the file even if it was not completely written. This was an issue in both the C and Perl code. Rewrite the destructors to simply free resources (like file handles) rather than calling the close() method. This leaves the temp file in place for filesystems that use temp files. Add unit tests to prevent regression. Reported by blogh.	2019-02-15 11:52:39 +02:00
David Steele	057e2e2782	Add unimplemented S3 driver method required for archive-get. This was not being caught because the integration tests for S3 were running remotely and going through the Perl code rather than the new C code. Implement the exists method for the S3 driver and add tests to prevent a regression. Reported by mibiio.	2019-02-09 18:57:30 +02:00
David Steele	aa3e5b8c72	Allow primary gid for the test user to be different from uid. Apparently up until now they have always been the same, which is pretty typical. However, if they were not then ContainerTest.pm was not happy.	2019-01-30 17:03:17 +02:00
David Steele	8f6d324b2c	Fix issue with multiple async status files causing a hard error. Multiple status files were being created by asynchronous archiving if a high-level error occurred after one or more WAL segments had already been transferred successfully. Error files were being written for every file in the queue regardless of whether it had already succeeded. To fix this, add an option to skip writing error files when an ok file already exists. There are other situations where both files might exist (various fsync and filesystem error scenarios) so it seems best to retry in the case that multiple status files are found rather than throwing a hard error (which then means that archiving is completely stuck). In the case of multiple status files, a warning will be logged to alert the user that something unusual is happening and the command will be retried. Reported by fpa-postgres, Joe Ayers, Douglas J Hunley.	2019-01-26 16:59:54 +02:00
David Steele	d245f8eb42	The info command is implemented entirely in C. The C info code has already been committed but this commit wires it into main. Also remove the info Perl code and tests since they are no longer called.	2019-01-21 13:51:45 +02:00
David Steele	db24ff8df4	v2.08: Minor Improvements and Bug Fixes Bug Fixes: * Remove request for S3 object info directly after putting it. (Reported by Matt Kunkel.) * Correct archive-get-queue-max to be size type. (Reported by Ronan Dunklau.) * Add error message when current user uid/gid does not map to a name. (Reported by Camilo Aguilar.) * Error when --target-action=shutdown specified for PostgreSQL < 9.5. Improvements: * Set TCP keepalives on S3 connections. (Suggested by Ronan Dunklau.) * Reorder info command text output so most recent backup is output last. (Contributed by Cynthia Shang. Suggested by Ryan Lambert.) * Change file ownership only when required. * Redact authentication header when throwing S3 errors. (Suggested by Brad Nicholson.)	2019-01-02 22:04:47 +02:00
Cynthia Shang	35bbb5bd68	Reorder info command text output so most recent backup is output last. After a stanza-upgrade backups for the old cluster are displayed until they expire. Cluster info was output newest to oldest which meant after an upgrade the most recent backup would no longer be output last. Update the text output ordering so the most recent backup is always output last. Contributed by Cynthia Shang. Suggested by Ryan Lambert.	2018-12-14 18:25:31 -05:00
Cynthia Shang	cbf514e191	Improve info error messages introduced in `74b72df9`. - Add detail to errors when info files are loaded with incorrect encryption settings. - Throw FileMissingError rather than FileOpenError when both copies of the info file are missing. - If one file is present (but errors) and the other is missing, then return the error for the file that was present. Contributed by Cynthia Shang.	2018-12-10 16:32:41 -05:00
David Steele	e73416e9e3	Change file ownership only when required. Previously chown() would be called even when no ownership changes were required. In most cases changes are not required and it seems better to perform an extra stat() rather than an extra chown(). Also add unit tests for owner() since there weren't any.	2018-12-05 17:56:47 -05:00
David Steele	cc6447356e	Fix test binary name for gprof. This got missed in `1f8931f7` when the test binary was renamed. Also output call graph along with the flat report. The flat report is generally most useful but it doesn't hurt to have both.	2018-12-05 09:15:45 -05:00
David Steele	74b72df9db	Improve error message when info files are missing/corrupt. The previous error message only showed the last error. In addition, some errors were missed (such as directory permission errors) that could prevent the copy from being checked. Show both errors below a generic "unable to load" error. Details are now given explaining exactly why the primary and copy failed. Previously if one file could not be loaded a warning would be output. This has been removed because it is not clear what the user should do in this case. Should they do a stanza-create --force? Maybe the best idea is to automatically repair the corrupt file, but on the other hand that might just spread corruption if pgBackRest makes the wrong choice.	2018-11-28 18:41:21 -05:00
David Steele	7c2fcb63e4	Enable encryption for archive-get command in C. The decryption filter was added in archiveGetFile() and archiveGetCheck() was modified to return the WAL decryption key stored in archive.info. The rest was plumbing. The mock/archive/1 integration test added encryption to provide coverage for the new code paths while mock/archive/2 dropped encryption to provide coverage for the existing code paths. This caused some churn in the expect logs but there was no change in behavior.	2018-11-28 14:56:26 -05:00
David Steele	56ce98b2f0	Explicitly compile with Posix 2001 standard. This standard was being selectively applied in modules that needed it. Instead, apply the standard to all compilation for consistency.	2018-11-25 10:06:31 -05:00
David Steele	315aa2c451	Conditional compilation of Perl logic in exit.c. This file is the only one to contain Perl logic outside of the perl module. Make the Perl logic conditional to improve reusability.	2018-11-25 08:39:41 -05:00
David Steele	78fe642eae	Remove extraneous use/include statements. Use conditional loading to make docs work in the absence of LibC. Somehow this also required a use statement to be added. Perl, go figure.	2018-11-24 20:31:35 -05:00
David Steele	801e2a5a2c	Rename PGBACKREST/BACKREST constants to PROJECT. This brings consistency between the C and Perl constants and allows for easier code reuse.	2018-11-24 19:05:03 -05:00
David Steele	beae375330	Enable S3 storage for archive-get command in C. The only change required was to remove the filter that prevented S3 storage from being used. The archive-get command did not require any modification which demonstrates that the storage interface is working as intended. The mock/archive/3 integration test was modified to run S3 storage locally to provide coverage for the new code paths while mock/stanza/3 was modified to run S3 storage remotely to provide coverage for the existing code paths. This caused some churn in the expect logs but there was no change in behavior.	2018-11-23 12:18:07 -05:00
David Steele	ac426bc456	New test containers with static test certificates. Test certificates were generated dynamically but there are advantages to using static certificates. For example, it possible to use the same certificate between container versions. Mostly, it is easier to document the certificates if they are not buried deep in the container code. The new test certificates are initially intended to be used with the C unit tests but they will eventually be used for integration tests as well. Two new certificates have been defined. See test/certificate/README.md for details. The old dynamic certificates will be retained until they are replaced.	2018-11-21 18:13:37 -05:00
David Steele	bc25db5667	Add interface objects for libxml2. Add XmlDocument, XmlNode, and XmlNodeList objects as a thin interface layer on libxml2. This interface is not intended to be comprehensive. Only a few libxml2 capabilities are exposed but more can be added as needed.	2018-11-20 20:40:11 -05:00
David Steele	f743d4e924	Add testRepoPath() to let C unit tests know where the code repository is located. This allows a C unit test to access data in the code repository that might be useful for testing. Add testRepoPathSet() to set the repository path. In passing remove extra whitespace in the TEST_RESULT_VOID() macro.	2018-11-20 15:48:56 -05:00
David Steele	8c7e97a369	Clarify comment about main.c being excluded from unit testing. Also remove !!! which by convention we use as a marker for code that needs attention before it can be committed to master.	2018-11-14 08:08:26 -05:00
David Steele	acb579c469	Tighten limits on code coverage context selection. If the last } of a function was marked as uncovered then the context selection would overrun into the next function. Start checking context on the current line to prevent this. Make the same change for start context even though it doesn't seem to have an issue.	2018-11-13 10:37:58 -05:00
David Steele	7107cc68d2	Expand context shown in coverage and update colors. Too few lines were shown for coverage context so show the entire function if it has any missing coverage. Update colors to work with light and dark browser modes.	2018-11-12 18:11:16 -05:00
David Steele	22ecbc153a	New, concise coverage report for C. The report HTML generated by lcov is overly verbose and cumbersome to navigate. Since we maintain 100% coverage it's far more interesting to look at what is not covered than what is. The new report presents all missing coverage on a single page and excludes code that is covered for brevity.	2018-11-11 17:32:42 -05:00
David Steele	3e695af961	New test containers. * Add libxml2 library needed for S3 development. * Minor version updates for PostgreSQL. * Remove PostgreSQL 11 beta/rc repository.	2018-11-08 21:41:41 -05:00
David Steele	8efa5e6a6a	Rename CipherError to CryptoError. This aligns with the general renaming from cipher to crypto.	2018-11-06 19:38:38 -05:00
David Steele	57d7809297	Improve efficiency of code generation. Code generation saved files even when they had not changed, which often caused code generation cascades. So, don't save files unless they have changed. Use rsync to determine which files have changed since the last test run. The manifest of changed files is saved and not removed until all code generation and builds have completed. If an error occurs the work will be redone on the next run. The eventual goal is to do all the builds from the test/repo directory created by rsync but for now it is only used to track changes.	2018-11-03 19:52:46 -04:00
David Steele	1f8931f732	Improve single test run performance. Improve on `7794ab50` by including the build flag files directly into the Makefile as dependencies (even though they are not includes). This simplifies some of the rsync logic and allows make to do what it does best. Also split build flag files into test, harness, and build to reduce rebuilds. Test flags are used to build test.c, harness flags are used to build the rest of the files in the test harness, and build flags are used for the files that are not directly involved in testing.	2018-11-03 16:34:04 -04:00
David Steele	7794ab50dc	Preserve contents of C unit test build directory between test.pl executions. The contents were already preserved between tests in a single test.pl run but for a separate execution the entire project had to be built from scratch, which was getting slower as we added code. Save the important build flags in a file so the new execution knows whether the build contents can be reused.	2018-11-02 11:56:13 -04:00
Cynthia Shang	34c63276cd	Automatically enable backup checksum delta when anomalies (e.g. timeline switch) are detected. There are a number of cases where a checksum delta is more appropriate than the default time-based delta: * Timeline has switched since the prior backup * File timestamp is older than recorded in the prior backup * File size changed but timestamp did not * File timestamp is in the future compared to the start of the backup * Online option has changed since the prior backup A practical example is that checksum delta will be enabled after a failover to standby due to the timeline switch. In this case, timestamps can't be trusted and our recommendation has been to run a full backup, which can impact the retention schedule and requires manual intervention. Now, a checksum delta will be performed if the backup type is incr/diff. This means more CPU will be used during the backup but the backup size will be smaller and the retention schedule will not be impacted. Contributed by Cynthia Shang.	2018-11-01 11:31:25 -04:00
David Steele	cca7a4ffd4	Retry all S3 5xx errors rather than just 500 internal errors. We were already retrying 500 errors but 503 (rate-limiting) errors were not being retried and would cause an instant failure which aborted the command. There are only two 5xx errors currently implemented by S3 but instead of adding 503 simply retry all 5xx errors. This is consistent with the http definition of this error class, "the server failed to fulfill an apparently valid request." Suggested by Craig A. James.	2018-10-30 16:45:42 -04:00
David Steele	286f7e5011	Fix static WAL segment size used to determine if archive-push-queue-max has been exceeded. This calculation was missed when the WAL segment size was made dynamic in preparation for PostgreSQL 11. Fix the calculation by checking the actual WAL file sizes instead of using an estimate based on WAL segment size. This is more accurate because it takes into account .history and .backup files, which are smaller. Since the calculation is done in the async process the additional processing time should not adversely affect performance. Remove the PG_WAL_SIZE constant and instead use local constants where the old value is still required. This is only the case for some tests and PostgreSQL 8.3 which does not provide a way to get the WAL segment size from pg_control.	2018-10-27 20:00:00 +01:00

1 2 3 4 5 ...

430 Commits