pgbackrest

mirror of https://github.com/pgbackrest/pgbackrest.git synced 2024-12-14 10:13:05 +02:00

Author	SHA1	Message	Date
David Steele	02aa03d1a2	Remove obsolete methods in pgBackRest::Storage::Storage module. All the methods in this module will need to be implemented via the command-line in order to get rid of LibC, so the first step is to reduce the code in the module as much as possible. First remove storageDb() and use storageTest() instead. Then create storageTest() using pgBackRestTest::Common::Storage which has no dependencies on LibC. Now the only storage using the LibC interface is storageRepo(). Remove all link functions since those operations cannot be performed on a repo unless it is Posix, in which case the LibC interface is not needed. Same for owner(). Remove pathSync() because syncs are not required in the tests. No test data is reused after a crash. Path create/exists functions should never be explicitly performed on a repo so remove those. File exists can be implemented by calling info() instead. Remove encryption detection functions which were only used by Backup/Archive::Info reconstruct() which are now obsolete. Remove all filters except pgBackRest::Storage::Filter::CipherBlock since they are not being used. That also means there are no filters returning results so remove all the result code. Move hashSize() and pathAbsolute() into pgBackRest::Storage::Base where they can be shared between pgBackRest::Storage::Storage and pgBackRestTest::Common::Storage.	2020-03-06 14:10:09 -05:00
David Steele	00647c7109	Remove Perl Db module and LibC dependencies. This was mostly dead code except the DB_BACKUP_ADVISORY_LOCK constant, moved to the real/all test module, and the function that pulls info from pg_control, moved to ExpireEnvTest.pm.	2020-03-06 07:21:17 -05:00
David Steele	2e0fe25650	Remove dependency on LibC hash filter. Perl provides Digest::SHA for hashing so there is no need to expose this via LibC anymore.	2020-03-05 18:34:59 -05:00
David Steele	eb4347f20b	Use static checksums in mock/all integration tests. Using static values serves as a better cross-check against the page checksum code. The downside is that these checksums may not work with some big endian systems but in that case neither will the unit tests. We can also remove the page checksum interface from LibC which brings us one step closer to eliminating it.	2020-03-05 13:56:20 -05:00
David Steele	4ab8943ca8	Use PG_PAGE_SIZE_DEFAULT constant instead of pageSize variable. Page size is passed around a lot but in fact it can only have one value, PG_PAGE_SIZE_DEFAULT, which is checked when pg_control is loaded. There may be an argument for supporting multiple page sizes in the future but for now just use the constant to simplify the code. There is also a significant performance benefit. Because pageSize was being used in pageChecksumBlock() the main loop was neither unrolled nor vectorized (-funroll-loops -ftree-vectorize) as it is now with a constant loop boundary.	2020-03-05 09:14:27 -05:00
David Steele	91f321fb86	Rename old page*() functions to conform to new conventions. The general convention now is to prefix PostgreSQL functions with "pg".	2020-03-04 14:24:40 -05:00
Cynthia Shang	ceb050e950	Fix flapping test in real/all module. The restore test function was passing strBackup to the restoreCompare function but when the restore is expected to pick a backup based on a timestamp, then strBackup may not be the one chosen. Modified the code so that strBackupExpected is set based on the parameters passed to the function and this is then passed to restoreCompare.	2020-02-28 14:50:50 -05:00
David Steele	e2c304d473	Prevent defunct processes in asynchronous archive commands. The main improvement is a double-fork to prevent zombie processes if the parent process exits after the (child) async process. This is a real possibility since the parent process sticks around to monitor the results of the async process. In the first fork, ignore SIGCHLD in the very unlikely case that the async process exits before the first fork. This is probably only possible if the async process exits immediately, perhaps due to a chdir() failure. Set SIGCHLD back to default in the async process so waitpid() will work as expected. Also update the comment on chdir() to more accurately reflect what is happening. Finally, add a test in certain debug builds to ensure the first fork exits very quickly. This only works when valgrind is not in use because valgrind makes forking so slow that it is hard to tell if the async process performed work or not (in the case that the second fork goes missing and the async process is a direct child).	2020-02-12 12:17:23 -07:00
Cynthia Shang	856980ae99	Auto-select backup set on restore when time target is specified. Auto-selection is performed only when --set is not specified. If a backup set for the given target time cannot not be found, the latest (default) backup set will be used. Currently a limited number of date formats are recognized and timezone names are not allowed, only timezone offsets.	2020-01-30 14:38:05 -07:00
David Steele	90abc3cf17	Use pkg-config instead of xml2-config for libxml2 build options. pkg-config is a generic way to get build options rather than relying on a package-specific utility. XML2_CONFIG can be used to override this utility for systems that do not ship pkg-config.	2020-01-24 10:08:05 -07:00
David Steele	94842ccece	Fix comment.	2020-01-21 11:59:25 -07:00
David Steele	03d434c7e1	Remove RHEL package patch now that it has been merged upstream. Also revert `731ffcfb` and update ContainerTest.pm for upstream changes.	2020-01-21 11:57:59 -07:00
David Steele	e81629b442	Reclassify Perl and LibC code as test/harness. These were still being included in the core totals but they are no longer used by core.	2020-01-15 13:53:30 -07:00
David Steele	7a1871c341	Fix test log message to match pg-version parameter name. It was confusing that this part of the log message did not match the parameter name, which made reproducing test failures from CI a little harder.	2020-01-08 09:54:44 -07:00
David Steele	33e328abbf	Remove unused LibC code. The code was made obsolete by the migration to C.	2019-12-28 18:30:32 -07:00
David Steele	74c3842595	Remove errant tabs and fix spacing.	2019-12-19 16:25:46 -05:00
David Steele	620386f034	Remove integration tests that are now covered in the unit tests. Most of these tests are just checking that errors are thrown when required. These are well covered in various unit tests. The "cannot resume" tests are also well covered in the backup unit tests. Finally, config warnings are well covered in the config unit tests. There is more to be done here, but this accounts for the low-hanging fruit.	2019-12-17 20:14:45 -05:00
David Steele	977ec2e307	Integration test improvements for disk and memory efficiency. Set log-level-file=off when more that one test will run. In this case is it impossible to see the logs anyway since they will be automatically cleaned up after the test. This improves performance pretty dramatically since trace-level logging is expensive. If a singe integration test is run then log-level-file is trace by default but can be changed with the --log-level-test-file option. Reduce buffer-size to 64k to save memory during testing and allow more processes to run in parallel. Update log replacement rules so that these options can change without affecting expect logs.	2019-12-17 15:23:07 -05:00
David Steele	f0ef73db70	pgBackRest is now pure C. Remove embedded Perl from the distributed binary. This includes code, configure, Makefile, and packages. The distributed binary is now pure C. Remove storagePathEnforceSet() from the C Storage object which allowed Perl to write outside of the storage base directory. Update mock/all and real/all integration tests to use storageLocal() where they were violating this rule. Remove "c" option that allowed the remote to tell if it was being called from C or Perl. Code to convert options to JSON for passing to Perl (perl/config.c) has been moved to LibC since it is still required for Perl integration tests. Update build and installation instructions in the user guide. Remove all Perl unit tests. Remove obsolete Perl code. In particular this included all the Perl protocol code which required modifications to the Perl storage, manifest, and db objects that are still required for integration testing but only run locally. Any remaining Perl code is required for testing, documentation, or code generation. Rename perlReq to binReq in define.yaml to indicate that the binary is required for a test. This had been the actual meaning for quite some time but the key was never renamed.	2019-12-13 17:55:41 -05:00
David Steele	1f2ce45e6b	The backup command is implemented entirely in C. For the most part this is a direct migration of the Perl code into C except as noted below. A backup can now be initiated from a linked directory. The link will not be stored in the manifest or recreated on restore. If a link or directory does not already exist in the restore location then a directory will be created. The logic for creating backup labels has been improved and it should no longer be possible to get a backup label earlier than the latest backup even with timezone changes or clock skew. This has never been an issue in the field that we know of, but we found it in testing. For online backups all times are fetched from the PostgreSQL primary host (before only copy start was). This doesn't affect backup integrity but it does prevent clock skew between hosts affecting backup duration reporting. Archive copy now works as expected when the archive and backup have different compression settings, i.e. when one is compressed and the other is not. This was a long-standing bug in the Perl code. Resume will now work even if hardlink settings have been changed. Reviewed by Cynthia Shang.	2019-12-13 17:14:26 -05:00
David Steele	b031dbbcf8	Allow timezones to be explicitly set for testing. The TZ environment variable was not reliably pushed down to the test processes. Instead pass TZ via a command line parameter and set explicitly in the test process.	2019-12-11 22:11:04 -05:00
David Steele	d0ba8ff58c	Remove test point infrastructure. `82df7e6f` and `9856fef5` updated tests that used test points in preparation for the feature not being available in the C code. Since tests points are no longer used remove the infrastructure. Also remove one stray --test option in mock/all that was essentially a noop but no longer works now that the option has been removed.	2019-12-10 13:16:47 -05:00
David Steele	d7d663c2b9	Make buildPutDiffers() work with empty files. If the file was empty the timestamp was updated. If the file is empty and there is no content then file should not be saved.	2019-12-10 13:02:36 -05:00
David Steele	e632c60525	Fix backup labels in mock/all resume integration tests. These were not getting updated to match the directory name when the manifests were copied. The Perl code didn't care but the C code expects labels to be set correctly.	2019-12-06 11:48:41 -05:00
David Steele	8dfe0e48e2	Use more general error code when tablespace linked into PGDATA. The specific error code was not that useful since we also test the error message which contains details of the link error.	2019-12-02 10:49:25 -05:00
David Steele	fc291b6f28	Reduce the scope of mock/all exclusion tests. Run exclusions only on the tests where they will have an effect to reduce churn in the expect logs when they change.	2019-12-01 17:47:47 -05:00
David Steele	686b6f91da	Set archive-check option in manifest correctly when offline. Archive check does not run when in offline backup mode but the option was set to true in the manifest. It's harmless since these options are informational only but it could cause confusion when debugging.	2019-11-28 08:27:21 -05:00
David Steele	158e439689	Remove obsolete Perl archive code. This should have been removed in `a1c13a50` but was missed.	2019-11-26 17:16:45 -05:00
David Steele	82df7e6f3b	Update integration tests in real/all that use test points. Test points are not supported by the new C code so these will be replaced with unit tests. The fact that the tests still pass even when the changes aren't made mid-backup (except application_name) shows how weak they were in the first place. Even so, this does represent a regression in (soon to be be removed) Perl coverage.	2019-11-26 11:32:12 -05:00
David Steele	8800f32ad9	Remove exclusions once they have been tested in mock/all. The exclusions no longer have any effect after a restore and just add noise to the expect log.	2019-11-25 08:35:26 -05:00
David Steele	9856fef586	Update integration tests in mock/all that use test points. Test points will not be available in the C code so update these tests as best as possible without using them. This represents a loss of coverage for the Perl code (soon to be removed) which will be made up in the C code with unit tests.	2019-11-25 07:48:52 -05:00
David Steele	3cd45a7411	Remove start/stop --force integration tests in mock/all. These tests require test points which are not being implemented in the C code. This functionality is fully tested in the command/control unit tests so integration tests are no longer required.	2019-11-25 07:45:58 -05:00
David Steele	01aefc563d	Update Perl page checksum expression. This expression determines which files contain page checksums but it was also including the directory above the relation directories. In a real PostgreSQL installation this not a problem because these directories don't contain any files. However, our tests place a file in `base` which the Perl code thought should have page checksums while the new C code says no. Update the expression to document the change and avoid churn in the expect logs later.	2019-11-25 07:37:09 -05:00
David Steele	a4b9440d35	Only install specific lcov version when required. Installing lcov 1.14 everywhere turned out to be a problem just as using 1.13 on Ubuntu 19.04 was. Since we primarily use Ubuntu 18.04 for coverage testing and reporting, we definitely want to make sure that works. So, revert to using the default packaged lcov except when specified otherwise in VmTest.pm. PostgreSQL minor version releases are also included since all containers have been rebuilt.	2019-11-22 19:25:49 -05:00
David Steele	c524ec4f95	Remove obsolete integration tests from mock/all. The protocol timeout tests have been superceded by unit tests. The TEST_BACKUP_RESUME test point was incorrectly included into a number of tests, probably a copy pasto. It didn't hurt anything but it did add 200ms to each test where it appeared. Catalog and control version tests were redundant. The database version and system id tests covered the important code paths and the C code gets these values from a lookup table. Finally, fix an incomplete update to the backup.info file while munging for tests.	2019-11-21 16:06:27 -05:00
David Steele	2d10293d04	v2.19: C Migrations and Bug Fixes Bug Fixes: * Fix remote timeout in delta restore. When performing a delta restore on a largely unchanged cluster the remote could timeout if no files were fetched from the repository within protocol-timeout. Add keep-alives to prevent remote timeout. (Reported by James Sewell, Jens Wilke.) * Fix handling of repeated HTTP headers. When HTTP headers are repeated they should be considered equivalent to a single comma-separated header rather than generating an error, which was the prior behavior. (Reported by donicrosby.) Improvements: * JSON output from the info command is no longer pretty-printed. Monitoring systems can more easily ingest the JSON without linefeeds. External tools such as jq can be used to pretty-print if desired. (Contributed by Cynthia Shang.) * The check command is implemented entirely in C. (Contributed by Cynthia Shang.) Documentation Improvements: * Document how to contribute to pgBackRest. (Contributed by Cynthia Shang.) * Document maximum version for auto-stop option. (Contributed by Brad Nicholson.) Test Suite Improvements: * Fix container test path being used when --vm=none. (Suggested by Stephen Frost.) * Fix mismatched timezone in expect test. (Suggested by Stephen Frost.) * Don't autogenerate embedded libc code by default. (Suggested by Stephen Frost.)	2019-11-12 15:51:28 -05:00
David Steele	4317178633	Update MinIO to newest release. We had some problems with newer versions so had held off on updating. Those problems appear to have been resolved. In addition, the --compat flag is no longer required. Prior versions of MinIO required all parts of a multi-part upload (except the last) to be of equal size. The --compat flag was introduced to restore the default S3 behavior. Now --compat is only required when ETag is being used for MD5 verification, which we don't do.	2019-11-08 17:56:34 -05:00
David Steele	8b682b75d2	Allow mock integration tests for all VM types. Previously the mock integration tests would be skipped for VMs other than the standard four used in CI. Now VMs outside the standard four will run the same tests as VM4 (currently U18).	2019-11-02 10:35:48 +01:00
David Steele	e06db21e35	Error when specified vm is invalid.	2019-10-17 14:00:18 +02:00
David Steele	fa6a54bb45	Update last tests that required sudo. All tests should now run in a sudo-less environment.	2019-10-16 17:05:24 +02:00
David Steele	48bd9e22f1	C test harness refactor. Consolidate setting configuration into hrnInit() and rename other functions for consistency. Split out internal functions into a new header.	2019-10-16 15:48:33 +02:00
David Steele	a2fa1d04b0	Update container images to PostgreSQL 12 GA.	2019-10-12 11:26:13 -04:00
David Steele	397a41e0f9	Add Ubuntu 19.04 container definition.	2019-10-12 11:24:55 -04:00
David Steele	93656db186	Update lcov to 1.14. 1.13 is not compatible with gcc 8 which is what ships with newer distributions. Build from source to get a more recent version. 1.13 is not compatible with gcc 9 so we'll need to address that at a later date.	2019-10-12 11:24:21 -04:00
David Steele	11c7c8fabb	Remove pgbackrest test user. This user was created before we tested in containers to ensure isolation between the pg and repo hosts which were then just directories. The downside is that this resulted in a lot of sudos to set the pgbackrest user and to remove files which did not belong to the main test user. Containers provide isolation without needing separate users so we can now safely remove the pgbackrest user. This allows us to remove most sudos, except where they are explicitly needed in tests. While we're at it, remove the code that installed the Perl C library (which also required sudo) and simply add the build path to @INC instead.	2019-10-12 09:45:18 -04:00
David Steele	6f0e7f00af	Fix recovery test failing in PostgreSQL 12.0. This test was not creating recovery.signal when testing with --type=preserve. The preserve recovery type only keeps existing files and does not create any. RC1 was just ignoring recovery.signal and going right into recovery. Weirdly, 12.0 used restore_command to do crash recovery which made the problem harder to diagnose, but this has now been fixed in PostgreSQL and should be released in 12.1.	2019-10-12 09:26:19 -04:00
Cynthia Shang	2972580566	Remove info expect tests from mock/all and mock/stanza. These tests are redundant now that we have full coverage in the unit tests are are not worth maintaining anymore.	2019-10-11 12:38:03 -04:00
David Steele	6db4e59a66	Allow tests that use ports to run in parallel. Set the test index in the C unit test code so it can assign port numbers that won't conflict between tests.	2019-10-10 16:13:43 -04:00
David Steele	13fcbb24e9	Fix container test path being used when --vm=none. Suggested by Stephen Frost.	2019-10-10 15:09:11 -04:00
David Steele	9a3ba649e1	Remove code to generate .travis.yml. Most of the logic has been moved to test/travis.pl so there wasn't much purpose to this code anymore.	2019-10-10 11:25:59 -04:00
David Steele	7f369006b5	Add gcc 9 support. A number of tests have been updated and Fedora 30 has been added to the test suite so the unit tests can run on gcc 9. Stop running unit tests on co6/7 since we appear to have ample unit test coverage.	2019-10-09 15:03:03 -04:00
David Steele	528f4c4347	Remove dependency on aws cli for testing. This tool was only being used it a few places but was a pretty large dependency. Rework the forceStorageMove() code using our storage layer and replace one aws cli cp with a storage put. Also, remove the Dockerfile that was once used to build the Scality S3 test container.	2019-10-09 14:38:24 -04:00
David Steele	61c4f64895	Be smarter about which packages are loaded for testing. Now that our tests are more diversified it makes sense to load only the packages that are needed for each test. Move the package loads from .travis.yaml to test/travis.pl where we have more control over what is loaded.	2019-10-08 18:56:55 -04:00
Cynthia Shang	a1c13a50dd	The check command is implemented entirely in C. Note that building the manifest on each host has been temporarily removed. This feature will likely be brought back as a non-default option (after the manifest code has been fully migrated to C) since it can be fairly expensive.	2019-10-08 18:04:09 -04:00
David Steele	45881c74ae	Allow most unit tests to run outside of a container. Three major changes were required to get this working: 1) Provide the path to pgbackrest in the build directory when running outside a container. Tests in a container will continue to install and run against /usr/bin/pgbackrest. 1) Set a per-test lock path so tests don't conflict on the default /tmp/pgbackrest path. Also set a per-test log-path while we are at it. 2) Use localhost instead of a custom host for TLS test connections. Tests in containers will continue to update /etc/hosts and use the custom host. Add infrastructure and update harnessCfgLoad*() to get the correct exe and paths loaded for testing. Since new tests are required to verify that running outside a container works, also rework the tests in Travis CI to provide coverage within a reasonable amount of time. Mainly, break up to doc tests by VM and run an abbreviated unit test suite on co6 and co7.	2019-10-08 12:06:30 -04:00
David Steele	29e132f5e9	PostgreSQL 12 support. Recovery settings are now written into postgresql.auto.conf instead of recovery.conf. Existing recovery_target* settings will be commented out to help avoid conflicts. A comment is added before recovery settings to identify them as written by pgBackRest since it is unclear how, in general, old settings will be removed. recovery.signal and standby.signal are automatically created based on the recovery settings.	2019-10-01 13:20:43 -04:00
David Steele	f1ba428fb0	Add performance test capability in C with scaling. Scaling allows the starting values to be increased from the command-line without code changes. Also suppress valgrind and assertions when running performance testing. Optimization is left at -O0 because we should not be depending on compiler optimizations to make our code performant, and it makes profiling more informative.	2019-09-28 14:02:12 -04:00
David Steele	004ff99a2d	Identify Perl performance test by appending -perl. This is intended to differentiate the upcoming C performance tests from the Perl performance tests that will eventually be migrated.	2019-09-28 13:17:21 -04:00
David Steele	afc483ef86	Clarify which timeline should be used for timeline integration test.	2019-09-27 13:37:59 -04:00
David Steele	d82102d6ef	Add explicit promotes to recovery integration tests. PostgreSQL 12 will shutdown in these cases which seems to be the correct action (according to the documentation) when hot_standby = off, but older versions are promoting instead. Set target_action explicitly so all versions will behave the same way. This does beg the question of whether the PostgreSQL 12 behavior is wrong (though it matches the docs) or the previous versions are.	2019-09-27 13:04:36 -04:00
David Steele	833d0da0d9	Store recovery file name in integration when testing preserve recovery. This makes the test a little more maintainable and is friendly with the changes needed for PostgreSQL 12.	2019-09-27 12:29:33 -04:00
David Steele	80eb561caf	Add missing PostgreSQL 11 control/WAL versions in Perl tests. These values don't seem to be used for testing but better to be tidy.	2019-09-27 09:45:11 -04:00
David Steele	d6a6d93a04	Add PostgreSQL 12 to u18 container. This does not add PostgresQL 12 support; it simply adds PostgreSQL 12 to the u18 container for development and testing.	2019-09-27 09:35:59 -04:00
David Steele	c41fb575fb	Add standby restore type. This restore type automatically adds standby_mode=on to recovery.conf. This could be accomplished previously by setting --recovery-option=standby_mode=on but PostgreSQL 12 requires standby mode to be enabled by a special file named standby.signal. The new restore type allows us to maintain a common interface between PostgreSQL versions.	2019-09-26 17:39:45 -04:00
David Steele	451ae397be	The restore command is implemented entirely in C. For the most part this is a direct migration of the Perl code into C. There is one important behavioral change with regard to how file permissions are handled. The Perl code tried to set ownership as it was in the manifest even when running as an unprivileged user. This usually just led to errors and frustration. The C code works like this: If a restore is run as a non-root user (the typical scenario) then all files restored will belong to the user/group executing pgBackRest. If existing files are not owned by the executing user/group then an error will result if the ownership cannot be updated to the executing user/group. In that case the file ownership will need to be updated by a privileged user before the restore can be retried. If a restore is run as the root user then pgBackRest will attempt to recreate the ownership recorded in the manifest when the backup was made. Only user/group names are stored in the manifest so the same names must exist on the restore host for this to work. If the user/group name cannot be found locally then the user/group of the PostgreSQL data directory will be used and finally root if the data directory user/group cannot be mapped to a name. Reviewed by Cynthia Shang.	2019-09-26 07:52:02 -04:00
David Steele	e968acbdd7	Fix outdated comment. This was probably missed when a new test was added and the timeline was updated.	2019-09-24 16:55:11 -04:00
David Steele	d3a7055ee5	Only enable test.pl --debug-test-trace option when --debug also enabled. The other way makes no sense and leads to compile errors since --debug-test-trace requires some code that is only enabled by --debug.	2019-09-23 15:15:04 -04:00
Cynthia Shang	56bf9d0566	Update HINT messages to conform to new standard detailed in CODING.md.	2019-09-14 12:21:08 -04:00
David Steele	92365fb801	Disable missing-field-initializers warnings in unit testing. This warning gives very unpredictable results between compiler versions and seems unrealistic since most of our structs are zeroed for initialization. This warning has been disabled in the Makefile for a long time.	2019-09-12 15:55:18 -04:00
David Steele	f809d2f008	Ignore apt-get update errors in Travis CI. Broken vendor packages have been causing builds to break due to an error on apt-get update. Ignore errors and proceed directory to apt-get install. It's possible that we'll try to reference an expired package version and get an error anyway, but that seems better than a guaranteed hard error.	2019-09-12 15:16:42 -04:00
David Steele	dca5b63f97	Move documentation job first for Travis CI. Since this job has been running long recently this should improved overall performance when multiple commits are queued up.	2019-09-10 13:06:44 -04:00
David Steele	f8d0574759	Increase process timeout and emit occasional warnings. Travis will timeout after 10 minutes with no output. Emit a warning every 5 minutes to keep Travis alive and increase the total timeout to 20 minutes. Documentation builds have been timing out a lot recently so hopefully this will help.	2019-09-10 12:29:36 -04:00
David Steele	ce2bf29998	v2.17: C Migrations and Bug Fixes Bug Fixes: * Improve slow manifest build for very large quantities of tables/segments. (Reported by Jens Wilke.) * Fix exclusions for special files. (Reported by CluelessTechnologist, Janis Puris, Rachid Broum.) Improvements: * The stanza-create/update/delete commands are implemented entirely in C. (Contributed by Cynthia Shang.) * The start/stop commands are implemented entirely in C. (Contributed by Cynthia Shang.) * Create log directories/files with 0750/0640 mode. (Suggested by Damiano Albani.) Documentation Bug Fixes: * Fix yum.p.o package being installed when custom package specified. (Reported by Joe Ayers, John Harvey.) Documentation Improvements: * Build pgBackRest as an unprivileged user. (Suggested by Laurenz Albe.)	2019-09-03 16:39:32 -04:00
David Steele	3a28b68b8b	Disable S3 and encryption on u18 integration tests for mock/all/1. This test is commonly used for sanity checking but the combination of S3 and encryption makes it hard to use and encourages temporary changes to make it usable. Acknowledge this and disable S3 and encryption for this test and move them to mock/all/2.	2019-09-02 19:06:12 -04:00
Josh Soref	c2771e5469	Fix comment typos. This includes some variable names in tests which don't seem important enough for their own commits. Contributed by Josh Soref.	2019-08-26 12:05:36 -04:00
David Steele	22aa532be1	Add storage tests for files beginning with dots. Prevent a regression of the issue fixed in `f88012ce` by adding some tests.	2019-08-26 11:37:21 -04:00
David Steele	01c2669b97	Fix exclusions for special files. Prior to 2.16 the Perl manifest code would skip any file that began with a dot. This was not intentional but it allowed PostgreSQL socket files to be located in the data directory. The new C code in 2.16 did not have this unintentional exclusion so socket files in the data directory caused errors. Worse, the file type error was being thrown before the exclusion check so there was really no way around the issue except to move the socket files out of the data directory. Special file types (e.g. socket, pipe) will now be automatically skipped and a warning logged to notify the user of the exclusion. The warning can be suppressed with an explicit --exclude. Reported by CluelessTechnologist, Janis Puris, Rachid Broum.	2019-08-23 07:47:54 -04:00
David Steele	2862f480cd	Add special file type to storageInfo(). There's not much we can do with special files, but higher level logic can at least exclude them gracefully rather than throwing a hard error.	2019-08-23 07:24:25 -04:00
David Steele	f88012cef3	Fix regexp to ignore ./.. directories in the Posix driver. In versions <= 2.15 the old regexp caused any file or directory beginning with . to be ignored during a backup. This has caused behavioral differences in 2.16 because the new C code correctly excludes ./.. directories. This Perl code is only used for testing now, but it should still match the output of the C functions.	2019-08-22 10:18:34 -04:00
David Steele	c002a2ce2f	Move info file checksum to the end of the file. Putting the checksum at the beginning of the file made it impossible to stream the file out when saving. The entire file had to be held in memory while it was checksummed so the checksum could be written at the beginning. Instead place the checksum at the end. This does not break the existing Perl or C code since the read is not order dependent. There are no plans to improve the Perl code to take advantage of this change, but it will make the C implementation more efficient. Reviewed by Cynthia Shang.	2019-08-21 19:45:48 -04:00
Cynthia Shang	c733319063	The stanza-create/update/delete commands are implemented entirely in C. Contributed by Cynthia Shang.	2019-08-21 16:26:28 -04:00
David Steele	3df075bf40	Fix test writing "null" into manifest files. "null" is not allowed in the manifest format (null values should be missing instead) but Perl was treating the invalid values written by this test as if they were missing. Update the test code to remove the values rather than setting them to "null".	2019-08-18 15:29:18 -04:00
David Steele	f8b0676fd6	Allow modules to be included for testing without requiring coverage. Sometimes it is useful to get at the internals of a module that is not being tested for coverage in order to provide coverage for another module that is being tested. The include directive allows this. Update modules that had previously been added to coverage that only need to be included.	2019-07-25 20:15:06 -04:00
David Steele	d8ca0e5c5b	Add Perl interface to C PgQuery object. This validates that all current queries work with the new interface and removes the dependency on DBD::Pg.	2019-07-25 17:05:39 -04:00
David Steele	415542b4a3	Add PostgreSQL query client. This direct interface to libpq allows simple queries to be run against PostgreSQL and supports timeouts. Testing is performed using a shim that can use scripted responses to test all aspects of the client code. The shim will be very useful for testing backup scenarios on complex topologies. Reviewed by Cynthia Shang.	2019-07-25 14:50:02 -04:00
David Steele	59f135340d	The local command for backup is implemented entirely in C. The local process is now entirely migrated to C. Since all major I/O operations are performed in the local process, the vast majority of I/O is now performed in C. Contributed by David Steele, Cynthia Shang.	2019-07-25 14:34:16 -04:00
David Steele	e10577d0b0	Fix incorrect offline upper bound for ignoring page checksum errors. For offline backups the upper bound was being set to 0x0000FFFF0000FFFF rather than UINT64_MAX. This meant that page checksum errors might be ignored for databases with a lot of past WAL in offline mode. Online mode is not affected since the upper bound is retrieved from pg_start_backup().	2019-07-11 09:13:56 -04:00
David Steele	27b3246e85	Exclude more build files from rsync between tests. Files (especially build.auto.h) were being removed and forcing a full build between separate invocations of test.pl. This affected ad-hoc testing at the command-line, not a full test run in CI.	2019-07-08 08:29:25 -04:00
David Steele	5e1ed2e8a5	Remove clang static analysis. This analysis never produced anything but false positives (var might be NULL) but took over a minute per test run and added 600MB to the test container.	2019-07-05 18:34:15 -04:00
David Steele	488fb67294	Force PostgreSQL versions to string for newer versions of JSON:PP. Since 2.91 JSON::PP has a bias for saving variables that look like numbers as numbers even if they were declared as strings. Force versions to strings where needed by appending ''. Update the json-pp-perl package on Ubuntu 18.04 to 2.97 to provide test coverage.	2019-07-05 17:25:01 -04:00
David Steele	9836578520	Remove perl critic and coverage. No new Perl code is being developed, so these tools are just taking up time and making migrations to newer platforms harder. There are only a few Perl tests remaining with full coverage so the coverage tool does not warn of loss of coverage in most cases. Remove both tools and associated libraries.	2019-07-05 16:55:17 -04:00
David Steele	1708f1d151	Use minio for integration testing. ScalityS3 has not received any maintenance in years and is slow to start which is bad for testing. Replace it with minio which starts quickly and ships as a single executable or a tiny container. Minio has stricter limits on allowable characters but should still provide enough coverage to show that our encoding is working correctly. This commit also includes the upgrade to openssl 1.1.1 in the Ubuntu 18.04 container.	2019-07-02 22:20:35 -04:00
David Steele	4bffa0c5bb	Add test function to create the S3 bucket instead of using aws cli. Eventually the idea is to remove the dependency on aws cli since Python is a big install.	2019-06-26 15:02:30 -04:00
David Steele	4815752ccc	Add Perl interface to C storage layer. Maintaining the storage layer/drivers in two languages is burdensome. Since the integration tests require the Perl storage layer/drivers we'll need them even after the core code is migrated to C. Create an interface layer so the Perl code can be removed and new storage drivers/features introduced without adding Perl equivalents. The goal is to move the integration tests to C so this interface will eventually be removed. That being the case, the interface was designed for maximum compatibility to ease the transition. The result looks a bit hacky but we'll improve it as needed until it can be retired.	2019-06-26 08:24:58 -04:00
Cynthia Shang	b498188f01	Error on db history mismatch when expiring. Amend commit `434cd832` to error when the db history in archive.info and backup.info do not match. The Perl code would attempt to reconcile the history by matching on system id and version but we are not planning to migrate that code to C. It's possible that there are users with mismatches but if so they should have been getting errors from info for the last six months. It's easy enough to manually fix these files if there are any mismatches in the field. Contributed by Cynthia Shang.	2019-06-24 11:59:44 -04:00
David Steele	434cd83285	The expire command is implemented entirely in C. This implementation duplicates the functionality of the Perl code but does so with different logic and includes full unit tests. Along the way at least one bug was fixed, see issue #748. Contributed by Cynthia Shang.	2019-06-18 15:19:20 -04:00
David Steele	f88bee7b33	TLS/HTTP statistics log replacements. These statistics can change with any code update so they cause a lot of churn in the expect logs.	2019-06-18 10:13:28 -04:00
David Steele	0a96a2895d	Add storage layer for tests and documentation. The tests and documentation have been using the core storage layer but soon that will depend entirely on the C library, creating a bootstrap problem (i.e. the storage layer will be needed to build the C library). Create a simplified Posix storage layer to be used by documentation and the parts of the test code that build and execute the actual tests. The actual tests will still use the core storage driver so they can interact with any type of storage.	2019-06-17 09:16:44 -04:00
David Steele	9ba95e993b	Use retries to wait for test S3 server to start. The prior method of tailing the docker log no longer seems reliable. Instead, keep retrying the make bucket command until it works and show the error if it times out.	2019-06-13 17:58:33 -04:00
David Steele	6ff3325c77	Enforce requiring repo-cipher-pass at config parse time. This was not enforced at parse time because repo1-cipher-type could be passed on the command-line even in cases where encryption was not needed by the subprocess. Filter repo-cipher-type so it is never passed on the command line. If the subprocess does not have access to the passphrase then knowing the encryption type is useless anyway.	2019-06-05 11:43:17 -04:00
David Steele	92e04ea9f4	Remove per-stanza repo cache clear during testing. This was not being used and is not supported by the equivalent C code.	2019-06-04 10:34:19 -04:00
David Steele	12bca3c43e	Add CPPFLAGS to compile rules. This should silence the last of the Debian package warnings.	2019-06-01 09:28:31 -04:00
David Steele	a2ec1253e9	Add code classification exclusion missed in `3e1b06ac`.	2019-05-30 10:44:35 -04:00
David Steele	3e1b06acaa	Use minio as local S3 emulator in documentation. The documentation was relying on a ScalityS3 container built for testing which wasn't very transparent. Instead, use the stock minio container and configure it in the documentation. Also, install certificates and CA so that TLS verification can be enabled.	2019-05-27 07:37:20 -04:00
David Steele	86482c7db9	Reduce log level for all expect tests to detail. The C code is designed to be efficient rather than deterministic at the debug log level. As we move more testing from integration to unit tests it makes less sense to try and maintain the expect logs at this log level. Most of the expect logs have already been moved to detail level but mock/all still had tests at debug level. Change the logging defaults in the config file and remove as many references to log-level-console as possible.	2019-05-22 18:23:44 -04:00
David Steele	e3fe3434b4	Rename repo-s3-verify-ssl option to repo-s3-verify-tls. The new name is preferred because pgBackRest does not support any SSL protocol versions (they are all considered to be insecure). The old name will continue to be accepted.	2019-05-21 10:14:41 -04:00
Cynthia Shang	19d8358cba	Update mock/expire module test matrix so expect tests output. Also add an error message to prevent regression. Contributed by Cynthia Shang.	2019-05-16 09:53:55 -04:00
Cynthia Shang	18d4cb5741	Bypass database checks when stanza-delete issued with force. Previously it was not possible to delete a stanza if the PostgreSQL server could not be contacted. Contributed by Cynthia Shang. Suggested by Roman.	2019-05-15 13:14:58 -04:00
David Steele	5c1d4bcd0d	Automate coverage summary report generation. This report replaces the lcov report that was generated manually for each release. The lcov report was overly verbose just to say that we have virtually 100% coverage.	2019-05-15 13:04:56 -04:00
David Steele	5bba72b874	Remove -Wswitch-enum compiler option. The -Wswitch option included in -Wall provides the same level of coverage and allows enum options to be grouped into default.	2019-05-15 12:55:08 -04:00
David Steele	87f36e814e	Improve macros and coverage rules that were hiding missing coverage. The branch coverage exclusion rules were overly broad and included functions that ended in a capital letter, which disabled all coverage for the statement. Improve matching so that all characters in the name must be upper-case for a match. Some macros with internal branches accepted parameters that might contain conditionals. This made it impossible to tell which branches belonged to which, and in any case an overzealous exclusion rule was ignoring all branches in such cases. Add the DEBUG_COVERAGE flag to build a modified version of the macros without any internal branches to be used for coverage testing. In most cases, the branches were optimizations (like checking logWill()) that improve production performance but are not needed for testing. In other cases, a parameter needed to be added to the underlying function to handle the branch during coverage testing. Also tweak the coverage rules so that macros without conditionals are automatically excluded from branch coverage as long as they are not themselves a parameter. Finally, update tests and code where missing coverage was exposed by these changes. Some code was updated to remove existing coverage exclusions when it was a simple change.	2019-05-11 14:51:51 -04:00
David Steele	cb00030ee3	Remove dead code missed in `1b486847`. This commit removed all Perl references to spool storage but some stuff was left behind.	2019-05-08 18:58:07 -04:00
David Steele	8c712d89eb	Improve type safety of interfaces and drivers. The function pointer casting used when creating drivers made changing interfaces difficult and led to slightly divergent driver implementations. Unit testing caught production-level errors but there were a lot of small issues and the process was harder than it should have been. Use void pointers instead so that no casts are required. Introduce the THIS_VOID and THIS() macros to make dealing with void pointers a little safer. Since we don't want to expose void pointers in header files, driver functions have been removed from the headers and the various driver objects return their interface type. This cuts down on accessor methods and the vast majority of those functions were not being used. Move functions that are still required to .intern.h. Remove the special "C" crypto functions that were used in libc and instead use the standard interface.	2019-05-02 17:52:24 -04:00
David Steele	28359eea83	Update code count rules missed in `027c2638`.	2019-05-02 16:33:23 -04:00
David Steele	027c263871	Add configure script for improved multi-platform support. Use autoconf to provide a basic configure script. WITH_BACKTRACE is yet to be migrated to configure and the unit tests still use a custom Makefile. Each C file must include "build.auto.conf" before all other includes and defines. This is enforced by test.pl for includes, but it won't detect incorrect define ordering. Update packages to call configure and use standard flags to pass options.	2019-04-26 08:08:23 -04:00
David Steele	3505559a80	Update test containers with PostgreSQL minor releases and liblz4. Update RHEL repos that have changed upstream. Remove PostgreSQL 9.3 since the RHEL6/7 packages have disappeared. Remove PostgreSQL versions from U12 that are still getting minor updates so the container does not need to be rebuilt. LZ4 is included for future development, but this seems like a good time to add it to the containers.	2019-04-24 13:23:32 -04:00
David Steele	1ae8a6a716	Add build-max option to set max build processes. Currently this controls make processes via -j.	2019-04-23 20:52:03 -04:00
David Steele	c11c936366	Reduce ScalityS3 processes since only two are needed.	2019-04-23 20:19:31 -04:00
David Steele	41f3874822	v2.13: Bug Fixes Bug Fixes: * Fix zero-length reads causing problems for IO filters that did not expect them. (Reported by brunre01, jwpit, Tomasz Kontusz, guruguruguru.) * Fix reliability of error reporting from local/remote processes. * Fix Posix/CIFS error messages reporting the wrong filename on write/sync/close.	2019-04-18 21:26:02 -04:00
David Steele	3aa521fed0	Fix compile flag accidentally removed in `5ee8388f`.	2019-04-10 13:37:24 -04:00
David Steele	1b48684713	The archive-push command is implemented entirely in C. This new implementation should behave exactly like the old Perl code with the exception of updated log messages. Remove as much of the Perl code as possible without breaking other commands.	2019-03-29 13:26:33 +00:00
David Steele	5ee8388f48	Build test harness with the same warnings as code being tested. The test harness was not being built with warnings which caused some wackiness with an improperly structured switch. Just use the same warnings as the code being tested. Also enable warnings on code that is not directly being tested since other code modules are frequently modified during testing.	2019-03-26 08:20:55 +02:00
David Steele	e26d510d0c	Use restore command for remote performances tests. Since archive-push is being moved to C, the Perl remote will no longer work with that command. Eventually this module will need to be rewritten in C, but for now just use the restore command which is planned to be migrated last.	2019-03-17 22:11:35 +04:00
David Steele	9382283586	Fix issues when a path option is / terminated. This condition was not being properly checked for in the C code and it caused problems in the info command, at the very least. Instead of applying a local fix, introduce a new path option type that will rigorously check the format of any incoming paths. Reported by Marc Cousin.	2019-03-14 13:48:33 +04:00
David Steele	b8ebea6b1c	Add separate archive-push-async command. This command was previously forked off from the archive-push command which required a bit of artificial option and log manipulation. A separate command is easier to test and will work on platforms that don't have fork(), e.g. Windows.	2019-03-14 13:38:55 +04:00
blogh	e4e2606fce	Add additional options to backup.manifest for debugging purposes. Add the buffer-size, compress-level, compress-level-network, and process-max options to the backup:option section in backup.manifest to aid in debugging. It may also make sense to propagate these options up to backup.info so they can be displayed in the info command, but for now this is deemed sufficient. Contributed by blogh.	2019-03-10 11:03:52 +02:00
David Steele	21f56f64eb	Add hints when unable to find a WAL segment in the archive. When this error happens in the context of a backup it can be a bit mystifying as to why the backup is failing. Add some hints to get the user started. These hints will appear any time a WAL segment can't be found, which makes the hint about the check command redundant when the user is actually running the check command, but it doesn't seem worth trying to exclude the hint in that case. Suggested by Hans-Jürgen Schönig.	2019-03-10 10:38:12 +02:00
David Steele	d441061168	Create test matrix for mock/all to increase coverage and reduce tests. The same test configurations are run on all four test VMs, which seems a real waste of resources. Vary the tests per VM to increase coverage while reducing the total number of tests. Be sure to include each major feature (remote, s3, encryption) in each VM at least once.	2019-03-02 15:01:02 +02:00
David Steele	f7d1d4400f	Create test matrix for mock/expire to increase coverage and reduce tests. The same test configurations are run on all four test VMs, which seems a real waste of resources. Vary the tests per VM to increase coverage while reducing the total number of tests.	2019-03-01 19:04:26 +02:00
David Steele	91622942c2	Create test matrix for mock/archive-stop to increase coverage and reduce tests. The same test configurations are run on all four test VMs, which seems a real waste of resources. Vary the tests per VM to increase coverage while reducing the total number of tests. Be sure to include each major feature (remote, s3, encryption) in each VM at least once.	2019-03-01 17:12:41 +02:00
Marc Cousin	cb3b4fa24b	Enable socket keep-alive on older Perl versions. The prior method depended on IO:Socket:SSL to push the keep-alive options down to the socket but it only worked for recent versions of the module. Instead, create the socket directly using IO::Socket::IP if available or IO:Socket:INET as a fallback. The keep-alive option is set directly on the socket before it is passed to IO:Socket:SSL. Contributed by Marc Cousin.	2019-02-28 14:33:29 +02:00
David Steele	db4b447be8	The archive-get command is implemented entirely in C. This new implementation should behave exactly like the old Perl code with the exception of a few updated log messages. Remove as much of the Perl code as possible without breaking other commands.	2019-02-27 23:03:02 +02:00
David Steele	9367cc461c	Migrate local command to C. The C local is only used for C commands in the main process. Some tweaking of the existing protocolGet() command was required. Originally the idea was to share the function for local and remote requests but the differences (as in Perl) were too great to make that practical.	2019-02-27 22:34:21 +02:00
David Steele	18b62a4220	Only run test-level stack trace by default for unit-tested modules. This amends `70c30dfb` which disabled test tracing in general. Instead, only enable test tracing by default for modules that are being unit tested. This saves lots of time but still ensures that test tracing is working and helps with debugging in unit tests. Also rename the option to --debug-test-trace for a clarity.	2019-02-27 17:09:19 +02:00
David Steele	3a05359087	Create test matrix for mock/stanza to increase coverage and reduce tests. The same test configurations are run on all four test VMs, which seems a real waste of resources. Vary the tests per VM to increase coverage while reducing the total number of tests. Be sure to include each major feature (remote, s3, encryption) in each VM at least once.	2019-02-24 07:42:41 +02:00
David Steele	6d3e18b181	Reduce expect log level in mock/stanza tests. The expect tests were originally a rough-and-ready type of unit test so monitoring changes in the expect log helped us detect changes in behavior. Now the stanza code is heavily unit-tested so the detailed logs mainly cause churn and don't have any measurable benefit. Reduce the log level to DETAIL to make the logs less verbose and volatile, yet still check user-facing log messages.	2019-02-24 06:55:59 +02:00
David Steele	2f081f3ec7	Rename test modules for consistency. The conventions for command and info tests have shifted in the C modules, though not even all the C modules got the message.	2019-02-23 18:51:52 +02:00
David Steele	d489eb87f7	Create test matrix for mock/archive to increase coverage and reduce tests. The same test configurations are run on all four test VMs, which seems a real waste of resources. Vary the tests per VM to increase coverage while reducing the total number of tests. Be sure to include each major feature (remote, s3, encryption) in each VM at least once.	2019-02-23 15:59:39 +02:00
David Steele	4a7588e604	Create aliases for test VMs ordered by age. This will allow for smarter allocation of tests in the next commit.	2019-02-23 15:13:23 +02:00
David Steele	59d7958914	Reduce expect log level in mock/archive tests. The expect tests were originally a rough-and-ready type of unit test so monitoring changes in the expect log helped us detect changes in behavior. Now the archive code is heavily unit-tested so the detailed logs mainly cause churn and don't have any measurable benefit. Reduce the log level to DETAIL to make the logs less verbose and volatile, yet still check user-facing log messages.	2019-02-23 15:05:06 +02:00
David Steele	70c30dfb61	Disable test-level stack trace by default. Detailed stack traces for low-level functions (e.g. strCat, bufMove) can be very useful for debugging but leaving them on for all tests has become quite burdensome in terms of time. Complex operations like generating JSON on a large KevValue can lead to timeouts even with generous values. Add a new param, --debug-trace, to enable test-level stack trace, but leave it off by default.	2019-02-22 11:40:30 +02:00
David Steele	d211c2b8b5	Fix possible truncated WAL segments when an error occurs mid-write. The file write object destructors called close() and finalized the file even if it was not completely written. This was an issue in both the C and Perl code. Rewrite the destructors to simply free resources (like file handles) rather than calling the close() method. This leaves the temp file in place for filesystems that use temp files. Add unit tests to prevent regression. Reported by blogh.	2019-02-15 11:52:39 +02:00
David Steele	057e2e2782	Add unimplemented S3 driver method required for archive-get. This was not being caught because the integration tests for S3 were running remotely and going through the Perl code rather than the new C code. Implement the exists method for the S3 driver and add tests to prevent a regression. Reported by mibiio.	2019-02-09 18:57:30 +02:00
David Steele	aa3e5b8c72	Allow primary gid for the test user to be different from uid. Apparently up until now they have always been the same, which is pretty typical. However, if they were not then ContainerTest.pm was not happy.	2019-01-30 17:03:17 +02:00
David Steele	8f6d324b2c	Fix issue with multiple async status files causing a hard error. Multiple status files were being created by asynchronous archiving if a high-level error occurred after one or more WAL segments had already been transferred successfully. Error files were being written for every file in the queue regardless of whether it had already succeeded. To fix this, add an option to skip writing error files when an ok file already exists. There are other situations where both files might exist (various fsync and filesystem error scenarios) so it seems best to retry in the case that multiple status files are found rather than throwing a hard error (which then means that archiving is completely stuck). In the case of multiple status files, a warning will be logged to alert the user that something unusual is happening and the command will be retried. Reported by fpa-postgres, Joe Ayers, Douglas J Hunley.	2019-01-26 16:59:54 +02:00
David Steele	d245f8eb42	The info command is implemented entirely in C. The C info code has already been committed but this commit wires it into main. Also remove the info Perl code and tests since they are no longer called.	2019-01-21 13:51:45 +02:00
David Steele	db24ff8df4	v2.08: Minor Improvements and Bug Fixes Bug Fixes: * Remove request for S3 object info directly after putting it. (Reported by Matt Kunkel.) * Correct archive-get-queue-max to be size type. (Reported by Ronan Dunklau.) * Add error message when current user uid/gid does not map to a name. (Reported by Camilo Aguilar.) * Error when --target-action=shutdown specified for PostgreSQL < 9.5. Improvements: * Set TCP keepalives on S3 connections. (Suggested by Ronan Dunklau.) * Reorder info command text output so most recent backup is output last. (Contributed by Cynthia Shang. Suggested by Ryan Lambert.) * Change file ownership only when required. * Redact authentication header when throwing S3 errors. (Suggested by Brad Nicholson.)	2019-01-02 22:04:47 +02:00
Cynthia Shang	35bbb5bd68	Reorder info command text output so most recent backup is output last. After a stanza-upgrade backups for the old cluster are displayed until they expire. Cluster info was output newest to oldest which meant after an upgrade the most recent backup would no longer be output last. Update the text output ordering so the most recent backup is always output last. Contributed by Cynthia Shang. Suggested by Ryan Lambert.	2018-12-14 18:25:31 -05:00
Cynthia Shang	cbf514e191	Improve info error messages introduced in `74b72df9`. - Add detail to errors when info files are loaded with incorrect encryption settings. - Throw FileMissingError rather than FileOpenError when both copies of the info file are missing. - If one file is present (but errors) and the other is missing, then return the error for the file that was present. Contributed by Cynthia Shang.	2018-12-10 16:32:41 -05:00
David Steele	e73416e9e3	Change file ownership only when required. Previously chown() would be called even when no ownership changes were required. In most cases changes are not required and it seems better to perform an extra stat() rather than an extra chown(). Also add unit tests for owner() since there weren't any.	2018-12-05 17:56:47 -05:00
David Steele	cc6447356e	Fix test binary name for gprof. This got missed in `1f8931f7` when the test binary was renamed. Also output call graph along with the flat report. The flat report is generally most useful but it doesn't hurt to have both.	2018-12-05 09:15:45 -05:00
David Steele	74b72df9db	Improve error message when info files are missing/corrupt. The previous error message only showed the last error. In addition, some errors were missed (such as directory permission errors) that could prevent the copy from being checked. Show both errors below a generic "unable to load" error. Details are now given explaining exactly why the primary and copy failed. Previously if one file could not be loaded a warning would be output. This has been removed because it is not clear what the user should do in this case. Should they do a stanza-create --force? Maybe the best idea is to automatically repair the corrupt file, but on the other hand that might just spread corruption if pgBackRest makes the wrong choice.	2018-11-28 18:41:21 -05:00
David Steele	7c2fcb63e4	Enable encryption for archive-get command in C. The decryption filter was added in archiveGetFile() and archiveGetCheck() was modified to return the WAL decryption key stored in archive.info. The rest was plumbing. The mock/archive/1 integration test added encryption to provide coverage for the new code paths while mock/archive/2 dropped encryption to provide coverage for the existing code paths. This caused some churn in the expect logs but there was no change in behavior.	2018-11-28 14:56:26 -05:00
David Steele	56ce98b2f0	Explicitly compile with Posix 2001 standard. This standard was being selectively applied in modules that needed it. Instead, apply the standard to all compilation for consistency.	2018-11-25 10:06:31 -05:00
David Steele	315aa2c451	Conditional compilation of Perl logic in exit.c. This file is the only one to contain Perl logic outside of the perl module. Make the Perl logic conditional to improve reusability.	2018-11-25 08:39:41 -05:00
David Steele	78fe642eae	Remove extraneous use/include statements. Use conditional loading to make docs work in the absence of LibC. Somehow this also required a use statement to be added. Perl, go figure.	2018-11-24 20:31:35 -05:00
David Steele	801e2a5a2c	Rename PGBACKREST/BACKREST constants to PROJECT. This brings consistency between the C and Perl constants and allows for easier code reuse.	2018-11-24 19:05:03 -05:00
David Steele	beae375330	Enable S3 storage for archive-get command in C. The only change required was to remove the filter that prevented S3 storage from being used. The archive-get command did not require any modification which demonstrates that the storage interface is working as intended. The mock/archive/3 integration test was modified to run S3 storage locally to provide coverage for the new code paths while mock/stanza/3 was modified to run S3 storage remotely to provide coverage for the existing code paths. This caused some churn in the expect logs but there was no change in behavior.	2018-11-23 12:18:07 -05:00
David Steele	ac426bc456	New test containers with static test certificates. Test certificates were generated dynamically but there are advantages to using static certificates. For example, it possible to use the same certificate between container versions. Mostly, it is easier to document the certificates if they are not buried deep in the container code. The new test certificates are initially intended to be used with the C unit tests but they will eventually be used for integration tests as well. Two new certificates have been defined. See test/certificate/README.md for details. The old dynamic certificates will be retained until they are replaced.	2018-11-21 18:13:37 -05:00
David Steele	bc25db5667	Add interface objects for libxml2. Add XmlDocument, XmlNode, and XmlNodeList objects as a thin interface layer on libxml2. This interface is not intended to be comprehensive. Only a few libxml2 capabilities are exposed but more can be added as needed.	2018-11-20 20:40:11 -05:00
David Steele	f743d4e924	Add testRepoPath() to let C unit tests know where the code repository is located. This allows a C unit test to access data in the code repository that might be useful for testing. Add testRepoPathSet() to set the repository path. In passing remove extra whitespace in the TEST_RESULT_VOID() macro.	2018-11-20 15:48:56 -05:00
David Steele	8c7e97a369	Clarify comment about main.c being excluded from unit testing. Also remove !!! which by convention we use as a marker for code that needs attention before it can be committed to master.	2018-11-14 08:08:26 -05:00
David Steele	acb579c469	Tighten limits on code coverage context selection. If the last } of a function was marked as uncovered then the context selection would overrun into the next function. Start checking context on the current line to prevent this. Make the same change for start context even though it doesn't seem to have an issue.	2018-11-13 10:37:58 -05:00
David Steele	7107cc68d2	Expand context shown in coverage and update colors. Too few lines were shown for coverage context so show the entire function if it has any missing coverage. Update colors to work with light and dark browser modes.	2018-11-12 18:11:16 -05:00
David Steele	22ecbc153a	New, concise coverage report for C. The report HTML generated by lcov is overly verbose and cumbersome to navigate. Since we maintain 100% coverage it's far more interesting to look at what is not covered than what is. The new report presents all missing coverage on a single page and excludes code that is covered for brevity.	2018-11-11 17:32:42 -05:00
David Steele	3e695af961	New test containers. * Add libxml2 library needed for S3 development. * Minor version updates for PostgreSQL. * Remove PostgreSQL 11 beta/rc repository.	2018-11-08 21:41:41 -05:00
David Steele	8efa5e6a6a	Rename CipherError to CryptoError. This aligns with the general renaming from cipher to crypto.	2018-11-06 19:38:38 -05:00
David Steele	57d7809297	Improve efficiency of code generation. Code generation saved files even when they had not changed, which often caused code generation cascades. So, don't save files unless they have changed. Use rsync to determine which files have changed since the last test run. The manifest of changed files is saved and not removed until all code generation and builds have completed. If an error occurs the work will be redone on the next run. The eventual goal is to do all the builds from the test/repo directory created by rsync but for now it is only used to track changes.	2018-11-03 19:52:46 -04:00
David Steele	1f8931f732	Improve single test run performance. Improve on `7794ab50` by including the build flag files directly into the Makefile as dependencies (even though they are not includes). This simplifies some of the rsync logic and allows make to do what it does best. Also split build flag files into test, harness, and build to reduce rebuilds. Test flags are used to build test.c, harness flags are used to build the rest of the files in the test harness, and build flags are used for the files that are not directly involved in testing.	2018-11-03 16:34:04 -04:00
David Steele	7794ab50dc	Preserve contents of C unit test build directory between test.pl executions. The contents were already preserved between tests in a single test.pl run but for a separate execution the entire project had to be built from scratch, which was getting slower as we added code. Save the important build flags in a file so the new execution knows whether the build contents can be reused.	2018-11-02 11:56:13 -04:00
Cynthia Shang	34c63276cd	Automatically enable backup checksum delta when anomalies (e.g. timeline switch) are detected. There are a number of cases where a checksum delta is more appropriate than the default time-based delta: * Timeline has switched since the prior backup * File timestamp is older than recorded in the prior backup * File size changed but timestamp did not * File timestamp is in the future compared to the start of the backup * Online option has changed since the prior backup A practical example is that checksum delta will be enabled after a failover to standby due to the timeline switch. In this case, timestamps can't be trusted and our recommendation has been to run a full backup, which can impact the retention schedule and requires manual intervention. Now, a checksum delta will be performed if the backup type is incr/diff. This means more CPU will be used during the backup but the backup size will be smaller and the retention schedule will not be impacted. Contributed by Cynthia Shang.	2018-11-01 11:31:25 -04:00
David Steele	cca7a4ffd4	Retry all S3 5xx errors rather than just 500 internal errors. We were already retrying 500 errors but 503 (rate-limiting) errors were not being retried and would cause an instant failure which aborted the command. There are only two 5xx errors currently implemented by S3 but instead of adding 503 simply retry all 5xx errors. This is consistent with the http definition of this error class, "the server failed to fulfill an apparently valid request." Suggested by Craig A. James.	2018-10-30 16:45:42 -04:00
David Steele	286f7e5011	Fix static WAL segment size used to determine if archive-push-queue-max has been exceeded. This calculation was missed when the WAL segment size was made dynamic in preparation for PostgreSQL 11. Fix the calculation by checking the actual WAL file sizes instead of using an estimate based on WAL segment size. This is more accurate because it takes into account .history and .backup files, which are smaller. Since the calculation is done in the async process the additional processing time should not adversely affect performance. Remove the PG_WAL_SIZE constant and instead use local constants where the old value is still required. This is only the case for some tests and PostgreSQL 8.3 which does not provide a way to get the WAL segment size from pg_control.	2018-10-27 20:00:00 +01:00
David Steele	41b00dc204	Fix issue with archive-push-queue-max not being honored on connection error. If an error occurred while acquiring a lock on a remote server the error would be reported correctly, but the queue max detection code was not reached. The tests failed to detect this because they fixed the connection before queue max, allowing the ccde to be reached. Move the queue max code before the lock so it will run even when remote connections are not working. This means that no attempt will be made to transfer WAL once queue max has been exceeded, but it makes it much more likely that the code will be reach without error. Update tests to continue errors up to the point where queue max is exceeded. Reported by Lardière Sébastien.	2018-10-27 16:57:57 +01:00
David Steele	9ae3d8c46a	Install nodejs from deb.nodesource.com. The standard npm packages on Ubuntu 18.04 suddenly required libssl1.0 which broke the pgbackrest package builds. Installing nodejs from deb.nodesource.com seems to work fine with standard libssl. This package is required by ScalityS3 which is used for local S3 testing.	2018-10-15 23:13:08 +01:00
David Steele	d038b9a029	Support configurable WAL segment size. PostgreSQL 11 introduces configurable WAL segment sizes, from 1MB to 1GB. There are two areas that needed to be updated to support this: building the archive-get queue and checking that WAL has been archived after a backup. Both operations require the WAL segment size to properly build a list. Checking the archive after a backup is still implemented in Perl and has an active database connection, so just get the WAL segment size from the database. The archive-get command does not have a connection to the database, so get the WAL segment size from pg_control instead. This requires a deeper inspection of pg_control than has been done in the past, so it seemed best to copy the relevant data structures from each version of PostgreSQL and build a generic interface layer to address them. While this approach is a bit verbose, it has the advantage of being relatively simple, and can easily be updated for new versions of PostgreSQL. Since the integration tests generate pg_control files for testing, teach Perl how to generate files with the correct offsets for both 32-bit and 64-bit architectures.	2018-09-25 10:24:42 +01:00
Cynthia Shang	880fbb5e57	Add checksum delta for incremental backups. Use checksums rather than timestamps to determine if files have changed. This is useful in cases where the timestamps may not be trustworthy, e.g. when performing an incremental after failing over to a standby. If checksum delta is enabled then checksums will be used for verification of resumed backups, even if they are full. Resumes have always used checksums to verify the files in the repository, enabling delta performs checksums on the database files as well. Note that the user must manually enable this feature in cases were it would be useful or just keep in enabled all the time. A future commit will address automatically enabling the feature in cases where it seems likely to be useful. Contributed by Cynthia Shang.	2018-09-19 11:12:45 -04:00
Cynthia Shang	b6b2c915b2	Allow hashSize() to run on remote storage. Apparently we never needed to run this function remotely. It will be needed by the backup checksum delta feature, so implement it now. Contributed by Cynthia Shang.	2018-09-18 11:39:48 -04:00
David Steele	e55d733041	Add -ftree-coalesce-vars option to unit test compilation. This is a workaround for inefficient handling of many setjmps in gcc >= 4.9. Setjmp is used in all error handling, but in the unit tests each test macro contains an error handling block so they add up pretty quickly for large unit tests. Enabling -ftree-coalesce-vars in affected versions reduces build time and memory requirements by nearly an order of magnitude. Even so, compiles are much slower than gcc <= 4.8. We submitted a bug for this at: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87316 Which was marked as a duplicate of: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=63155	2018-09-17 11:38:10 -04:00
David Steele	b5f749b21c	Add CIFS driver to storage helper for read-only repositories. For read-only repositories the Posix and CIFS drivers behave exactly the same. Since that's all we support in C right now it's valid to treat them as the same thing. An assertion has been added to remind us to add the CIFS driver before allowing the repository to be writable. Mostly we want to make sure that the C code does not blow up when the repository type is CIFS.	2018-09-16 18:41:30 -04:00
David Steele	4119ce208d	Move test expect log out of the regular test directory. Storing the expect log (created by common/harnessLog) in the regular test directory was not ideal. It showed up in tests and made it difficult to clear the test directory between each run. Move the expect log to a purpose-built directory one level up so it does not interfere with regular testing.	2018-09-16 15:58:46 -04:00
David Steele	f0ed89f21f	Allow C or Perl coverage to run on more than one VM. C or Perl coverage tests can now be run on any VM provided a recent enough version of Devel::Cover or lcov is available. For now, leave u18 as the only VM to run coverage tests due to some issues with older versions of lcov.	2018-09-15 13:27:06 -04:00
David Steele	31cdd9d20b	Remove compiler warnings that are not valid for u16.	2018-09-15 08:23:55 -04:00
David Steele	aeb1fa3dfb	Don't perform valgrind when requested. The --no-valgrind flag was not being honored. It's not clear if this flag ever worked, but it does now.	2018-09-13 19:12:40 -04:00
Cynthia Shang	e351b8c67c	Improve info command to display the stanza cipher type. Contributed by Cynthia Shang. Suggested by Douglas J Hunley.	2018-09-10 13:09:45 -04:00
David Steele	c688bc8627	Improve support for special characters in filenames. % characters caused issues in backup/restore due to filenames being appended directly into a format string. Reserved XML characters (<>&') caused issues in the S3 driver due to improper escaping. Add a file with all common special characters to regression testing.	2018-09-10 10:54:34 -04:00
David Steele	f7fc8422f7	Make Valgrind return an error even when a non-fatal issue is detected. By default Valgrind does not exit with an error code when a non-fatal error is detected, e.g. unfreed memory. Use the --error-exitcode option to enabled this behavior. Update some minor issues discovered in the tests as a result. Luckily, no issues were missed in the core code.	2018-09-07 16:50:01 -07:00
David Steele	de1b74da0c	Move encryption in mock/archive tests to remote tests. The new archive-get C code can't run (yet) when encryption is enabled. Therefore move the encryption tests so we can test the new C code. We'll move it back when encryption is enabled in C. Also, push one WAL segment with compression to test decompression in the C code.	2018-09-06 09:35:34 -07:00
David Steele	6361a06181	Fix incorrectly reported error return in info logging. A return code of 1 from the archive-get was being logged as an error message at info level but otherwise worked correctly. Also improve info messages when an archive segment is or is not found.	2018-09-04 21:46:41 -04:00
David Steele	375ff9f9d2	Ignore all files in a linked tablespace directory except the subdirectory for the current version of PostgreSQL. Previously an error would be generated if other files were present and not owned by the PostgreSQL user. This hasn't been a big deal in practice but it could cause issues. Also add tests to make sure the same logic applies with links to files, i.e. all other files in the directory should be ignored. This was actually working correctly, but there were no tests for it before.	2018-08-31 16:06:40 -04:00
David Steele	d41570c37a	Improve log file names for remote processes started by locals. The log-subprocess feature added in `22765670` failed to take into account the naming for remote processes spawned by local processes. Not only was the local command used for the naming of log files but the process id was not pass through. This meant every remote log was named "[stanza]-local-remote-000" which is confusing and meant multiple processes were writing to the same log. Instead, pass the real command and process id to the remote. This required a minor change in locking to ignore locks if process id is greater than 0 since remotes started by locals never lock.	2018-08-31 11:31:13 -04:00
David Steele	70514061fd	Fix issue where relative links in $PGDATA could be stored in the backup with the wrong path. Relative link paths were being combined with the paths of previous links (relative or absolute) due to the $strPath variable being modified in the current iteration rather than simply being passed to the next level of recursion. This issue did not affect absolute links and relative tablespace links were caught by other checks, though the error was confusing. Reported by Cynthia Shang.	2018-08-30 16:27:36 -04:00
David Steele	c638490451	Documentation updates for exclude feature based on review. Reviewed by Cynthia Shang.	2018-08-28 16:49:29 -04:00
David Steele	14cde54b37	Limit manifest build recursion (i.e. links followed) to sixteen levels to detect link loops.	2018-08-28 16:27:10 -04:00
David Steele	a6cecf7d5e	Prevent manifest from being built more than once.	2018-08-28 16:22:30 -04:00
David Steele	bef58a7974	Allow arbitrary directories and/or files to be excluded from a backup. Misuse of this feature can lead to inconsistent backups so read the --exclude documentation carefully before using.	2018-08-27 15:51:05 -04:00
Cynthia Shang	eb30d88b6a	Allow zero-size files in backup manifest to reference a prior manifest regardless of timestamp delta. Contributed by Cynthia Shang.	2018-08-24 16:50:33 -04:00
David Steele	0ed37ab9e7	Update Archive::Info->archiveIdList() to return a valid error code instead of unknown.	2018-08-24 12:13:10 -04:00
David Steele	2276567027	Add log-subprocess option to allow file logging for local and remote subprocesses.	2018-08-22 20:05:49 -04:00
David Steele	8a8738308c	Enable -Wvla.	2018-08-22 14:48:37 -04:00

... 2 3 4 5 6 ...

703 Commits