pgbackrest

mirror of https://github.com/pgbackrest/pgbackrest.git synced 2024-12-14 10:13:05 +02:00

Author	SHA1	Message	Date
David Steele	f03d1b5b7b	Use __noreturn_ on error functions when coverage testing. The errorInternalThrowSys() functions were marked as returning during coverage testing even when they had no possibility to return, i.e. the error parameter was set to constant true. This meant the compiler would treat the functions as returning even when they would not. Instead create completely separate functions for coverage to use for THROW_ON_SYS_ERROR() that can return and leave the regular functions marked __noreturn__.	2020-04-14 11:43:50 -04:00
David Steele	b7d8d61526	Split session functionality of SocketClient out into SocketSession. This abstraction allows the session code to be shared between the socket client and (upcoming) server code. There should no difference in how the code works -- only the organization has changed. Note that no changes to the tests were required. This same abstraction will be required for TlsClient but that will be done in a separate commit because it requires test changes.	2020-04-13 16:59:02 -04:00
Cynthia Shang	310d42ca35	Correct option name in test.pl help.	2020-04-13 14:51:04 -04:00
David Steele	71ce637557	Use standard HARNESS_FORK*() macros to fork test servers. These forks were done in a custom way (not sure why) and lack the capability of the standard macros for the parent to wait for child exit. This mean that the server would continue to run after the tests were complete and that multiple servers could run at once. This caused subtle timing and connection issues that required larger timeouts to resolve. Don't change the timeouts here since they need to be adjusted in future commits anyway.	2020-04-12 09:01:41 -04:00
David Steele	674b65886f	Suppress uncoverable error in common/io-tls test module. It is pretty much impossible for a static IP to not resolve to an address but in theory the error could catch other conditions so it seems best to keep it.	2020-04-11 16:21:03 -04:00
Cynthia Shang	63b9f46a34	Update contributing documentation and remove test/README.md. When the Vagrant file was updated to use pgbackrest/ vs /backrest/ as the location for executing tests and building the documentation, parts of the contributing.xml (and hence the CONTRIBUTING.md) were not updated since some parts of the document are not actually executed when the CONTRIBUTING.md is built from contributing.xml: those parts that are executed were updated but those parts that are not executed were not. This commit fixes the contributing.xml issue but also removes test/README.md as its contents were out of date and redundant given that they are covered in CONTRIBUTING.md.	2020-04-09 18:25:25 -04:00
David Steele	55c3a3c8d3	Allow zero wait times in Wait object. This limitation forced extra logic in cases where zero wait times were needed. Remove the limitation and the extra logic in cases where zero wait times are possible.	2020-04-09 18:13:35 -04:00
David Steele	c292e8957d	Add some debugging to tests that fork servers. Help identify whether errors are happening in the forked server or the main test by showing the line number where the server was forked off in the stack trace.	2020-04-07 14:56:09 -04:00
David Steele	099bd85ed7	Reset line number in TEST_ERROR() macros. If these are not reset then an error not wrapped in a TEST_ERROR() macro may show the line number of the previous error in a stack trace, which is confusing. It is better for the line number to be unreported than wrong.	2020-04-07 14:42:05 -04:00
David Steele	627b495352	Add waitRemaining() to get remaining wait time. This can be used to set timeouts so they do not wait longer than needed.	2020-04-07 14:35:36 -04:00
David Steele	ac3cfa4c9c	Allow default process id to be specified in logInit(). The default process id was previously always 0 but there are cases where it is useful to be able to set the default. Currently the only use case is for testing but the upcoming server code will also make use of it.	2020-04-07 14:23:31 -04:00
David Steele	5e55d58850	Simplify storage driver info and list functions. The storage driver requires two list functions to be implemented, list and infoList. But the former is a subset of the latter so implementing both in every driver is wasteful. The reason both exist is that in Posix it is cheaper to get a list of names than it is to stat files to get size, time, etc. In S3 these operations are equivalent. Introduce storageInfoLevelType to determine the amount of information required by the caller. That way Posix can work efficiently and all drivers can return only the data required which saves some bandwidth. The storageList() and storageInfoList() functions remain in the storage interface since they are useful -- the only change is simplifying the drivers with no external impact. Note that since list() accepted an expression infoList() must now do so. Checking the expression is optional for the driver but can be used to limit results or save IO costs. Similarly, exists() and pathExists() are just specialized forms of info() so adapt them to call info() instead.	2020-04-06 16:09:18 -04:00
David Steele	f984aec665	Standardize some section names in headers. It's better to start out with plural forms rather than flip back and forth as functions are added and subtracted. So, use "Constructors" instead of "Constructor". Use "Getters/Setters" rather than "Getters" or "Setters" to avoid similar churn.	2020-04-03 18:15:32 -04:00
David Steele	1aca2cc902	Move extern function comments to headers. This has been the policy for some time but due to migration pressure only new functions and refactors have been following this rule. Now it seems sensible to make a clean sweep and move all the comments that have not been moved already (i.e. most of them). Only obvious typos and gross inaccuracies in the comments have been fixed. For this most part this was a copy and paste operation. Useless comments, e.g. "New object", were not copied. Even so, there are surely many deficient comments left. Some rearranging was done where needed and functions were placed in the proper sections, e.g. "Constructors", "Functions", etc. A few function prototypes were found that not longer had an implementation. These were removed, but there may be more. The coding document has been updated to reflect this policy, which is not new but has never been documented.	2020-04-03 18:01:28 -04:00
Cynthia Shang	3fbde30c6f	Add/remove dependent backups during backup.info reconstruct. Prior to performing a backup or expiring backups, the backup.info file is validated by reconstructing it from the backups in the repository. When a backup had already been removed from the repo, it was removed from the backup.info file but its dependents were not. Now, the dependent backups will also be removed from backup.info and only backups in the repo that have their full dependency chain will be added to backup.info if they are missing.	2020-04-03 13:25:38 -04:00
David Steele	f3ae74b0d6	Remove storageRead() and storageWriteDriver(). These functions were only being used in the tests. This usage likely dates to before the include directive was available in define.yaml.	2020-04-03 08:38:28 -04:00
David Steele	1214f1d70b	Update RHEL package location. This changed upstream so update the file paths.	2020-04-03 08:18:07 -04:00
David Steele	713211d89f	Clean up const usage in bufPtr() and bufRemainsPtr(). These functions accepted const Buffer objects and returned non-const pointers which is definitely not a good idea. Add bufPtrConst() to handle cases where only a const return value is needed and update call sites. Use UNCONSTIFY() in cases where library code out of our control requires a non-const pointer. This includes the already-documented exception in command/backup/pageChecksum and input buffers in the gzCompress and gzDecompress filters.	2020-04-02 17:25:49 -04:00
David Steele	76b88a3cd5	Add UNCONSTIFY() macro. Allows casting const-ness away from an expression, but doesn't allow changing the type. Enforcement of the latter currently only works for gcc-like compilers. Note that it is not safe to cast const-ness away if the result will ever be modified (it would be undefined behavior). Doing so can cause compiler mis-optimizations or runtime crashes (by modifying read-only memory). It is only safe to use when the result will not be modified, but API design or language restrictions prevent you from declaring that (e.g. because a function returns both const and non-const variables). Note that this only works in function scope, not for global variables (it would be nice, but not trivial, to improve that). UNCONSTIFY() requires static assert which is a feature in its own right.	2020-04-02 16:58:38 -04:00
David Steele	78beb16d6f	Remove unused getters in common/io/write module. These were probably added for symmetry with IoRead but we would prefer to remove those getters. So, just remove the equivalents in IoWrite.	2020-04-01 19:33:11 -04:00
David Steele	3aedcd1776	Enable FD_CLOEXEC. PostgreSQL enables this option when available which seems like a good idea since we also do not share connections between processes. Note that as in PostgreSQL there is no way to disable this option.	2020-04-01 17:20:47 -04:00
David Steele	967f2c0d7f	Enable TCP_NODELAY. PostgreSQL enables this option when available which seems like a good idea since we also buffer transmissions. Note that as in PostgreSQL there is no way to disable this option.	2020-04-01 16:56:15 -04:00
David Steele	a1a0a23c6a	Fix typo.	2020-04-01 16:51:29 -04:00
David Steele	789e364e6b	Rename tcp-keep-alive option to sck-keep-alive. This is really a socket option so the new name is clearer. Since common/io/socket/tcp will contains a mix of options it makes sense to rename it to socket and cascade name changes as needed.	2020-04-01 15:44:51 -04:00
David Steele	5c6fb88bef	TCP keep-alive options are configurable. Prior to 2.25 the individual TCP keep-alive options were not being configured due to a missing header. In 2.25 they were being configured incorrectly due to a disconnect between the timeout specified in ms and what was expected by the TCP options, i.e. seconds. Instead make the TCP keep-alive options directly configurable, with correct units and better testing. Keep-alive is enabled by default (though it can be defaulted to the system setting instead) and the rest of the options are not set by default. This is in line with what PostgreSQL does, though PostgreSQL does not allow keep-alive to be defaulted. Also move configuration of TCP options before connect() as PostgreSQL does.	2020-03-31 18:13:11 -04:00
David Steele	8989118cc6	Add SocketClient object. This functionality was embedded into TlsClient but that was starting to get unwieldy. Add SocketClient to contain all socket-related client functionality.	2020-03-31 12:43:29 -04:00
David Steele	da43db3543	Move common/object.h to common/type/object.h. This header does not contain a type but is used to define types so this seems like a better location.	2020-03-30 20:52:57 -04:00
David Steele	a29e25a845	Add storage filter performance test. This test allows the important storage filters to be benchmarked by MiB/s.	2020-03-29 21:25:48 -04:00
David Steele	1e0b0c9344	Remove Debian package patch now that it has been merged upstream.	2020-03-29 09:17:39 -04:00
David Steele	bf7b989103	Add time since last test started to test output. This makes it easier to see the timing of each step in the test.	2020-03-28 18:25:51 -04:00
David Steele	7e519e72d5	Add TEST_TITLE_FMT() macro.	2020-03-28 18:20:29 -04:00
Adrian Vondendriesch	e1c72f6f97	Fix typos.	2020-03-28 17:48:57 -04:00
David Steele	4b1d995bba	Update packages required for each CI job. Documentation builds and tests have only a few packages in common so rearrange packages to save some time and clarify dependencies. Remove the libperl-dev package which became obsolete when the LibC module was removed in `79cfd3ae`. Add a few comments for good measure.	2020-03-26 22:20:09 -04:00
David Steele	3d255dce3c	Add performance/storage test. The primary purpose of this test (currently) is to measure the performance of storageRemoteInfoList(), which is critical for building a manifest when the PostgreSQL host is remote. The starting baseline of 1 million files is perhaps a bit aggressive but it seems very likely to blow up if there are performance regressions.	2020-03-26 21:05:36 -04:00
David Steele	b64755d635	Increase baseline of the performance/type test. Recent performance improvements allow increasing the baseline of this test. In general it is best if the baseline is large enough to cause the test to blow up if there are performance regressions.	2020-03-26 20:52:05 -04:00
David Steele	50cf7370ee	Add --no-performance to test.pl to suppress performance tests. Performance tests do not need to be run on all platforms. Using vm=none to run performance tests seems best ... for performance.	2020-03-26 20:36:09 -04:00
David Steele	24e03e1320	Remove RHEL package patch now that it has been merged upstream.	2020-03-26 20:25:42 -04:00
David Steele	fd3dca036b	v2.25: LZ4 Compression Support Features: * Add lz4 compression support. Note that setting compress-type=lz4 will make new backups and archive incompatible (unrestorable) with prior versions of pgBackRest. (Reviewed by Cynthia Shang.) * Add --dry-run option to the expire command. Use dry-run to see which backups/archive would be removed by the expire command without actually removing anything. (Contributed by Cynthia Shang, Luca Ferrari.) Improvements: * Improve performance of remote manifest build. (Suggested by Jens Wilke.) * Fix detection of keepalive options on Linux. (Contributed by Marc Cousin.) * Add configure host detection to set standards flags correctly. (Contributed by Marc Cousin.) * Remove compress/compress-level options from commands where unused. These commands (e.g. restore, archive-get) never used the compress options but allowed them to be passed on the command line. Now they will error when these options are passed on the command line. If these errors occur then remove the unused options. (Reviewed by Cynthia Shang.) * Limit backup file copy size to size reported at backup start. If a file grows during the backup it will be reconstructed by WAL replay during recovery so there is no need to copy the additional data. (Reviewed by Cynthia Shang.)	2020-03-26 17:14:53 -04:00
David Steele	8af0462c5d	Fix race condition in real/all integration tests. If the tests are running quickly then the time target might end up the same as the end time of the prior full backup. That means restore auto-select will not pick it as a candidate and restore the last backup instead causing the restore compare to fail. So, sleep one second.	2020-03-26 15:30:59 -04:00
Cynthia Shang	86f71349ef	Improve and centralize backup dependency calculation. Add functions to select a current backup by label and to retrieve a backup dependency list for any given backup. Update the expire code to utilize the new functions and to expire backup sets from newest dependency to oldest.	2020-03-26 14:05:40 -04:00
David Steele	e63fdfbfd2	Debug and optimize flag cleanup for unit tests. Decisions about when to optimize or enable debug code were spread out in too many places making it hard to keep them consistent. Centralize the logic as much as possible to make it easier to maintain.	2020-03-26 11:16:35 -04:00
David Steele	88d7ee6215	Add srtCatZN(). Append N characters from a zero-terminated string. Note that the string does not actually need to be zero-terminated as long as N is <= the end of the string being concatenated.	2020-03-25 18:37:35 -04:00
Cynthia Shang	eb7f7dd5ca	Fix backup-prior for diff backups in mock/expire integration test. In the ExpireEnvTest.pm backupCreate() function, backup-prior was incorrectly set for diff backups to the previous backup regardless of what backup type the previous backup was. This did not cause any issues in the Mock Expire tests before because it was not being checked. However, in order to reduce churn in the expect logs for a new feature where the backup-prior is utilized, this is being fixed so that the full backup is always used as backup-prior.	2020-03-25 15:47:03 -04:00
Cynthia Shang	480a11066d	More refactoring of command/expire unit test module. Buffer cleanup following on from `e170c53e`.	2020-03-25 12:04:23 -04:00
David Steele	d20115d29e	Allocate a minimum amount of space when a string is likely to grow. This helps prevent excessive memory reallocation.	2020-03-25 09:12:51 -04:00
David Steele	ad4187eb9f	Improve performance of memResize(). The major bottleneck was finding the memory allocation to be resized since it required a sequential search through a list. Instead, put the allocation header at the beginning of the allocation and return an offset to the user for their buffer. This allows us to use pointer arithmetic to get back to the allocation header quickly when resizing. A side effect is to make memFree() faster as well. The downside is we won't detect garbage pointers passed to memResize()/memFree(), which is also true for MemContext pointers. The performance benefits can be pretty large in certain cases, in particular when loading and saving manifests. The following are the before and after performance tests on a 900K file manifest. Before: run 003 - manifestNewLoad()/manifestSave() 000.000s l0125 - generate manifest 183.411s l0236 - 101.2MB manifest generated with 900000 files 183.411s l0239 - load manifest 403.816s l0243 - completed in 220405ms 403.816s l0245 - check file total 403.816s l0248 - save manifest 670.217s l0253 - completed in 266401ms 670.217s l0256 - find all files 671.263s l0266 - completed in 1046ms After: run 003 - manifestNewLoad()/manifestSave() 000.000s l0125 - generate manifest 007.730s l0236 - 101.2MB manifest generated with 900000 files 007.730s l0239 - load manifest 033.431s l0243 - completed in 25701ms 033.431s l0245 - check file total 033.431s l0248 - save manifest 057.755s l0253 - completed in 24324ms 057.755s l0256 - find all files 058.689s l0266 - completed in 934ms	2020-03-24 19:08:00 -04:00
Cynthia Shang	e170c53e7e	Refactor command/expire unit test module. Add titles and use a Buffer to store backup.info instead of a String.	2020-03-23 14:31:04 -04:00
David Steele	f9c86b11a5	More improvements to custom coverage report. * Fix a few issues with file names being truncated introduced in `787d3fd6`. * Use function line info from the lcov file to calculate which lines to show for uncovered functions. This is more accurate than what we were doing before and function comment headers are now excluded which reduces clutter in the report.	2020-03-23 12:17:34 -04:00
David Steele	dbb1248bfb	Implement TEST_RESULT_*() macros with functions, mostly. The prior macros had grown over time to be pretty significant pieces of code that required a lot of compile time, though runtime was efficient. Move most of the macro code into functions to reduce compile time, perhaps at a slight expense to runtime. The overall performance benefit is 10-15% so this seems like a good tradeoff. Add TEST_RESULT_UINT_INT() to safely compare uint to int with range checking.	2020-03-22 20:44:51 -04:00
David Steele	d6ffa9ea6d	Fix incorrect result types in unit tests. Upcoming changes to the TEST_RESULT_* macros are more type safe and identified that the wrong macros were being used to test results in many cases. Commit these changes separately to verify that they work with the current macro versions. Note that no core bugs were exposed by these changes.	2020-03-22 20:25:31 -04:00
David Steele	e1da6a45e0	Remove TRY...CATCH blocks from TEST_RESULT() macros. TRY...CATCH blocks are fairly expensive and when all the TEST_RESULT() macros succeed they are not needed. Instead just record info at the start of the result test so a detailed exception can be thrown in test.c in the rare case where an exception occurs.	2020-03-22 16:14:33 -04:00
David Steele	5a8758cc8a	Add test function to set stack trace function line number. This is helpful for test macros that know the line number. The line number can now be non-zero below the top of the stack without WITH_BACKTRACE so instead ignore the line number for output when it is zero.	2020-03-22 16:04:24 -04:00
David Steele	c2df383aec	Fix missing parameter in common/stack-trace test module. This was passing since we don't test WITH_BACKTRACE in CI because it is used only for test builds. Ideally we would test this but it doesn't seem worth the trouble at the moment.	2020-03-22 14:24:28 -04:00
David Steele	8c76ea2d93	Fix space lost in `d70ca259`.	2020-03-22 14:18:16 -04:00
David Steele	06a3f82e91	Add --clean and --clean-only options to test.pl.	2020-03-22 13:46:30 -04:00
David Steele	3ec630f5b2	Allow suppression of times in testing for reproducibility. Timestamps, timings, etc. cause a lot of log churn when included in documentation.	2020-03-22 10:12:29 -04:00
David Steele	56fb399373	Build contributing documentation on Travis CI. Building the contributing document has some special requirements because it runs Docker in Docker so the repo path must align on the host and all Docker containers. Run `pgbackrest/doc/doc.pl` from within the home directory of the user that will do the doc build, e.g. `home/vagrant`. If the repo is not located directly in the home directory, e.g. `/home/vagrant/pgbackrest`, then a symlink may be used, e.g. `ln -s /path/to/repo /home/vagrant/pgbackrest`. Mount the repo in the Vagrantfile at /home/vagrant/pgbackrest but provide a link from the old location at /backrest to make the transition less painful.	2020-03-21 18:45:58 -04:00
David Steele	f405c82dcc	Don't list directories as changed from the last run. It's not very useful information and just clutters the list.	2020-03-20 15:00:20 -04:00
David Steele	cfab67a775	Enable coverage testing on Fedora 30. Now that coverage testing works reliably with gcc9 it makes sense to enable it for CI.	2020-03-20 13:49:23 -04:00
David Steele	782c9f89f4	Remove old coverage data before starting new test. The old coverage data has been recorded so it is no longer needed. In newer versions of gcc leaving this file around can lead to an error when writing profile data after forking off to a non-pgbackrest binary (which we do in some unit tests).	2020-03-20 13:43:08 -04:00
David Steele	787d3fd67b	Improve custom coverage report. * Show all uncovered branch parts even when there are more than two parts per branch. This is the way gcc9 reports coverage so it needs to work even if it doesn't make as much sense as the old way. * Show covered branches in functions where coverage is missing. Showing just the uncovered branches can be confusing because it's not always clear how the coverage relates to the code. By showing all branch coverage (+ or -) this correspondence is made easier.	2020-03-20 12:54:29 -04:00
David Steele	8af8029006	Fix lcov report when test module missing coverage. We don't report branch coverage on test modules (e.g. test/src/module/common/errorTest.c) but the code that excluded branch coverage from the test module would also exclude it from all core modules if the test module was included in the lcov report due to lack of function/line coverage. Adjust the coverage code to only exclude branches during the extraction of test module coverage.	2020-03-20 09:16:38 -04:00
David Steele	f6e9bb0819	Remove obsolete -O2 option for Fedora 30 unit test builds. For some reason gcc9 would not do -O0 builds in combination with one of the options that libperl required. Now that libperl is gone this exception is no longer required.	2020-03-19 19:30:09 -04:00
David Steele	2241524c0b	Remove obsolete deletes from Vagrantfile. pgBackRest no longer writes files into the .vagrant path so there's no longer anything to delete.	2020-03-19 18:34:10 -04:00
David Steele	dcddf3a58b	Limit backup file copy size to size reported at backup start. If a file grows during the backup it will be reconstructed by WAL replay during recovery so there is no need to copy the additional data. This also reduces the likelihood of seeing torn pages during the copy. Torn pages can still occur in the middle of the file, though, so they must be handled.	2020-03-19 13:16:05 -04:00
Cynthia Shang	73315268fd	Fix typo.	2020-03-19 12:11:20 -04:00
David Steele	d677b07081	Move coverage code to CoverageTest module. This code needs some work, which will be easier if it is all in one place.	2020-03-19 12:07:51 -04:00
David Steele	26c89b2c8c	Improve testing of files that change size during the backup. Files can change size during a backup so update and add tests to cover the various scenarios more thoroughly.	2020-03-18 13:40:16 -04:00
David Steele	4ec04e5163	Added redacted manifest to testBackupValidate(). The manifest is excellent for validation but including the entire manifest is too noisy and some values are architecture/algorithm dependent. Output a redacted version that contains the most important information which can be improved on over time.	2020-03-18 10:10:10 -04:00
David Steele	b8cd1b6790	Add TEST_RESULT_STR_Z_KEYRPL() test macro. This macro will automatically do key replacement before the comparison. This saves the indentation required for an embedded function call. Possibly TEST_RESULT_Z_KEYRPL() would also be useful but it will be added when needed.	2020-03-18 10:05:08 -04:00
David Steele	f2548f45ce	Allow storage reads to be limited by bytes. The current use case is reading files from the PostgreSQL cluster during backup. A file may grow during backup but we only need to copy the number of bytes that were reported during the manifest build. The rest will be rebuilt from the WAL during recovery so copying more is just a waste of space. Limiting the copy sizes in backup will be part of a future commit.	2020-03-17 18:16:17 -04:00
David Steele	307e741298	Test that shrunk file is backed up correctly. It's possible, though rare, for a file to shrink during a backup. There was no issue with the code but having a test is always a good idea.	2020-03-17 16:01:17 -04:00
David Steele	9a47b88da3	Add links to custom coverage report. When multiple files were missing coverage it could be hard to locate the coverage report for a specific file. Add links for uncovered files to make this easier. Also move table titles out of the table so they are valid html.	2020-03-16 20:02:36 -04:00
David Steele	f7dac144a6	Reduce variables extern'd by the common/log module in debug builds. These days it is better to include the module in define.yaml when we need to poke at the internal implementation. This doesn't quite work for the log test harness, so for now some variables will need to remain extern'd in debug builds.	2020-03-16 18:16:27 -04:00
David Steele	3fbfcba811	Forbid access to /tmp/pgbackrest in the Vagrantfile. This matches the error that will be thrown in the vm=none test on Travis CI if a unit test writes to /tmp/pgbackrest.	2020-03-16 17:27:01 -04:00
David Steele	46911c64c1	Make storage and logging dry-run aware. Enhance dry-run support added in `2fa69af8` by forbidding writes in the storage layer and adding prefixes to log messages. The former will protect against mistakes in dry-run implementations and the latter will make it clear when a command was executed in dry-run mode. Update expire unit tests with the new log prefix.	2020-03-16 17:24:21 -04:00
Cynthia Shang	2fa69af8da	Add --dry-run option to the expire command. Use dry-run to see which backups/archive would be removed by the expire command without actually removing anything.	2020-03-16 13:56:52 -04:00
David Steele	4328bc1ac6	Move raw coverage results to test/result/raw path. These results were stored in the vagrant path along with a full copy of src. Instead store the raw coverage data in test/result/raw and change source references to the files that already exist in [test-path]/repo.	2020-03-16 08:41:32 -04:00
David Steele	d702249507	Build binaries in the test path rather than the vagrant path. It makes more sense to build in the test path since many developers won't have a vagrant path. Anyway, it's better not to modify the vagrant path since it belongs to vagrant. Instead of installing the binary just mount it into the container from where it was built. This saves a bit of time and space.	2020-03-15 10:09:27 -04:00
David Steele	19d975346b	Improve stability of command/check test module. When pgbackrest was present this test behaved unexpectedly. While the binary is not currently required for this test is might be in the future so fix the test to prevent a regression.	2020-03-15 09:59:22 -04:00
David Steele	959dce569b	Update code classification and remove XS definition.	2020-03-14 18:30:24 -04:00
David Steele	213cc6e8be	Move docker files to test/result.	2020-03-14 15:40:37 -04:00
David Steele	6827e248cd	Move coverage results to test/result.	2020-03-14 15:29:42 -04:00
David Steele	75ff25f17f	Move profile results to test/result.	2020-03-14 14:50:36 -04:00
David Steele	0f7fe55f72	Build packages on demand only and change build path. Building packages is not a normal part of development so don't build packages by default. Instead build them in CI as needed. Do the builds in test/result instead of .vagrant to be friendlier with hosts that are not running vagrant. Anyway, it's probably not a good idea to be creating files in the .vagrant path.	2020-03-14 14:35:09 -04:00
David Steele	5645c91ed5	Add comments to test/.gitignore.	2020-03-14 14:18:22 -04:00
David Steele	4cd060b7fe	Generate src/build/aclocal.m4 automatically. This file is required when macros from the autoconf archive are used in configure.ac	2020-03-14 12:48:08 -04:00
David Steele	9e80c5710e	Use a checksum to build configure.ac more efficiently. Building the configure.ac script can take multiple seconds depending on the state of the autoconf cache. Use a checksum to only rebuild when configure.ac has changed no matter how the timestamps have changed.	2020-03-14 12:39:29 -04:00
David Steele	748f9502eb	Remove obsolete ignore.	2020-03-14 10:04:49 -04:00
David Steele	237a3da4d6	Configure and make improvements. Configure: * Use standard make variables, e.g. CFLAGS, rather than our own, e.g. CINCLUDE * Add PG_CONFIG var for configuring custom pg_config location * Don't error if xml_config or pg_config is missing (but error if libs/headers not found) * Check for zlib.h header * Check for lz4frame.h header when liblz4 is present Make: * Use gcc-style auto dependencies * Put src list at the top since it is most frequently modified * Add clean-all target to also remove auto-generated config files	2020-03-13 09:07:57 -04:00
David Steele	838ef4eca1	Move configure.ac to src/build. This file is used to generate src/configure and is not required to make pgbackrest since src/configure is updated before distribution. Move to src/build so it is out of the way.	2020-03-12 09:34:52 -04:00
David Steele	2ac9c19d4a	Fix misleading comment.	2020-03-12 09:28:16 -04:00
David Steele	181fa1fc8b	Detect changes in reference.xml for code auto-generation. Changes to reference.xml can affect the command-line documentation built into the binary so changes must trigger an auto-generated code build during smart builds.	2020-03-12 09:27:44 -04:00
David Steele	0ba8062f5f	Get package source files dynamically during package build. The prior method was to build a special container to hold these files which meant they would get stale on development systems. On CI the container was always rebuilt so failures would be seen there even when dev seemed to be working. Instead get the package source when the package is built to ensure it is as up-to-date as possible. This change was prompted by failures on the Ubuntu 12.04 container while getting the package source, probably due to an ancient version of git. Package builds are no longer supported on that platform with the addition of lz4 compression so it didn't seem worth fixing.	2020-03-12 08:48:45 -04:00
David Steele	4a5bd002c0	Move pgBackRest::Version module to pgBackRestDoc::ProjectInfo. The primary source for project info is now src/version.h. The pgBackRestDoc::ProjectInfo module loads the project info from src/version.h at runtime so there is no need to update it.	2020-03-10 17:57:02 -04:00
David Steele	731b862e6f	Rename BackRestDoc Perl module to pgBackRestDoc. This is consistent with the way BackRest and BackRest test were renamed way back in `18fd2523`. More modules will be moving to pgBackRestDoc soon so renaming now reduces churn later.	2020-03-10 15:41:56 -04:00
David Steele	36d4ab9bff	Move Perl modules out of lib directory. This directory was once the home of the production Perl code but since `f0ef73db` this is no longer true. Move the modules to test in most cases, except where the module is expected to be useful for the doc engine beyond the expected lifetime of the Perl test code (about a year if all goes well). The exception is pgBackRest::Version which requires more work to migrate since it is used to track pgBackRest versions.	2020-03-10 15:12:44 -04:00
David Steele	c279a00279	Add lz4 compression support. LZ4 compresses data faster than gzip but at a lower ratio. This can be a good tradeoff in certain scenarios. Note that setting compress-type=lz4 will make new backups and archive incompatible (unrestorable) with prior versions of pgBackRest.	2020-03-10 14:45:27 -04:00
David Steele	79cfd3aebf	Remove LibC. This was the interface between Perl and C introduced in `36a5349b` but since `f0ef73db` has only been used by the Perl integration tests. This is expensive code to maintain just for testing. The main dependency was the interface to storage, no matter where it was located, e.g. S3. Replace this with the new-introduced repo commands (`d3c83453`) that allow access to repo storage via the command line. The other dependency was on various cfgOption* functions and CFGOPT_ constants that were convenient but not necessary. Replace these with hard-coded strings in most places and create new constants for commonly used values. Remove all auto-generated Perl code. This means that the error list will no longer be maintained automatically so copy used errors to Common::Exception.pm. This file will need to be maintained manually going forward but there is not likely to be much churn as the Perl integration tests are being retired. Update test.pl and related code to remove LibC builds. Ding, dong, LibC is dead.	2020-03-09 17:41:59 -04:00
David Steele	d3c83453de	Add repo-create, repo-get, repo-put, and repo-rm commands. These commands are generally useful but more importantly they allow removing LibC by providing the Perl integration tests an alternate way to work with repository storage. All the commands are currently internal only and should not be used on production repositories.	2020-03-09 17:15:03 -04:00
David Steele	948835fb84	Update repo-ls command to work better with files. If the command was passed a file it would return no results since it was originally intended to list files when passed a path. However, as a general purpose command working directly with files makes sense.	2020-03-09 16:54:07 -04:00
David Steele	5e1291a29f	Rename ls command to repo-ls. This command only makes sense for the repository storage since other storage (e.g. pg and spool) must be located on a local Posix filesystem and can be listed using standard unix commands. Since the repo storage can be located lots of places having a common way to list it makes sense. Prefix with repo- to make the scope of this command clear. Update documentation to reflect this change.	2020-03-09 16:41:04 -04:00
David Steele	f581edfa50	Remove valgrind suppressions made obsolete by `f0ef73db`.	2020-03-09 13:36:46 -04:00
David Steele	3c4f91b319	Remove Perl unit tests made obsolete in `434cd832`. These were replaced by C unit tests but not all the unit test setup code was removed in the Perl module.	2020-03-09 13:35:26 -04:00
David Steele	54bc3b454a	Cleanup pgPageChecksum() test in postgres/interface module. Some of the comments were wrong or inconsistent. Update TEST_RESULT_U16_HEX() to the less-specific TEST_RESULT_UINT_HEX().	2020-03-06 15:01:50 -05:00
David Steele	438b957f9c	Add infrastructure for multiple compression type support. Add compress-type option and deprecate compress option. Since the compress option is boolean it won't work with multiple compression types. Add logic to cfgLoadUpdateOption() to update compress-type if it is not set directly. The compress option should no longer be referenced outside the cfgLoadUpdateOption() function. Add common/compress/helper module to contain interface functions that work with multiple compression types. Code outside this module should no longer call specific compression drivers, though it may be OK to reference a specific compression type using the new interface (e.g., saving backup history files in gz format). Unit tests only test compression using the gz format because other formats may not be available in all builds. It is the job of integration tests to exercise all compression types. Additional compression types will be added in future commits.	2020-03-06 14:41:03 -05:00
David Steele	02aa03d1a2	Remove obsolete methods in pgBackRest::Storage::Storage module. All the methods in this module will need to be implemented via the command-line in order to get rid of LibC, so the first step is to reduce the code in the module as much as possible. First remove storageDb() and use storageTest() instead. Then create storageTest() using pgBackRestTest::Common::Storage which has no dependencies on LibC. Now the only storage using the LibC interface is storageRepo(). Remove all link functions since those operations cannot be performed on a repo unless it is Posix, in which case the LibC interface is not needed. Same for owner(). Remove pathSync() because syncs are not required in the tests. No test data is reused after a crash. Path create/exists functions should never be explicitly performed on a repo so remove those. File exists can be implemented by calling info() instead. Remove encryption detection functions which were only used by Backup/Archive::Info reconstruct() which are now obsolete. Remove all filters except pgBackRest::Storage::Filter::CipherBlock since they are not being used. That also means there are no filters returning results so remove all the result code. Move hashSize() and pathAbsolute() into pgBackRest::Storage::Base where they can be shared between pgBackRest::Storage::Storage and pgBackRestTest::Common::Storage.	2020-03-06 14:10:09 -05:00
David Steele	00647c7109	Remove Perl Db module and LibC dependencies. This was mostly dead code except the DB_BACKUP_ADVISORY_LOCK constant, moved to the real/all test module, and the function that pulls info from pg_control, moved to ExpireEnvTest.pm.	2020-03-06 07:21:17 -05:00
David Steele	2e0fe25650	Remove dependency on LibC hash filter. Perl provides Digest::SHA for hashing so there is no need to expose this via LibC anymore.	2020-03-05 18:34:59 -05:00
David Steele	e55443c890	Move logic from postgres/pageChecksum to command/backup/pageChecksum(). The postgres/pageChecksum module was designed as an interface to the C structs for the Perl code. The new C code can do this directly so no need for an interface. Move the remaining test for pgPageChecksum() into the postgres/interface test module.	2020-03-05 16:12:54 -05:00
David Steele	3796b74dca	Use stock PostgreSQL page checksum implementation. We were using a customized version which worked fine but was hard to merge with upstream changes. Now this code is maintained much like the types in static.auto.h that we copy and check with each release. The goal is to eventually build directly against PostgreSQL (either source or libcommon) and this brings us one step closer.	2020-03-05 14:23:01 -05:00
David Steele	1b647a1a22	Remove invalid page checksum test. All zero pages should not have checksums. Not only is this test invalid but it will not work with the stock page checksum implementation in PostgreSQL, which checks for zero pages. Since we will be using that code verbatim soon this test needs to go.	2020-03-05 14:06:36 -05:00
David Steele	eb4347f20b	Use static checksums in mock/all integration tests. Using static values serves as a better cross-check against the page checksum code. The downside is that these checksums may not work with some big endian systems but in that case neither will the unit tests. We can also remove the page checksum interface from LibC which brings us one step closer to eliminating it.	2020-03-05 13:56:20 -05:00
David Steele	4ab8943ca8	Use PG_PAGE_SIZE_DEFAULT constant instead of pageSize variable. Page size is passed around a lot but in fact it can only have one value, PG_PAGE_SIZE_DEFAULT, which is checked when pg_control is loaded. There may be an argument for supporting multiple page sizes in the future but for now just use the constant to simplify the code. There is also a significant performance benefit. Because pageSize was being used in pageChecksumBlock() the main loop was neither unrolled nor vectorized (-funroll-loops -ftree-vectorize) as it is now with a constant loop boundary.	2020-03-05 09:14:27 -05:00
David Steele	91f321fb86	Rename old page*() functions to conform to new conventions. The general convention now is to prefix PostgreSQL functions with "pg".	2020-03-04 14:24:40 -05:00
David Steele	a86253f112	Remove obsolete function pageChecksumBufferTest(). This function made validation faster in Perl because fewer calls (and buffer transformations) were required when all checksums were valid. In C calling pageChecksumTest() directly is just as efficient so there is no longer a need for pageChecksumBufferTest().	2020-03-04 14:12:02 -05:00
David Steele	9d48882268	Centralize PostgreSQL page header data structures. These data structures were copied a few places (but only once in the core code) so put them in a place where everyone can use them. To do this create a new file, static.auto.h, to contain data types and macros that have stayed the same through all the versions of PostgreSQL that we support. This allows us to have single, non-versioned set of headers and code for stable data structures like page headers. Migrate a few types from version.auto.h that are required for page header structures and pull the remaining types from PostgreSQL directly. We had previously renamed xlog to wal so update those where required since we won't be modifying the PostgreSQL names anymore.	2020-03-04 13:31:27 -05:00
David Steele	8ec41efb04	Improve poor man's regular expression common prefix generator. The S3 driver depends on being able to generate a common prefix to limit the number of results from list commands, which saves on bandwidth. The prior implementation could be tricked by an expression like ^ABC\|^DEF where there is more than one possible prefix. To fix this disallow any prefix when another ^ anchor is found in the expression. [^ and \^ are OK since they are not anchors. Note that this was not an active bug because there are currently no expressions with multiple ^ anchors.	2020-02-28 17:41:34 -05:00
Cynthia Shang	ceb050e950	Fix flapping test in real/all module. The restore test function was passing strBackup to the restoreCompare function but when the restore is expected to pick a backup based on a timestamp, then strBackup may not be the one chosen. Modified the code so that strBackupExpected is set based on the parameters passed to the function and this is then passed to restoreCompare.	2020-02-28 14:50:50 -05:00
David Steele	7d8c0d29fb	Remove compress option from config tests. This option was used for boolean testing but it will soon be deprecated and the semantics changed. To reduce churn it seems easiest to just use other options for testing. This will also be helpful when the option is eventually removed.	2020-02-27 14:51:40 -05:00
David Steele	dbf6255ab8	Remove compress/compress-level options from commands where unused. These commands (e.g. restore, archive-get) never used the compress options but allowed them to be passed on the command line. Now they will error when these options are passed on the command line. If these errors occur then remove the unused options.	2020-02-27 12:25:32 -05:00
David Steele	3f77a83e73	Remove raw option for gz compression. This was a minor optimization used in protocol layer compression. Even though it was slightly faster, it omitted the crc-32 that is generated during normal compression which could lead to corrupt data after a bad network transmission. This would be caught on restore by our checksum but it seems better to catch an issue like this early. The raw option also made the function signature different than future compression formats which may not support raw, or require different code to support raw. In general, it doesn't seem worth the extra testing to support a format that has minimal benefit and is seldom used, since protocol compression is only enabled when the transmitted data is uncompressed.	2020-02-27 12:19:40 -05:00
David Steele	ee351682da	Rename "gzip" to "gz". "gz" was used as the extension but "gzip" was generally used for function and type naming. With a new compression format on the way, it makes sense to standardize on a single abbreviation to represent a compression format in the code. Since the extension is standard and we must use it, also use the extension for all naming.	2020-02-27 12:09:05 -05:00
David Steele	5afd950ed9	Improve performance of MEM_CONTEXT() macros. The prior code used TRY...CATCH blocks to cleanup mem contexts when an error occurred. This included freeing new mem contexts that were still being initialized when the error occurred and ensuring that the prior memory context was restored. This worked fine in production but it involved a lot of setjmp()/longjmp() calls that resulted in longer compilation times and sluggish performance under valgrind, profiling, and coverage testing. Instead maintain a stack of new contexts and context switches that can be used to do cleanup after an error. Normally, the stack is not used for this purpose and pushing/popping is a cheap operation. In the prior implementation most of the TRY...CATCH logic needed to be run even on success. One bonus is that the binary is about 8% smaller after this change. Another benefit is that new contexts must* be explicitly freed/discarded or an error will occur. See info/manifest.c for an example of where this is useful outside the standard macros.	2020-02-26 21:15:39 -05:00
David Steele	cc743f2e04	Skip pg_internal.init temp file during backup. If PostgreSQL crashes it can leave behind a pg_internal.init temp file with the pid as the extension, as discussed in https://www.postgresql.org/message-id/flat/20200131045352.GB2631%40paquier.xyz#7700b9481ef5b0dd5f09cc410b4750f6. On restart this file is not cleaned up so it can persist for the lifetime of the cluster or until another process with the same id happens to write pg_internal.init. This is arguably a bug in PostgreSQL, but in any case it makes sense not to backup this file.	2020-02-21 11:51:39 -05:00
David Steele	6353e9428d	Error when archive-get/archive-push/restore are not run on a PostgreSQL host. This error was lost during the migration to C. The error that occurred instead (generally an SSH auth error) was hard to debug. Restore the original behavior by throwing an error immediately if pg1-host is configured for any of these commands. reset-pg1-host can be used to suppress the error when required.	2020-02-12 17:18:48 -07:00
David Steele	dac8119bf1	Add pgIsLocalVerify(). This functionality is required in commands other than restore, so centralize it.	2020-02-12 15:47:07 -07:00
David Steele	e2c304d473	Prevent defunct processes in asynchronous archive commands. The main improvement is a double-fork to prevent zombie processes if the parent process exits after the (child) async process. This is a real possibility since the parent process sticks around to monitor the results of the async process. In the first fork, ignore SIGCHLD in the very unlikely case that the async process exits before the first fork. This is probably only possible if the async process exits immediately, perhaps due to a chdir() failure. Set SIGCHLD back to default in the async process so waitpid() will work as expected. Also update the comment on chdir() to more accurately reflect what is happening. Finally, add a test in certain debug builds to ensure the first fork exits very quickly. This only works when valgrind is not in use because valgrind makes forking so slow that it is hard to tell if the async process performed work or not (in the case that the second fork goes missing and the async process is a direct child).	2020-02-12 12:17:23 -07:00
David Steele	43936c58a8	Fix resume when the resumable backup was created by Perl. In this case the resumable backup should be ignored, but the C code was not able to load the partial manifest written by Perl since the format differs slightly. Add validations to catch this case and continue gracefully.	2020-02-11 19:44:06 -07:00
David Steele	44adf21c83	Consolidate archive async exec code. Move duplicated code to the common module. This will reduce copy and paste between the get and push modules when changes are made.	2020-02-10 21:30:43 -07:00
David Steele	0eaedc9a6a	Improve async archive error file removal. `2a06df93` removed the error file so an old error would not be reported before the async process had a chance to try again. However, if the async process was already running this might lead to a timeout error before reporting the correct error. Instead, remove the error files once we know that the async process will start, i.e. after the archive lock has been acquired. This effectively reverts `2a06df93`.	2020-02-10 19:17:11 -07:00
David Steele	2a06df93f3	Remove async archive error file when not throwing an error. This ensures that the error will not be thrown before the async process has a chance to retry.	2020-02-06 20:59:04 -08:00
David Steele	0f8ec3e478	Read HTTP content to eof when size/encoding not specified. Generally, the content-size or content-encoding headers will be used to specify how much content should be expected. There is a special case where the server sends 'Connection:close' without the content headers and the content may be read up until eof. This appears to be an atypical usage but it is required by the specification.	2020-01-30 14:51:26 -07:00
Cynthia Shang	856980ae99	Auto-select backup set on restore when time target is specified. Auto-selection is performed only when --set is not specified. If a backup set for the given target time cannot not be found, the latest (default) backup set will be used. Currently a limited number of date formats are recognized and timezone names are not allowed, only timezone offsets.	2020-01-30 14:38:05 -07:00
Cynthia Shang	f46d1fa74c	Add timezone calculations to time module. Add tzPartsValid() and tzOffsetSecond() to calculate timezone offsets from user provided values. Update epochFromParts() to accept a timezone offset in seconds.	2020-01-30 11:28:30 -07:00
David Steele	80687cbe74	Free TLS connection in common/io-http test. The test that checks for no output from the server was leaving a connection open which valgrind was complaining about. Wait on the server long enough to cause the error on the client then close the connection to free the memory.	2020-01-28 10:19:58 -07:00
David Steele	697150eaf8	Add more validations to the manifest on backup. Validate that checksums exist for zero size files. This means that the checksums for zero size files are explicitly set by backup even though they'll always be the same. Also validate that zero length files have the correct checksum. Validate that repo size is > 0 if size is > 0. No matter what compression type is used a non-zero amount of data cannot be stored in zero bytes.	2020-01-26 23:07:07 -07:00
David Steele	7ab07dc580	Validate checksums are set in the manifest on backup/restore. This is a modest start but it addresses the specific issue that was caused by the bug fixed in `45ec694a`. This validation will produce an immediate error rather than erroring out partway through the restore. More validations are planned but this is the most important one and seems safest for this release.	2020-01-26 21:58:59 -07:00
David Steele	45ec694af2	Fix missing files corrupting the manifest. If a file was removed by PostgreSQL during the backup (or was missing from the standby) then the next file might not be copied and updated in the manifest. If this happened then the backup would error when restored. The issue was that removing files from the manifest invalidated the pointers stored in the processing queues. When a file was removed, all the pointers shifted to the next file in the list, causing a file to be unprocessed. Since the unprocessed file was still in the manifest it would be saved with no checksum, causing a failure on restore. When process-max was > 1 then the bug would often not express since the file had already been pulled from the queue and updates to the manifest are done by name rather than by pointer.	2020-01-26 13:19:13 -07:00
David Steele	90abc3cf17	Use pkg-config instead of xml2-config for libxml2 build options. pkg-config is a generic way to get build options rather than relying on a package-specific utility. XML2_CONFIG can be used to override this utility for systems that do not ship pkg-config.	2020-01-24 10:08:05 -07:00
David Steele	b134175fc7	Use designated initializers to initialize structs. Previously memNew() used memset() to initialize all struct members to 0, NULL, false, etc. While this appears to work in practice, it is a violation of the C specification. For instance, NULL == 0 must be true but neither NULL nor 0 must be represented with all zero bits. Instead use designated initializers to initialize structs. These guarantee that struct members will be properly initialized even if they are not specified in the initializer. Note that due to a quirk in the C99 specification at least one member must be explicitly initialized even if it needs to be the default value. Since pre-zeroed memory is no longer required, adjust memAllocInternal()/memReallocInternal() to return raw memory and update dependent functions accordingly. All instances of memset() have been removed except in debug/test code where needed. Add memMewPtrArray() to allocate an array of pointers and automatically set all pointers to NULL. Rename memGrowRaw() to the more logical memResize().	2020-01-23 14:15:58 -07:00
David Steele	600a51815f	Set client_encoding to UTF8 on PostgreSQL connect. This is the only non-ASCII character encoding we have tested so make sure that's all we get from PostgreSQL.	2020-01-21 18:42:22 -07:00
David Steele	94842ccece	Fix comment.	2020-01-21 11:59:25 -07:00
David Steele	03d434c7e1	Remove RHEL package patch now that it has been merged upstream. Also revert `731ffcfb` and update ContainerTest.pm for upstream changes.	2020-01-21 11:57:59 -07:00
David Steele	b89e6b7f69	Fix error in timeline conversion. The timeline is required to verify WAL segments in the archive after a backup. The conversion was performed base 10 instead of 16, which led to errors when the timeline was ≥ 0xA.	2020-01-21 10:29:46 -07:00
David Steele	c630bda1c1	Remove Debian package patch now that it has been merged upstream.	2020-01-19 10:37:08 -07:00
David Steele	d9efbc3698	Add UTF8 strings to manifest and restore tests. The most likely place to get UTF8 characters is in database names so make sure UTF8 works in the places where database names are processed.	2020-01-18 10:46:48 -07:00
David Steele	ec173f12fb	Add MEM_CONTEXT_PRIOR() block and update current call sites. This macro block encapsulates the common pattern of switching to the prior (formerly called old) mem context to return results from a function. Also rename MEM_CONTEXT_OLD() to memContextPrior(). This violates our convention of macros being in all caps but memContextPrior() will become a function very soon so this will reduce churn.	2020-01-17 13:29:49 -07:00
David Steele	c6d6b7dbef	Use MEM_CONTEXT_NEW_BEGIN() block instead of memContextNew(). A few places were using just memContextNew(), probably because they did not immediately need to create anything in the new context, but it's better if we use the same pattern everywhere, even if it results in a few extra mem context switches.	2020-01-17 11:58:41 -07:00
David Steele	e81629b442	Reclassify Perl and LibC code as test/harness. These were still being included in the core totals but they are no longer used by core.	2020-01-15 13:53:30 -07:00
David Steele	2c0ba0820d	v2.21: C Migration Complete Bug Fixes: * Fix options being ignored by asynchronous commands. The asynchronous archive-get/archive-push processes were not loading options configured in command configuration sections, e.g. [global:archive-get]. (Reviewed by Cynthia Shang. Reported by Urs Kramer.) * Fix handling of \ in filenames. \ was not being properly escaped when calculating the manifest checksum which prevented the manifest from loading. Since instances of \ in cluster filenames should be rare to nonexistent this does not seem likely to be a serious problem in the field. Features: * pgBackRest is now pure C. * Add pg-user option. Specifies the database user name when connecting to PostgreSQL. If not specified pgBackRest will connect with the local OS user or PGUSER, which was the previous behavior. (Contributed by Mike Palmiotto.) * Allow path-style URIs in S3 driver. Improvements: * The backup command is implemented entirely in C. (Reviewed by Cynthia Shang.)	2020-01-15 13:21:52 -07:00
David Steele	8d3710b2fe	Fix options being ignored by asynchronous commands. The local, remote, archive-get-async, and archive-push-async commands were used to run functionality that was not directly available to the user. Unfortunately that meant they would not pick up options from the command that the user expected, e.g. backup, archive-get, etc. Remove the internal commands and add roles which allow pgBackRest to determine what functionality is required without implementing special commands. This way the options are loaded from the expected command section. Since remote is no longer a specific command with its own options, more manipulation is required when calling remote. This might be something we can improve in the config system but it may be worth leaving as is because it is a one-off, for now at least.	2020-01-15 12:24:58 -07:00
David Steele	a7738ebba3	Update comments in command/remote module.	2020-01-13 13:21:28 -07:00
David Steele	fe263e87b1	Allow path-style URIs in S3 driver. Although path-style URIs have been deprecated by AWS, they may still be used with products like Minio because no additional DNS configuration is required. Path-style URIs must be explicitly enabled since it is not clear how they can be auto-detected reliably. More importantly, faulty detection could cause regressions in current installations.	2020-01-12 11:31:06 -07:00
David Steele	3f89ecf8d9	Add time to storage ls JSON output. Time is supported in all drivers with the update to S3 at `61538f93`, so it is now possible to add time to the ls command and have it work on all repo types.	2020-01-10 09:39:33 -07:00
David Steele	0c5c78e5e1	Make quoting in cfgExeParam() optional. Parameter lists that are passed directly to exec*() do not need quoting when spaces are present. Worse, the quotes will not be stripped and the option value will be garbled. Unfortunately this still does not fix all issues with quoting since we don't know how it might need to be escaped to work with SSH command configuration. The answer seems to be to pass the options in the protocol layer but that's beyond the scope of this commit.	2020-01-09 09:23:15 -07:00
David Steele	7de5ce23ad	Add internal remote-type option. This option was overloaded on the general type option but it makes sense to split this out since the meaning is pretty different. Rename the values to conform to current standards, i.e. pg and repo, now that the Perl code won't care anymore.	2020-01-08 18:59:02 -07:00
David Steele	7a1871c341	Fix test log message to match pg-version parameter name. It was confusing that this part of the log message did not match the parameter name, which made reproducing test failures from CI a little harder.	2020-01-08 09:54:44 -07:00
David Steele	61538f932c	Parse dates in storageS3InfoList() and storageS3Info(). Previously dates were not being filled by these functions which was fine since dates were not used. We plan to use dates for the ls command plus it makes sense for the driver to be complete since it will be used as an example.	2020-01-06 15:53:53 -07:00
David Steele	d2fb4f977c	Add httpLastModifiedToTime() to parse HTTP last-modified header.	2020-01-06 15:24:49 -07:00
David Steele	a08298ce1b	Add basic time management functions. These are similar to what mktime() and strptime() do but they ignore the local system timezone which saves having to munge the TZ env variable to do time conversions.	2020-01-06 15:18:52 -07:00
David Steele	33e328abbf	Remove unused LibC code. The code was made obsolete by the migration to C.	2019-12-28 18:30:32 -07:00
David Steele	e72a9dd0d2	Add error parameter to cfgCommandId(). This allows commands to be checked for validity without generating an error.	2019-12-28 13:37:03 -07:00
David Steele	d41eea685a	Change meaning of TEST_RESULT_STR() macro. This macro was created before the String object existed so subsequent usage with String always included a lot of strPtr() wrapping. TEST_RESULT_STR_Z() had already been introduced but a wholesale replacement of TEST_RESULT_STR() was not done since the priority was on the C migration. Update all calls to (old) TEST_RESULT_STR() with one of the following variants: (new) TEST_RESULT_STR(), TEST_RESULT_STR_Z(), TEST_RESULT_Z(), TEST_RESULT_Z_STR().	2019-12-26 18:08:27 -07:00
David Steele	74c3842595	Remove errant tabs and fix spacing.	2019-12-19 16:25:46 -05:00
Mike Palmiotto	dc1e7ca22d	Add pg-user option. Specifies the database user name when connecting to PostgreSQL. If not specified pgBackRest will connect with the local OS user or PGUSER, which was the previous behavior.	2019-12-19 11:26:38 -05:00
David Steele	9452084dd1	Fix misspellings of libpq.	2019-12-17 23:32:39 -05:00
David Steele	63a855e2f7	Fix misaligned continuation character.	2019-12-17 23:26:52 -05:00
David Steele	d780d084b7	Add comments about increasing Vagrantfile disk size.	2019-12-17 21:56:02 -05:00
Mike Palmiotto	d89d9f1c52	Skip vagrant disksize option if no plugin. Previously, `vagrant up` would bail if no `vagrant-disksize` plugin was installed. This option is just a nice-to-have, so skip it rather than bailing.	2019-12-17 21:47:19 -05:00
David Steele	620386f034	Remove integration tests that are now covered in the unit tests. Most of these tests are just checking that errors are thrown when required. These are well covered in various unit tests. The "cannot resume" tests are also well covered in the backup unit tests. Finally, config warnings are well covered in the config unit tests. There is more to be done here, but this accounts for the low-hanging fruit.	2019-12-17 20:14:45 -05:00
David Steele	977ec2e307	Integration test improvements for disk and memory efficiency. Set log-level-file=off when more that one test will run. In this case is it impossible to see the logs anyway since they will be automatically cleaned up after the test. This improves performance pretty dramatically since trace-level logging is expensive. If a singe integration test is run then log-level-file is trace by default but can be changed with the --log-level-test-file option. Reduce buffer-size to 64k to save memory during testing and allow more processes to run in parallel. Update log replacement rules so that these options can change without affecting expect logs.	2019-12-17 15:23:07 -05:00
David Steele	ccea30b8d8	Increase memory in ramdisk for Travis CI testing. The co6 tests were occasionally running out of space so bump up the size of the ramdisk a bit to hopefully prevent this. A longer term solution would be to disable the trace-level file logs when running on Travis CI since they seem to be using most of the space.	2019-12-14 10:20:23 -05:00
David Steele	6bd280f7bd	Don't warn when stop-auto is enabled on PostgreSQL >= 9.6. PostgreSQL >= 9.6 uses non-exclusive backup which has implicit stop-auto since the backup will stop when the connection is terminated. The warning was made more verbose in `1f2ce45e` but this now seems like a bad idea since there are likely users with mixed version environments where stop-auto is enabled globally. There's no reason to fill their logs with warnings over a harmless option. If anything we should warn when stop-auto is explicitly set to false but this doesn't seem very important either. Revert to the prior behavior, which is to warn and reset when stop-auto is enabled on PostgreSQL < 9.3.	2019-12-14 09:53:50 -05:00
David Steele	03849840b8	Fix handling of \ in filenames. \ was not being properly escaped when calculating the manifest checksum which prevented the manifest from loading. Use jsonFromStr() to properly quote and escape \. Since instances of \ in cluster filenames should be rare to nonexistent this does not seem likely to be a serious problem in the field.	2019-12-13 21:33:13 -05:00
David Steele	f0ef73db70	pgBackRest is now pure C. Remove embedded Perl from the distributed binary. This includes code, configure, Makefile, and packages. The distributed binary is now pure C. Remove storagePathEnforceSet() from the C Storage object which allowed Perl to write outside of the storage base directory. Update mock/all and real/all integration tests to use storageLocal() where they were violating this rule. Remove "c" option that allowed the remote to tell if it was being called from C or Perl. Code to convert options to JSON for passing to Perl (perl/config.c) has been moved to LibC since it is still required for Perl integration tests. Update build and installation instructions in the user guide. Remove all Perl unit tests. Remove obsolete Perl code. In particular this included all the Perl protocol code which required modifications to the Perl storage, manifest, and db objects that are still required for integration testing but only run locally. Any remaining Perl code is required for testing, documentation, or code generation. Rename perlReq to binReq in define.yaml to indicate that the binary is required for a test. This had been the actual meaning for quite some time but the key was never renamed.	2019-12-13 17:55:41 -05:00
David Steele	1f2ce45e6b	The backup command is implemented entirely in C. For the most part this is a direct migration of the Perl code into C except as noted below. A backup can now be initiated from a linked directory. The link will not be stored in the manifest or recreated on restore. If a link or directory does not already exist in the restore location then a directory will be created. The logic for creating backup labels has been improved and it should no longer be possible to get a backup label earlier than the latest backup even with timezone changes or clock skew. This has never been an issue in the field that we know of, but we found it in testing. For online backups all times are fetched from the PostgreSQL primary host (before only copy start was). This doesn't affect backup integrity but it does prevent clock skew between hosts affecting backup duration reporting. Archive copy now works as expected when the archive and backup have different compression settings, i.e. when one is compressed and the other is not. This was a long-standing bug in the Perl code. Resume will now work even if hardlink settings have been changed. Reviewed by Cynthia Shang.	2019-12-13 17:14:26 -05:00
David Steele	e206093beb	Allow end anchor to be excluded in backupRegExp(). This is useful for matching files in the backup history directory which have characters after the backup label.	2019-12-12 18:52:16 -05:00
David Steele	8acfb6adf4	Add pgLsnRangeToWalSegmentList() to convert lsn range to wal segments.	2019-12-12 16:43:34 -05:00
David Steele	81295fd388	Move not found error into walSegmentFind(). This error is also needed in backup so move it here to centralize it.	2019-12-12 16:28:26 -05:00
David Steele	1378d9c58b	Fix bad arithmetic in pgLsnToWalSegment(). / takes precedence over & but the appropriate parens were not provided. By some bad luck the tests worked either way, so add a new test that only works the correct way to prevent a regression.	2019-12-12 16:21:51 -05:00
David Steele	676be2c773	Add pgWalPath() to return version-specific WAL path. Also update the manifest module to use the new function.	2019-12-12 16:11:09 -05:00
David Steele	39fc2b7ad6	v2.20: Bug Fixes Bug Fixes: * Fix archive-push/archive-get when PGDATA is symlinked. These commands tried to use cwd() as PGDATA but this would disagree with the path configured in pgBackRest if PGDATA was symlinked. If cwd() does not match the pgBackRest path then chdir() to the path and make sure the next cwd() matches the result from the first call. (Reported by Stephen Frost, Milosz Suchy.) * Fix reference list when backup.info is reconstructed in expire command. Since the backup command is still using the Perl version of reconstruct this issue will not express unless 1) there is a backup missing from backup.info and 2) the expire command is run directly instead of running after backup as usual. This unlikely combination of events means this is probably not a problem in the field. * Fix segfault on unexpected EOF in gzip decompression. (Reported by Stephen Frost.)	2019-12-12 08:20:21 -05:00
David Steele	b031dbbcf8	Allow timezones to be explicitly set for testing. The TZ environment variable was not reliably pushed down to the test processes. Instead pass TZ via a command line parameter and set explicitly in the test process.	2019-12-11 22:11:04 -05:00
David Steele	0194a98671	Fix archive-push/archive-get when PGDATA is symlinked. Commit `7168e074` tried to use cwd() as PGDATA but this would disagree with the path configured in pgBackRest if PGDATA was symlinked. If cwd() does not match the pgBackRest path then chdir() to the path and make sure the next cwd() matches the result from the first call.	2019-12-11 14:36:39 -05:00
David Steele	8c840c28a6	Fix segfault on unexpected EOF in gzip decompression. If the compressed stream terminated early then the decompression process would get a flush request (NULL input buffer) since the filter was not marked as done. This could happen on a zero-length or truncated (i.e. invalid) compressed file. Change the existing assertion to an error to catch this condition in production gracefully.	2019-12-11 08:48:46 -05:00
David Steele	c933f12f9c	Remove obsolete --perl-option option. This option was used when Perl was executed instead of being embedded. It has been obsolete for a long time so remove it.	2019-12-10 13:28:15 -05:00
David Steele	d0ba8ff58c	Remove test point infrastructure. `82df7e6f` and `9856fef5` updated tests that used test points in preparation for the feature not being available in the C code. Since tests points are no longer used remove the infrastructure. Also remove one stray --test option in mock/all that was essentially a noop but no longer works now that the option has been removed.	2019-12-10 13:16:47 -05:00
David Steele	d7d663c2b9	Make buildPutDiffers() work with empty files. If the file was empty the timestamp was updated. If the file is empty and there is no content then file should not be saved.	2019-12-10 13:02:36 -05:00
David Steele	800d2972b0	Remove stray uint type. This was probably copied from an example but some compilers don't like it.	2019-12-09 18:28:20 -05:00
Cynthia Shang	ca33545630	Remove redundant test and move another test.	2019-12-09 14:06:32 -05:00
David Steele	d3132dae26	Add functions for building new manifests. New manifests are built before a backup is performed. Reviewed by Cynthia Shang.	2019-12-08 18:43:47 -05:00
David Steele	2cfde18755	Add pgLsnFromStr(), pgLsnToStr(), and pgLsnToWalSegment().	2019-12-08 14:19:47 -05:00
David Steele	f517b141fb	Update pq harness to play nicely with variable LSNs.	2019-12-08 14:15:23 -05:00
David Steele	d2587250da	Add backup functions to Db object. These functions implement the database backup functionality for all supported versions.	2019-12-07 18:44:06 -05:00
David Steele	8766326da8	Add protocolRemoteFree() to shutdown a specific remote. Sometimes it is useful to shutdown remotes that are no longer needed instead of waiting for them to be shutdown at program exit.	2019-12-07 17:48:53 -05:00
David Steele	35a262951a	Pq test harness usability and error reporting improvements. Pq script errors are now printed in test output in case they are being masked by a later error. Once a script error occurs, the same error will be thrown forever rather than throwing a new error on the next item in the script. HRNPQ_MACRO_CLOSE() is not required in scripts unless harnessPqScriptStrictSet(true) is called. Most higher-level tests should not need to run in strict mode. The command/check test seems to require strict mode but there's no apparent reason why it should. This would be a good thing to look into at some point.	2019-12-07 17:33:34 -05:00
David Steele	d6479ddd0e	Add log replacements to help test non-deterministic log output. Some log output (e.g. time) is hard to test because the values can change between tests. Add expressions to replace substrings in the log with predictable values to simplify testing. This is similar to the log replacement facility available for Perl expect log testing.	2019-12-07 17:15:20 -05:00
David Steele	e4716ee036	Improve diff output in tests. Always compare expected vs actual (in that order) and give a hint in the error message to indicate what should be added and what removed.	2019-12-07 17:02:41 -05:00
David Steele	8c47ee296a	Improve storage harness test callback. Add ability to omit the root (i.e. dot) path and get real size of compressed files.	2019-12-07 16:55:50 -05:00
David Steele	1b3770e248	Recopy during backup when resumed file is missing or corrupt. A recopy would occur if the size or checksum was invalid but on error the backup would terminate. Instead, recopy the resumed file on any error. If the error is systemic (e.g. network failure) then it should show up again during the recopy.	2019-12-07 09:48:33 -05:00
David Steele	e632c60525	Fix backup labels in mock/all resume integration tests. These were not getting updated to match the directory name when the manifests were copied. The Perl code didn't care but the C code expects labels to be set correctly.	2019-12-06 11:48:41 -05:00
David Steele	b2d82bd248	Add functions to get the substring found by regExpMatch(). For now this is only used in testing but there are places where it could be useful in the core code. Even if that turns out not to be true, it doesn't seem worth implementing a new version in testing just to capture a few values that we already have.	2019-12-04 19:43:26 -05:00
David Steele	8dfe0e48e2	Use more general error code when tablespace linked into PGDATA. The specific error code was not that useful since we also test the error message which contains details of the link error.	2019-12-02 10:49:25 -05:00
David Steele	33a63aae50	Add flag to dbGet() to require a standby. This is needed from backup from standby functionality.	2019-12-02 07:39:42 -05:00
David Steele	28116918ff	Error in remote command when stop file exists. This duplicates the Perl functionality.	2019-12-02 07:35:36 -05:00
David Steele	fc291b6f28	Reduce the scope of mock/all exclusion tests. Run exclusions only on the tests where they will have an effect to reduce churn in the expect logs when they change.	2019-12-01 17:47:47 -05:00
David Steele	d15ed33821	Make MCV return false when a boolean tie. This is to maintain compatibility with the older Perl code that returned the lowest sorted order item in a tie. For other datatypes the C code returns the same value, often enough at least to not cause churn in the expect tests.	2019-12-01 16:32:21 -05:00
David Steele	56ee321a95	Add pgLsnName() and pgXactPath().	2019-12-01 15:49:34 -05:00
David Steele	50eb062e0e	Fix reference list when backup.info is reconstructed in expire command. Adding a manifest to backup.info was migrated to C in `4e4d1f41` but deduplication of the references was missed leading to a reference for every file being added to backup.info. Since the backup command is still using the Perl version of reconstruct this issue will not express unless 1) there is a backup missing from backup.info and 2) the expire command is run directly instead of running after backup as usual. This unlikely combination of events means this is probably not a problem in the field.	2019-11-28 09:34:19 -05:00
David Steele	686b6f91da	Set archive-check option in manifest correctly when offline. Archive check does not run when in offline backup mode but the option was set to true in the manifest. It's harmless since these options are informational only but it could cause confusion when debugging.	2019-11-28 08:27:21 -05:00
David Steele	158e439689	Remove obsolete Perl archive code. This should have been removed in `a1c13a50` but was missed.	2019-11-26 17:16:45 -05:00
David Steele	82df7e6f3b	Update integration tests in real/all that use test points. Test points are not supported by the new C code so these will be replaced with unit tests. The fact that the tests still pass even when the changes aren't made mid-backup (except application_name) shows how weak they were in the first place. Even so, this does represent a regression in (soon to be be removed) Perl coverage.	2019-11-26 11:32:12 -05:00
David Steele	b145c72b5c	Update missing manifest warning in BackupInfo. This brings the Perl message in line with C to reduce expect log churn.	2019-11-25 08:51:28 -05:00
David Steele	8800f32ad9	Remove exclusions once they have been tested in mock/all. The exclusions no longer have any effect after a restore and just add noise to the expect log.	2019-11-25 08:35:26 -05:00
David Steele	9856fef586	Update integration tests in mock/all that use test points. Test points will not be available in the C code so update these tests as best as possible without using them. This represents a loss of coverage for the Perl code (soon to be removed) which will be made up in the C code with unit tests.	2019-11-25 07:48:52 -05:00
David Steele	3cd45a7411	Remove start/stop --force integration tests in mock/all. These tests require test points which are not being implemented in the C code. This functionality is fully tested in the command/control unit tests so integration tests are no longer required.	2019-11-25 07:45:58 -05:00
David Steele	01aefc563d	Update Perl page checksum expression. This expression determines which files contain page checksums but it was also including the directory above the relation directories. In a real PostgreSQL installation this not a problem because these directories don't contain any files. However, our tests place a file in `base` which the Perl code thought should have page checksums while the new C code says no. Update the expression to document the change and avoid churn in the expect logs later.	2019-11-25 07:37:09 -05:00
David Steele	cace54151f	Add hostId to protocolLocalGet(). Previously this function was only creating locals that talked to the repository. Backup will need to be able to talk to multiple PostgreSQL hosts.	2019-11-23 10:32:57 -05:00
David Steele	ab65ffdfac	Add protocolStorageType*() to manage protocol storage types. Abstract the string representation of storage types that are passed over the protocol layer.	2019-11-23 10:22:11 -05:00
David Steele	a4b9440d35	Only install specific lcov version when required. Installing lcov 1.14 everywhere turned out to be a problem just as using 1.13 on Ubuntu 19.04 was. Since we primarily use Ubuntu 18.04 for coverage testing and reporting, we definitely want to make sure that works. So, revert to using the default packaged lcov except when specified otherwise in VmTest.pm. PostgreSQL minor version releases are also included since all containers have been rebuilt.	2019-11-22 19:25:49 -05:00
David Steele	52a3ba6b6f	Revert "Forbid % character in parameters." The issue "fixed" in `f01aa586` was caused by treating all strings as format strings while logging, which was fixed in `0c05df45`. Revert because there no longer seems a reason for the extra logic, and it was only partially applied, i.e. not to env vars, command-line options, or config options.	2019-11-22 15:18:56 -05:00
David Steele	381aecae4e	Fix walPath() when CWD is / and path is relative. The function would return a // prefix in this case, which works fine but looks odd while debugging.	2019-11-22 14:30:56 -05:00
David Steele	0c05df4582	Add _FMT() logging macro variants. Using the same macros for formatted and unformatted logging had several disadvantages. First, the compiler was unable to verify the format string against the parameters. Second, legitimate % characters in messages were being interpreted as format characters with garbage output ensuing. Add _FMT() variants and update all call sites to use the correct variant.	2019-11-22 13:33:26 -05:00
David Steele	f01aa5861d	Forbid % character in parameters. This character causes problems in C and in the shell if we try to output it in an error message. Forbid it completely and spell it out in error messages to avoid strange effects. There is likely a better way deal with the issue but this will do for now.	2019-11-21 17:28:03 -05:00
David Steele	c524ec4f95	Remove obsolete integration tests from mock/all. The protocol timeout tests have been superceded by unit tests. The TEST_BACKUP_RESUME test point was incorrectly included into a number of tests, probably a copy pasto. It didn't hurt anything but it did add 200ms to each test where it appeared. Catalog and control version tests were redundant. The database version and system id tests covered the important code paths and the C code gets these values from a lookup table. Finally, fix an incomplete update to the backup.info file while munging for tests.	2019-11-21 16:06:27 -05:00
David Steele	270f9496e4	Add manifestMove().	2019-11-21 12:08:32 -05:00
David Steele	c5a6631d27	Rearrange manifest module. Put functions with related functions, move getters above the helper functions, and rename manifestPgPath() to manifestPathPg().	2019-11-21 11:44:40 -05:00
David Steele	9f71a019c8	Allow storageInfo() to operate outside the base storage path. It is occasionally useful to get information about a file outside of the base storage path. storageLocal() can be used in some cases but when the storage is remote is doesn't seem worth creating a separate storage object for adhoc info requests. storageInfo() is a read-only operation so this seems pretty safe. The noPathEnforce parameter will make auditing exceptions easy.	2019-11-21 10:55:03 -05:00
David Steele	d3b1897625	Allow adhoc enforcement in storagePath(). The ability to disable enforcement (i.e., the requested absolute path is within the storage path) globally will be removed after the Perl migration. The feature will still be needed occasionally so allow it in an adhoc fashion.	2019-11-21 10:34:32 -05:00
David Steele	e1dad720a1	Rename storagePath() to storageP() in places where it was missed. Correct this since it will be enforced in a subsequent patch.	2019-11-21 10:21:35 -05:00
David Steele	cef9f0f37f	Process . in strPathAbsolute(). A . in a link will always lead to an error since the destination will be inside PGDATA. However, it is accepted symlink syntax so it's better to resolve it and get the correct error message. Also, we may have other uses for this function in the future.	2019-11-21 09:40:15 -05:00
David Steele	63c4c14836	Fix lcov build in Vagrantfile. -q was being instead of -s for silent mode which caused the build to fail.	2019-11-19 20:52:01 -05:00
David Steele	1db9e3b144	Remove *MP() macros variants. Adding a dummy column which is always set by the P() macro allows a single macro to be used for parameters or no parameters without violating C's prohibition on the {} initializer. -Wmissing-field-initializers remains disabled because it still gives wildly different results between versions of gcc.	2019-11-17 15:10:40 -05:00
David Steele	09e129886e	Add storageInfoList() support to remote storage driver.	2019-11-16 17:47:42 -05:00
David Steele	26e1da82e7	Allow zero-length substrings to be extracted from the end of a string. The previous assert was a bit overzealous and did not allow this case. It's not very common but still occasionally useful.	2019-11-16 17:32:49 -05:00
David Steele	8a3de1e05a	Add storageInfo() support to remote storage driver.	2019-11-16 17:30:08 -05:00
David Steele	8d6a8c3bf0	Store base path for remote storage locally. It wasn't practical for the main process to be ignorant of the remote path, and in any case knowing the path makes debugging easier. Pull the remote path when connecting and pass the result of local storagePath() to the remote when making calls.	2019-11-16 17:12:16 -05:00
David Steele	6827a13f3a	Add facility for reading and writing adhoc protocol output. Pushing output through a JSON blob is not practical if the output is extremely large, e.g. a backup manifest with 100K+ files. Add read/write routines so that output can be returned in chunks but errors will still be detected.	2019-11-16 17:05:34 -05:00
David Steele	c8db11e65b	Add user-id/group-id to hrnReplaceKey().	2019-11-15 17:50:12 -05:00
David Steele	53a2d04ab0	Allow "null" in jsonToStr().	2019-11-15 17:48:25 -05:00
David Steele	48e8942e86	Allow trailing / for relative paths in strPathAbsolute(). The trailing / does nothing but is nevertheless valid syntax.	2019-11-15 08:53:15 -05:00
David Steele	3b879c2cb3	Filter logged command options based on the command definition. Previously, options were being filtered based on what was currently valid. For chained commands (e.g. backup then expire) some options may be valid for the first command but not the second. Filter based on the command definition rather than what is currently valid to avoid logging options that are not valid for subsequent commands. This reduces the number of options logged and will hopefully help avoid confusion and expect log churn.	2019-11-14 16:48:41 -05:00
David Steele	2d10293d04	v2.19: C Migrations and Bug Fixes Bug Fixes: * Fix remote timeout in delta restore. When performing a delta restore on a largely unchanged cluster the remote could timeout if no files were fetched from the repository within protocol-timeout. Add keep-alives to prevent remote timeout. (Reported by James Sewell, Jens Wilke.) * Fix handling of repeated HTTP headers. When HTTP headers are repeated they should be considered equivalent to a single comma-separated header rather than generating an error, which was the prior behavior. (Reported by donicrosby.) Improvements: * JSON output from the info command is no longer pretty-printed. Monitoring systems can more easily ingest the JSON without linefeeds. External tools such as jq can be used to pretty-print if desired. (Contributed by Cynthia Shang.) * The check command is implemented entirely in C. (Contributed by Cynthia Shang.) Documentation Improvements: * Document how to contribute to pgBackRest. (Contributed by Cynthia Shang.) * Document maximum version for auto-stop option. (Contributed by Brad Nicholson.) Test Suite Improvements: * Fix container test path being used when --vm=none. (Suggested by Stephen Frost.) * Fix mismatched timezone in expect test. (Suggested by Stephen Frost.) * Don't autogenerate embedded libc code by default. (Suggested by Stephen Frost.)	2019-11-12 15:51:28 -05:00
David Steele	a44c5d0315	Add Strings for STORAGE_REPO_ARCHIVE and STORAGE_REPO_BACKUP. These constants are used often enough that they deserve to have String constants rather than repeatedly calling STRDEF().	2019-11-12 13:12:07 -05:00
David Steele	10c8eeaf6c	Fix handling of repeated HTTP headers. When HTTP headers are repeated they should be considered equivalent to a single comma-separated header rather than generating an error, which was the prior behavior. Reported by donicrosby.	2019-11-08 18:58:45 -05:00
David Steele	4317178633	Update MinIO to newest release. We had some problems with newer versions so had held off on updating. Those problems appear to have been resolved. In addition, the --compat flag is no longer required. Prior versions of MinIO required all parts of a multi-part upload (except the last) to be of equal size. The --compat flag was introduced to restore the default S3 behavior. Now --compat is only required when ETag is being used for MD5 verification, which we don't do.	2019-11-08 17:56:34 -05:00
David Steele	edcc7306a3	Add TIME parameter debug type. Previously we were using int64_t to debug time_t but this may not be right depending on how the compiler represents time_t, e.g. it could be a float. Since a mismatch would have caused a compiler error we are not worried that this has actually happened, and anyway the worst case is that the debug log would be wonky. The primary benefit, aside from correctness, is that it makes choosing a parameter debug type for time_t obvious.	2019-11-08 09:46:00 -05:00
David Steele	8b682b75d2	Allow mock integration tests for all VM types. Previously the mock integration tests would be skipped for VMs other than the standard four used in CI. Now VMs outside the standard four will run the same tests as VM4 (currently U18).	2019-11-02 10:35:48 +01:00
David Steele	7168e07440	Use getcwd() to construct path when WAL path is relative. Using pg1-path, as we were doing previously, could lead to WAL being copied to/from unexpected places. PostgreSQL sets the current working directory to PGDATA so we can use that to resolve relative paths.	2019-10-30 14:55:25 +01:00
David Steele	e06db21e35	Error when specified vm is invalid.	2019-10-17 14:00:18 +02:00
David Steele	a52faf83a5	Disable code generation on dry-run.	2019-10-17 11:56:45 +02:00
David Steele	fa6a54bb45	Update last tests that required sudo. All tests should now run in a sudo-less environment.	2019-10-16 17:05:24 +02:00
David Steele	48bd9e22f1	C test harness refactor. Consolidate setting configuration into hrnInit() and rename other functions for consistency. Split out internal functions into a new header.	2019-10-16 15:48:33 +02:00
David Steele	b4aeb217e6	Allow parameters to be passed to travis.pl. This makes configuring tests easier. Also add a parameter for tests that require sudo. This should be retired at some point but some tests still require it.	2019-10-15 17:19:42 +02:00
David Steele	f3b2189659	Remove package build sudo into the container. By running this in the container we no longer need sudo on the host system for package builds.	2019-10-15 13:27:03 +02:00
David Steele	67dde73727	Run tests in tmpfs. This will likely improve performance, but it also makes the filesystem consistent between platforms. A number of tests were failing on shiftfs, which was the default for arm64 on Travis.	2019-10-14 11:51:14 +02:00
David Steele	64c6102a15	Update packages required for Travis-CI builds. These packages are expected on the arm64 build even though we are using the same os image as amd64. It appears the arm64 image is slimmer.	2019-10-12 14:47:01 -04:00
David Steele	35eef2b867	Use a lower user id for posix storage tests. arm64 was not happy with the old user id, so use something smaller.	2019-10-12 14:16:22 -04:00
David Steele	827e95944a	Use < 0 and > 0 for strCmp() tests. Using -1 and 1 was a bit sloppy since the spec only guarantees that the values will be < 0 and > 0. Found on arm64 where the values were -64 and 64.	2019-10-12 13:52:45 -04:00
David Steele	a2fa1d04b0	Update container images to PostgreSQL 12 GA.	2019-10-12 11:26:13 -04:00
David Steele	397a41e0f9	Add Ubuntu 19.04 container definition.	2019-10-12 11:24:55 -04:00
David Steele	93656db186	Update lcov to 1.14. 1.13 is not compatible with gcc 8 which is what ships with newer distributions. Build from source to get a more recent version. 1.13 is not compatible with gcc 9 so we'll need to address that at a later date.	2019-10-12 11:24:21 -04:00
David Steele	11c7c8fabb	Remove pgbackrest test user. This user was created before we tested in containers to ensure isolation between the pg and repo hosts which were then just directories. The downside is that this resulted in a lot of sudos to set the pgbackrest user and to remove files which did not belong to the main test user. Containers provide isolation without needing separate users so we can now safely remove the pgbackrest user. This allows us to remove most sudos, except where they are explicitly needed in tests. While we're at it, remove the code that installed the Perl C library (which also required sudo) and simply add the build path to @INC instead.	2019-10-12 09:45:18 -04:00
David Steele	6f0e7f00af	Fix recovery test failing in PostgreSQL 12.0. This test was not creating recovery.signal when testing with --type=preserve. The preserve recovery type only keeps existing files and does not create any. RC1 was just ignoring recovery.signal and going right into recovery. Weirdly, 12.0 used restore_command to do crash recovery which made the problem harder to diagnose, but this has now been fixed in PostgreSQL and should be released in 12.1.	2019-10-12 09:26:19 -04:00
Cynthia Shang	db1dc4f275	Remove pretty-printing from jsonFromKv() and jsonFromVar(). Now that pretty-printing has been removed from the info command it no longer has a purpose, so remove it.	2019-10-11 13:03:52 -04:00
Cynthia Shang	d90b2724f8	JSON output from the info command is no longer pretty-printed. Monitoring systems can more easily ingest the JSON without linefeeds. External tools such as jq can be used to pretty-print if desired.	2019-10-11 12:56:03 -04:00
Cynthia Shang	2972580566	Remove info expect tests from mock/all and mock/stanza. These tests are redundant now that we have full coverage in the unit tests are are not worth maintaining anymore.	2019-10-11 12:38:03 -04:00
David Steele	642ce003c8	Don't autogenerate embedded libc code by default. This is only needed when new code is added to the Perl C library, which is becoming rare as the migration progresses. Also, the code will vary slightly based on the Perl version used for generation so for normal users it is just noise. Suggested by Stephen Frost.	2019-10-11 11:32:51 -04:00
David Steele	bcd3e4953a	Make perl/exec test container required. This test fails in some cases when --vm=none but it's not worth investigating since this code will be going away soon.	2019-10-10 22:10:20 -04:00
David Steele	e3d87ebace	Fix mismatched timezone in expect test. Also run the --vm-none tests in a non-UTC timezone to prevent regressions. Suggested by Stephen Frost.	2019-10-10 19:43:42 -04:00
David Steele	6db4e59a66	Allow tests that use ports to run in parallel. Set the test index in the C unit test code so it can assign port numbers that won't conflict between tests.	2019-10-10 16:13:43 -04:00
David Steele	13fcbb24e9	Fix container test path being used when --vm=none. Suggested by Stephen Frost.	2019-10-10 15:09:11 -04:00
David Steele	9a3ba649e1	Remove code to generate .travis.yml. Most of the logic has been moved to test/travis.pl so there wasn't much purpose to this code anymore.	2019-10-10 11:25:59 -04:00
David Steele	696e6a7c44	Don't require sudo to run tests with --vm=none. Run these tests without sudo privileges on Travis to prevent regressions.	2019-10-10 11:21:09 -04:00
David Steele	7f369006b5	Add gcc 9 support. A number of tests have been updated and Fedora 30 has been added to the test suite so the unit tests can run on gcc 9. Stop running unit tests on co6/7 since we appear to have ample unit test coverage.	2019-10-09 15:03:03 -04:00
David Steele	528f4c4347	Remove dependency on aws cli for testing. This tool was only being used it a few places but was a pretty large dependency. Rework the forceStorageMove() code using our storage layer and replace one aws cli cp with a storage put. Also, remove the Dockerfile that was once used to build the Scality S3 test container.	2019-10-09 14:38:24 -04:00
David Steele	61c4f64895	Be smarter about which packages are loaded for testing. Now that our tests are more diversified it makes sense to load only the packages that are needed for each test. Move the package loads from .travis.yaml to test/travis.pl where we have more control over what is loaded.	2019-10-08 18:56:55 -04:00
Cynthia Shang	a1c13a50dd	The check command is implemented entirely in C. Note that building the manifest on each host has been temporarily removed. This feature will likely be brought back as a non-default option (after the manifest code has been fully migrated to C) since it can be fairly expensive.	2019-10-08 18:04:09 -04:00
Cynthia Shang	ecae5e34e5	Update expire command to use C backup.info reconstruct. This was still being done in Perl until the C Manifest object was available.	2019-10-08 17:30:33 -04:00
Cynthia Shang	4e4d1f414a	Add infoBackupLoadFileReconstruct() to InfoBackup object. Check the backup.info file against the backup path. Add any backups that are missing and remove any backups that no longer exist. It's important to run this before backup or expire to be sure we are using the most up-to-date list of backups.	2019-10-08 16:04:27 -04:00
David Steele	b2825b82c7	Add missing header file.	2019-10-08 15:47:47 -04:00
Cynthia Shang	6d8d0eeba7	Add pgBackRest version to Info and Manifest objects. This was not being exposed previously because it is primarily informational, but now it is needed to reconstruct the backup.info file.	2019-10-08 15:37:08 -04:00
Cynthia Shang	38b72eded4	Document how to contribute to pgBackRest. There's a lot more to be done here, but this is a good start.	2019-10-08 15:27:17 -04:00
David Steele	45881c74ae	Allow most unit tests to run outside of a container. Three major changes were required to get this working: 1) Provide the path to pgbackrest in the build directory when running outside a container. Tests in a container will continue to install and run against /usr/bin/pgbackrest. 1) Set a per-test lock path so tests don't conflict on the default /tmp/pgbackrest path. Also set a per-test log-path while we are at it. 2) Use localhost instead of a custom host for TLS test connections. Tests in containers will continue to update /etc/hosts and use the custom host. Add infrastructure and update harnessCfgLoad*() to get the correct exe and paths loaded for testing. Since new tests are required to verify that running outside a container works, also rework the tests in Travis CI to provide coverage within a reasonable amount of time. Mainly, break up to doc tests by VM and run an abbreviated unit test suite on co6 and co7.	2019-10-08 12:06:30 -04:00
David Steele	5394893e33	Remove pgPath parameter from pgControlFromFile(). In practice this function is always used with storagePg*() so pgPath is known.	2019-10-03 11:14:22 -04:00
David Steele	29e132f5e9	PostgreSQL 12 support. Recovery settings are now written into postgresql.auto.conf instead of recovery.conf. Existing recovery_target* settings will be commented out to help avoid conflicts. A comment is added before recovery settings to identify them as written by pgBackRest since it is unclear how, in general, old settings will be removed. recovery.signal and standby.signal are automatically created based on the recovery settings.	2019-10-01 13:20:43 -04:00
Cynthia Shang	f96c54c4ba	Add info command set option for detailed text output. The additional details include databases that can be used for selective restore and a list of tablespaces and symlinks with their default destinations. This information is not included in the JSON output because it requires reading the manifest which is too IO intensive to do for all manifests. We plan to include this information for JSON in a future release.	2019-09-30 12:39:38 -04:00
David Steele	a58635ac02	Move C performance tests out of unit tests. Performance tests were being done in unit tests until there was a better place to put them. Now there is, so move them there.	2019-09-28 14:24:27 -04:00
David Steele	f1ba428fb0	Add performance test capability in C with scaling. Scaling allows the starting values to be increased from the command-line without code changes. Also suppress valgrind and assertions when running performance testing. Optimization is left at -O0 because we should not be depending on compiler optimizations to make our code performant, and it makes profiling more informative.	2019-09-28 14:02:12 -04:00
David Steele	004ff99a2d	Identify Perl performance test by appending -perl. This is intended to differentiate the upcoming C performance tests from the Perl performance tests that will eventually be migrated.	2019-09-28 13:17:21 -04:00
David Steele	cb62bebadf	Use bsearch() on sorted lists rather than an iterative method. bsearch() is far more efficient than an iterative approach except in the most trivial cases. For now insert will reset the sort order to none and the list will need to be resorted before bsearch() can be used. This is necessary because item pointers are not stable after a sort, i.e. they can move around. Until lists are stable it's not a good idea to surprise the caller by mixing up their pointers on insert.	2019-09-28 10:08:20 -04:00
David Steele	d3d2a7cd86	Add line number and fix spacing in TEST_LOG*() macros.	2019-09-28 09:57:06 -04:00
David Steele	afc483ef86	Clarify which timeline should be used for timeline integration test.	2019-09-27 13:37:59 -04:00
David Steele	d82102d6ef	Add explicit promotes to recovery integration tests. PostgreSQL 12 will shutdown in these cases which seems to be the correct action (according to the documentation) when hot_standby = off, but older versions are promoting instead. Set target_action explicitly so all versions will behave the same way. This does beg the question of whether the PostgreSQL 12 behavior is wrong (though it matches the docs) or the previous versions are.	2019-09-27 13:04:36 -04:00
David Steele	833d0da0d9	Store recovery file name in integration when testing preserve recovery. This makes the test a little more maintainable and is friendly with the changes needed for PostgreSQL 12.	2019-09-27 12:29:33 -04:00
David Steele	80eb561caf	Add missing PostgreSQL 11 control/WAL versions in Perl tests. These values don't seem to be used for testing but better to be tidy.	2019-09-27 09:45:11 -04:00
David Steele	d6a6d93a04	Add PostgreSQL 12 to u18 container. This does not add PostgresQL 12 support; it simply adds PostgreSQL 12 to the u18 container for development and testing.	2019-09-27 09:35:59 -04:00
David Steele	03a7bda511	Refactor recovery file generation. Separate the generation of recovery values and formatting them into recovery.conf format. This is generally a good idea, but also makes the code ready to deal with a different recovery file in PostgreSQL 12. Also move the recovery file logic out of cmdRestore() into restoreRecoveryWrite().	2019-09-27 09:19:12 -04:00
David Steele	c41fb575fb	Add standby restore type. This restore type automatically adds standby_mode=on to recovery.conf. This could be accomplished previously by setting --recovery-option=standby_mode=on but PostgreSQL 12 requires standby mode to be enabled by a special file named standby.signal. The new restore type allows us to maintain a common interface between PostgreSQL versions.	2019-09-26 17:39:45 -04:00

... 4 5 6 7 8 ...

1682 Commits