pgbackrest

mirror of https://github.com/pgbackrest/pgbackrest.git synced 2024-12-14 10:13:05 +02:00

Author	SHA1	Message	Date
David Steele	3181cd0ab8	Set WAL long header flag in test function missed in `e938a892`. This was missed because the unit tests were reusing a buffer without resetting it to zero, so this flag ended up still set when the test function was called. This was not a live issue since it only expressed in tests and this code is not used in master yet.	2019-03-26 18:46:44 +02:00
David Steele	5ee8388f48	Build test harness with the same warnings as code being tested. The test harness was not being built with warnings which caused some wackiness with an improperly structured switch. Just use the same warnings as the code being tested. Also enable warnings on code that is not directly being tested since other code modules are frequently modified during testing.	2019-03-26 08:20:55 +02:00
David Steele	abba2bd132	Add strLstMergeAnti() for merge anti-joins. We deal with some pretty big lists in archive-push so a nested-loop anti-join looked like it would not be efficient enough. This merge anti-join should do the trick even though both lists must be sorted first.	2019-03-25 20:35:20 +04:00
David Steele	538e2c7474	Add storagePg()/storagePgWrite() to storage helper. Add read/write storage helper functions for the PostgreSQL data directory.	2019-03-25 09:45:36 +04:00
David Steele	444b4f8640	Allow warnings to be written by archiveAsyncStatusOkWrite(). Migrate this feature to the C code since archive-push will need it.	2019-03-25 09:17:15 +04:00
David Steele	8820d69574	Use a single file to handle global errors in async archiving. The prior behavior on a global error (i.e. not file specific) was to write an individual error file for each WAL file being processed. On retry each of these error files would be removed, and if the error was persistent, they would then be recreated. In a busy environment this could mean tens or hundreds of thousands of files. Another issue was that the error files could not be written until a list of WAL files to process had been generated. This was easy enough for archive-get but archive-push requires more processing and any errors that happened when generating the list would only be reported in the pgBackRest log rather than the PostgreSQL log. Instead write a global.error file that applies to any WAL file that does not have an explicit ok or error file. This reduces churn and allows more errors to be reported directly to PostgreSQL.	2019-03-25 08:12:38 +04:00
David Steele	1fff60ad2a	Remove unused infoArchiveCheckPg() function. The idea was to use this function for archive-push, but in the end archive-push required two very different versions of the function.	2019-03-24 14:34:45 +04:00
David Steele	7cf7373761	Refactor PostgreSQL interface to remove most code duplication. Having a copy per version worked well until it was time to add new features or modify existing functions. Then it was necessary to modify every version and try to keep them all in sync. Consolidate all the PostgreSQL types into a single file using #if for type versions. Many types do not change or change infrequently so this cuts down on duplication. In addition, it is far easier to see what has changed when a new version is added. Use macros to write the interface functions. There is still duplication here since some changes require a new copy of the macro, but it is far less than before.	2019-03-21 21:11:36 +04:00
David Steele	e938a89250	Add WAL info to PostgreSQL interface. This allows the WAL header to be read for any supported version on PostgreSQL.	2019-03-19 19:44:06 +04:00
David Steele	e26d510d0c	Use restore command for remote performances tests. Since archive-push is being moved to C, the Perl remote will no longer work with that command. Eventually this module will need to be rewritten in C, but for now just use the restore command which is planned to be migrated last.	2019-03-17 22:11:35 +04:00
David Steele	856a369b86	Add file write to the S3 storage driver. Now that repositories are writable the storage drivers that don't yet support file writes need to be updated to do so. Note that the part size for multi-part upload has not been defined as a proper constant. This will become an option in the near future so it doesn't seem worth creating a constant that we might then forget to remove.	2019-03-17 22:00:54 +04:00
David Steele	7193738288	Add document creation to XML objects. The xml objects only exposed read methods of the underlying libxml2. This worked for S3 commands that only received data but to send data we need to be able to create XML documents from scratch. Add the ability to create empty documents and add nodes and contents.	2019-03-17 21:38:07 +04:00
David Steele	8ebc6d6c34	Add file write to the remote storage driver. Now that repositories are writable the storage drivers that don't yet support file writes need to be updated to do so.	2019-03-16 21:50:19 +04:00
David Steele	2d386cd266	Move WAL path prefix logic into walPath(). This logic is used by both archive-push and archive-get.	2019-03-16 16:14:10 +04:00
David Steele	12273a1034	Add storageRepoWrite() to storage helper. Writable repos are allowed with the addition of the CIFS driver, so add a helper function.	2019-03-16 15:48:50 +04:00
David Steele	66c2f4cd2e	Make notion of current PostgreSQL info ID in C align with Perl. The C code was assuming that the current PostgreSQL version in archive.info/backup.info was the most recent item in the history, but this is not always the case with some stanza-upgrade scenarios. If a cluster is restored from before the upgrade and stanza-upgrade is run again, it will revert db-id to the original history item. Instead, load db-id from the db section explicitly as the Perl code does. This did not affect archive-get since it does a reverse scan through the history versions and does not rely on the current version.	2019-03-16 15:27:38 +04:00
David Steele	b2b2cf0511	Fix issues with remote/local command logging options. Logging was being enable on local/remote processes even if --log-subprocess was not specified, so fix that. Also, make sure that stderr is enabled at error level as it was on Perl. This helps expose error information for debugging. For remotes, suppress log and lock paths since these are not applicable on remote hosts. These options should be set in the local config if they need to be overridden.	2019-03-16 15:00:02 +04:00
David Steele	d377e926c8	httpClientRequest() accepts a body parameter. None of our C HTTP requests have needed to output a body, but they will with the migration of archive-push. Also, add constants that are useful when POSTing/PUTing data.	2019-03-16 13:07:43 +04:00
David Steele	078df66e2c	Add httpHeaderDup(). Allow duplication of headers and updating of redactions.	2019-03-16 12:51:37 +04:00
David Steele	9c1549585e	Add command for removing all Docker containers to the Vagrantfile.	2019-03-15 22:18:56 +04:00
David Steele	b68d1d60a1	Add cvtSSizeToZ() and debug macros.	2019-03-15 22:16:50 +04:00
David Steele	5554377632	Make strLstDup() null-tolerant. Duping a NULL StringList without checking if it is NULL is a useful capability.	2019-03-15 10:20:52 +04:00
David Steele	9382283586	Fix issues when a path option is / terminated. This condition was not being properly checked for in the C code and it caused problems in the info command, at the very least. Instead of applying a local fix, introduce a new path option type that will rigorously check the format of any incoming paths. Reported by Marc Cousin.	2019-03-14 13:48:33 +04:00
David Steele	b8ebea6b1c	Add separate archive-push-async command. This command was previously forked off from the archive-push command which required a bit of artificial option and log manipulation. A separate command is easier to test and will work on platforms that don't have fork(), e.g. Windows.	2019-03-14 13:38:55 +04:00
David Steele	982b47c5ec	Add CIFS storage driver. This driver borrows heavily from the Posix driver. At this point the only difference is that CIFS does not allow explicit directory fsyncs so they need to be suppressed. At some point the CIFS diver will also omit link support. With the addition of this driver repository storage is now writable.	2019-03-14 13:28:33 +04:00
David Steele	941dbb4731	Remove package patch merged to upstream.	2019-03-14 13:02:19 +04:00
David Steele	68d20edea6	v2.11: C Implementation of Archive Get Bug Fixes: * Fix possible truncated WAL segments when an error occurs mid-write. (Reported by blogh.) * Fix info command missing WAL min/max when stanza specified. (Fixed by Stefan Fercot.) * Fix non-compliant JSON for options passed from C to Perl. (Reported by Leo Khomenko.) Improvements: * The archive-get command is implemented entirely in C. * Enable socket keep-alive on older Perl versions. (Contributed by Marc Cousin.) * Error when parameters are passed to a command that does not accept parameters. (Suggested by Jason O'Donnell.) * Add hints when unable to find a WAL segment in the archive. (Suggested by Hans-Jürgen Schönig.) * Improve error when hostname cannot be found in a certificate. (Suggested by James Badger.) * Add additional options to backup.manifest for debugging purposes. (Contributed by blogh.)	2019-03-10 18:56:00 +02:00
David Steele	2ef5ad70a2	Move crypto module to common/crypto. It makes sense for the crypto code to be in common since it is not pgBackRest-specific. Also combine the crypto tests into a single module.	2019-03-10 13:27:30 +02:00
David Steele	95597be81e	Move compress module to common/compress. It makes sense for the compression code to be in common since it is not pgBackRest-specific.	2019-03-10 13:11:20 +02:00
David Steele	2f63babe9d	Move help/help test module to command/help.	2019-03-10 11:55:01 +02:00
blogh	e4e2606fce	Add additional options to backup.manifest for debugging purposes. Add the buffer-size, compress-level, compress-level-network, and process-max options to the backup:option section in backup.manifest to aid in debugging. It may also make sense to propagate these options up to backup.info so they can be displayed in the info command, but for now this is deemed sufficient. Contributed by blogh.	2019-03-10 11:03:52 +02:00
David Steele	21f56f64eb	Add hints when unable to find a WAL segment in the archive. When this error happens in the context of a backup it can be a bit mystifying as to why the backup is failing. Add some hints to get the user started. These hints will appear any time a WAL segment can't be found, which makes the hint about the check command redundant when the user is actually running the check command, but it doesn't seem worth trying to exclude the hint in that case. Suggested by Hans-Jürgen Schönig.	2019-03-10 10:38:12 +02:00
Douglas J Hunley	786e77a9fe	Make DESTDIR fully-configurable in the Makefile. DESTDIR always had /usr/bin appended which was a problem systems that don't use /usr/bin as the install location for binaries. Instead, use the value of DESTDIR exactly and update the Debian packages accordingly. Contributed by Douglas J Hunley.	2019-03-09 11:40:55 +02:00
David Steele	60fe5b7365	Error when parameters are passed to a command that does not accept parameters. This behavior allowed a command like this to run without error: pgbackrest backup --stanza=db full Even though it actually performed an incremental backup in most circumstances because the `full` parameter was ignored. Instead, output an error and exit. Suggested by Jason O'Donnell.	2019-03-09 11:03:47 +02:00
David Steele	cad6fedb7b	Prevent option warning from being output when running help command. This warning was being output when getting help if retention was not set: WARN: option repo1-retention-full is not set, the repository may run out of space Suppress this when getting help since the warning will display by default on a system that is not completely configured.	2019-03-07 15:54:59 +02:00
David Steele	d441061168	Create test matrix for mock/all to increase coverage and reduce tests. The same test configurations are run on all four test VMs, which seems a real waste of resources. Vary the tests per VM to increase coverage while reducing the total number of tests. Be sure to include each major feature (remote, s3, encryption) in each VM at least once.	2019-03-02 15:01:02 +02:00
David Steele	f7d1d4400f	Create test matrix for mock/expire to increase coverage and reduce tests. The same test configurations are run on all four test VMs, which seems a real waste of resources. Vary the tests per VM to increase coverage while reducing the total number of tests.	2019-03-01 19:04:26 +02:00
David Steele	91622942c2	Create test matrix for mock/archive-stop to increase coverage and reduce tests. The same test configurations are run on all four test VMs, which seems a real waste of resources. Vary the tests per VM to increase coverage while reducing the total number of tests. Be sure to include each major feature (remote, s3, encryption) in each VM at least once.	2019-03-01 17:12:41 +02:00
David Steele	c307d9fdf9	Don't make a copy of the context name in the MemContext module. This is very inefficient in terms of memory and time and dynamic context names were never utilized. Just require that context names be valid for the life of the context. In practice they are all static strings.	2019-03-01 15:58:52 +02:00
David Steele	90709dfd21	Improve performance of context and memory allocations in MemContext module. Allocations required a sequential scan through the allocation list for both contexts and memory. This was very inefficient since for the most part individual memory allocations are seldom freed directly, rather they are freed when their context is freed. For both types of allocations track an index for the lowest free position. After an allocation of the free position, a sequential search will be required for the next allocation but this is still far better than doing a scan for every allocation. With a moderately-sized dataset (500 history entries in backup.info), there is a 237X performance improvement when combined with the `f74e88bb` refactor. Before: % cumulative self time seconds seconds name 65.11 331.37 331.37 memContextAlloc 16.19 413.78 82.40 memContextCurrent 14.74 488.81 75.03 memContextTop 2.65 502.29 13.48 memContextNewIndex 1.18 508.31 6.02 memFind After: % cumulative self time seconds seconds name 94.69 2.14 2.14 memFind Finding memory allocations in order to free or resize them is the next bottleneck, but this does not seem to be a major issue presently.	2019-03-01 14:57:01 +02:00
Marc Cousin	cb3b4fa24b	Enable socket keep-alive on older Perl versions. The prior method depended on IO:Socket:SSL to push the keep-alive options down to the socket but it only worked for recent versions of the module. Instead, create the socket directly using IO::Socket::IP if available or IO:Socket:INET as a fallback. The keep-alive option is set directly on the socket before it is passed to IO:Socket:SSL. Contributed by Marc Cousin.	2019-02-28 14:33:29 +02:00
David Steele	0913523096	Cleanup local/remote protocol interaction from `9367cc46`. The command option was not being set correctly when a remote was started from a local. It was being set as 'local' rather than the command that the local was running as. Also automatically select the remote protocol id based on whether it is started from a local (use the local protocol id) or from the main process (use 0). These were not live issues but could cause strange behaviors as new features are added that might be hard to diagnose.	2019-02-28 09:51:19 +02:00
David Steele	db4b447be8	The archive-get command is implemented entirely in C. This new implementation should behave exactly like the old Perl code with the exception of a few updated log messages. Remove as much of the Perl code as possible without breaking other commands.	2019-02-27 23:03:02 +02:00
David Steele	9367cc461c	Migrate local command to C. The C local is only used for C commands in the main process. Some tweaking of the existing protocolGet() command was required. Originally the idea was to share the function for local and remote requests but the differences (as in Perl) were too great to make that practical.	2019-02-27 22:34:21 +02:00
David Steele	35abd4cd95	Add ProtocolParallel* objects for parallelizing commands. Allows commands to be easily parallelized if the jobs are broken up into discrete, non-overlapping chunks.	2019-02-27 21:10:52 +02:00
David Steele	35acfae7c2	Add ProtocolCommand object. This formalizes the creation of protocol commands, which was previously done by creating KeyValue objects manually.	2019-02-27 19:48:30 +02:00
David Steele	dee90d3e60	Expose handle (file descriptor) from IoRead when applicable. Some IO objects have file descriptors which can be useful for monitoring with select(). It might also be useful to expose handles for write objects but there is currently no use case.	2019-02-27 18:11:09 +02:00
David Steele	b1957b07f3	Improve null-handling of varToLog().	2019-02-27 18:10:18 +02:00
David Steele	ea2aef1d0c	Add lstRemove() to List object. Because it is sometimes useful to remove items from a list.	2019-02-27 18:09:21 +02:00
David Steele	4be271ea2a	Improve fork harness to allow multiple children and setup pipes automatically. There was a lot of extra boilerplate involved in setting up pipes so that is now automated. In some cases testing with multiple children is useful so allow that as well.	2019-02-27 18:07:16 +02:00
David Steele	18b62a4220	Only run test-level stack trace by default for unit-tested modules. This amends `70c30dfb` which disabled test tracing in general. Instead, only enable test tracing by default for modules that are being unit tested. This saves lots of time but still ensures that test tracing is working and helps with debugging in unit tests. Also rename the option to --debug-test-trace for a clarity.	2019-02-27 17:09:19 +02:00
David Steele	3a05359087	Create test matrix for mock/stanza to increase coverage and reduce tests. The same test configurations are run on all four test VMs, which seems a real waste of resources. Vary the tests per VM to increase coverage while reducing the total number of tests. Be sure to include each major feature (remote, s3, encryption) in each VM at least once.	2019-02-24 07:42:41 +02:00
David Steele	6d3e18b181	Reduce expect log level in mock/stanza tests. The expect tests were originally a rough-and-ready type of unit test so monitoring changes in the expect log helped us detect changes in behavior. Now the stanza code is heavily unit-tested so the detailed logs mainly cause churn and don't have any measurable benefit. Reduce the log level to DETAIL to make the logs less verbose and volatile, yet still check user-facing log messages.	2019-02-24 06:55:59 +02:00
David Steele	2f081f3ec7	Rename test modules for consistency. The conventions for command and info tests have shifted in the C modules, though not even all the C modules got the message.	2019-02-23 18:51:52 +02:00
David Steele	d489eb87f7	Create test matrix for mock/archive to increase coverage and reduce tests. The same test configurations are run on all four test VMs, which seems a real waste of resources. Vary the tests per VM to increase coverage while reducing the total number of tests. Be sure to include each major feature (remote, s3, encryption) in each VM at least once.	2019-02-23 15:59:39 +02:00
David Steele	4a7588e604	Create aliases for test VMs ordered by age. This will allow for smarter allocation of tests in the next commit.	2019-02-23 15:13:23 +02:00
David Steele	59d7958914	Reduce expect log level in mock/archive tests. The expect tests were originally a rough-and-ready type of unit test so monitoring changes in the expect log helped us detect changes in behavior. Now the archive code is heavily unit-tested so the detailed logs mainly cause churn and don't have any measurable benefit. Reduce the log level to DETAIL to make the logs less verbose and volatile, yet still check user-facing log messages.	2019-02-23 15:05:06 +02:00
David Steele	a9cbf23f4c	Improve error when hostname cannot be found in a certificate. Update error message with the hostname and more detail about what went wrong. Hopefully this will help in diagnosing certificate/hostname issues. Suggested by James Badger.	2019-02-23 07:28:27 +02:00
David Steele	1f66bda02e	Fix non-compliant JSON for options passed from C to Perl. We have been using a hacked-up JSON generator to pass options from C to Perl since the C binary was introduced. This generator was not very compliant which led to issues with \n, ", etc. inside strings. We have a fully-compliant JSON generator now so use that instead. Reported by Leo Khomenko.	2019-02-22 12:02:26 +02:00
David Steele	70c30dfb61	Disable test-level stack trace by default. Detailed stack traces for low-level functions (e.g. strCat, bufMove) can be very useful for debugging but leaving them on for all tests has become quite burdensome in terms of time. Complex operations like generating JSON on a large KevValue can lead to timeouts even with generous values. Add a new param, --debug-trace, to enable test-level stack trace, but leave it off by default.	2019-02-22 11:40:30 +02:00
David Steele	ae86e6d5b2	Add missing ToLog() coverage to String, List, and PgControl. Missing coverage is exposed in the next commit which disables test tracing by default.	2019-02-22 11:31:37 +02:00
David Steele	e14c0eeb65	Use driver for remote protocol introduced in `da628be8`. The remote protocol was calling into the Storage object but this required some translation which will get more awkward as time goes by. Instead, call directly into the local driver so the communication is directly driver to driver. This still requires resolving the path and may eventually have more duplication with the Storage object methods but it seems the right thing to do.	2019-02-21 16:20:46 +02:00
David Steele	b1eb8af7d5	Resolve storage path expressions before passing to remote. Expressions such as <REPO:ARCHIVE> require a stanza name in order to be resolved correctly. However, if the stanza name is passed to the remote then that remote will only work correctly for that one stanza. Instead, resolved the expressions locally but still pass a relative path to the remote. That way, a storage path that is only configured on the remote does not need to be known locally.	2019-02-21 15:40:21 +02:00
David Steele	b4d4680f8c	Allow cfgExecParam() to exclude options. It is useful to be able to exclude an option even if it is shared by both commands.	2019-02-21 14:40:35 +02:00
David Steele	be6a3f131e	Improve null-handling of strToLog(). NULL was returning {"(null)"} which was comprehensible but not very pretty. Instead return null on NULL.	2019-02-21 14:26:06 +02:00
David Steele	1fd89f05af	Add kvKeyExists() to KeyValue object. Check if a key exists even if the value is NULL, which is the same result for a missing key.	2019-02-21 14:16:17 +02:00
Stefan Fercot	80df1114bd	Fix info command missing WAL min/max when stanza specified. This issue was a result of STORAGE_REPO_PATH prepending an extra stanza when the stanza was specified on the command line. The tests missed this because by some strange coincidence the WAL dirs were empty for each test that specified a stanza. Add new tests to prevent a regression. Fixed by Stefan Fercot.	2019-02-21 12:09:12 +02:00
David Steele	1519f5b045	Add storageHelperFree() to storage helper. Free all cached objects in the storage helper, especially the stanza name. This clears the storage environment for tests that switch stanza names or go from a stanza name to no stanza name or vice versa. This is only useful for testing right now, but may be used in the future for commands than act on multiple stanzas.	2019-02-21 11:40:30 +02:00
David Steele	b0b5989aca	Migrate remote archive-get command to C. All required protocol commands are implemented so this is mostly a matter of enabling the feature and updating expect logs.	2019-02-20 22:57:18 +02:00
David Steele	6866ff031a	Add exists() to remote storage.	2019-02-20 22:43:02 +02:00
David Steele	73be64ce49	Add separate archive-get-async command. This command was previously forked off from the archive-get command which required a bit of artificial option and log manipulation. A separate command is easier to test and will work on platforms that don't have fork(), e.g. Windows.	2019-02-20 15:52:07 +02:00
David Steele	da628be8a8	Migrate remote command to C. Prior to this the Perl remote was used to satisfy C requests. This worked fine but since the remote needed to be migrated to C anyway there was no reason to wait. Add the ProtocolServer object and tweak ProtocolClient to work with it. It was also necessary to add a mechanism to get option values from the remote so that encryption settings could be read and used in the storage object. Update the remote storage objects to comply with the protocol changes and add the storage protocol handler. Ideally this commit would have been broken up into smaller chunks but there are cross-dependencies in the protocol layer and it didn't seem worth the extra effort.	2019-02-19 20:57:38 +02:00
David Steele	d211c2b8b5	Fix possible truncated WAL segments when an error occurs mid-write. The file write object destructors called close() and finalized the file even if it was not completely written. This was an issue in both the C and Perl code. Rewrite the destructors to simply free resources (like file handles) rather than calling the close() method. This leaves the temp file in place for filesystems that use temp files. Add unit tests to prevent regression. Reported by blogh.	2019-02-15 11:52:39 +02:00
David Steele	057e2e2782	Add unimplemented S3 driver method required for archive-get. This was not being caught because the integration tests for S3 were running remotely and going through the Perl code rather than the new C code. Implement the exists method for the S3 driver and add tests to prevent a regression. Reported by mibiio.	2019-02-09 18:57:30 +02:00
David Steele	be8f31a952	JSON improvements. Optimize the parser implementation and make the renderer more null tolerant. Also make some string and variant constructors null tolerant.	2019-02-05 15:34:18 +02:00
David Steele	b29a8dd9c5	Automatically adjust db-timeout when protocol-timeout is smaller. This already worked in reverse, but this case is needed when a command that only uses protocol-timeout (e.g. info) calls a remote process where protocol-timeout and db-timeout can be set. If protocol-timeout was set to less than the default db-timeout then an error resulted.	2019-02-02 15:03:19 +02:00
David Steele	aa3e5b8c72	Allow primary gid for the test user to be different from uid. Apparently up until now they have always been the same, which is pretty typical. However, if they were not then ContainerTest.pm was not happy.	2019-01-30 17:03:17 +02:00
David Steele	b5a103f2df	Improve P/PP type macro handling. Rather than create _P/_PP variants for every type that needs to pass/return pointers, create FUNCTION_*_P/PP() macros that will properly pass or return any single/double pointer types. There remain a few unresolved edge cases such as CHARPY but this handles the majority of types well.	2019-01-28 22:33:29 +02:00
David Steele	d29aa61286	Allocate extra space for concatenations in the String object. The string object was reallocating memory with every concatenation which is not very efficient. This is especially true for JSON rendering which does a lot of concatenations. Instead allocate a pool of extra memory on the first concatenation (50% of size) to be used for future concatenations and reallocate when needed. Also add a 1GB size limit to ensure that there are no overflows.	2019-01-27 11:50:09 +02:00
David Steele	82c2d615b3	Move MACRO_TO_STR() to common/debug.h. This macro is handy for constructing static message strings so move it where the core code can see it.	2019-01-27 11:34:12 +02:00
David Steele	8f6d324b2c	Fix issue with multiple async status files causing a hard error. Multiple status files were being created by asynchronous archiving if a high-level error occurred after one or more WAL segments had already been transferred successfully. Error files were being written for every file in the queue regardless of whether it had already succeeded. To fix this, add an option to skip writing error files when an ok file already exists. There are other situations where both files might exist (various fsync and filesystem error scenarios) so it seems best to retry in the case that multiple status files are found rather than throwing a hard error (which then means that archiving is completely stuck). In the case of multiple status files, a warning will be logged to alert the user that something unusual is happening and the command will be retried. Reported by fpa-postgres, Joe Ayers, Douglas J Hunley.	2019-01-26 16:59:54 +02:00
David Steele	053972bfe0	Update address lookup in C TLS client to use modern methods. The implementation using gethostbyname() was only intended to be used during prototyping but was forgotten when the code was finalized. Replace it with gettaddrinfo() which is more modern and supports IPv6. Suggested by Bruno Friedmann.	2019-01-26 12:01:18 +02:00
David Steele	4d1060ea6b	Replace FileOpenError with HostConnectError in TlsClient. HostConnectError is more appropriate in this context.	2019-01-26 10:39:51 +02:00
David Steele	db08656537	Rename FUNCTION_DEBUG_* and consolidate ASSERT_* macros for consistency. Rename FUNCTION_DEBUG_* macros to FUNCTION_LOG_* to more accurately reflect what they do. Further rename FUNCTION_DEBUG_RESULT* macros to FUNCTION_LOG_RETURN* to make it clearer that they return from the function as well as logging. Leave FUNCTION_TEST_* macros as they are. Consolidate the various ASSERT* macros into a single ASSERT macro that is always compiled out of production builds. It was difficult to figure out when an assert would be checked with all the different types in play. When ASSERTs are compiled in they will always be checked regardless of the log level -- tying these two concepts together was not a good idea.	2019-01-21 17:41:59 +02:00
David Steele	d245f8eb42	The info command is implemented entirely in C. The C info code has already been committed but this commit wires it into main. Also remove the info Perl code and tests since they are no longer called.	2019-01-21 13:51:45 +02:00
David Steele	f79af47bd4	Increase timeout in storage/s3 module to improve reliability.	2019-01-19 20:44:40 +02:00
David Steele	e57f1b5293	Fix escaping in JSON string parser.	2019-01-19 12:57:40 +02:00
David Steele	7355248d6b	Add remote storage objects. This is a partial implementation of remote storage with just enough functionality to get the info command working. The client is written in C but the server is still in Perl, which limits progress until a C server is written.	2019-01-18 22:04:37 +02:00
David Steele	88201f37a3	Add ProtocolClient object and helper functions. This is a complete protocol client implementation in C. Currently there is no C server implementation so the C client is talking to a Perl server. This won't work very long, though, as the protocol format, even though in JSON, has a lot of language-specific structure. While it would be possible to maintain compatibility between C and Perl it's probably not worth the effort in the long run. Just as in Perl there are helper functions to make constructing protocol objects easier. Currently only repository remotes are supported.	2019-01-18 21:32:51 +02:00
David Steele	0986db630c	Fix comment typo.	2019-01-18 13:15:43 +02:00
David Steele	9cac403f61	Add Exec object. Executes a child process and allows the calling process to communicate with it using read/write io. This object is specially tailored to implement the protocol layer and may or may not be generally applicable to general purpose execution.	2019-01-18 11:45:40 +02:00
David Steele	797f8098d1	Add ioReadBuf() to easily read into a buffer. Moves some boilerplate into a function and makes it easier to get coverage in cases where a single buffer read captures all the data.	2019-01-18 11:14:44 +02:00
David Steele	06d41b4dc0	Add cfgExecParam() to generate parameters for executing commands. Parameters for the local/remote commands are based on parameters that are passed to the current command. Generate parameters for the new command based on the intersection of parameters between the current command and the command to be executed.	2019-01-17 22:29:19 +02:00
David Steele	ecd56105e6	Add IoHandleRead and IoHandleWrite objects. General i/o objects for reading and writing file descriptors, in particular those that can block. In other words, these are not generally to be used with file descriptors for actual files, but rather pipes, sockets, etc.	2019-01-17 22:08:31 +02:00
David Steele	7d4bbf290c	Fix difference in cipher type reporting missed in `8304d452`. The C code can't get the cipher type from the storage object because the C storage object does not have encryption baked in like the Perl code does. Instead, check backup.info to see if encryption is enabled. This will need to rethought if another cipher type is added but for now it works fine.	2019-01-16 22:16:50 +02:00
David Steele	e68d1e7304	Simplify info command text message when no stanza are present. Replace the repository path with just "the repository". The path is not important in this context and it is clearer to state where the stanzas are missing from.	2019-01-16 19:23:10 +02:00
David Steele	0014e15944	Fix escaping in JSON string renderer.	2019-01-16 18:34:50 +02:00
David Steele	c0a9048536	Fix null output in JSON renderer. In some cases nulls were being rendered as "(null)" because no special logic existed to handle them.	2019-01-16 09:11:01 +02:00
David Steele	ec26a0594e	Allow string Variant objects to contain null. This is more useful than the variant itself being null as it saves conditionals when creating.	2019-01-16 09:05:13 +02:00
David Steele	1b334da87f	Add kvMove() and varLstMove(). Allow these objects to be moved to another mem context.	2019-01-14 22:02:23 +02:00
David Steele	aab9e38b9a	Return UnknownError from errorTypeFromCode() for invalid error codes. The prior behavior was to throw an exception but this was not very helpful when something unexpected happened. Better to at least emit the error message even if the error code is not very helpful.	2019-01-14 21:34:22 +02:00
David Steele	2b02d37602	Allow empty arrays in JSON parser.	2019-01-14 21:21:37 +02:00
David Steele	8304d452b3	Make the C version of the info command conform to the Perl version. There were some small differences in ordering and how the C version handled missing directories. It may be that the C version is more consistent, but for now it is more important to be compatible with the Perl version. These differences were missed because the C info command was not wired into main.c so it was not being tested in regression. This commit does not fix the wiring issue because there will likely be a release soon and it is too big a change to put in at the last moment.	2019-01-13 22:44:58 +02:00
David Steele	f314a1f8aa	Improve accuracy of strSizeFormat(). Casting to int caused large values to be slightly inaccurate so cast to uint64_t instead. Also, use multiplication where possible since the compiler should precompute multiplied values.	2019-01-13 22:22:15 +02:00
David Steele	cebcb3f027	Null-terminate list returned by strLstPtr(). The null-terminator is required by many C functions that accept a string list so just add it in all cases rather than making it optional.	2019-01-10 09:49:28 +02:00
David Steele	ade2c3102d	Ignore SIGPIPE signals and check EPIPE result instead. SIGPIPE immediately terminates the process but we would rather catch the EPIPE error and gracefully shutdown. Ignore SIGPIPE and throw the EPIPE error via normal error handling.	2019-01-06 17:42:44 +02:00
David Steele	9560baf659	Move C module include in test.c above headers included for testing. Including the C module after the headers required for testing meant that if headers were missing from the C module they were not caught while directly testing the C module. The missing headers were caught in general testing, but it is frustrating to get an error in a module that has already passed while testing another module or running CI. Move the C module include to the very top so missing headers cause immediate failures.	2019-01-06 15:52:59 +02:00
David Steele	1de22cac2b	Rename common/io/handle module to common/io/handleWrite. ioHandleWriteOneStr() will become a helper function for the IoHandleWrite object.	2019-01-06 14:37:39 +02:00
David Steele	842147321f	Fix typo in error message.	2019-01-05 14:43:40 +02:00
David Steele	db24ff8df4	v2.08: Minor Improvements and Bug Fixes Bug Fixes: * Remove request for S3 object info directly after putting it. (Reported by Matt Kunkel.) * Correct archive-get-queue-max to be size type. (Reported by Ronan Dunklau.) * Add error message when current user uid/gid does not map to a name. (Reported by Camilo Aguilar.) * Error when --target-action=shutdown specified for PostgreSQL < 9.5. Improvements: * Set TCP keepalives on S3 connections. (Suggested by Ronan Dunklau.) * Reorder info command text output so most recent backup is output last. (Contributed by Cynthia Shang. Suggested by Ryan Lambert.) * Change file ownership only when required. * Redact authentication header when throwing S3 errors. (Suggested by Brad Nicholson.)	2019-01-02 22:04:47 +02:00
David Steele	26c888873e	Merge common/typeVariantListTest module into common/typeVariantTest. These modules are closely related so it makes sense for them to be merged.	2019-01-01 18:14:43 +02:00
David Steele	07b9176f25	Merge common/typeStringListTest module into common/typeStringTest. These modules are closely related so it makes sense for them to be merged.	2019-01-01 18:05:13 +02:00
David Steele	493fb78787	Add strLstInsert() and strLstInsertZ() to StringList object. Use lstInsert() to implement insertions into any position in a StringList.	2019-01-01 17:31:23 +02:00
Cynthia Shang	72865ca33b	Add admonitions to documentation renderers. Admonitions call out places where the user should take special care. Support added for HTML, PDF, Markdown and help text renderers. XML files have been updated accordingly. Contributed by Cynthia Shang.	2018-12-30 16:40:20 +02:00
Cynthia Shang	35bbb5bd68	Reorder info command text output so most recent backup is output last. After a stanza-upgrade backups for the old cluster are displayed until they expire. Cluster info was output newest to oldest which meant after an upgrade the most recent backup would no longer be output last. Update the text output ordering so the most recent backup is always output last. Contributed by Cynthia Shang. Suggested by Ryan Lambert.	2018-12-14 18:25:31 -05:00
Cynthia Shang	205525b607	Migrate local info command to C. The info command will only be executed in C if the repository is local, i.e. not located on a remote repository host. S3 is considered "local" in this case. This is a direct migration from Perl to integrate as seamlessly with the remaining Perl code as possible. It should not be possible to determine if the C version is running unless debug-level logging is enabled. Contributed by Cynthia Shang.	2018-12-13 16:22:34 -05:00
Cynthia Shang	e6ef40e8a3	Add infoBackup object to encapsulate the backup.info file. The infoBackup object is the counterpart to the infoArchive object which encapsulates the archive.info file. Currently the object is read-only, i.e. it is not possible to create a new or modify an existing backup.info file. There a number of constants that will also be used in the infoManifest object so go ahead and create a module to contain them so they don't need to be moved later. Contributed by Cynthia Shang.	2018-12-13 15:46:18 -05:00
Cynthia Shang	56d466ce4b	Add null support to jsonToKv(). Contributed by Cynthia Shang.	2018-12-13 15:20:57 -05:00
Cynthia Shang	2f15a90d18	Add infoArchiveIdHistoryMatch() to the InfoArchive object. Match a PostgreSQL system identifier and version to a pgBackRest archive id. Contributed by Cynthia Shang.	2018-12-10 18:45:57 -05:00
Cynthia Shang	e3435ec386	Add path expression for repository backup to the storage helper. This is the counterpart to the archive path expression and constructs paths into the backup part of the repository. Contributed by Cynthia Shang.	2018-12-10 17:35:31 -05:00
Cynthia Shang	4f539db8d9	Allow NULL stanza in storage helper. Some commands (e.g. info) do not take a stanza or the stanza is optional. In that case it is the job of the command to construct the repository path with a stanza as needed. Update helper functions to omit the stanza from the constructed path when it is NULL. Contributed by Cynthia Shang.	2018-12-10 17:01:33 -05:00
Cynthia Shang	cbf514e191	Improve info error messages introduced in `74b72df9`. - Add detail to errors when info files are loaded with incorrect encryption settings. - Throw FileMissingError rather than FileOpenError when both copies of the info file are missing. - If one file is present (but errors) and the other is missing, then return the error for the file that was present. Contributed by Cynthia Shang.	2018-12-10 16:32:41 -05:00
Cynthia Shang	80a3e21521	Add strSizeFormat() to String object. Converts sizes in bytes to a more human-readable form, .e.g. 1KB, 1.1GB. Contributed by Cynthia Shang.	2018-12-10 16:11:51 -05:00
David Steele	e73416e9e3	Change file ownership only when required. Previously chown() would be called even when no ownership changes were required. In most cases changes are not required and it seems better to perform an extra stat() rather than an extra chown(). Also add unit tests for owner() since there weren't any.	2018-12-05 17:56:47 -05:00
David Steele	cc6447356e	Fix test binary name for gprof. This got missed in `1f8931f7` when the test binary was renamed. Also output call graph along with the flat report. The flat report is generally most useful but it doesn't hurt to have both.	2018-12-05 09:15:45 -05:00
David Steele	14190f9e6c	Update URL for Docker install. As usual the old URL started providing a broken version of Docker rather than producing a clear error message. This happens once a year or so.	2018-12-03 12:41:53 -05:00
David Steele	74b72df9db	Improve error message when info files are missing/corrupt. The previous error message only showed the last error. In addition, some errors were missed (such as directory permission errors) that could prevent the copy from being checked. Show both errors below a generic "unable to load" error. Details are now given explaining exactly why the primary and copy failed. Previously if one file could not be loaded a warning would be output. This has been removed because it is not clear what the user should do in this case. Should they do a stanza-create --force? Maybe the best idea is to automatically repair the corrupt file, but on the other hand that might just spread corruption if pgBackRest makes the wrong choice.	2018-11-28 18:41:21 -05:00
David Steele	47687dd13a	Add xmlNodeAttribute() to XmlNode object. Retrieves a node attribute unless it is the root node.	2018-11-28 18:10:54 -05:00
David Steele	7c2fcb63e4	Enable encryption for archive-get command in C. The decryption filter was added in archiveGetFile() and archiveGetCheck() was modified to return the WAL decryption key stored in archive.info. The rest was plumbing. The mock/archive/1 integration test added encryption to provide coverage for the new code paths while mock/archive/2 dropped encryption to provide coverage for the existing code paths. This caused some churn in the expect logs but there was no change in behavior.	2018-11-28 14:56:26 -05:00
David Steele	6c23830991	Add encryption capability to Info* objects. Info objects can be stored encrypted and loaded with a cipher type and passphrase.	2018-11-28 14:41:10 -05:00
David Steele	410a04a58e	Allow arbitary InOut filters to be chained in IoFilterGroup. If InOut filters were placed next to each other then the second filter would never get a NULL input signaling it to flush. This arrangement only worked if the second filter had some other indication that it should flush, such as a decompression filter where the flush is indicated in the input stream. This is not a live issue because currently no InOut filters are chained together.	2018-11-28 14:20:12 -05:00
David Steele	838cfa44b7	Allow arbitrary multiplier and flush character in IoTestFilterMultiply. This allows for more complex test filter chains. Rename from IoTestFilterDouble to reflect the new functionality.	2018-11-28 14:02:14 -05:00
David Steele	3e254f4cff	Add IoFilter interface to CipherBlock object. This allows CipherBlock to be used as a filter in an IoFilterGroup. The C-style functions used by Perl are now deprecated and should not be used for any new code. Also add functions to convert between cipher names and CipherType.	2018-11-28 12:42:36 -05:00
David Steele	c3a84ccae0	Merge cipher.h into crypto.h. There is not enough in cipher.h to make it worth having a separate header.	2018-11-28 12:03:27 -05:00
David Steele	7df7cf6e6d	Rename cipherBlock() functions to cipherBlockC(). Some of the old names conflict with the new functions that must be created to implement the filter. Rename these to cipherBlock*C() to indicate that they take C-style parameters. These functions are only used by the Perl LibC code and will be removed or refactored eventually.	2018-11-27 22:49:24 -05:00
David Steele	fea27dbd7e	Return IoFilterGroup * from ioFilterGroupAdd(). This allows filters adds to be chained.	2018-11-27 22:02:08 -05:00
David Steele	56ce98b2f0	Explicitly compile with Posix 2001 standard. This standard was being selectively applied in modules that needed it. Instead, apply the standard to all compilation for consistency.	2018-11-25 10:06:31 -05:00
David Steele	315aa2c451	Conditional compilation of Perl logic in exit.c. This file is the only one to contain Perl logic outside of the perl module. Make the Perl logic conditional to improve reusability.	2018-11-25 08:39:41 -05:00
David Steele	78fe642eae	Remove extraneous use/include statements. Use conditional loading to make docs work in the absence of LibC. Somehow this also required a use statement to be added. Perl, go figure.	2018-11-24 20:31:35 -05:00
David Steele	801e2a5a2c	Rename PGBACKREST/BACKREST constants to PROJECT. This brings consistency between the C and Perl constants and allows for easier code reuse.	2018-11-24 19:05:03 -05:00
David Steele	7a1a688835	Add EOF detection to content read in HttpClient. If the connection closed before all content was sent httpClientRead() would get stuck in an infinite loop waiting for it to arrive. EOF should never be reached during content read so immediately error if EOF is detected.	2018-11-24 09:12:44 -05:00
Cynthia Shang	f4a1751abc	Improve JSON to Variant conversion and add Variant to JSON conversion. Add boolean and one-dimensional list types to jsonToKv(). Add varToJson() and kvToJson() to convert Variants and KeyValues to JSON. Contributed by Cynthia Shang.	2018-11-23 16:02:33 -05:00
David Steele	beae375330	Enable S3 storage for archive-get command in C. The only change required was to remove the filter that prevented S3 storage from being used. The archive-get command did not require any modification which demonstrates that the storage interface is working as intended. The mock/archive/3 integration test was modified to run S3 storage locally to provide coverage for the new code paths while mock/stanza/3 was modified to run S3 storage remotely to provide coverage for the existing code paths. This caused some churn in the expect logs but there was no change in behavior.	2018-11-23 12:18:07 -05:00
David Steele	b5690e21a4	Allow I/O read interface to explicitly request blocking reads. TlsClient introduced a non-blocking read which is required to read protocol messages that are linefeed-terminated rather than a known size. However, in many cases the expected number of bytes is known in advance so in that case it is more efficient to have tlsClientRead() block until all the bytes are read. Add block parameter to all read functions and use it when a blocking read is required. For most read functions this is a noop, i.e. if the read function never blocks then it can ignore the parameter. In passing, set the log level of storageNew*() functions to debug to expose more high-level I/O operations.	2018-11-23 12:01:36 -05:00
David Steele	256b727a3d	Add S3 storage driver. Only the storageNewRead() and storageList() functions are currently implemented, but this is enough to enable S3 for the archive-get command.	2018-11-21 19:32:49 -05:00
David Steele	72252ed2a1	Add HttpClient object. A robust HTTP client with pipelining support and automatic retries. Using a single object to make multiple requests is more efficient because requests are pipelined whenever possible. Requests are automatically retried when the connection has been closed by the server. Any 5xx response is also retried. Only the HTTPS protocol is currently supported.	2018-11-21 19:11:45 -05:00
David Steele	1dd06a6e46	Add TlsClient object. A simple, secure TLS client intended to allow access to services that are exposed via HTTPS. We call it TLS instead of SSL because SSL methods are disabled so only TLS connections are allowed. This object is intended to be used for multiple TLS connections against a service so tlsClientOpen() can be called each time a new connection is needed. By default, an open connection will be reused for pipelining so the user must be prepared to retry their transaction on a read/write error if the server closes the connection before it can be reused. If this behavior is not desirable then tlsClientClose() may be used to ensure that the next call to tlsClientOpen() will create a new TLS session. Note that tlsClientRead() is non-blocking unless there are zero bytes to be read from the session in which case it will raise an error after the defined timeout. In any case the tlsClientRead()/tlsClientWrite()/tlsClientEof() functions should not generally be called directly. Instead use the read/write interfaces available from tlsClientIoRead()/tlsClientIoWrite().	2018-11-21 18:43:25 -05:00
David Steele	ac426bc456	New test containers with static test certificates. Test certificates were generated dynamically but there are advantages to using static certificates. For example, it possible to use the same certificate between container versions. Mostly, it is easier to document the certificates if they are not buried deep in the container code. The new test certificates are initially intended to be used with the C unit tests but they will eventually be used for integration tests as well. Two new certificates have been defined. See test/certificate/README.md for details. The old dynamic certificates will be retained until they are replaced.	2018-11-21 18:13:37 -05:00
David Steele	bc25db5667	Add interface objects for libxml2. Add XmlDocument, XmlNode, and XmlNodeList objects as a thin interface layer on libxml2. This interface is not intended to be comprehensive. Only a few libxml2 capabilities are exposed but more can be added as needed.	2018-11-20 20:40:11 -05:00
David Steele	f743d4e924	Add testRepoPath() to let C unit tests know where the code repository is located. This allows a C unit test to access data in the code repository that might be useful for testing. Add testRepoPathSet() to set the repository path. In passing remove extra whitespace in the TEST_RESULT_VOID() macro.	2018-11-20 15:48:56 -05:00
David Steele	d0e553e4c2	Reduce debug level for infoIni() to test. Getters should generally not be logging at debug or trace level since it clutters the log. In passing move the destructor to the end of the file.	2018-11-16 16:05:06 -05:00
David Steele	04d9e4d5a8	v2.07: Automatic Backup Checksum Delta Bug Fixes: * Fix issue with archive-push-queue-max not being honored on connection error. (Reported by Lardière Sébastien.) * Fix static WAL segment size used to determine if archive-push-queue-max has been exceeded. * Fix error after log file open failure when processing should continue. (Reported by vthriller.) Features: * Automatically enable backup checksum delta when anomalies (e.g. timeline switch) are detected. (Contributed by Cynthia Shang.) Improvements: * Retry all S3 5xx errors rather than just 500 internal errors. (Suggested by Craig A. James.)	2018-11-16 09:50:50 -05:00
David Steele	332a68ea8d	Fix incorrect config constant introduced in `5e3b7cbe`. This commit introduced PGBACKREST_CONFIG_ORIG_PATH_FILE_STR as a String constant for PGBACKREST_CONFIG_ORIG_PATH_FILE but failed to get the value correct. Also, no test was added for PGBACKREST_CONFIG_ORIG_PATH_FILE_STR to prevent regressions as there is for PGBACKREST_CONFIG_ORIG_PATH_FILE.	2018-11-16 08:48:02 -05:00
David Steele	b6f7cbc315	Add destructors to IoRead and IoWrite objects. These interfaces previously used the memory context of the object they were associated with and did not have their own destructors. There are times when it is useful to free the interface without also freeing the underlying object so give IoRead and IoWrite their own memory contexts and destructors. In passing fix a comment type in bufferRead.c.	2018-11-15 16:25:46 -05:00
David Steele	480e1da798	Add ioWriteFlush() to flush pending output. By default the IoWrite object does not write until the output buffer is full but this is a problem for protocol messages that must be sent in order to get a response. ioWriteFlush() is not called internally by IoWrite but can be used at any time to immediately write all bytes from the output buffer without closing the IoWrite object.	2018-11-14 08:53:42 -05:00
David Steele	14d2f8b86c	Reword misleading message in stack trace when parameter buffer is full. The prior message stated that there had been a buffer overrun which is not true since the code prevents that. In fact, this message means the parameter buffer filled while building the parameter list. Rather than display a partial list we output this message instead. Also remove !!! which by convention we use as a marker for code that needs attention before it can be committed to master.	2018-11-14 08:11:11 -05:00
David Steele	8c7e97a369	Clarify comment about main.c being excluded from unit testing. Also remove !!! which by convention we use as a marker for code that needs attention before it can be committed to master.	2018-11-14 08:08:26 -05:00
David Steele	7310203491	Remove obsolete comments.	2018-11-14 07:42:28 -05:00
David Steele	a0b2af567a	Add TEST_LOG() and TEST_LOG_FMT() macros. These macros provide a convenient way to output debug information in tests. They are not intended to be left in test code when it is committed to master.	2018-11-13 20:58:32 -05:00
David Steele	acb579c469	Tighten limits on code coverage context selection. If the last } of a function was marked as uncovered then the context selection would overrun into the next function. Start checking context on the current line to prevent this. Make the same change for start context even though it doesn't seem to have an issue.	2018-11-13 10:37:58 -05:00
David Steele	086bc35ddc	Make ioReadLine() read less aggressively. ioReadLine() calls ioRead(), which aggressively tries to fill the output buffer, but this doesn't play well with blocking reads. Give ioReadLine() an option that tells it to read only what is available. That doesn't mean the function will never block but at least it won't do so by reading too far.	2018-11-12 21:18:53 -05:00
David Steele	a2672557f8	Allow storage path and file mode to be 0. Not all storage drivers support Posix-style permissions, e.g. S3.	2018-11-12 20:00:57 -05:00
David Steele	7107cc68d2	Expand context shown in coverage and update colors. Too few lines were shown for coverage context so show the entire function if it has any missing coverage. Update colors to work with light and dark browser modes.	2018-11-12 18:11:16 -05:00
David Steele	aff9693e64	Add strEmpty() to String object. strEmpty() is more concise and explanatory than strSize() == 0.	2018-11-12 14:13:55 -05:00
David Steele	22ecbc153a	New, concise coverage report for C. The report HTML generated by lcov is overly verbose and cumbersome to navigate. Since we maintain 100% coverage it's far more interesting to look at what is not covered than what is. The new report presents all missing coverage on a single page and excludes code that is covered for brevity.	2018-11-11 17:32:42 -05:00
David Steele	8f857a975e	Add constant macros to String object. There are many places (and the number is growing) where a zero-terminated string constant must be transformed into a String object to be usable. This pattern wastes time and memory, especially since the created string is generally used in a read-only fashion. Define macros to create constant String objects that are initialized at compile time rather than at run time.	2018-11-10 09:37:12 -05:00
David Steele	df200bee2a	Add regExpPrefix() to aid in static prefix searches. The storageList() command accepts a regular expression as a filter. This works fine for local filesystems where it is relatively cheap to get a complete list of files and filter them in code. However, for remote filesystems like S3 it can be expensive to fetch a complete list of files only to discard the bulk of them locally. S3 does not filter on regular expressions but it can accept a static prefix so this function extracts a prefix from a regular expression when possible. Even a few characters can drastically reduce the amount of data that must be fetched remotely so the function does not try to be too clever. It requires a ^ anchor and stops scanning when the first special character is found.	2018-11-09 16:50:22 -05:00
David Steele	8c504bd2f9	Improve alignment of expected vs. got error test results. It is easier to compare the error messages when they start at the same column.	2018-11-09 10:37:59 -05:00
David Steele	b5a3c8c84b	Add strCatChr() to String object.	2018-11-09 09:54:55 -05:00
David Steele	3e695af961	New test containers. * Add libxml2 library needed for S3 development. * Minor version updates for PostgreSQL. * Remove PostgreSQL 11 beta/rc repository.	2018-11-08 21:41:41 -05:00
David Steele	7cf1f42c5c	Add time since the beginning of the run to each test statement. Output the time in seconds of each test statement since the start of the test run. This helps find individual tests that are running slowly.	2018-11-08 19:50:31 -05:00
David Steele	12b3be1d8e	Limit usable Buffer size without changing allocated size. Allow buffers to report a lower size than their allocated size. This means a larger buffer can be used to do the work of a smaller buffer without having to create a new buffer and concatenate. This is useful for blocking I/O where the buffer may be too large for the amount of data that is available to read.	2018-11-08 16:48:33 -05:00
David Steele	edb2c6eb26	Construct Wait object in milliseconds instead of fractional seconds. The Wait object accepted a double in the constructor for wait time but used TimeMSec internally. This was done for compatibility with the Perl code. Instead, use TimeMSec in the Wait constructor and make changes as needed to calling code. Note that Perl still uses a double for its Wait object so translation is needed in some places. There are no plans to update the Perl code as it will become obsolete.	2018-11-08 08:37:57 -05:00
David Steele	a9feaba9e5	Add memContextCallbackClear() to prevent double free() calls. If an object free() method was called manually when a callback was set then the callback would call free() again. This meant that each free() method had to protect against a subsequent call. Instead, clear the callback (if present) before calling memContextFree(). This is faster (since there is no unecessary callback) and removes the need for semaphores to protect against a double free().	2018-11-07 08:51:32 -05:00
David Steele	48d2795f31	Merge crypto/random module into crypto/crypto. There wasn't enough code to justify a separate module/test and it seems to fit just fine in crypto/crypto.	2018-11-06 20:04:16 -05:00
David Steele	8efa5e6a6a	Rename CipherError to CryptoError. This aligns with the general renaming from cipher to crypto.	2018-11-06 19:38:38 -05:00
David Steele	2cb312ef5a	Add cryptoError() and update crypto code to use it. This adds detail to error messages when available and improves code coverage.	2018-11-06 19:16:00 -05:00
David Steele	bef31f1802	Add base variants to all integer to string conversion functions. Previously these functions were fixed at base 10 conversion. Add variants that can convert from any base.	2018-11-06 18:32:44 -05:00
David Steele	7de9584435	Ignore deleted files in rsync to test/repo. Deleted files are showing up in git ls-files (added `57d78092`) but they don't actually exist on disk. If there is someway to exclude deleted files from ls-files then I can't find it, so just tell rsync to ignore missing files.	2018-11-04 08:39:48 -05:00
David Steele	57d7809297	Improve efficiency of code generation. Code generation saved files even when they had not changed, which often caused code generation cascades. So, don't save files unless they have changed. Use rsync to determine which files have changed since the last test run. The manifest of changed files is saved and not removed until all code generation and builds have completed. If an error occurs the work will be redone on the next run. The eventual goal is to do all the builds from the test/repo directory created by rsync but for now it is only used to track changes.	2018-11-03 19:52:46 -04:00
David Steele	1f8931f732	Improve single test run performance. Improve on `7794ab50` by including the build flag files directly into the Makefile as dependencies (even though they are not includes). This simplifies some of the rsync logic and allows make to do what it does best. Also split build flag files into test, harness, and build to reduce rebuilds. Test flags are used to build test.c, harness flags are used to build the rest of the files in the test harness, and build flags are used for the files that are not directly involved in testing.	2018-11-03 16:34:04 -04:00
David Steele	7794ab50dc	Preserve contents of C unit test build directory between test.pl executions. The contents were already preserved between tests in a single test.pl run but for a separate execution the entire project had to be built from scratch, which was getting slower as we added code. Save the important build flags in a file so the new execution knows whether the build contents can be reused.	2018-11-02 11:56:13 -04:00
David Steele	1a98bd41b4	Mount tmpfs in Vagrantfile instead test.pl. Mounting/unmounting tmpfs on /home/[user]/test takes time, forces at least 3GB of memory to be available for tests, and makes it harder to preserve data between tests. Instead, move mounting of tmpfs to the Vagrantfile and add it to fstab so it survives reboots.	2018-11-02 08:37:27 -04:00
Cynthia Shang	34c63276cd	Automatically enable backup checksum delta when anomalies (e.g. timeline switch) are detected. There are a number of cases where a checksum delta is more appropriate than the default time-based delta: * Timeline has switched since the prior backup * File timestamp is older than recorded in the prior backup * File size changed but timestamp did not * File timestamp is in the future compared to the start of the backup * Online option has changed since the prior backup A practical example is that checksum delta will be enabled after a failover to standby due to the timeline switch. In this case, timestamps can't be trusted and our recommendation has been to run a full backup, which can impact the retention schedule and requires manual intervention. Now, a checksum delta will be performed if the backup type is incr/diff. This means more CPU will be used during the backup but the backup size will be smaller and the retention schedule will not be impacted. Contributed by Cynthia Shang.	2018-11-01 11:31:25 -04:00
David Steele	cca7a4ffd4	Retry all S3 5xx errors rather than just 500 internal errors. We were already retrying 500 errors but 503 (rate-limiting) errors were not being retried and would cause an instant failure which aborted the command. There are only two 5xx errors currently implemented by S3 but instead of adding 503 simply retry all 5xx errors. This is consistent with the http definition of this error class, "the server failed to fulfill an apparently valid request." Suggested by Craig A. James.	2018-10-30 16:45:42 -04:00
David Steele	286f7e5011	Fix static WAL segment size used to determine if archive-push-queue-max has been exceeded. This calculation was missed when the WAL segment size was made dynamic in preparation for PostgreSQL 11. Fix the calculation by checking the actual WAL file sizes instead of using an estimate based on WAL segment size. This is more accurate because it takes into account .history and .backup files, which are smaller. Since the calculation is done in the async process the additional processing time should not adversely affect performance. Remove the PG_WAL_SIZE constant and instead use local constants where the old value is still required. This is only the case for some tests and PostgreSQL 8.3 which does not provide a way to get the WAL segment size from pg_control.	2018-10-27 20:00:00 +01:00
David Steele	41b00dc204	Fix issue with archive-push-queue-max not being honored on connection error. If an error occurred while acquiring a lock on a remote server the error would be reported correctly, but the queue max detection code was not reached. The tests failed to detect this because they fixed the connection before queue max, allowing the ccde to be reached. Move the queue max code before the lock so it will run even when remote connections are not working. This means that no attempt will be made to transfer WAL once queue max has been exceeded, but it makes it much more likely that the code will be reach without error. Update tests to continue errors up to the point where queue max is exceeded. Reported by Lardière Sébastien.	2018-10-27 16:57:57 +01:00
David Steele	03b9db9aa2	Fix error after log file open failure when processing should continue. The C code was warning on failure and continuing but the Perl logging code was never updated with the same feature. Rather than add the feature to Perl, just disable file logging if the log file cannot be opened. Log files are always opened by C first, so this will eliminate the error in Perl. Reported by vthriller.	2018-10-25 14:58:25 +01:00
Cynthia Shang	d301720c58	Change infoArchiveCheckPg() to display the PostgreSQL version as a string in errors. Previously an integer representation (e.g. 90400) would be displayed which is correct but not as friendly to the user. Contributed by Cynthia Shang.	2018-10-25 10:48:37 +01:00
Cynthia Shang	5590bfa84c	Add tests for InfoPg history fixes introduced in `070455ce`. The existing tests were not adequate to ensure the history was being added in the correct order when some entries were loaded from a file and others added with infoPgAdd(). Contributed by Cynthia Shang.	2018-10-19 18:31:05 +02:00
David Steele	070455ce44	Correct current history item in InfoPg to always be in position 0. The InfoPg object was partially modified in `960ad732` to place the current history item in position 0, but infoPgDataCurrent() didn't get updated correctly. Remove this->indexCurrent and make the current position always equal 0. Use the new lstInsert() function when adding new history items via infoPgAdd(), but continue to use lstAdd() when loading from a file for efficiency. This does not appear to be a live bug because infoPgDataCurrent() and infoPgAdd() are not yet used in any production code. The archive-get command is the only C code using InfoPG and it always looks at the entire list of items rather than just the current item. Suggested by Cynthia Shang.	2018-10-19 12:31:56 +02:00
David Steele	f345db3f7c	Add lstInsert() to List object. Add general purpose insert function and make lstAdd() a special insert case.	2018-10-19 11:52:17 +02:00
David Steele	9ae3d8c46a	Install nodejs from deb.nodesource.com. The standard npm packages on Ubuntu 18.04 suddenly required libssl1.0 which broke the pgbackrest package builds. Installing nodejs from deb.nodesource.com seems to work fine with standard libssl. This package is required by ScalityS3 which is used for local S3 testing.	2018-10-15 23:13:08 +01:00
David Steele	68110d04b2	Add ioReadLine()/ioWriteLine() to IoRead/IoWrite objects. Allow a single linefeed-terminated line to be read or written. This is useful for various protocol implementations, including HTTP and pgBackRest's protocol. On read the maximum line size is limited to buffer-size to prevent runaway memory usage in case a linefeed is not found. This seems fine for HTTP but we may need to revisit this decision when implementing the pgBackRest protocol. Another option would be to increase the minimum buffer size (currently 16KB).	2018-10-07 17:50:10 +01:00
David Steele	db8dce7adc	Disable flapping archive/get unit on CentOS 6. This test has been flapping since `9b9396c7`. It seems to be some kind of timing issue since all integration tests pass and this unit passes on all other VMs. It only happens on Travis and is not reproducible in any development environment that we have tried. For now, disable the test since the constant flapping is causing major delays in testing and quite a bit of time has been spent trying to identify the root cause. We are actively developing these tests and hope the issue will be identified during the course of normal development. A number of improvements were made to the tests while searching for this issue. While none of them helped, it makes sense to keep the improvements.	2018-10-02 17:54:43 +01:00
David Steele	5404628148	Fix incorrect error message for duplicate options in configuration files. Duplicating a non-multi-value option was not throwing the correct message when the option was a boolean. The reason was that the option was being validated as a boolean before the multi-value check was being done. The validation code assumed it was operating on a string but was instead operating on a string list causing an assertion to fail. Since it's not safe to do the multi-value check so late, move it up to the command-line and configuration file parse phases instead. Reported by Jesper St John.	2018-09-27 17:48:40 +01:00
David Steele	be2271f6d3	Add cfgDefOptionMulti() to identify multi-value options. Previously this was done in two separate places by checking if an option was type hash or list. Bad enough that it was in two places, but an upcoming bug fix will add another instance so make it a function.	2018-09-27 17:24:06 +01:00
David Steele	e66e68e324	Add cryptoHmacOne() for HMAC support. There doesn't seem to be any need to implement this as a filter since current use cases (S3 authentication) work on small datasets. So, use the single function method provided by OpenSSL for simplicity.	2018-09-27 09:20:47 +01:00
David Steele	6470f03fa2	Comment fixes.	2018-09-26 22:36:42 +01:00
David Steele	bcca625062	Add bufHex()to Buffer object. A general-purpose function for converting buffers to hex strings.	2018-09-26 22:33:48 +01:00
David Steele	51484a008f	Add bufNewZ() to Buffer object. This constructor creates a Buffer object directly from a zero-terminated string. The old way was to create a String object first, then convert that to a Buffer using bufNewStr(). Updated in all places that used the old pattern.	2018-09-26 18:46:52 +01:00
David Steele	d038b9a029	Support configurable WAL segment size. PostgreSQL 11 introduces configurable WAL segment sizes, from 1MB to 1GB. There are two areas that needed to be updated to support this: building the archive-get queue and checking that WAL has been archived after a backup. Both operations require the WAL segment size to properly build a list. Checking the archive after a backup is still implemented in Perl and has an active database connection, so just get the WAL segment size from the database. The archive-get command does not have a connection to the database, so get the WAL segment size from pg_control instead. This requires a deeper inspection of pg_control than has been done in the past, so it seemed best to copy the relevant data structures from each version of PostgreSQL and build a generic interface layer to address them. While this approach is a bit verbose, it has the advantage of being relatively simple, and can easily be updated for new versions of PostgreSQL. Since the integration tests generate pg_control files for testing, teach Perl how to generate files with the correct offsets for both 32-bit and 64-bit architectures.	2018-09-25 10:24:42 +01:00
David Steele	c0b0b4e541	PostgreSQL 11 Beta 4 support. Catalog version changed for this release, so update it. Also update and upload a new container with beta 4 installed.	2018-09-21 13:25:27 -04:00
Cynthia Shang	880fbb5e57	Add checksum delta for incremental backups. Use checksums rather than timestamps to determine if files have changed. This is useful in cases where the timestamps may not be trustworthy, e.g. when performing an incremental after failing over to a standby. If checksum delta is enabled then checksums will be used for verification of resumed backups, even if they are full. Resumes have always used checksums to verify the files in the repository, enabling delta performs checksums on the database files as well. Note that the user must manually enable this feature in cases were it would be useful or just keep in enabled all the time. A future commit will address automatically enabling the feature in cases where it seems likely to be useful. Contributed by Cynthia Shang.	2018-09-19 11:12:45 -04:00
Cynthia Shang	b6b2c915b2	Allow hashSize() to run on remote storage. Apparently we never needed to run this function remotely. It will be needed by the backup checksum delta feature, so implement it now. Contributed by Cynthia Shang.	2018-09-18 11:39:48 -04:00
David Steele	03003562d8	Merge all posix storage tests into a single unit. As we add storage drivers it's important to keep the tests for each completely separate. Rather than have three tests for each driver, standardize on having a single test unit for each driver.	2018-09-17 11:45:41 -04:00
David Steele	e55d733041	Add -ftree-coalesce-vars option to unit test compilation. This is a workaround for inefficient handling of many setjmps in gcc >= 4.9. Setjmp is used in all error handling, but in the unit tests each test macro contains an error handling block so they add up pretty quickly for large unit tests. Enabling -ftree-coalesce-vars in affected versions reduces build time and memory requirements by nearly an order of magnitude. Even so, compiles are much slower than gcc <= 4.8. We submitted a bug for this at: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87316 Which was marked as a duplicate of: https://gcc.gnu.org/bugzilla/show_bug.cgi?id=63155	2018-09-17 11:38:10 -04:00
David Steele	b5f749b21c	Add CIFS driver to storage helper for read-only repositories. For read-only repositories the Posix and CIFS drivers behave exactly the same. Since that's all we support in C right now it's valid to treat them as the same thing. An assertion has been added to remind us to add the CIFS driver before allowing the repository to be writable. Mostly we want to make sure that the C code does not blow up when the repository type is CIFS.	2018-09-16 18:41:30 -04:00
David Steele	a6c346cb04	Clear test directory between test runs. Previously it was the responsibility of the individual tests to clean up after themselves. Now the test harness now does the cleanup automatically. This means that some paths/files need to be recreated with each run but that doesn't happen very often. An attempt has been made to remove all redundant cleanup code but it's hard to know if everything has been caught. No issues will be caused by anything that was missed, but they will continue to chew up time in the tests.	2018-09-16 17:26:04 -04:00
David Steele	4119ce208d	Move test expect log out of the regular test directory. Storing the expect log (created by common/harnessLog) in the regular test directory was not ideal. It showed up in tests and made it difficult to clear the test directory between each run. Move the expect log to a purpose-built directory one level up so it does not interfere with regular testing.	2018-09-16 15:58:46 -04:00
David Steele	8852622fa2	Fix missing test caused by a misplaced YAML tag.	2018-09-16 15:53:19 -04:00
David Steele	c47fc28c8b	Make comment blocks consistent across all tests.	2018-09-16 14:15:21 -04:00
David Steele	84ab787b1a	Merge protocol storage helper into storage helper. These are separated the same way in the Perl code where the remote storage driver is located in the Protocol module. However, in the C code the intention is to implement the remote storage driver as a regular driver in the storage layer rather than making a special case out of it. So, merge the storage helpers. This also has the benefit of making the code a bit simpler. Also separate storageSpool() and storageSpoolWrite() to make it clearer which operations require write access and to maintain consistency with the other storage helper functions.	2018-09-16 14:12:53 -04:00
David Steele	3014b05dab	Fix buffer underrun in configuration test harness. If the total bytes read from the expect log file was 0 then the last byte of whatever was in memory before harnessLogBuffer would be set to 0. On 32-bit systems this expressed as the high order byte of a pointer being cleared and wackiness (in the form of segfaults) ensued.	2018-09-16 13:41:49 -04:00
David Steele	0c02481d6e	Update all interfaces to use variable parameter constructors. Fixed parameter constructors made adding new interface functions a burden, so we switched to using structs to define interfaces in the storage module at `c49eaec7`. While propagating this pattern to the IO interfaces it became obvious that the existing variable parameter function pattern (begun in the storage module) was more succinct and consistent with the existing code. So, use variable parameter functions to define all interfaces. This assumes that the non-interface parameters will be fixed, which seems reasonable for low-level code.	2018-09-15 21:07:00 -04:00
David Steele	f0ed89f21f	Allow C or Perl coverage to run on more than one VM. C or Perl coverage tests can now be run on any VM provided a recent enough version of Devel::Cover or lcov is available. For now, leave u18 as the only VM to run coverage tests due to some issues with older versions of lcov.	2018-09-15 13:27:06 -04:00
David Steele	31cdd9d20b	Remove compiler warnings that are not valid for u16.	2018-09-15 08:23:55 -04:00
David Steele	c49eaec776	Full abstraction of storage driver interface. The external storage interfaces (Storage, StorageFileRead, etc.) have been stable for a while, but internally they were calling the posix driver functions directly. Create driver interfaces for storage, fileRead, and fileWrite and remove all references to the posix driver outside storage/driver/posix (with the exception of a direct call to pathRemove() in Perl LibC). Posix is still the only available driver so more adjustment may be needed, but this should represent the bulk of the changes.	2018-09-14 16:08:33 -04:00
David Steele	aeb1fa3dfb	Don't perform valgrind when requested. The --no-valgrind flag was not being honored. It's not clear if this flag ever worked, but it does now.	2018-09-13 19:12:40 -04:00
David Steele	fd14ceb399	Rename posix driver files/functions for consistency. The posix driver was developed over time and the naming is not very consistent. Rename the files and functions to work well with other drivers and generally favor longer names since the driver functions are seldom (eventually never) used outside the driver itself.	2018-09-13 18:58:22 -04:00
David Steele	ab1762663c	Don't use negations in objects below Storage. The Storage object represents some some optional parameters as negated if the default is true. This allows sensible defaults without having to specify most optional parameters. However, there's no need to propagate this down to functions that require all parameters to be passed -- it makes the code and logging more confusing. Rename the parameters and update logic to remove negations.	2018-09-13 17:53:48 -04:00
David Steele	5aa458ffae	Simplify debug logging by allowing log functions to return String objects. Previously, debug log functions had to handle NULLs and truncate output to the available buffer size. This was verbose for both coding and testing. Instead, create a function/macro combination that allows log functions to return a simple String object. The wrapper function takes care of the memory context, handles NULLs, and truncates the log string based on the available buffer size.	2018-09-11 18:32:56 -04:00
David Steele	9b9396c7b7	Migrate local, unencrypted, non-S3 archive-get command to C. The archive-get command will only be executed in C if the repository is local, unencrypted, and type posix or cifs. Admittedly a limited use case, but this is just the first step in migrating the archive-get command entirely into C. This is a direct migration from the Perl code (including messages) to integrate as seamlessly with the remaining Perl code as possible. It should not be possible to determine if the C version is running unless debug-level logging is enabled.	2018-09-11 15:42:31 -04:00
David Steele	787e7c295f	Change locking around async process forking to be more test friendly. The lock is now released before the fork and reacquired after the fork so the parent process no longer needs to worry about clearing the lock. This is the same locking mechanism that will be used once archive-get-async is exec'd as a separate command, so introduce it now to simplify testing.	2018-09-11 14:43:16 -04:00
David Steele	9e574a37dc	Make archive-get info messages consistent between C and Perl implementations. The info messages were spread around and logged differently based on the execution path and in some cases logged nothing at all. Temporarily track the async server status with a flag so that info messages are not output in the async process. The async process will be refactored as a separate command to be exec'd in a future commit.	2018-09-11 12:30:48 -04:00
David Steele	6c1d48b018	Command-line help update missed in `cabb3f857`.	2018-09-11 10:55:55 -04:00
Cynthia Shang	e351b8c67c	Improve info command to display the stanza cipher type. Contributed by Cynthia Shang. Suggested by Douglas J Hunley.	2018-09-10 13:09:45 -04:00
David Steele	c688bc8627	Improve support for special characters in filenames. % characters caused issues in backup/restore due to filenames being appended directly into a format string. Reserved XML characters (<>&') caused issues in the S3 driver due to improper escaping. Add a file with all common special characters to regression testing.	2018-09-10 10:54:34 -04:00
David Steele	f7fc8422f7	Make Valgrind return an error even when a non-fatal issue is detected. By default Valgrind does not exit with an error code when a non-fatal error is detected, e.g. unfreed memory. Use the --error-exitcode option to enabled this behavior. Update some minor issues discovered in the tests as a result. Luckily, no issues were missed in the core code.	2018-09-07 16:50:01 -07:00
David Steele	faaa9a91fd	Fix typo in unit test error messages, EXECTED => EXPECTED.	2018-09-07 08:03:56 -07:00
David Steele	6e9b6fdca9	Migrate control functions to detect stop files to C from Perl. Basic functions to detect the presence of stanza or all stop files and error when they are present. The functionality to detect stop files without error was not migrated. This functionality is only used by stanza-delete and will be migrated with that command.	2018-09-07 08:03:05 -07:00
David Steele	5bdaa35fa5	Migrate walIsPartial(), walIsSegment(), and walSegmentFind() from Perl to C. Also refactor regular expression defines to make them more reusable.	2018-09-07 08:00:18 -07:00
David Steele	9660076093	Add helper for repository storage. Implement rules for generating paths within the archive part of the repository. Add a helper function, storageRepo(), to create the repository storage based on configuration settings. The repository storage helper is located in the protocol module because it will support remote file systems in the future, just as the Perl version does. Also, improve the existing helper functions a bit using string functions that were not available when they were written.	2018-09-07 07:58:08 -07:00
David Steele	960ad73298	Info objects now parse JSON and use specified storage. Use JSON code now that it is available and remove temporary hacks used to get things working initially. Use passed storage objects rather than using storageLocal(). All storage objects in C are still local but this won't always be the case. Also, move Postgres version conversion functions to postgres/info.c since they have no dependency on the info objects and will likely be useful elsewhere.	2018-09-06 10:12:14 -07:00
David Steele	de1b74da0c	Move encryption in mock/archive tests to remote tests. The new archive-get C code can't run (yet) when encryption is enabled. Therefore move the encryption tests so we can test the new C code. We'll move it back when encryption is enabled in C. Also, push one WAL segment with compression to test decompression in the C code.	2018-09-06 09:35:34 -07:00
David Steele	6361a06181	Fix incorrectly reported error return in info logging. A return code of 1 from the archive-get was being logged as an error message at info level but otherwise worked correctly. Also improve info messages when an archive segment is or is not found.	2018-09-04 21:46:41 -04:00
David Steele	800afeef70	Posix file functions now differentiate between open and missing errors. The Perl functions do so and the integration tests rely on checking for these errors. This has been exposed as more functionality is moved into C. Passing the errors types is now a bit complicated so instead use a flag to determine which errors to throw.	2018-09-04 17:47:23 -04:00
David Steele	375ff9f9d2	Ignore all files in a linked tablespace directory except the subdirectory for the current version of PostgreSQL. Previously an error would be generated if other files were present and not owned by the PostgreSQL user. This hasn't been a big deal in practice but it could cause issues. Also add tests to make sure the same logic applies with links to files, i.e. all other files in the directory should be ignored. This was actually working correctly, but there were no tests for it before.	2018-08-31 16:06:40 -04:00
David Steele	bc7462d86d	v2.05: Environment Variable Options and Exclude Temporary/Unlogged Relations Bug Fixes: * Fix issue where relative links in $PGDATA could be stored in the backup with the wrong path. This issue did not affect absolute links and relative tablespace links were caught by other checks. (Reported by Cynthia Shang.) * Remove incompletely implemented online option from the check command. Offline operation runs counter to the purpose of this command, which is to check if archiving and backups are working correctly. (Reported by Jason O'Donnell.) * Fix issue where errors raised in C were not logged when called from Perl. pgBackRest properly terminated with the correct error code but lacked an error message to aid in debugging. (Reported by Douglas J Hunley.) * Fix issue when a boolean option (e.g. delta) was specified more than once. (Reported by Yogesh Sharma.) Features: * Allow any option to be set in an environment variable. This includes options that previously could only be specified on the command line, e.g. stanza, and secret options that could not be specified on the command-line, e.g. repo1-s3-key-secret. * Exclude temporary and unlogged relation (table/index) files from backup. Implemented using the same logic as the patches adding this feature to PostgreSQL, 8694cc96 and 920a5e50. Temporary relation exclusion is enabled in PostgreSQL ≥ 9.0. Unlogged relation exclusion is enabled in PostgreSQL ≥ 9.1, where the feature was introduced. (Contributed by Cynthia Shang.) * Allow arbitrary directories and/or files to be excluded from a backup. Misuse of this feature can lead to inconsistent backups so read the --exclude documentation carefully before using. (Reviewed by Cynthia Shang.) * Add log-subprocess option to allow file logging for local and remote subprocesses. * PostgreSQL 11 Beta 3 support. Improvements: * Allow zero-size files in backup manifest to reference a prior manifest regardless of timestamp delta. (Contributed by Cynthia Shang.) * Improve asynchronous archive-get/archive-push performance by directly checking status files. (Contributed by Stephen Frost.) * Improve error message when a command is missing the stanza option. (Suggested by Sarah Conway.)	2018-08-31 13:19:43 -04:00
David Steele	d41570c37a	Improve log file names for remote processes started by locals. The log-subprocess feature added in `22765670` failed to take into account the naming for remote processes spawned by local processes. Not only was the local command used for the naming of log files but the process id was not pass through. This meant every remote log was named "[stanza]-local-remote-000" which is confusing and meant multiple processes were writing to the same log. Instead, pass the real command and process id to the remote. This required a minor change in locking to ignore locks if process id is greater than 0 since remotes started by locals never lock.	2018-08-31 11:31:13 -04:00
David Steele	c2d0a21d63	Allow secrets to be passed via environment variables. When environment variables were added in `d0b9f986` they were classified as cfgSourceParam, but one of the restrictions on this type is that they can't pass secrets because they might be exposed in the process list. The solution is to reclassify environment variables as cfgSourceConfig. This allows them to handle secrets because they will not pass values to subprocesses as parameters. Instead, each subprocess is expected to check the environment directly during configuration parsing. In passing, move the error about secrets being passed on the command-line up to command-line parsing and make the error more generic with respect to the configuration file now that multiple configuration files are allowed.	2018-08-30 18:44:40 -04:00
David Steele	70514061fd	Fix issue where relative links in $PGDATA could be stored in the backup with the wrong path. Relative link paths were being combined with the paths of previous links (relative or absolute) due to the $strPath variable being modified in the current iteration rather than simply being passed to the next level of recursion. This issue did not affect absolute links and relative tablespace links were caught by other checks, though the error was confusing. Reported by Cynthia Shang.	2018-08-30 16:27:36 -04:00
David Steele	c638490451	Documentation updates for exclude feature based on review. Reviewed by Cynthia Shang.	2018-08-28 16:49:29 -04:00
David Steele	14cde54b37	Limit manifest build recursion (i.e. links followed) to sixteen levels to detect link loops.	2018-08-28 16:27:10 -04:00
David Steele	a6cecf7d5e	Prevent manifest from being built more than once.	2018-08-28 16:22:30 -04:00
David Steele	bef58a7974	Allow arbitrary directories and/or files to be excluded from a backup. Misuse of this feature can lead to inconsistent backups so read the --exclude documentation carefully before using.	2018-08-27 15:51:05 -04:00
David Steele	4c3cf435c0	Add cvtZToUInt() to convert string to unsigned int.	2018-08-24 17:08:00 -04:00
Cynthia Shang	eb30d88b6a	Allow zero-size files in backup manifest to reference a prior manifest regardless of timestamp delta. Contributed by Cynthia Shang.	2018-08-24 16:50:33 -04:00
David Steele	0ed37ab9e7	Update Archive::Info->archiveIdList() to return a valid error code instead of unknown.	2018-08-24 12:13:10 -04:00
David Steele	2276567027	Add log-subprocess option to allow file logging for local and remote subprocesses.	2018-08-22 20:05:49 -04:00
David Steele	512da125ad	Improve error reporting for TEST_ASSIGN() and TEST_RESULT_VOID() macros.	2018-08-22 19:03:13 -04:00
David Steele	ad989068d2	Fix issue when a boolean option (e.g. delta) was specified more than once. Reported by Yogesh Sharma.	2018-08-22 16:45:25 -04:00
David Steele	8a8738308c	Enable -Wvla.	2018-08-22 14:48:37 -04:00
David Steele	de5614db6b	PostgreSQL 11 Beta 3 support.	2018-08-15 14:05:39 -04:00
David Steele	d0b9f986a0	Allow any option to be set in an environment variable. This includes options that previously could only be specified on the command line, e.g. stanza.	2018-08-15 10:52:53 -04:00
David Steele	4a822d3032	Correct OptionInvalidError to OptionInvalidValueError in boolean option parsing.	2018-08-15 10:32:18 -04:00
David Steele	cb4b715533	Add strReplaceChr() to String object.	2018-08-14 16:49:38 -04:00
David Steele	4a176681c3	Add cvtCharToZ() and macro for debugging char params.	2018-08-14 16:18:17 -04:00
David Steele	9e3273fdf9	Define cipher magic size with sizeof() rather than using a constant.	2018-08-14 16:08:58 -04:00
David Steele	6643afe9a8	Add gzip compression/decompression filters for C.	2018-08-14 14:56:59 -04:00
David Steele	e3ff6b209d	Filters can now produce output that differs from input. This allows filters such as compression, encryption, etc. to be implemented.	2018-08-14 14:21:53 -04:00
David Steele	3434240097	Remove incompletely implemented online option from the check command. Offline operation runs counter to the purpose of this command, which is to check if archiving and backups are working correctly. Reported by Jason O'Donnell.	2018-08-12 19:24:21 -04:00
David Steele	cd5df3570b	Validate configuration options in a single pass. By pre-calculating and storing the option dependencies in parse.auto.c validation can be completed in a single pass, which is both simpler and faster.	2018-08-11 12:55:33 -04:00
David Steele	f06bf9e832	Improve error message when a command is missing the stanza option. Suggested by Sarah Conway.	2018-08-11 09:47:07 -04:00
Cynthia Shang	8ab2e72960	Migrate minimum set of code for reading archive.info files from Perl to C. Contributed by Cynthia Shang.	2018-08-09 08:57:21 -04:00
David Steele	7993f1a966	Add basic C JSON parser.	2018-08-09 08:06:23 -04:00
David Steele	31167d8f98	Enable -Wduplicated-branches, and -Wduplicated-cond.	2018-08-05 08:35:18 -04:00
David Steele	14f21a8f6c	Enable -Wpointer-arith.	2018-08-04 19:37:28 -04:00
David Steele	429a356e33	Enable -Wstrict-prototypes and update all void functions to conform.	2018-08-03 19:19:14 -04:00
Cynthia Shang	bec4c176dc	Exclude temporary and unlogged relation (table/index) files from backup. Implemented using the same logic as the patches adding this feature to PostgreSQL, 8694cc96 and 920a5e50. Temporary relation exclusion is enabled in PostgreSQL ≥ 9.0. Unlogged relation exclusion is enabled in PostgreSQL ≥ 9.1, where the feature was introduced. Contributed by Cynthia Shang.	2018-07-30 18:53:34 -04:00
David Steele	7a00bbebfa	Do nothing in memContextMove() when the context is already in the specified parent.	2018-07-28 09:04:06 -04:00
David Steele	666537cbde	Update code count for new file types and exclusions.	2018-07-28 08:13:30 -04:00
David Steele	01aea0c067	Implement filters that do not modify the buffer. Update cryptoHash to use the new interface.	2018-07-24 21:08:27 -04:00
David Steele	ae72772e5b	Fix typo in `18626306`.	2018-07-23 07:17:24 -04:00
David Steele	1862630629	Use pre-built images from Docker Hub when the container definition has not changed. Downloading an image is quite a bit faster than building a new image from scratch and saves minutes per test run in CI.	2018-07-21 17:02:42 -04:00
David Steele	8568622a6f	Add --log-level-test option. This allows setting the test log level independently from the general test harness setting, but current only works for the C tests. It is useful for seeing log output from functions on the console while a test is running.	2018-07-20 19:03:46 -04:00
David Steele	58e9f1e50c	Refactor the common/log tests to not depend on common/harnessLog. common/harnessLog was not ideally suited for general testing and made all the tests quite awkward. Instead, move all code used to test the common/log module into the logTest module and repurpose common/harnessLog to do log expect testing for all other tests in a cleaner way. Add a few exceptions for config testing since the log levels are reset by default in config/parse.	2018-07-20 18:51:42 -04:00
David Steele	1359e2908c	Fix issue where errors raised in C were not logged when called from Perl. pgBackRest properly terminated with the correct error code but lacked an error message to aid in debugging. Reported by Douglas J Hunley.	2018-07-20 08:11:34 -04:00
David Steele	d3cfeebdf9	Rename error-handling variables in Main.pm to conform to standard.	2018-07-20 08:03:44 -04:00
David Steele	fa53e2eddb	Show exact log level required for stack trace param output instead of just "debug".	2018-07-20 07:15:28 -04:00
David Steele	0ac176b722	Abstract IO layer out of the storage layer. This allows the routines to be used for IO objects that do not have a storage representation. Implement buffer read and write IO objects.	2018-07-19 16:04:20 -04:00
David Steele	5dc8a2ec08	storageFileRead() accepts a buffer for output rather than creating one. This is more efficient overall and allows the caller to specify how many bytes will be read on each call. Reads are appended if the buffer already contains data but the buffer size will never increase. Allow Buffer object "used size" to be different than "allocated size". Add functions to manage used size and remaining size and update automatically when possible.	2018-07-17 19:01:54 -04:00
Cynthia Shang	0acf705416	Require PostgreSQL catalog version when instantiating a Manifest object (and not loading it from disk). Contributed by Cynthia Shang.	2018-07-16 17:25:15 -04:00
Cynthia Shang	4e38cbaea9	Add iniSectionList() to Ini object and remove dead code. Contributed by Cynthia Shang.	2018-07-12 15:28:46 -04:00
Cynthia Shang	0e6b927a17	Add uint64 variant type and supporting conversion functions. Contributed by Cynthia Shang. Reviewed by Stephen Frost.	2018-07-12 15:23:18 -04:00
David Steele	0e331b12ba	Improve performance of string to int conversion. Use strtoll() instead of sprintf() for conversion. Also use available integer min/max constants rather than hard-coded values. Reviewed by Stephen Frost. Suggested by Stephen Frost.	2018-07-12 15:15:32 -04:00
David Steele	cf889790e6	Error if LibC build is performed outside test environment. LibC is no longer required for production builds.	2018-07-10 15:39:03 -04:00
David Steele	22adb395db	Add zero-length file to mock/all test.	2018-07-05 15:40:50 -04:00
David Steele	db17973cd0	Fix critical bug in resume that resulted in inconsistent backups. A regression in v0.82 removed the timestamp comparison when deciding which files from the aborted backup to keep on resume. All resumed backups should be considered inconsistent. A resumed backup can be identified by checking the log for the message "aborted backup of same type exists, will be cleaned to remove invalid files and resumed". Reported by David Youatt, Yogesh Sharma, Stephen Frost.	2018-07-03 14:01:57 -04:00
David Steele	3d07e01fe9	Make ls ordering deterministic in mock/all test.	2018-07-01 20:02:37 -04:00
Andrew Schwartz	1bd98b61df	Fix non-compliant ISO-8601 timestamp format in S3 authorization headers. AWS and some gateways were tolerant of space rather than zero-padded hours while others were not. Fixed by Andrew Schwartz.	2018-07-01 08:17:27 -04:00
David Steele	7e65ddad34	PostgreSQL 11 Beta 2 support.	2018-06-30 14:55:25 -04:00
David Steele	7b0e65d488	Improve the HTTP client to set content-length to 0 when not specified by the server. S3 (and gateways) always set content-length or transfer-encoding but HTTP 1.1 does not require it and proxies (e.g. HAProxy) may not include either. Suggested by Adam K. Sumner.	2018-06-26 17:27:22 -04:00
David Steele	e8c68b9c66	Update parameters for VBoxService start.	2018-06-26 16:56:05 -04:00
David Steele	7fee739710	Remove Debian package patch since it has been committed upstream.	2018-06-20 18:26:53 -04:00
David Steele	3793ae1e4f	Convert the not very portable uint type to unsigned int. Suggested by Devrim Gündüz.	2018-06-14 17:59:10 -04:00
David Steele	d55e609959	Use a prebuilt s3 server container for documentation and tests.	2018-06-12 13:43:15 -04:00
David Steele	350b30fa49	Move cryptographic hash functions to C using OpenSSL.	2018-06-11 14:52:26 -04:00
David Steele	064ec757e9	Rename cipher module to the more general crypto.	2018-06-11 10:53:16 -04:00
David Steele	835396751f	Document generator improvements. * Build containers from scratch for more accurate testing. * Allow environment load to be skipped. * Allow bash wrapping to be skipped. * Allow forcing a command to run as a user without sudo.	2018-06-10 14:13:56 -04:00

... 4 5 6 7 8 ...

1142 Commits