History

Ed Santiago 0f78f345d9 Fix race condition in kill test leading to hang When you open a FIFO for reading, but there's no writer, you hang. This is just one of those obscure UNIXisms we all know but just forget all too often. My last PR was guilty of introducing such a condition; I caught it by accident while testing other stuff. In short, the signal container was doing 'echo DONE' as its last step, and we (BATS) were reading the FIFO to check for it; but if the container exited before we opened the FIFO for read, the open would hang. This is not a hang that we can catch in the test: it would hang the entire job forever. CI would presumably time out eventually, but with no useful indication of the cause of the error. Solution: use 'exec' to open the FIFO early and keep it open, and use 'read -u FD' instead of 'read <$fifo': the former reads from an open FD, the latter forces a new open() each time. There is a shorter, more maintainable solution -- see #4755 -- but that suffers from the same hanging problem in the (unlikely) case where the signal-handling container exits, e.g. if signal handling is broken in podman. The test would hang, with no helpful indicator. Although this PR is a little more advanced scripting, I have commented the relevant code well and believe the maintenance cost is worth the risk of undebuggable hangs. There is still a hang risk: if 'podman logs -f' fails and exits immediately, the 'exec' will hang. I can't think of a non-racy way to prevent that, and choose to live with that risk. Tested by temporarily including 9 (SIGKILL) in the signals list. The read timeout triggers, and the end user has a fair chance of tracking down the root cause. Signed-off-by: Ed Santiago <santiago@redhat.com>		2019-12-28 08:00:05 -07:00
..
000-TEMPLATE	BATS: new tests, and improvements to existing ones	2019-03-18 15:21:52 -06:00
001-basic.bats	implement 'make remotesystem'	2019-08-08 11:44:55 -06:00
005-info.bats	Add a MissingRuntime implementation	2019-10-15 15:59:20 -04:00
010-images.bats	podman images history test - clean up	2019-12-12 08:56:44 -07:00
015-help.bats	stats: list all running containers unless specified otherwise	2019-10-23 13:48:54 +02:00
030-run.bats	system tests: run test: reenable and fix	2019-09-24 08:46:40 -06:00
035-logs.bats	bats - various small updates	2019-04-18 15:39:21 -06:00
040-ps.bats	Implement review feedback	2019-03-07 14:09:00 -07:00
050-stop.bats	Implement review feedback	2019-03-07 14:09:00 -07:00
055-rm.bats	system tests: reenable skipped tests	2019-10-02 06:22:36 -06:00
060-mount.bats	BATS tests: start supporting podman-remote	2019-04-10 08:19:09 -06:00
065-cp.bats	system tests: reenable skipped tests	2019-10-02 06:22:36 -06:00
070-build.bats	test for #3920 (improper caching of tarballs in build)	2019-12-05 07:28:18 -07:00
075-exec.bats	System tests: make sure exec pid hash w/o leaking	2019-10-18 15:04:09 +08:00
110-history.bats	BATS tests: start supporting podman-remote	2019-04-10 08:19:09 -06:00
120-load.bats	BATS tests - get working again	2019-06-11 06:53:03 -06:00
130-kill.bats	Fix race condition in kill test leading to hang	2019-12-28 08:00:05 -07:00
200-pod-top.bats	implement 'make remotesystem'	2019-08-08 11:44:55 -06:00
250-generate-systemd.bats	Move skipping systemd tests to early setup.	2019-07-06 20:45:26 +03:00
300-cli-parsing.bats	BATS: new tests, and improvements to existing ones	2019-03-18 15:21:52 -06:00
400-unprivileged-access.bats	System-tests: Use bash explicitly	2019-10-02 14:32:31 -04:00
README.md	implement 'make remotesystem'	2019-08-08 11:44:55 -06:00
TODO.md	System-test: Documentation and TODO list	2019-03-06 12:14:49 -05:00
helpers.bash	podman images history test - clean up	2019-12-12 08:56:44 -07:00
helpers.t	new system tests under BATS	2019-03-07 13:09:54 -07:00

README.md

Quick overview of podman system tests. The idea is to use BATS, but with a framework for making it easy to add new tests and to debug failures.

Quick Start

Look at 030-run.bats for a simple but packed example. This introduces the basic set of helper functions:

setup (implicit) - resets container storage so there's one and only one (standard) image, and no running containers.
parse_table - you can define tables of inputs and expected results, then read those in a while loop. This makes it easy to add new tests. Because bash is not a programming language, the caller of parse_table sometimes needs to massage the returned values; 015-run.bats offers examples of how to deal with the more typical such issues.
run_podman - runs command defined in $PODMAN (default: 'podman' but could also be './bin/podman' or 'podman-remote'), with a timeout. Checks its exit status.
is - compare actual vs expected output. Emits a useful diagnostic on failure.
die - output a properly-formatted message to stderr, and fail test
skip_if_rootless - if rootless, skip this test with a helpful message.
skip_if_remote - like the above, but skip if testing podman-remote
random_string - returns a pseudorandom alphanumeric string

Test files are of the form NNN-name.bats where NNN is a three-digit number. Please preserve this convention, it simplifies viewing the directory and understanding test order. In particular, 00x tests should be reserved for a first-pass fail-fast subset of tests:

bats test/system/00*.bats || exit 1
bats test/system

...the goal being to provide quick feedback on catastrophic failures without having to wait for the entire test suite.

Running tests

To run the tests locally in your sandbox, you can use one of these methods:

make;PODMAN=./bin/podman bats ./test/system/070-build.bats # runs just the specified test
make;PODMAN=./bin/podman bats ./test/system # runs all

To test as root:

$ PODMAN=./bin/podman sudo --preserve-env=PODMAN bats test/system

Analyzing test failures

The top priority for this scheme is to make it easy to diagnose what went wrong. To that end, podman_run always logs all invoked commands, their output and exit codes. In a normal run you will never see this, but BATS will display it on failure. The goal here is to give you everything you need to diagnose without having to rerun tests.

The is comparison function is designed to emit useful diagnostics, in particular, the actual and expected strings. Please do not use the horrible BATS standard of [ x = y ]; that's nearly useless for tracking down failures.

If the above are not enough to help you track down a failure:

Debugging tests

Some functions have dprint statements. To see the output of these, set PODMAN_TEST_DEBUG="funcname" where funcname is the name of the function or perhaps just a substring.

Requirements

The jq tool is needed for parsing JSON output.

Further Details

TBD. For now, look in helpers.bash; each helper function has (what are intended to be) helpful header comments. For even more examples, see and/or run helpers.t; that's a regression test and provides a thorough set of examples of how the helpers work.