automation-tests

Commit Graph

Author	SHA1	Message	Date
Paul Holzinger	b237b4dc2a	libpod: journald do not lock thread This is not needed and was added by during debugging but it turned out to be something else. We should not lock the thread unless needed because this just raises question why it is here otherwise. Also the lock would not do much as we spawn a goroutine below anyway so it runs on another thread no matter what. From the review comment by Miloslav but it was merged before I had the chance to fix it: https://github.com/containers/podman/pull/24406#discussion_r1828102666 Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-11-05 11:32:51 +01:00
Paul Holzinger	768ad8653a	rework event code to improve API errors One of the problems with the Events() API was that you had to call it in a new goroutine. This meant the the error returned by it had to be read back via a second channel. This cuased other bugs in the past but here the biggest problem is that basic errors such as invalid since/until options were not directly returned to the caller. It meant in the API we were not able to write http code 200 quickly because we always waited for the first event or error from the channels. This in turn made some clients not happy as they assume the server hangs on time out if no such events are generated. To fix this we resturcture the entire event flow. First we spawn the goroutine inside the eventer Read() function so not all the callers have to. Then we can return the basic error quickly without the goroutine. The caller then checks the error like any normal function and the API can use this one to decide which status code to return. Second we now return errors/event in one channel then the callers can decide to ignore or log them which makes it a bit more clear. Fixes `c46884aa93` ("podman events: check for an error after we finish reading events") Fixes #23712 Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-11-01 18:54:13 +01:00
Jan Rodák	de856dab99	Add --health-max-log-count, --health-max-log-size, --health-log-destination flags These flags can affect the output of the HealtCheck log. Currently, when a container is configured with HealthCheck, the output from the HealthCheck command is only logged to the container status file, which is accessible via `podman inspect`. It is also limited to the last five executions and the first 500 characters per execution. This makes debugging past problems very difficult, since the only information available about the failure of the HealthCheck command is the generic `healthcheck service failed` record. - The `--health-log-destination` flag sets the destination of the HealthCheck log. - `none`: (default behavior) `HealthCheckResults` are stored in overlay containers. (For example: `$runroot/healthcheck.log`) - `directory`: creates a log file named `<container-ID>-healthcheck.log` with JSON `HealthCheckResults` in the specified directory. - `events_logger`: The log will be written with logging mechanism set by events_loggeri. It also saves the log to a default directory, for performance on a system with a large number of logs. - The `--health-max-log-count` flag sets the maximum number of attempts in the HealthCheck log file. - A value of `0` indicates an infinite number of attempts in the log file. - The default value is `5` attempts in the log file. - The `--health-max-log-size` flag sets the maximum length of the log stored. - A value of `0` indicates an infinite log length. - The default value is `500` log characters. Add --health-max-log-count flag Signed-off-by: Jan Rodák <hony.com@seznam.cz> Add --health-max-log-size flag Signed-off-by: Jan Rodák <hony.com@seznam.cz> Add --health-log-destination flag Signed-off-by: Jan Rodák <hony.com@seznam.cz>	2024-09-25 14:01:35 +02:00
Paul Holzinger	9ee96a9569	properly implement pull-error event status Commit `03f6589f3` added basic support for pull-error event from libimage but it contains several problems: 1. storing the error as error type prevents it from being unmarshalled, thus change it to a string 2. the error was never propagated from the libimage event to the podman event struct 3. the error message was not wired into the cli and API This commit fixes these problems. Fixes #21458 Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2024-03-05 10:50:42 +01:00
Matt Heon	03f6589f34	Fix events by fully adding the new PullError event Signed-off-by: Matt Heon <mheon@redhat.com>	2024-02-29 15:23:17 -05:00
Matt Heon	72f1617fac	Bump Go module to v5 Moving from Go module v4 to v5 prepares us for public releases. Move done using gomove [1] as with the v3 and v4 moves. [1] https://github.com/KSubedi/gomove Signed-off-by: Matt Heon <mheon@redhat.com>	2024-02-08 09:35:39 -05:00
Oleksandr Redko	2a2d0b0e18	chore: delete obsolete // +build lines Signed-off-by: Oleksandr Redko <Oleksandr_Redko@epam.com>	2024-01-04 11:53:38 +02:00
Chetan Giradkar	572f38c0db	Set correct exitcode in remove events and change ContainerExitCode from int to int ptr Added additional check for event type to be remove and set the correct exitcode. While it was getting difficult to maintain the omitempty notation for Event->ContainerExitCode, changing the type from int to int ptr gives us the ability to check for ContainerExitCode to be not nil and continue operations from there. closes #19124 Signed-off-by: Chetan Giradkar <cgiradka@redhat.com>	2023-11-28 13:31:18 +00:00
Daniel J Walsh	c2de6d34ca	Run codespell on podman Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2023-11-06 09:23:16 -06:00
Paul Holzinger	99a14332ef	healthcheck: make sure to always show health_status events This fixes a regression caused by commit `7e6e267329`, unfortunately this was not caught during review as for some reason this works fine rootless and only fails as root. Because we set the systemd log level to notice in order to hide the unit started/stopped messages to prevent spamming the journal the issue is that this now also causes systemd to ignore the events we write to journald as we also send them as info level. To fix this we simply send health_status events now on notice level. I decided against sending all events on notice as I think info is fine for them. Whenever the notice level is right is of course debatable but given it may contain the unhealthy message I think having this a notice should be ok. The main reason this made it through testing is because we do not rely on the systemd unit to fire healthchecks in the tests as this is flaky. There is one test were we rely on it though and I added a check there to make sure events are displayed correctly when trigger via systemd. Fixes #20342 Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2023-10-12 15:02:32 +02:00
Erik Sjölund	b5ce0ab2de	Fix language, typos and markdown layout [NO NEW TESTS NEEDED] Signed-off-by: Erik Sjölund <erik.sjolund@gmail.com>	2023-07-24 11:18:25 +02:00
Paul Holzinger	2ab90f2ed6	journald: podman events only show events for current user I noticed this while running some things in parallel, podman events would show events from other users. Because all events are written to the journal everybody can see them. So when we read the journal we must filter events for only the current UID. To reproduce run `podman events` as user then in another window create a container as root for example. After this patch it will correctly ignore these events from other users. [NO NEW TESTS NEEDED] I don't think we can test with two users at the same time. Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2023-01-27 16:20:32 +01:00
Paul Holzinger	767947ab88	journald: podman logs --until --follow exit after time When you use podman logs with --until and --follow it should exit after the requested until time and not keep hanging forever. To make this work I reworked the code to use the better journald event reading code for logs as well. this correctly uses the sd_journal API without having to compare the cursors to find the EOF. The same problems exists for the k8s-file driver, I will fix this in the next commit. Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2023-01-06 15:19:22 +01:00
Paul Holzinger	c674b3dd83	journald: seek to time when --since is used Instead of reading the full journal which can be expensive we can seek based on the time. If you have a journald with many podman events just compare the time `time podman events --since 1s --stream=false` with and without this patch. Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2023-01-06 15:19:22 +01:00
Matej Vasek	04ea8eadea	fix: event read from syslog When labes map is too big we may get syslog entry truncated. This breaks JSON parsing making event loading impossible. [NO NEW TESTS NEEDED] Signed-off-by: Matej Vasek <mvasek@redhat.com>	2022-12-08 13:08:02 +01:00
Valentin Rothberg	71f92d263c	container create: add inspect data to event When the new `events_container_create_inspect_data` option is enabled in containers.conf set the `ContainersInspectData` event field for each container-create event. The data was requested for the purpose of auditing (e.g., intrusion detection). Jira: https://issues.redhat.com/browse/RUN-1702 Signed-off-by: Valentin Rothberg <vrothberg@redhat.com>	2022-11-29 10:40:24 +01:00
Matthew Heon	6ee582315b	Events for containers in pods now include the pod's ID This allows tools like Cockpit to know that the pod in question has also been updated, so they can refresh the list of containers in the pod. Fixes #15408 Signed-off-by: Matthew Heon <matthew.heon@pm.me>	2022-09-22 14:18:56 -04:00
Paul Holzinger	76980a2226	event backend journald: fix problem with empty journal Currently podman events will just fail with `Error: failed to get journal cursor: failed to get cursor: cannot assign requested address` when the journal contains zero podman events. The problem is that we are using the journal accessors wrong. There is no need to call GetCursor() and compare them manually. The Next() return an integer which tells if it moved to the next or not. This means the we can remove GetCursor() which would fail when there is no entry. This also includes another bug fix. Previously the logic called Next() twice for the first entry which caused us to miss the first entry. To reproduce this issue you can run the following commands: ``` sudo journalctl --rotate sudo journalctl --vacuum-time=1s ``` Note that this will delete the full journal. Now run podman events and it fails but with this patch it works. Now generate a single event, i.e. podman pull alpine, and run podman events --until 1s. I am not sure how to get a reliable test into CI, I really do not want to delete the journal and developer or CI systems. Fixes second part of #15688 Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2022-09-12 18:05:17 +02:00
Paul Holzinger	4e72aa5860	fix goroutine leaks in events and logs backend When running a single podman logs this is not really important since we will exit when we finish reading the logs. However for the system service this is very important. Leaking goroutines will cause an increased memory and CPU ussage over time. Both the the event and log backend have goroutine leaks with both the file and journald drivers. The journald backend has the problem that journal.Wait(IndefiniteWait) will block until we get a new journald event. So when a client closes the connection the goroutine would still wait until there is a new journal entry. To fix this we just wait for a maximum of 5 seconds, after that we can check if the client connection was closed and exit correctly in this case. For the file backend we can fix this by waiting for either the log line or context cancel at the same time. Currently it would block waiting for new log lines and only check afterwards if the client closed the connection and thus hang forever if there are no new log lines. [NO NEW TESTS NEEDED] I am open to ideas how we can test memory leaks in CI. To test manually run a container like this: `podman run --log-driver $driver --name test -d alpine sh -c 'i=1; while [ "$i" -ne 1000 ]; do echo "line $i"; i=$((i + 1)); done; sleep inf'` where `$driver` can be either `journald` or `k8s-file`. Then start the podman system service and use: `curl -m 1 --output - --unix-socket $XDG_RUNTIME_DIR/podman/podman.sock -v 'http://d/containers/test/logs?follow=1&since=0&stderr=1&stdout=1' &>/dev/null` to get the logs from the API and then it closes the connection after 1 second. Now run the curl command several times and check the memory usage of the service. Fixes #14879 Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2022-07-20 12:55:34 +02:00
Sascha Grunert	251d91699d	libpod: switch to golang native error wrapping We now use the golang error wrapping format specifier `%w` instead of the deprecated github.com/pkg/errors package. [NO NEW TESTS NEEDED] Signed-off-by: Sascha Grunert <sgrunert@redhat.com>	2022-07-05 16:06:32 +02:00
openshift-ci[bot]	278afae1de	Merge pull request #14705 from jakecorrenti/show-health-status-event Show Health Status events	2022-06-27 17:49:27 +00:00
Jake Correnti	0c1a3b70f5	Show Health Status events Previously, health status events were not being generated at all. Both the API and `podman events` will generate health_status events. ``` {"status":"health_status","id":"ae498ac3aa6c63db8b69a37583a6eae1a9cefbdbdbeeadcf8e1d66d745f0df63","from":"localhost/healthcheck-demo:latest","Type":"container","Action":"health_status","Actor":{"ID":"ae498ac3aa6c63db8b69a37583a6eae1a9cefbdbdbeeadcf8e1d66d745f0df63","Attributes":{"containerExitCode":"0","image":"localhost/healthcheck-demo:latest","io.buildah.version":"1.26.1","maintainer":"NGINX Docker Maintainers \u003cdocker-maint@nginx.com\u003e","name":"healthcheck-demo"}},"scope":"local","time":1656082205,"timeNano":1656082205882271276,"HealthStatus":"healthy"} ``` ``` 2022-06-24 11:06:04.886238493 -0400 EDT container health_status ae498ac3aa6c63db8b69a37583a6eae1a9cefbdbdbeeadcf8e1d66d745f0df63 (image=localhost/healthcheck-demo:latest, name=healthcheck-demo, health_status=healthy, io.buildah.version=1.26.1, maintainer=NGINX Docker Maintainers <docker-maint@nginx.com>) ``` Signed-off-by: Jake Correnti <jcorrenti13@gmail.com>	2022-06-27 10:44:53 -04:00
Paul Holzinger	90a669594a	fix CI: golangci-lint is broken on main The merge of both 528739cef3d2 and `1b62e45438` at the same time created a lint error on main. Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2022-06-15 14:57:10 +02:00
Paul Holzinger	1b62e45438	golangci-lint: add systemd build tag Lint the systemd code and fix the reported problems. The remoteclient tag is no longer used so I just removed it. [NO NEW TESTS NEEDED] Signed-off-by: Paul Holzinger <pholzing@redhat.com>	2022-06-14 15:28:45 +02:00
Valentin Rothberg	ea08765f40	go fmt: use go 1.18 conditional-build syntax Signed-off-by: Valentin Rothberg <vrothberg@redhat.com>	2022-03-18 09:11:53 +01:00
Valentin Rothberg	bd09b7aa79	bump go module to version 4 Automated for .go files via gomove [1]: `gomove github.com/containers/podman/v3 github.com/containers/podman/v4` Remaining files via vgrep [2]: `vgrep github.com/containers/podman/v3` [1] https://github.com/KSubedi/gomove [2] https://github.com/vrothberg/vgrep Signed-off-by: Valentin Rothberg <rothberg@redhat.com>	2022-01-18 12:47:07 +01:00
Daniel J Walsh	1c4e6d8624	standardize logrus messages to upper case Remove ERROR: Error stutter from logrus messages also. [ NO TESTS NEEDED] This is just code cleanup. Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2021-09-22 15:29:34 -04:00
Daniel J Walsh	4216f7b7f4	Add no-trunc support to podman-events Standardize on no-trunc through the code. Alias notruncate where necessary. Standardize on the man page display of no-trunc. Fixes: https://github.com/containers/podman/issues/8941 Signed-off-by: Daniel J Walsh <dwalsh@redhat.com>	2021-09-16 09:41:29 -04:00
cdoern	d06d285e66	logFile until flag issue we were adding a negative duration in podman events, causing inputs like -5s to be correct and 5s to be incorrect. fixes #11158 Signed-off-by: cdoern <cdoern@redhat.com>	2021-08-23 12:51:56 -04:00
Valentin Rothberg	37f39eefee	events: support disjunctive filters While different filters are applied in conjunction, the same filter (but with different values) should be applied in disjunction. This allows, for instance, to query the events of two containers. Fixes: #10507 Signed-off-by: Valentin Rothberg <rothberg@redhat.com>	2021-06-02 13:37:56 +02:00
Valentin Rothberg	5dded6fae7	bump go module to v3 We missed bumping the go module, so let's do it now :) * Automated go code with github.com/sirkon/go-imports-rename * Manually via `vgrep podman/v2` the rest Signed-off-by: Valentin Rothberg <rothberg@redhat.com>	2021-02-22 09:03:51 +01:00
baude	002d0d6ee6	close journald when reading when reading from journald, we need to close the journal handler for events and logging. Fixes: #8864 Signed-off-by: baude <bbaude@redhat.com>	2021-01-04 13:27:38 -06:00
Paul Holzinger	74fcd9fef3	podman events allow future time for --until The podman events aren't read until the given timestamp if the timestamp is in the future. It just reads all events until now and exits afterwards. This does not make sense and does not match docker. The correct behavior is to read all events until the given time is reached. This fixes a bug where the wrong event log file path was used when running first time with a new storage location. Fixes #8694 This also fixes the events api endpoint which only exited when an error occurred. Otherwise it just hung after reading all events. Signed-off-by: Paul Holzinger <paul.holzinger@web.de>	2020-12-11 23:15:09 +01:00
baude	a3e0b7d117	add network connect\|disconnect compat endpoints this enables the ability to connect and disconnect a container from a given network. it is only for the compatibility layer. some code had to be refactored to avoid circular imports. additionally, tests are being deferred temporarily due to some incompatibility/bug in either docker-py or our stack. Signed-off-by: baude <bbaude@redhat.com>	2020-11-19 08:16:19 -06:00
Paul Holzinger	692559c8c5	Improve the journal event reading Fix the AddMatch/SeekTail conflict. This prevents reading unnecessary journal entries which could cause errors. Also wrap the sdjournal errors to provide better error messages. Fixes #8125 Signed-off-by: Paul Holzinger <paul.holzinger@web.de>	2020-10-27 14:10:55 +01:00
baude	c593e49701	filter events by labels adding the ability to filter evens by the container labels. this requires that container labels be added to the events data being recorded and subsequently read. Signed-off-by: baude <bbaude@redhat.com>	2020-10-23 12:07:34 -05:00
Sascha Grunert	fef3e2da6a	Remove some unnecessary []byte to string conversions Some calls to `Sprintf("%s")` can be avoided by using direct string type assertions. Signed-off-by: Sascha Grunert <sgrunert@suse.com>	2020-08-03 09:13:04 +02:00
Valentin Rothberg	d856e87f40	events endpoint: fix panic and race condition Fix a potential panic in the events endpoint when parsing the filters parameter. Values of the filters map might be empty, so we need to account for that instead of uncondtitionally accessing the first item. Also apply a similar for race conditions as done in commit f4a2d25c0fca: Fix a race that could cause read errors to be masked. Masking such errors is likely to report red herrings since users don't see that reading failed for some reasons but that a given event could not be found. Another race was the handler closing event channel, which could lead to two kinds of panics: double close, send to close channel. The backend takes care of that. However, make sure that the backend stops working in case the context has been cancelled. Fixes: #6899 Signed-off-by: Valentin Rothberg <rothberg@redhat.com>	2020-07-20 11:37:05 +02:00
Matthew Heon	9e4cf6ca51	Fix `system service` panic from early hangup in events We weren't actually halting the goroutine that sent events, so it would continue sending even when the channel closed (the most notable cause being early hangup - e.g. Control-c on a curl session). Use a context to cancel the events goroutine and stop sending events. Fixes #6805 Signed-off-by: Matthew Heon <matthew.heon@pm.me>	2020-07-02 09:57:39 -04:00
Valentin Rothberg	450361fc64	update systemd & dbus dependencies Update the outdated systemd and dbus dependencies which are now provided as go modules. This will further tighten our dependencies and releases and pave the way for the upcoming auto-update feature. Signed-off-by: Valentin Rothberg <rothberg@redhat.com>	2020-03-10 18:34:55 +01:00
Giuseppe Scrivano	276d68c8f5	events: make sure the write channel is always closed in case of errors, the channel is not closed, blocking the reader indefinitely. Closes: https://bugzilla.redhat.com/show_bug.cgi?id=1767663 Signed-off-by: Giuseppe Scrivano <gscrivan@redhat.com>	2019-11-06 17:14:44 +01:00
baude	bbfc80c579	performance fix for podman events with large journalds in the case where the host has a large journald, iterating the journal without using a Match is very poor performance. this might be a temporary fix while we figure out why the systemd library does not seem to behave properly. Signed-off-by: baude <bbaude@redhat.com>	2019-08-14 09:30:12 -05:00
Matthew Heon	62e8c328d8	Small optimization - only store exit code when nonzero JSON optimizes it out in that case anyways, so don't waste cycles doing an Itoa (and Atoi on the decode side). Signed-off-by: Matthew Heon <matthew.heon@pm.me>	2019-08-12 19:05:44 -04:00
Matthew Heon	d3a433181d	Fix container exit code with Journald backend We weren't actually storing this, so we'd lose the exit code for containers run with --rm or force-removed while running if the journald backend for events was in use. Fixes #3795 Signed-off-by: Matthew Heon <matthew.heon@pm.me>	2019-08-12 19:05:44 -04:00
baude	62a04f2b03	systemd library conflict with seektail and addmatch it looks like the core-os systemd library has some issue when using seektail and add match. this patch works around that shortcoming for the time being. Fixes: #3616 Signed-off-by: baude <bbaude@redhat.com>	2019-08-06 14:58:26 -05:00
baude	63eef5a234	add eventlogger to info to help with future debugging, we now display the type of event logger being used inside podman info -> host. Signed-off-by: baude <bbaude@redhat.com>	2019-08-02 20:05:27 -05:00
baude	d252100c94	get last container event an internal change in libpod will soon required the ability to lookup the last container event using the continer name or id and the type of event. this pr is in preperation for that need. Signed-off-by: baude <bbaude@redhat.com>	2019-07-07 08:54:20 -05:00
baude	7a58c6601b	set default event logger based on build tags once the default event logger was removed from libpod.conf, we need to set the default based on whether the systemd build tag is used or not. Signed-off-by: baude <bbaude@redhat.com>	2019-05-14 16:17:11 -05:00
Sascha Grunert	d1fc3fc702	Add `systemd` build tag If the systemd development files are not present on the system which builds podman, then `podman events` will error on runtime creation. Beside this, a warning will be printed when compiling podman. This commit mainly exists because projects which depend on libpod would not need the podman event support and therefore do not need to rely on the systemd headers. Signed-off-by: Sascha Grunert <sgrunert@suse.com>	2019-05-13 14:00:27 +02:00
Matthew Heon	e05af78329	Do not hard fail on non-decodable events Signed-off-by: Matthew Heon <mheon@redhat.com>	2019-04-26 10:50:57 -04:00

1 2

51 Commits