summaryrefslogtreecommitdiffstats
path: root/roles/openshift_health_checker
Commit message (Collapse)AuthorAgeFilesLines
* Merge pull request #5365 from sosiouxme/20170908-disconnected-image-checkOpenShift Bot2017-09-1212-144/+184
|\ | | | | Merged by openshift-bot
| * openshift_health_check: allow disabling all checksLuke Meyer2017-09-122-3/+8
| | | | | | | | | | | | | | | | Can now set openshift_disable_check=* to disable all checks without needing to know their names. fixes bug 1462106 https://bugzilla.redhat.com/show_bug.cgi?id=1462106
| * docker_image_availability: fix local image searchLuke Meyer2017-09-121-5/+9
| | | | | | | | | | | | An image in the docker index may be tagged by name or by registry plus name. In order to find the image correctly locally and prevent looking for it externally, make sure all possible variations are searched.
| * docker_image_availability: probe registry connectivityLuke Meyer2017-09-122-122/+132
| | | | | | | | | | | | | | | | | | | | | | | | Probe whether the host has connectivity to the registry before trying to inspect it for images, and remember the result. Also if later inspection fails due to timeout, mark registry as unreachable. Note in failure output if any registries were unreachable. Registry order should match what is configured into docker now as well. Fixes bug 1480195 https://bugzilla.redhat.com/show_bug.cgi?id=1480195
| * openshift_checks: add retries in pythonLuke Meyer2017-09-1210-22/+43
| |
* | Skip failure dedup instead of crashingRodolfo Carvalho2017-09-112-2/+29
|/ | | | | | This makes the callback plugin behave better when dedup is not possible: work with the original list of failures instead of raising an unhandled exception and producing confusing output for users.
* Merge pull request #5296 from nak3/skeopeo-command-outputOpenShift Bot2017-09-061-4/+6
|\ | | | | Merged by openshift-bot
| * output skopeo image check commandKenjiro Nakayama2017-09-051-4/+6
| |
* | openshift_checks aos_version: also check installed under yumLuke Meyer2017-09-063-17/+21
| | | | | | | | | | | | | | | | | | Tweaks to the logic around using yum vs dnf; now uses ansible_pkg_mgr to determine which is in effect for a host. Also, extended the yum logic to check installed packages in addition to available packages in the aos_version module so that disconnected installs and others with weird repo configs need not disable the package_version check.
* | Import dnf only if importing yum failsJakub Hadvig2017-09-051-6/+12
|/
* Merge pull request #5035 from ↵Rodolfo Carvalho2017-08-311-1/+1
|\ | | | | | | | | Miciah/openshift_checks-ignore-hidden-files-in-checks-directory openshift_checks: ignore hidden files in checks dir
| * openshift_checks: ignore hidden files in checks dirMiciah Masters2017-08-081-1/+1
| | | | | | | | load_checks: Ignore hidden files when scanning the directory for checks.
* | Merge pull request #5271 from sosiouxme/20170830-disk-avail-bugRodolfo Carvalho2017-08-311-4/+1
|\ \ | | | | | | disk_availability: fix bug where msg is overwritten
| * | disk_availability: fix bug where msg is overwrittenLuke Meyer2017-08-301-4/+1
| | |
* | | Merge pull request #5228 from sosiouxme/20170825-timeout-skopeoRodolfo Carvalho2017-08-301-1/+4
|\ \ \ | | | | | | | | docker_image_availability: timeout skopeo inspect
| * | | docker_image_availability: timeout skopeo inspectLuke Meyer2017-08-281-1/+4
| | | | | | | | | | | | | | | | | | | | Set a 10 second timeout when using skopeo to inspect remote registries, so that it does not wait for a tcp timeout to fail if they are unreachable.
* | | | Update error message: s/non-unique/duplicateRodolfo Carvalho2017-08-242-6/+4
| | | |
* | | | Make pylint disables more specificRodolfo Carvalho2017-08-241-15/+26
| | | | | | | | | | | | | | | | And beautify the code a bit.
* | | | Handle exceptions in failure summary cb pluginRodolfo Carvalho2017-08-241-2/+11
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This serves two purposes: - Gracefully omit the summary if there was an error computing it, no confusion to the regular end user. - Provide a stacktrace of the error when running verbose, giving developers or users reporting bugs a better insight of what went wrong, as opposed to Ansible's opaque handling of errors in callback plugins.
* | | | Rewrite failure summary callback pluginRodolfo Carvalho2017-08-243-119/+243
| | | | | | | | | | | | | | | | | | | | The intent is to deduplicate similar errors that happened in many hosts, making the summary more concise.
* | | | Handle more exceptions when running checksRodolfo Carvalho2017-08-241-19/+30
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This prevents an exception in one check from interfering with other checks. Skips checks that raise an exception in their is_active method. Whenever capturing a broad exception in the `is_action` or `run` methods, include traceback information that can be useful in bug reports.
* | | | List known checks/tags when check name is invalidRodolfo Carvalho2017-08-242-16/+22
| | | |
* | | | List existing health checks when none is requestedRodolfo Carvalho2017-08-242-8/+43
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | This is a simple mechanism to learn what health checks are available. Note that we defer task_vars verification, so that we can compute requested_checks and resolved_checks earlier, allowing us to list checks even if openshift_facts has not run.
* | | | Add playbook for running arbitrary health checksRodolfo Carvalho2017-08-241-1/+1
|/ / / | | | | | | | | | | | | | | | | | | | | | This is useful on its own, and also aids in developing/testing new checks that are not part of any playbook. Since the intent when running this playbook is to execute checks, opt for a less verbose explanation on the error summary.
* | | Merge pull request #5101 from maxamillion/add-dnf-supportScott Dodson2017-08-231-17/+49
|\ \ \ | |/ / |/| | Add dnf support
| * | remove out of scope variable from exception messageAdam Miller2017-08-181-1/+0
| | | | | | | | | | | | Signed-off-by: Adam Miller <maxamillion@fedoraproject.org>
| * | raise AosVersionException if no expected packages found by dnf queryAdam Miller2017-08-181-0/+8
| | | | | | | | | | | | Signed-off-by: Adam Miller <maxamillion@fedoraproject.org>
| * | add dnf support to roles/openshift_health_checker/library/aos_version.pyAdam Miller2017-08-161-17/+42
| | | | | | | | | | | | Signed-off-by: Adam Miller <maxamillion@fedoraproject.org>
* | | etc_traffic check: factor away short_versionLuke Meyer2017-08-152-6/+4
|/ /
* | Merge pull request #5036 from ↵Scott Dodson2017-08-154-11/+17
|\ \ | | | | | | | | | | | | Miciah/openshift_checks-support-ovs-2.7-on-ocp-3.5-and-3.6 openshift_checks: allow OVS 2.7 on OCP 3.5 and 3.6
| * | openshift_checks: allow OVS 2.7 on OCP 3.5 and 3.6Miciah Masters2017-08-114-11/+17
| |/ | | | | | | | | | | | | | | | | rpm_version: Allow package_list items to specify a list value for version. If a list value is provided for a package, pass the check if any version in that list is found. ovs_version: Specify both 2.6 and 2.7 as allowed versions of OVS for OpenShift versions 3.5 and 3.6.
* | Merge pull request #4770 from kwoodson/openshift_repos_refactorScott Dodson2017-08-151-2/+1
|\ \ | | | | | | Removing openshift_repo dependencies. Moving to beginning of openshift_cluster.
| * | Removing dependencies for openshift_repos and setting them up early in the ↵Kenny Woodson2017-08-141-2/+1
| | | | | | | | | | | | cluster build.
* | | Merge pull request #4944 from sosiouxme/20170728-refactor-ansible-mountsScott Dodson2017-08-118-88/+73
|\ \ \ | |/ / |/| | openshift_checks: refactor find_ansible_mount
| * | openshift_checks: refactor find_ansible_mountLuke Meyer2017-08-088-88/+73
| |/ | | | | | | Reuse the code for finding the ansible_mounts mount for a path.
* | Merge pull request #4922 from sosiouxme/20170728-improve-get-varsScott Dodson2017-08-093-7/+73
|\ \ | | | | | | openshift_checks: enable variable conversion
| * | openshift_checks: enable variable conversionLuke Meyer2017-08-013-7/+73
| | |
* | | Merge pull request #4812 from kwoodson/openshift_version_refactorOpenShift Bot2017-08-081-0/+3
|\ \ \ | |_|/ |/| | Merged by openshift-bot
| * | Refactor of openshift_version.Kenny Woodson2017-08-081-0/+3
| | |
* | | Merge pull request #4913 from sosiouxme/20170720-refactor-check-resultsRodolfo Carvalho2017-08-0822-643/+730
|\ \ \ | |/ / |/| | openshift_checks: refactor check results
| * | openshift_checks: refactor logging checksLuke Meyer2017-08-0215-616/+699
| | | | | | | | | | | | | | | | | | | | | Turn failure messages into exceptions that tests can look for without depending on text meant for humans. Turn logging_namespace property into a method. Get rid of _exec_oc and just use logging.exec_oc.
| * | openshift_checks: add property to track 'changed'Luke Meyer2017-08-0212-40/+44
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | Introduced the 'changed' property for checks that can make changes to track whether they did or not. Rather than the check's own logic having to track this and include it in the result hash, just set the property and have the action plugin insert it in the result hash after running (even if there is an exception). Cleared out a lot of crufty "changed: false" hash entries.
* | | Merge pull request #4739 from kwoodson/openshift_facts_refactorScott Dodson2017-08-081-4/+0
|\ \ \ | | | | | | | | Reduce openshift_facts dependencies.
| * | | integration tests: keep openshift_version happyLuke Meyer2017-08-071-4/+0
| | | |
* | | | Merge pull request #4960 from ↵OpenShift Bot2017-08-072-5/+98
|\ \ \ \ | |/ / / |/| | | | | | | | | | | juanvallejo/jvallejo/verify-disk-memory-before-upgrade-no-flake Merged by openshift-bot
| * | | add pre-flight checks to ugrade pathjuanvallejo2017-08-022-5/+98
| | |/ | |/|
* | | Merge pull request #4969 from sosiouxme/20170801-tolerate-ovs-beyond-36OpenShift Bot2017-08-022-90/+80
|\ \ \ | |_|/ |/| | Merged by openshift-bot
| * | package_version check: tolerate release version 3.7Luke Meyer2017-08-022-90/+80
| |/ | | | | | | | | | | | | | | | | | | | | Addresses issue https://github.com/openshift/openshift-ansible/issues/4967 For now, any version >= 3.6 is handled as if it were 3.6. We may want to keep that or fine-tune it later. Also, the ovs_version check is not updated. This is a post-install health check (does not block install/upgrade) with an update already in progress so will be addressed there.
* / add fluentd logging driver config checkjuanvallejo2017-08-0111-109/+563
|/
* Normalize list of checks passed to action pluginRodolfo Carvalho2017-07-271-5/+15
| | | | | | | | | This allows users to pass either a string with comma-separated check names or a proper list of strings, whatever is more convenient in the context. For consistency, the same normalization is applied to the 'openshift_disable_check' variable.