Add flexibility to collectors & analysers to allow running external programs #1078

banjoh · 2023-03-24T18:35:33Z

Describe the rationale for the suggested feature.

In an effort of supporting extensibility of using support-bundle & preflight binaries, it would be good to be able to run other available programs in a system without only relying on stdout/stderr/exit codes. Some example use cases that come to mind are

Allowing cluster administrators that have home-built tools that collect and analyse data from hosts or k8s clusters to embed these tools without the need of rewriting them as specs to fit troubleshoot's current model.
Allow embedding in automation pipelines e.g CI which come with a plethora of tools

Describe the feature

The "how" part is still open, but here is a suggestion that has been touched on in a community meeting (notes can be found here) and other various discussions.

Extend the run collector

Here is my fictional collector that collects audit event logs, enriches them with user data and stores the output in $WORKSPACE_OUTPUT. $WORKSPACE_OUTPUT is a unique directory created by the framework for this collector instance. Contents are them copied over to the bundle once executing the collector completes.

apiVersion: troubleshoot.sh/v1beta2
kind: SupportBundle
metadata:
  name: run
spec:
  hostCollectors:
    - run:
        collectorName: "enriched-audit-logs"
        command: "python3"
        args: ["--timeout", "10m", "--output-dir", "$WORKSPACE_OUTPUT"]
        # Arbitrary parameters which get stored in on disk as YAML/JSON and fed passed
        # on to the command via a $CONFIG env
        config:   # perhaps call it "params"?
          username: postgres
          password: <my-pass>
          dbHost: <hostname>
          map:
            key: value
          list:
            - val1
            - val2

Create a similar analyser which takes in arbitrary parameters to operate on

Open questions

How would this look like in-cluster? Extending the run pod collector?
Is this the only way to allow extensibility?

Inspirations

https://k9scli.io/topics/plugins/
https://docs.github.com/en/actions/using-workflows/workflow-commands-for-github-actions#setting-an-environment-variable - GH action runners passes environment variables to jobs for them to store information that persists across jobs.

The text was updated successfully, but these errors were encountered:

mhrabovcin · 2023-03-27T13:12:29Z

The plugin interface should define standard way of retrieving kubeconfig. As support-bundle supports reading kubeconfig from KUBECONFIG env variable or passed as an CLI argument. If its passed as an argument then plugin would have no way of retrieving this kubeconfig path. Maybe it should say that it is guaranteed that KUBECONFIG is populated?

banjoh · 2023-03-27T16:44:03Z

The plugin interface should define standard way of retrieving kubeconfig.

Very correct.KUBECONFIG and a few other variables will be part of the constants passed to plugins. Here's non-exhaustive list

KUBECONFIG - how to connect to the cluster
Config parameters file e.g PLUGIN_CONFIG. We might want to provide a parameter (json|yaml) to define how the file format on disk
WORKSPACE - a place the plugin can run on. It would the plugins $CWD on launch. WORKSPACE/output can then contain all collected files. TBD
....

Inspiration from helm: https://helm.sh/docs/topics/plugins/#environment-variables

banjoh · 2023-11-16T13:19:33Z

Addressed by #1376

banjoh · 2023-11-17T16:45:07Z

Reopening since we might want to put in some thought on the analyser side of things. The ability to inject custom analyser logic without needing to write new analysers is worth considering.

banjoh changed the title ~~Add flexibility of collectors & analysers to allow running external programs~~ Add flexibility to collectors & analysers to allow running external programs Jun 21, 2023

cwyl02 mentioned this issue Aug 2, 2023

feat: save host run file output #1288

Closed

6 tasks

cwyl02 mentioned this issue Oct 19, 2023

feat: save host run file output #1376

Merged

6 tasks

banjoh closed this as completed Nov 16, 2023

banjoh reopened this Nov 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add flexibility to collectors & analysers to allow running external programs #1078

Add flexibility to collectors & analysers to allow running external programs #1078

banjoh commented Mar 24, 2023 •

edited

Loading

mhrabovcin commented Mar 27, 2023

banjoh commented Mar 27, 2023 •

edited

Loading

banjoh commented Nov 16, 2023

banjoh commented Nov 17, 2023

Add flexibility to collectors & analysers to allow running external programs #1078

Add flexibility to collectors & analysers to allow running external programs #1078

Comments

banjoh commented Mar 24, 2023 • edited Loading

mhrabovcin commented Mar 27, 2023

banjoh commented Mar 27, 2023 • edited Loading

banjoh commented Nov 16, 2023

banjoh commented Nov 17, 2023

banjoh commented Mar 24, 2023 •

edited

Loading

banjoh commented Mar 27, 2023 •

edited

Loading