question-mark
Stuck on an issue?

Lightrun Answers was designed to reduce the constant googling that comes with debugging 3rd party libraries. It collects links to all the places you might be looking at while hunting down a tough bug.

And, if you’re still stuck at the end, we’re happy to hop on a call to see how we can help out.

Failing action should fail experiment

See original GitHub issue

I have set up the following experiment:

{
    "version": "1.0.0",
    "title": "Memory leak",
    "description": "We shold be able to have a memoryleak in a pod",
    "tags": [
        "basecase"
    ],
    "steady-state-hypothesis": {
        "title": "All deployments are OK",
        "probes": [
            {
                "type": "probe",
                "tolerance": true,
                "provider": {
                  "type": "python",
                  "module": "chaosk8s.probes",
                  "func": "all_microservices_healthy"
                },
                "name": "all-microservices-healthy"
            }
        ]
    },
    "method": [
        {
            "type": "action",
            "provider": {
              "type": "python",
              "module": "chaosk8s.actions",
              "arguments": {
                "spec_path": "./memleak-deployment.yaml"
              },
              "func": "start_microservice"
            },
            "name": "start-microservice"
        }
    ],
    "rollbacks": [
        {
            "type": "action",
            "provider": {
              "type": "python",
              "module": "chaosk8s.actions",
              "arguments": {
                "name": "memory-leak"
              },
              "func": "kill_microservice"
            },
            "name": "kill-microservice"
          }
    ]
}```
For some reason reason, I get a failure in the "start_microservice" action (but that is a separate issue). The problem is that the experiment does not fail:

![image](https://user-images.githubusercontent.com/1544931/60712491-71927a80-9f17-11e9-95c5-0ea87b73cd94.png)


I expect it to fail, and it is important that it does.

Issue Analytics

  • State:closed
  • Created 4 years ago
  • Comments:5 (2 by maintainers)

github_iconTop GitHub Comments

1reaction
Lawouachcommented, Jul 6, 2019

I think there is some truth in that last statement. The trick is to define “failing condition” though. Is it failing because the system is broken? Is it failing because the action doesn’t work as it should (like buggy input…)?

How to evaluate the action is working in the expected boundaries from when it actually misbehaves?

I find that discussion quite exciting to be honest and it fits some larger discussion around taking the whole chaos experiment further via the Open Chaos project (https://openchaos.io/).

I’ll cc @russmiles on this one.

0reactions
github-actions[bot]commented, Aug 22, 2021

This Issue was closed because it was not reactivated after 7 days of being marked Stale.

Read more comments on GitHub >

github_iconTop Results From Across the Web

Learning: Experiments vs. Failure - LinkedIn
While there are a lot of advocates for “failing fast”, failure is something to be avoided if possible. Experiments and failures are both ......
Read more >
10 Reasons You Can't Be Afraid to Experiment & Fail with ...
Failure isn't always a bad thing. Here are 10 reasons why you shouldn't be afraid to experiment and fail with the content you...
Read more >
Strategies for Learning from Failure - Harvard Business Review
To understand why, look at the exhibit “A Spectrum of Reasons for Failure,” which lists causes ranging from deliberate deviation to thoughtful experimentation....
Read more >
The Most Famous Failed Experiment - YouTube
Use code ACTIONLAB to get $50 off your Kamikoto knives! https://kamikoto.com/actionlabI talk about the Michelson-Morley experiment and then ...
Read more >
Why the Freedom to Experiment and Fail Is Important for Every ...
Give yourself permission to fail. Failure avoidance will only lead to inaction. The hard truth though is that, acceptance of failure leads to...
Read more >

github_iconTop Related Medium Post

No results found

github_iconTop Related StackOverflow Question

No results found

github_iconTroubleshoot Live Code

Lightrun enables developers to add logs, metrics and snapshots to live code - no restarts or redeploys required.
Start Free

github_iconTop Related Reddit Thread

No results found

github_iconTop Related Hackernoon Post

No results found

github_iconTop Related Tweet

No results found

github_iconTop Related Dev.to Post

No results found

github_iconTop Related Hashnode Post

No results found