Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Probability of Unsuccessful Repair #17

Open
christianadriano opened this issue Jun 13, 2020 · 1 comment
Open

Probability of Unsuccessful Repair #17

christianadriano opened this issue Jun 13, 2020 · 1 comment
Assignees
Labels
question Further information is requested

Comments

@christianadriano
Copy link
Member

christianadriano commented Jun 13, 2020

Hi @MrBanhBao and @2start, could you please give me some feedback on the ideas below? This is only a reflection, it is not going to become a feature to implement.

  1. What is the probability that fixing this component will be succeful?
    Success of fixing a component depends on hidden information, the dependencies that this component has on other components which might be failing.

  2. If a component is failing, what is the probability that the failure is in the dependent components instead?
    If one component fails only because of other components, then we do not need to waste effort repairing the failing component. This probability can be used to make a decision of whether it is worth repairing something that might not be broken. If the probability is low, then it is better to repair the component.

2.1 What do we want to know?
Given an observation (component 1 and 2 failing), what is the probability that fixing component 2 will automatically fix component 1?

2.2 Why we want to know this?
Because we can make less expensive and faster repair decisions by repairing only the root-causes of the failure.

  1. How to learn these probabilities?

3.1 Model - We can use a hidden Markov Model to express the states of a component and how these states depend on the states of other components. These states are hidden, but we can learn their transitions by observing how the components fail to be fixed or the situations in which they succeed to be fixed. The transitions between hidden states would consist of the probability of a failure propagating from one component to another.

3.3 Usage - We can use these probabilities to compute expected utilities (probability of being an isolated failure in the current component x the utility_increase for that component).

@christianadriano christianadriano added the question Further information is requested label Jun 13, 2020
@2start
Copy link
Collaborator

2start commented Jun 19, 2020

I think that modeling dependent failures inside of the environment would certainly enhance the similarity to real world environments and can be learned by the agent. Regarding concrete probabilities: I don't have a clue how to choose them without looking at real world data and I don't know whether such information is available.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants