A redundant computer system consist of identical two-compute nodes, which each fail with an intensity of λ {\displaystyle \lambda } . When failed, they are repaired one at the time by a single repairman with negative exponential distributed repair times with expectation μ − 1 {\displaystyle \mu ^{-1}} .
Intensities from state 0 and state 1 are 2 λ {\displaystyle 2\lambda } , since each compute node has a failure intensity of λ {\displaystyle \lambda } . Intensity from state 1 to state 2 is λ {\displaystyle \lambda } . Transitions from state 2 to state 1 and state 1 to state 0 represent the repairs of the compute nodes and have the intensity μ {\displaystyle \mu } , since only a single unit is repaired at the time.
The asymptotic availability, i.e. availability over a long period, of the system is equal to the probability that the model is in state 1 or state 2.
This is calculated by making a set of linear equations of the state transition and solving the linear system.
The matrix is constructed with a row for each state. In a row, the intensity into the state is set in the column with the same index, with a negative term.
The identities cells balance the sum of their column to 0:
In addition the equality clause must be taken into account:
By solving this equation, the probability of being in state 1 or state 2 can be found, which is equal to the long-term availability of the service.
The reliability of the system is found by making the failure states absorbing, i.e. removing all outgoing state transitions.
For this system the function is:
Finite state models of systems are subject to state explosion. To create a realistic model of a system one ends up with a model with so many states that it is infeasible to solve or draw the model.
Bjarne E. Helvik (2007). Dependable Computing Systems and Communication Networks. Gnist Tapir. ↩