Continuous-time Markov chain

A continuous-time Markov chain (CTMC) is a continuous stochastic process in which, for each state, the process will change state according to an exponential random variable and then move to a different state as specified by the probabilities of a stochastic matrix. An equivalent formulation describes the process as changing state according to the least value of a set of exponential random variables, one for each possible state it can move to, with the parameters determined by the current state.

An example of a CTMC with three states $\{0,1,2\}$ is as follows: the process makes a transition after the amount of time specified by the holding time—an exponential random variable $E_{i}$ , where i is its current state. Each random variable is independent and such that $E_{0}\sim {\text{Exp}}(6)$ , $E_{1}\sim {\text{Exp}}(12)$ and $E_{2}\sim {\text{Exp}}(18)$ . When a transition is to be made, the process moves according to the jump chain, a discrete-time Markov chain with stochastic matrix:

{\begin{bmatrix}0&{\frac {1}{2}}&{\frac {1}{2}}\\{\frac {1}{3}}&0&{\frac {2}{3}}\\{\frac {5}{6}}&{\frac {1}{6}}&0\end{bmatrix}}.

Equivalently, by the theory of competing exponentials, this CTMC changes state from state i according to the minimum of two random variables, which are independent and such that $E_{i,j}\sim {\text{Exp}}(q_{i,j})$ for $i\neq j$ where the parameters are given by the Q-matrix $Q=(q_{i,j})$

{\begin{bmatrix}-6&3&3\\4&-12&8\\15&3&-18\end{bmatrix}}.

Each non-diagonal value can be computed as the product of the original state's holding time with the probability from the jump chain of moving to the given state. The diagonal values are chosen so that each row sums to 0.

A CTMC satisfies the Markov property, that its behavior depends only on its current state and not on its past behavior, due to the memorylessness of the exponential distribution and of discrete-time Markov chains.

Definition

A continuous-time Markov chain (X_t)_t ≥ 0 is defined by:[1]

a finite or countable state space S;
a transition rate matrix Q with dimensions equal to that of S; and
an initial state $k$ such that $X_{0}=k$ , or a probability distribution for this first state.

For i ≠ j, the elements q_ij are non-negative and describe the rate of the process transitions from state i to state j. The elements q_ii could be chosen to be zero, but for mathematical convenience a common convention is to choose them such that each row of $Q$ sums to zero, that is:

q_{ii}=-\sum _{k\neq i}q_{ik}.

Note how this differs from the definition of transition matrix for discrete Markov chains, where the row sums are all equal to one.

There are three other definitions of the process, equivalent to the one above.[2]

Transition probability definition

Another common way to define continuous-time Markov chains is to, instead of the transition rate matrix $Q$ , use the following:[1]

$v_{i}$ , for $i\in S$ , representing the decay rate (of an exponential distribution) that the system stays in state $i$ once it enters it; and
$m_{ij}$ , for $i,j\in S$ , representing the probability that the system goes to state $j$ , given that it is currently leaving state $i$ .

Naturally, $m_{ii}$ must be zero for all $i$ .

The values $v_{i}$ and $m_{ij}$ are closely related to the transition rate matrix $Q$ , by the formulas:

v_{i}=\sum _{k\neq i}q_{ik}=-q_{ii},{\text{ for all }}i,

m_{ij}={\frac {q_{ij}}{\sum _{k\neq i}q_{ik}}},{\text{ for all }}i\neq j.

Consider an ordered sequence of time instants $t_{0}<t_{1}<\dots <t_{n}$ and the states recorded at these times $i_{0},i_{1},\dots ,i_{n}$ , then it holds that:

\Pr(X_{t_{n+1}}=i_{n+1}\mid X_{t_{0}}=i_{0},X_{t_{1}}=i_{1},\ldots ,X_{t_{n}}=i_{n})=\Pr(X_{t_{n+1}}=i_{n+1}\mid X_{t_{n}}=i_{n})=p_{i_{n}i_{n+1}}(t_{n+1}-t_{n})

where the p_ij is the solution of the forward equation (a first-order differential equation):

P'(t)=P(t)Q

with initial condition P(0) being the identity matrix.

Infinitesimal definition

The continuous time Markov chain is characterized by the transition rates, the derivatives with respect to time of the transition probabilities between states i and j.

Let $X_{t}$ be the random variable describing the state of the process at time t, and assume the process is in a state i at time t. By definition of the continuous-time Markov chain, $X_{t+h}=j$ is independent of values prior to instant $t$ ; that is, it is independent of $\left(X_{s}:s<t\right)$ . With that in mind, for all $i,j$ , for all $t$ and for small values of $h$ , the following holds:

\Pr(X(t+h)=j\mid X(t)=i)=\delta _{ij}+q_{ij}h+o(h)

,

where $\delta _{ij}$ is the Kronecker delta and the little-o notation has been employed.

The above equation shows that $q_{ij}$ can be seen as measuring how quickly the transition from $i$ to $j$ happens for $i\neq j$ , and how quickly the transition away from $i$ happens for $i=j$ .

Jump chain/holding time definition

Define a discrete-time Markov chain Y_n to describe the nth jump of the process and variables S₁, S₂, S₃, ... to describe holding times in each of the states where S_i follows the exponential distribution with rate parameter −q_{Y_iY_i}.

Properties

Communicating classes

Communicating classes, transience, recurrence and positive and null recurrence are defined identically as for discrete-time Markov chains.

Transient behaviour

Write P(t) for the matrix with entries p_ij = P(X_t = j | X₀ = i). Then the matrix P(t) satisfies the forward equation, a first-order differential equation

P'(t)=P(t)Q

where the prime denotes differentiation with respect to t. The solution to this equation is given by a matrix exponential

P(t)=e^{tQ}

In a simple case such as a CTMC on the state space {1,2}. The general Q matrix for such a process is the following 2 × 2 matrix with α,β > 0

Q={\begin{pmatrix}-\alpha &\alpha \\\beta &-\beta \end{pmatrix}}.

The above relation for forward matrix can be solved explicitly in this case to give

P(t)={\begin{pmatrix}{\frac {\beta }{\alpha +\beta }}+{\frac {\alpha }{\alpha +\beta }}e^{-(\alpha +\beta )t}&{\frac {\alpha }{\alpha +\beta }}-{\frac {\alpha }{\alpha +\beta }}e^{-(\alpha +\beta )t}\\{\frac {\beta }{\alpha +\beta }}-{\frac {\beta }{\alpha +\beta }}e^{-(\alpha +\beta )t}&{\frac {\alpha }{\alpha +\beta }}+{\frac {\beta }{\alpha +\beta }}e^{-(\alpha +\beta )t}\end{pmatrix}}

However, direct solutions are complicated to compute for larger matrices. The fact that Q is the generator for a semigroup of matrices

P(t+s)=e^{(t+s)Q}=e^{tQ}e^{sQ}=P(t)P(s)

is used.

Stationary distribution

The stationary distribution for an irreducible recurrent CTMC is the probability distribution to which the process converges for large values of t. Observe that for the two-state process considered earlier with P(t) given by

P(t)={\begin{pmatrix}{\frac {\beta }{\alpha +\beta }}+{\frac {\alpha }{\alpha +\beta }}e^{-(\alpha +\beta )t}&{\frac {\alpha }{\alpha +\beta }}-{\frac {\alpha }{\alpha +\beta }}e^{-(\alpha +\beta )t}\\{\frac {\beta }{\alpha +\beta }}-{\frac {\beta }{\alpha +\beta }}e^{-(\alpha +\beta )t}&{\frac {\alpha }{\alpha +\beta }}+{\frac {\beta }{\alpha +\beta }}e^{-(\alpha +\beta )t}\end{pmatrix}}

as t → ∞ the distribution tends to

P_{\pi }={\begin{pmatrix}{\frac {\beta }{\alpha +\beta }}&{\frac {\alpha }{\alpha +\beta }}\\{\frac {\beta }{\alpha +\beta }}&{\frac {\alpha }{\alpha +\beta }}\end{pmatrix}}

Observe that each row has the same distribution as this does not depend on starting state. The row vector $π$ may be found by solving[3]

\pi Q=0.

with the additional constraint that

\sum _{i\in S}\pi _{i}=1.

Example 1

Directed graph representation of a continuous-time Markov chain describing the state of financial markets (note: numbers are made-up).

The image to the right describes a continuous-time Markov chain with state-space {Bull market, Bear market, Stagnant market} and transition rate matrix

Q={\begin{pmatrix}-0.025&0.02&0.005\\0.3&-0.5&0.2\\0.02&0.4&-0.42\end{pmatrix}}.

The stationary distribution of this chain can be found by solving $\pi Q=0$ , subject to the constraint that elements must sum to 1 to obtain

\pi ={\begin{pmatrix}0.885&0.071&0.044\end{pmatrix}}.

Example 2

Transition graph with transition probabilities, exemplary for the states 1, 5, 6 and 8. There is a bidirectional secret passage between states 2 and 8.

The image to the right describes a discrete-time Markov chain modeling Pac-Man with state-space {1,2,3,4,5,6,7,8,9}. The player controls Pac-Man through a maze, eating pac-dots. Meanwhile, he is being hunted by ghosts. For convenience, the maze shall be a small 3x3-grid and the monsters move randomly in horizontal and vertical directions. A secret passageway between states 2 and 8 can be used in both directions. Entries with probability zero are removed in the following transition rate matrix:

$Q={\begin{pmatrix}-1&{\frac {1}{2}}&&{\frac {1}{2}}\\{\frac {1}{4}}&-1&{\frac {1}{4}}&&{\frac {1}{4}}&&&{\frac {1}{4}}\\&{\frac {1}{2}}&-1&&&{\frac {1}{2}}\\{\frac {1}{3}}&&&-1&{\frac {1}{3}}&&{\frac {1}{3}}\\&{\frac {1}{4}}&&{\frac {1}{4}}&-1&{\frac {1}{4}}&&{\frac {1}{4}}\\&&{\frac {1}{3}}&&{\frac {1}{3}}&-1&&&{\frac {1}{3}}\\&&&{\frac {1}{2}}&&&-1&{\frac {1}{2}}\\&{\frac {1}{4}}&&&{\frac {1}{4}}&&{\frac {1}{4}}&-1&{\frac {1}{4}}\\&&&&&{\frac {1}{2}}&&{\frac {1}{2}}&-1\end{pmatrix}}$

This Markov chain is irreducible, because the ghosts can fly from every state to every state in a finite amount of time. Due to the secret passageway, the Markov chain is also aperiodic, because the monsters can move from any state to any state both in an even and in an uneven number of state transitions. Therefore, a unique stationary distribution exists and can be found by solving $\pi Q=0$ , subject to the constraint that elements must sum to 1. The solution of this linear equation subject to the constraint is $\pi =(7.7,15.4,7.7,11.5,15.4,11.5,7.7,15.4,7.7)\%.$ The central state and the border states 2 and 8 of the adjacent secret passageway are visited most and the corner states are visited least.

Time reversal

For a CTMC X_t, the time-reversed process is defined to be ${\hat {X}}_{t}=X_{T-t}$ . By Kelly's lemma this process has the same stationary distribution as the forward process.

A chain is said to be reversible if the reversed process is the same as the forward process. Kolmogorov's criterion states that the necessary and sufficient condition for a process to be reversible is that the product of transition rates around a closed loop must be the same in both directions.

Embedded Markov chain

One method of finding the stationary probability distribution, $π$ , of an ergodic continuous-time Markov chain, Q, is by first finding its embedded Markov chain (EMC). Strictly speaking, the EMC is a regular discrete-time Markov chain, sometimes referred to as a jump process. Each element of the one-step transition probability matrix of the EMC, S, is denoted by s_ij, and represents the conditional probability of transitioning from state i into state j. These conditional probabilities may be found by

s_{ij}={\begin{cases}{\frac {q_{ij}}{\sum _{k\neq i}q_{ik}}}&{\text{if }}i\neq j\\0&{\text{otherwise}}.\end{cases}}

From this, S may be written as

S=I-\left(\operatorname {diag} (Q)\right)^{-1}Q

where I is the identity matrix and diag(Q) is the diagonal matrix formed by selecting the main diagonal from the matrix Q and setting all other elements to zero.

To find the stationary probability distribution vector, we must next find $\varphi$ such that

\varphi S=\varphi ,

with $\varphi$ being a row vector, such that all elements in $\varphi$ are greater than 0 and $\|\varphi \|_{1}$ = 1. From this, $π$ may be found as

\pi ={-\varphi (\operatorname {diag} (Q))^{-1} \over \left\|\varphi (\operatorname {diag} (Q))^{-1}\right\|_{1}}.

(S may be periodic, even if Q is not. Once $π$ is found, it must be normalized to a unit vector.)

Another discrete-time process that may be derived from a continuous-time Markov chain is a δ-skeleton—the (discrete-time) Markov chain formed by observing X(t) at intervals of δ units of time. The random variables X(0), X(δ), X(2δ), ... give the sequence of states visited by the δ-skeleton.

Notes

Ross, S.M. (2010). Introduction to Probability Models (10 ed.). Elsevier. ISBN 978-0-12-375686-2.
Norris, J. R. (1997). "Continuous-time Markov chains I". Markov Chains. pp. 60–107. doi:10.1017/CBO9780511810633.004. ISBN 9780511810633.
Norris, J. R. (1997). "Continuous-time Markov chains II". Markov Chains. pp. 108–127. doi:10.1017/CBO9780511810633.005. ISBN 9780511810633.

References

A. A. Markov (1971). "Extension of the limit theorems of probability theory to a sum of variables connected in a chain". reprinted in Appendix B of: R. Howard. Dynamic Probabilistic Systems, volume 1: Markov Chains. John Wiley and Sons.
Markov, A. A. (2006). Translated by Link, David. "An Example of Statistical Investigation of the Text Eugene Onegin Concerning the Connection of Samples in Chains". Science in Context. 19 (4): 591–600. doi:10.1017/s0269889706001074.
Leo Breiman (1992) [1968] Probability. Original edition published by Addison-Wesley; reprinted by Society for Industrial and Applied Mathematics ISBN 0-89871-296-3. (See Chapter 7)
J. L. Doob (1953) Stochastic Processes. New York: John Wiley and Sons ISBN 0-471-52369-0.
S. P. Meyn and R. L. Tweedie (1993) Markov Chains and Stochastic Stability. London: Springer-Verlag ISBN 0-387-19832-6. online: MCSS . Second edition to appear, Cambridge University Press, 2009.
Kemeny, John G.; Hazleton Mirkil; J. Laurie Snell; Gerald L. Thompson (1959). Finite Mathematical Structures (1st ed.). Englewood Cliffs, NJ: Prentice-Hall, Inc. Library of Congress Card Catalog Number 59-12841. Classical text. cf Chapter 6 Finite Markov Chains pp. 384ff.
John G. Kemeny & J. Laurie Snell (1960) Finite Markov Chains, D. van Nostrand Company ISBN 0-442-04328-7
E. Nummelin. "General irreducible Markov chains and non-negative operators". Cambridge University Press, 1984, 2004. ISBN 0-521-60494-X
Seneta, E. Non-negative matrices and Markov chains. 2nd rev. ed., 1981, XVI, 288 p., Softcover Springer Series in Statistics. (Originally published by Allen & Unwin Ltd., London, 1973) ISBN 978-0-387-29765-1

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

[ross-1] Ross, S.M. (2010). Introduction to Probability Models (10 ed.). Elsevier. ISBN 978-0-12-375686-2.

[norris1-2] Norris, J. R. (1997). "Continuous-time Markov chains I". Markov Chains. pp. 60–107. doi:10.1017/CBO9780511810633.004. ISBN 9780511810633.

[norris2-3] Norris, J. R. (1997). "Continuous-time Markov chains II". Markov Chains. pp. 108–127. doi:10.1017/CBO9780511810633.005. ISBN 9780511810633.

Queueing theory
Single queueing nodes	D/M/1 queue M/D/1 queue M/D/c queue M/M/1 queue Burke's theorem M/M/c queue M/M/∞ queue M/G/1 queue Pollaczek–Khinchine formula Matrix analytic method M/G/k queue G/M/1 queue G/G/1 queue Kingman's formula Lindley equation Fork–join queue Bulk queue
Arrival processes	Poisson point process Markovian arrival process Rational arrival process
Queueing networks	Jackson network Traffic equations Gordon–Newell theorem Mean value analysis Buzen's algorithm Kelly network G-network BCMP network
Service policies	FIFO LIFO Processor sharing Round-robin Shortest job next Shortest remaining time
Key concepts	Continuous-time Markov chain Kendall's notation Little's law Product-form solution Balance equation Quasireversibility Flow-equivalent server method Arrival theorem Decomposition method Beneš method
Limit theorems	Fluid limit Mean field theory Heavy traffic approximation Reflected Brownian motion
Extensions	Fluid queue Layered queueing network Polling system Adversarial queueing network Loss network Retrial queue
Information systems	Data buffer Erlang (unit) Erlang distribution Flow control (data) Message queue Network congestion Network scheduler Pipeline (software) Quality of service Scheduling (computing) Teletraffic engineering
Category