Stochastic Matrix

In mathematics, a stochastic matrix is a square matrix used to describe the transitions of a Markov chain.

Each of its entries is a nonnegative real number representing a probability.: 9–11  It is also called a probability matrix, transition matrix, substitution matrix, or Markov matrix.: 9–11  The stochastic matrix was first developed by Andrey Markov at the beginning of the 20th century, and has found use throughout a wide variety of scientific fields, including probability theory, statistics, mathematical finance and linear algebra, as well as computer science and population genetics.: 1–8  There are several different definitions and types of stochastic matrices:: 9–11 

    A right stochastic matrix is a real square matrix, with each row summing to 1.
    A left stochastic matrix is a real square matrix, with each column summing to 1.
    A doubly stochastic matrix is a square matrix of nonnegative real numbers with each row and column summing to 1.

In the same vein, one may define a stochastic vector (also called probability vector) as a vector whose elements are nonnegative real numbers which sum to 1. Thus, each row of a right stochastic matrix (or column of a left stochastic matrix) is a stochastic vector.: 9–11  A common convention in English language mathematics literature is to use row vectors of probabilities and right stochastic matrices rather than column vectors of probabilities and left stochastic matrices; this article follows that convention.: 1–8  In addition, a substochastic matrix is a real square matrix whose row sums are all

History

Stochastic Matrix 
Andrey Markov in 1886

The stochastic matrix was developed alongside the Markov chain by Andrey Markov, a Russian mathematician and professor at St. Petersburg University who first published on the topic in 1906.: 1–8  His initial intended uses were for linguistic analysis and other mathematical subjects like card shuffling, but both Markov chains and matrices rapidly found use in other fields.: 1–8 

Stochastic matrices were further developed by scholars such as Andrey Kolmogorov, who expanded their possibilities by allowing for continuous-time Markov processes. By the 1950s, articles using stochastic matrices had appeared in the fields of econometrics and circuit theory. In the 1960s, stochastic matrices appeared in an even wider variety of scientific works, from behavioral science to geology to residential planning. In addition, much mathematical work was also done through these decades to improve the range of uses and functionality of the stochastic matrix and Markovian processes more generally.

From the 1970s to present, stochastic matrices have found use in almost every field that requires formal analysis, from structural science to medical diagnosis to personnel management. In addition, stochastic matrices have found wide use in land change modeling, usually under the term Markov matrix.

Definition and properties

A stochastic matrix describes a Markov chain Xt over a finite state space S with cardinality α.

If the probability of moving from i to j in one time step is Pr(j|i) = Pi,j, the stochastic matrix P is given by using Pi,j as the i-th row and j-th column element, e.g.,

    Stochastic Matrix 

Since the total of transition probability from a state i to all other states must be 1,

    Stochastic Matrix 

thus this matrix is a right stochastic matrix.: 1–8 

The above elementwise sum across each row i of P may be more concisely written as P1 = 1, where 1 is the α-dimensional column vector of all ones. Using this, it can be seen that the product of two right stochastic matrices P and P′′ is also right stochastic: PP′′ 1 = P′ (P′′ 1) = P1 = 1. In general, the k-th power Pk of a right stochastic matrix P is also right stochastic. The probability of transitioning from i to j in two steps is then given by the (i, j)-th element of the square of P:

    Stochastic Matrix 

In general, the probability transition of going from any state to another state in a finite Markov chain given by the matrix P in k steps is given by Pk.

An initial probability distribution of states, specifying where the system might be initially and with what probabilities, is given as a row vector.

A stationary probability vector π is defined as a distribution, written as a row vector, that does not change under application of the transition matrix; that is, it is defined as a probability distribution on the set {1, …, n} which is also a row eigenvector of the probability matrix, associated with eigenvalue 1:

    Stochastic Matrix 

It can be shown that the spectral radius of any stochastic matrix is one. By the Gershgorin circle theorem, all of the eigenvalues of a stochastic matrix have absolute values less than or equal to one. Additionally, every right stochastic matrix has an "obvious" column eigenvector associated to the eigenvalue 1: the vector 1 used above, whose coordinates are all equal to 1. As left and right eigenvalues of a square matrix are the same, every stochastic matrix has, at least, a row eigenvector associated to the eigenvalue 1 and the largest absolute value of all its eigenvalues is also 1. Finally, the Brouwer Fixed Point Theorem (applied to the compact convex set of all probability distributions of the finite set {1, …, n}) implies that there is some left eigenvector which is also a stationary probability vector.

On the other hand, the Perron–Frobenius theorem also ensures that every irreducible stochastic matrix has such a stationary vector, and that the largest absolute value of an eigenvalue is always 1. However, this theorem cannot be applied directly to such matrices because they need not be irreducible.

In general, there may be several such vectors. However, for a matrix with strictly positive entries (or, more generally, for an irreducible aperiodic stochastic matrix), this vector is unique and can be computed by observing that for any i we have the following limit,

    Stochastic Matrix 

where πj is the j-th element of the row vector π. Among other things, this says that the long-term probability of being in a state j is independent of the initial state i. That both of these computations give the same stationary vector is a form of an ergodic theorem, which is generally true in a wide variety of dissipative dynamical systems: the system evolves, over time, to a stationary state.

Intuitively, a stochastic matrix represents a Markov chain; the application of the stochastic matrix to a probability distribution redistributes the probability mass of the original distribution while preserving its total mass. If this process is applied repeatedly, the distribution converges to a stationary distribution for the Markov chain.: 55–59 

Example: Cat and Mouse

Suppose there is a timer and a row of five adjacent boxes. At time zero, a cat is in the first box, and a mouse is in the fifth box. The cat and the mouse both jump to a random adjacent box when the timer advances. For example, if the cat is in the second box and the mouse is in the fourth, the probability that the cat will be in the first box and the mouse in the fifth after the timer advances is one fourth. If the cat is in the first box and the mouse is in the fifth, the probability that the cat will be in box two and the mouse will be in box four after the timer advances is one. The cat eats the mouse if both end up in the same box, at which time the game ends. Let the random variable K be the time the mouse stays in the game.

The Markov chain that represents this game contains the following five states specified by the combination of positions (cat,mouse). Note that while a naive enumeration of states would list 25 states, many are impossible either because the mouse can never have a lower index than the cat (as that would mean the mouse occupied the cat's box and survived to move past it), or because the sum of the two indices will always have even parity. In addition, the 3 possible states that lead to the mouse's death are combined into one:

  • State 1: (1,3)
  • State 2: (1,5)
  • State 3: (2,4)
  • State 4: (3,5)
  • State 5: game over: (2,2), (3,3) & (4,4).

We use a stochastic matrix, Stochastic Matrix  (below), to represent the transition probabilities of this system (rows and columns in this matrix are indexed by the possible states listed above, with the pre-transition state as the row and post-transition state as the column).: 1–8  For instance, starting from state 1 – 1st row – it is impossible for the system to stay in this state, so Stochastic Matrix ; the system also cannot transition to state 2 – because the cat would have stayed in the same box – so Stochastic Matrix , and by a similar argument for the mouse, Stochastic Matrix . Transitions to states 3 or 5 are allowed, and thus Stochastic Matrix  .

    Stochastic Matrix 

Long-term averages

No matter what the initial state, the cat will eventually catch the mouse (with probability 1) and a stationary state π = (0,0,0,0,1) is approached as a limit.: 55–59  To compute the long-term average or expected value of a stochastic variable Stochastic Matrix , for each state Stochastic Matrix  and time Stochastic Matrix  there is a contribution of Stochastic Matrix . Survival can be treated as a binary variable with Stochastic Matrix  for a surviving state and Stochastic Matrix  for the terminated state. The states with Stochastic Matrix  do not contribute to the long-term average.

Phase-type representation

Stochastic Matrix 
The survival function of the mouse. The mouse will survive at least the first time step.

As State 5 is an absorbing state, the distribution of time to absorption is discrete phase-type distributed. Suppose the system starts in state 2, represented by the vector Stochastic Matrix . The states where the mouse has perished don't contribute to the survival average so state five can be ignored. The initial state and transition matrix can be reduced to,

    Stochastic Matrix 

and

    Stochastic Matrix 

where Stochastic Matrix  is the identity matrix, and Stochastic Matrix  represents a column matrix of all ones that acts as a sum over states.

Since each state is occupied for one step of time the expected time of the mouse's survival is just the sum of the probability of occupation over all surviving states and steps in time,

    Stochastic Matrix 

Higher order moments are given by

    Stochastic Matrix 

See also

References

Tags:

Stochastic Matrix HistoryStochastic Matrix Definition and propertiesStochastic Matrix Example: Cat and MouseStochastic Matrix

🔥 Trending searches on Wiki English:

American Civil WarPassover Seder plateCecil Williams (pastor)Benjamin BrandWorld War IMike TysonCassandra NovaMount TakaheGWikiIndiaBob Marley2024 Indian general election in Uttar PradeshInter Miami CFTamim bin Hamad Al ThaniAzerbaijanIndian National CongressJoe AlwynNot Without My Daughter (film)Jason MomoaCharles IIIRyan GarciaAadhaarNag AshwinHenry VIIIRobin WilliamsScottie SchefflerRobert Downey Jr.Anya Taylor-JoyOpinion polling for the 2024 Indian general electionRiley KeoughThe Big Bang TheoryMia KhalifaTrap (2024 film)OpenAIList of Italian football championsKanye West2024 Indian Premier LeagueTakeoff (rapper)FeyenoordAriana GrandeHogwartsLinkedInMichael Douglas2024 ICC Men's T20 World CupMatthew Perry2024AEW DynastyShōgun (2024 miniseries)JapanWomen's World Championship (WWE)Miles McBrideEarthAmazon (company)FA CupAnimal (2023 Indian film)Rajasthan RoyalsInterstellar (film)Mexico2024 Mutua Madrid Open – Women's singlesPremier LeagueGeri HalliwellPatti SmithFrom (TV series)WrestleMania XLTupac ShakurMrBeastDiljit DosanjhSunny LeoneLimoneneList of Indian Premier League seasons and resultsNew ZealandIndian Premier LeagueRed Eye (British TV series)Jennifer Pan2023–24 UEFA Champions LeagueFallout 4Pornhub🡆 More