A Markov Chain or Markov Process models a system that changes from state to state over time.
Example States:
Weather: Sunny, Cloudy, Rainy
Traffic signal: Green, Yellow, Red
Digital circuit: High, Low
Communication Channel: Good, Noisy
Transition Probabilities and Matrices
If a Markov chain has \(k\) possible states, labeled \(1, 2, \ldots, k\):
Transition Probability (\(p_{ij}\)):
The probability that the system is in state \(i\) at the next observation, given it was in state \(j\) at the preceding observation.
Transition Matrix (\(P\)):
\(P = [p_{ij}]\)
Columns represent preceding states.
Rows represent next states.
\[
P = \left[ \begin{array}{llll}
p_{11} & p_{12} & \dots & p_{1k} \\
p_{21} & p_{22} & \dots & p_{2k} \\
\vdots & \vdots & \ddots & \vdots \\
p_{k1} & p_{k2} & \dots & p_{kk}
\end{array} \right] \begin{array}{l}
\longleftarrow \text{ Next State 1} \\
\longleftarrow \text{ Next State 2} \\
\longleftarrow \vdots \\
\longleftarrow \text{ Next State k}
\end{array}
\]\[\uparrow \uparrow \quad \uparrow\]\[\text{Prev. State 1} \quad \text{Prev. State 2} \quad \text{Prev. State k}\]
Example 1: Car Rental Agency
A car rental agency has three locations (1, 2, 3). The manager finds customer return patterns:
From Location 1:
Return to 1: 80%
Return to 2: 10%
Return to 3: 10%
From Location 2:
Return to 1: 30%
Return to 2: 20%
Return to 3: 50%
From Location 3:
Return to 1: 20%
Return to 2: 60%
Return to 3: 20%
Transition Matrix (\(P\)):\[
P = \left[ \begin{array}{ccc}
.8 & .3 & .2 \\
.1 & .2 & .6 \\
.1 & .5 & .2
\end{array} \right]
\] This matrix is a Stochastic Matrix: all entries are non-negative, and columns sum to 1. \[
p_{1j} + p_{2j} + \dots + p_{kj} = 1
\]
State Vectors
Definition 2: State Vector (\(\mathbf{x}\)) A column vector \(\mathbf{x}\) whose \(i\)-th component \(x_i\) is the probability that the system is in the \(i\)-th state at a given time.
Example for a 3-state system: \[
\mathbf{x} = \left[ \begin{array}{l}
x_{1} \\
x_{2} \\
x_{3}
\end{array} \right]
\]
\(x_i \ge 0\) for all \(i\).
\(\sum x_i = 1\) (the system must be in some state).
A vector with these properties is called a probability vector.
Theorem 10.4.1: Future State Prediction If \(P\) is the transition matrix of a Markov chain and \(\mathbf{x}^{(n)}\) is the state vector at the \(n\)-th observation, then: \[ \mathbf{x}^{(n+1)} = P \mathbf{x}^{(n)} \] This implies: \[ \mathbf{x}^{(n)} = P^n \mathbf{x}^{(0)} \]
Example 2: Alumni Donation (Probabilistic Evolution)
A traffic officer moves between 8 intersections. They remain at the current intersection or move to an adjacent one, choosing randomly (equally likely).
Definition 3: Regular Transition Matrix A transition matrix \(P\) is regular if some integer power of it, \(P^m\), has all positive entries (no zeros). * Examples 1, 2, 5 were regular (\(m=1\) or \(m=4\)). This ensures convergence.
Limiting Behavior and Steady-State Vectors
Theorem 10.4.3: Behavior of \(P^n \mathbf{x}\) as \(n \rightarrow \infty\) If \(P\) is a regular transition matrix and \(\mathbf{x}\) is any probability vector, then as \(n \rightarrow \infty\): \[ P^n \mathbf{x} \rightarrow \mathbf{q} \] where \(\mathbf{q}\) is a fixed probability vector, called the steady-state vector.
\(\mathbf{q}\) is independent of the initial state \(\mathbf{x}^{(0)}\).
All entries of \(\mathbf{q}\) are positive.
Calculating the Steady-State Vector (\(\mathbf{q}\))
Theorem 10.4.4: Steady-State Vector Property The steady-state vector \(\mathbf{q}\) of a regular transition matrix \(P\) is the unique probability vector that satisfies the equation: \[ P \mathbf{q} = \mathbf{q} \] This is an eigenvector problem! \(\mathbf{q}\) is an eigenvector of \(P\) corresponding to the eigenvalue \(\lambda = 1\).
Rearranging the equation: \[ (I - P) \mathbf{q} = \mathbf{0} \] This is a homogeneous linear system. We need to find the unique probability vector (entries sum to 1) in the null space of \((I-P)\).
Example 7: Alumni Donation (Steady-State Calculation)
If the agency has 1000 cars, roughly 557 will be at Location 1, 230 at Location 2, and 213 at Location 3 in the long run. This informs optimal parking space allocation.
Example 9: Traffic Officer (Steady-State)
Let’s find the long-term probability distribution for the traffic officer’s location.
The resulting steady-state vector shows the proportion of time the officer spends at each intersection over the long term. Are these proportions equal? Why or why not?