Week 8: Multinomial, Multivariate Normal

0. Logistical Info

Section date: 11/8
Associated lectures: 10/31, 11/2
Associated pset: Pset 8, due 11/10
Office hours on 11/8 from 9-11pm at Quincy Dining Hall
Remember to fill out the attendance form

0.1 Summary + Practice Problem PDFs

1. Multinomial

We first generalize the notion of Bernoulli trials to many categories; this vocabulary for “categorical trials” is not standard/necessary for the class, just introduced by me to help define the Multinomial.

Consider categorical trials, where the outcome of a trial falls into one of $k$ categories (e.g., the roll of a die has $6$ categories, the flip of a coin has $2$ , etc.). Let $p \in R^{k}$ be a probability vector (where each entry is in $[0, 1]$ and the entries add up to $p$ ), where $p_{i}$ is the probability that the outcome falls into the $i^{th}$ category.

Multinomial story: Suppose we run $n$ independent and identically distributed (i.i.d.) categorical trials with $k$ categories and probability vector $p$ . Let $X$ (a $k$ -dimensional random vector) count the number of trials that fell into each category. Then $X$ is distributed Multinomial: $X \sim {Mult}_{k} (n, p)$ .

1.1 Multinomial Properties

Marginal: For $X \sim {Mult}_{k} (n, p)$ , $X_{j} \sim Bin (n, p_{j})$ .
Conditioning: For $X \sim {Mult}_{k} (n, p)$ , $\begin{aligned} (X_{2}, \dots, X_{n}) | X_{1} = x_{1} & \sim {Mult}_{k - 1} (n - x_{1}, (\frac{p_{2}}{1 - p_{1}}, \dots, \frac{p_{n}}{1 - p_{1}})) . \end{aligned}$
Lumping: Suppose $X \sim {Mult}_{k} (n, p)$ . Then we can group (lump) categories in any way to get a new Multinomial random variable by adding up the associated probabilities. For example, if $(X_{1}, X_{2}, X_{3}, X_{4}, X_{5}) \sim {Mult}_{5} (n, (p_{1}, p_{2}, p_{3}, p_{4}, p_{5}))$ , then some valid examples are $\begin{aligned} (X_{1} + X_{4}, X_{2}, X_{3} + X_{5}) & \sim {Mult}_{3} (n, (p_{1} + p_{4}, p_{2}, p_{3} + p_{5})), \\ (X_{1} + X_{2}, X_{3}, X_{4}, X_{5}) & \sim {Mult}_{4} (n, (p_{1} + p_{2}, p_{3}, p_{4}, p_{5})) . \end{aligned}$
Covariance: For $X \sim {Mult}_{k} (n, p)$ , $Cov (X_{i}, X_{j}) = - n p_{i} p_{j}$ .
Chicken-Egg extension: Suppose $N \sim Pois (λ)$ and $X | N = n \sim {Mult}_{k} (n, p)$ where $k, p$ don’t depend on $n$ . Then for $j = 1, 2, \dots, k$ , $\begin{aligned} X_{j} & \sim Pois (λ p_{j}) . \end{aligned}$

2. Multivariate Normal

Suppose $X$ is a $k$ -dimensional random vector. Then $X$ follows Multivariate Normal (MVN) distribution if for any constants $t_{1}, \dots, t_{k} \in R$ , $\begin{array}{r} t_{1} X_{1} + \dots + t_{k} X_{k} \end{array}$ is Normal (where $0$ is consider to follow a degenerate Normal distribution). The $k = 2$ case is called the Bivariate Normal.

2.1 Multivariate Normal Properties

Uncorrelated MVN implies independence: Suppose $(X, Y)$ is bivariate normal with $Cov (X, Y) = 0$ (i.e., $X$ and $Y$ are uncorrelated). Then $X$ and $Y$ are independent.
More generally, if $X$ and $Y$ (potentially vectors) are components of the same MVN and $X_{i}, Y_{j}$ are uncorrelated for any $i, j$ , then $X$ and $Y$ are independent.

Please note the specific conditions under which this result holds. It is always true that independent random variables are uncorrelated, but the converse is not a general truth. For example, two uncorrelated Normal random variables are not necessarily independent; we could only make that statement if we knew they were components of the same MVN.

Independence of sum and difference: Suppose $X \sim N (μ_{1}, σ^{2})$ and $Y \sim N (μ_{2}, σ^{2})$ are independent. Then $X + Y$ and $X - Y$ are also independent.
Concatenation: Suppose $X = (X_{1}, \dots, X_{n})$ and $Y = (Y_{1}, \dots Y_{m})$ are both Multivariate Normal with $X, Y$ independent of each other. Then $(X_{1}, \dots, X_{n}, Y_{1}, \dots, Y_{m})$ is also Multivariate Normal.
Subvector: Suppose $(X, Y, Z)$ is Multivariate Normal. Then $(X, Y)$ is also Multivariate Normal. In general, any subvector of a Multivariate Normal still follows a Multivariate Normal distribution.