Week 8: Multinomial, Multivariate Normal

0. Logistical Info

  • Section date: 11/8
  • Associated lectures: 10/31, 11/2
  • Associated pset: Pset 8, due 11/10
  • Office hours on 11/8 from 9-11pm at Quincy Dining Hall
  • Remember to fill out the attendance form

0.1 Summary + Practice Problem PDFs

Summary + Practice Problems PDF

Practice Problem Solutions PDF

1. Multinomial

We first generalize the notion of Bernoulli trials to many categories; this vocabulary for “categorical trials” is not standard/necessary for the class, just introduced by me to help define the Multinomial.

Consider categorical trials, where the outcome of a trial falls into one of k categories (e.g., the roll of a die has 6 categories, the flip of a coin has 2, etc.). Let pRk be a probability vector (where each entry is in [0,1] and the entries add up to p), where pi is the probability that the outcome falls into the ith category.

Multinomial story: Suppose we run n independent and identically distributed (i.i.d.) categorical trials with k categories and probability vector p. Let X (a k-dimensional random vector) count the number of trials that fell into each category. Then X is distributed Multinomial: XMultk(n,p).

1.1 Multinomial Properties

  • Marginal: For XMultk(n,p), XjBin(n,pj).
  • Conditioning: For XMultk(n,p), (X2,,Xn)|X1=x1Multk1(nx1,(p21p1,,pn1p1)).
  • Lumping: Suppose XMultk(n,p). Then we can group (lump) categories in any way to get a new Multinomial random variable by adding up the associated probabilities. For example, if (X1,X2,X3,X4,X5)Mult5(n,(p1,p2,p3,p4,p5)), then some valid examples are (X1+X4,X2,X3+X5)Mult3(n,(p1+p4,p2,p3+p5)),(X1+X2,X3,X4,X5)Mult4(n,(p1+p2,p3,p4,p5)).
  • Covariance: For XMultk(n,p), Cov(Xi,Xj)=npipj.
  • Chicken-Egg extension: Suppose NPois(λ) and X|N=nMultk(n,p) where k,p don’t depend on n. Then for j=1,2,,k, XjPois(λpj).

2. Multivariate Normal

Suppose X is a k-dimensional random vector. Then X follows Multivariate Normal (MVN) distribution if for any constants t1,,tkR, t1X1++tkXk is Normal (where 0 is consider to follow a degenerate Normal distribution). The k=2 case is called the Bivariate Normal.

2.1 Multivariate Normal Properties

  • Uncorrelated MVN implies independence: Suppose (X,Y) is bivariate normal with Cov(X,Y)=0 (i.e., X and Y are uncorrelated). Then X and Y are independent.
    More generally, if X and Y (potentially vectors) are components of the same MVN and Xi,Yj are uncorrelated for any i,j, then X and Y are independent.
Please note the specific conditions under which this result holds. It is always true that independent random variables are uncorrelated, but the converse is not a general truth. For example, two uncorrelated Normal random variables are not necessarily independent; we could only make that statement if we knew they were components of the same MVN.
  • Independence of sum and difference: Suppose XN(μ1,σ2) and YN(μ2,σ2) are independent. Then X+Y and XY are also independent.
  • Concatenation: Suppose X=(X1,,Xn) and Y=(Y1,Ym) are both Multivariate Normal with X,Y independent of each other. Then (X1,,Xn,Y1,,Ym) is also Multivariate Normal.
  • Subvector: Suppose (X,Y,Z) is Multivariate Normal. Then (X,Y) is also Multivariate Normal. In general, any subvector of a Multivariate Normal still follows a Multivariate Normal distribution.
Previous
Next