9
Dec 16

## Ditch statistical tables if you have a computer

You don't need statistical tables if you have Excel or Mathematica. Here I give the relevant Excel and Mathematica functions described in Chapter 14 of my book. You can save all the formulas in one spreadsheet or notebook and use it multiple times.

### Cumulative Distribution Function of the Standard Normal Distribution

For a given real $z$, the value of the distribution function of the standard normal is
$F(z)=\frac{1}{\sqrt{2\pi }}\int_{-\infty }^{z}\exp (-t^{2}/2)dt.$

In Excel, use the formula =NORM.S.DIST(z,TRUE).

In Mathematica, enter CDF[NormalDistribution[0,1],z]

### Probability Function of the Binomial Distribution

For given number of successes $x,$ number of trials $n$ and probability $p$ the probability is

$P(Binomial=x)=C_{x}^{n}p^{x}(1-p)^{n-x}$.

In Excel, use the formula =BINOM.DIST(x,n,p,FALSE)

In Mathematica, enter PDF[BinomialDistribution[n,p],x]

### Cumulative Binomial Probabilities

For a given cut-off value $x,$ number of trials $n$ and probability $p$ the cumulative probability is

$P(Binomial\leq x)=\sum_{t=0}^{x}C_{t}^{n}p^{t}(1-p)^{n-t}.$
In Excel, use the formula =BINOM.DIST(x,n,p,TRUE).

In Mathematica, enter CDF[BinomialDistribution[n,p],x]

### Values of the exponential function $e^{-\lambda}$

In Excel, use the formula =EXP(-lambda)

In Mathematica, enter Exp[-lambda]

### Individual Poisson Probabilities

For given number of successes $x$ and arrival rate $\lambda$ the probability is

$P(Poisson=x)=\frac{e^{-\lambda }\lambda^{x}}{x!}.$
In Excel, use the formula =POISSON.DIST(x,lambda,FALSE)

In Mathematica, enter PDF[PoissonDistribution[lambda],x]

### Cumulative Poisson Probabilities

For given cut-off $x$ and arrival rate $\lambda$ the cumulative probability is

$P(Poisson\leq x)=\sum_{t=0}^{x}\frac{e^{-\lambda }\lambda ^{t}}{t!}.$
In Excel, use the formula =POISSON.DIST(x,lambda,TRUE)

In Mathematica, enter CDF[PoissonDistribution[lambda],x]

### Cutoff Points of the Chi-Square Distribution Function

For given probability of the right tail $\alpha$ and degrees of freedom $\nu$, the cut-off value (critical value) $\chi _{\nu,\alpha }^{2}$ is a solution of the equation
$P(\chi _{\nu}^{2}>\chi _{\nu,\alpha }^{2})=\alpha .$
In Excel, use the formula =CHISQ.INV.RT(alpha,v)

In Mathematica, enter InverseCDF[ChiSquareDistribution[v],1-alpha]

### Cutoff Points for the Student’s t Distribution

For given probability of the right tail $\alpha$ and degrees of freedom $\nu$, the cut-off value $t_{\nu,\alpha }$ is a solution of the equation $P(t_{\nu}>t_{\nu,\alpha })=\alpha$.
In Excel, use the formula =T.INV(1-alpha,v)

In Mathematica, enter InverseCDF[StudentTDistribution[v],1-alpha]

### Cutoff Points for the F Distribution

For given probability of the right tail $\alpha$, degrees of freedom $v_1$ (numerator) and $v_2$ (denominator), the cut-off value $F_{v_1,v_2,\alpha }$ is a solution of the equation $P(F_{v_1,v_2}>F_{v_1,v_2,\alpha })=\alpha$.

In Excel, use the formula =F.INV.RT(alpha,v1,v2)

In Mathematica, enter InverseCDF[FRatioDistribution[v1,v2],1-alpha]

29
Sep 16

## Definitions of chi-square, t statistic and F statistic

Definitions of the standard normal distribution and independence can be combined to produce definitions of chi-square, t statistic and F statistic. The similarity of the definitions makes them easier to study.

### Independence of continuous random variables

Definition of independent discrete random variables easily modifies for the continuous case. Let $X,Y$ be two continuous random variables with densities $p_X,\ p_Y$, respectively. We say that these variables are independent if the density $p_{X,Y}$ of the pair $(X,Y)$ is a product of individual densities:

(1) $p_{X,Y}(s,t)=p_X(s)p_Y(t)$ for all $s,t.$

As in this post, equation (1) can be understood in two ways. If (1) is given, then $X,Y$ are independent. Conversely, we if want them to be independent, we can define the density of the pair by equation (1). This definition readily generalizes for the case of many variables. In particular, if we want variables $z_1,...,z_n$ to be standard normal and independent, we say that each of them has density defined here and the joint density $p_{z_1,...,z_n}$ is a product of individual densities.

### Definition of chi-square variable

Figure 1. chi-square with 1 degree of freedom

Let $z_1,...,z_n$ be standard normal and independent. Then the variable $\chi^2_n=z_1^2+...+z_n^2$ is called a chi-square variable with $n$ degrees of freedom. Obviously, $\chi^2_n\ge 0$, which means that its density is zero to the left of the origin. For low values of degrees of freedom, the density is not bounded near the origin, see Figure 1.

### Definition of t distribution

Figure 2. t distribution and standard normal compared

Let $z_0,z_1,...,z_n$ be standard normal and independent. Then the variable $t_n=\frac{z_0}{\sqrt{(z_1^2+...+z_n^2)/n}}$ is called a t distribution with $n$ degrees of freedom. The density of the t distribution is bell-shaped and for low $n$ has fatter tails than the standard normal. For high $n$, it approaches that of the standard normal, see Figure 2.

### Definition of F distribution

Figure 3. F distribution with (1,m) degrees of freedom

Let $u_1,...,u_n,v_1,...,v_m$ be standard normal and independent. Then the variable $F_{n,m}=\frac{(u_1^2+...+u_n^2)/n}{(v_1^2+...+v_m^2)/m}$ is called an F distribution with $(n,m)$ degrees of freedom. It is nonnegative and its density is zero to the left of the origin. When $n$ is low, the density is not bounded in the neighborhood of zero, see Figure 3.

The Mathematica file and video illustrate better the densities of these three variables.

### Consequences

1. If $\chi^2_n$ and $\chi^2_m$ are independent, then $\chi^2_n+\chi^2_m$ is $\chi^2_{n+m}$ (addition rule). This rule is applied in the theory of ANOVA models.
2. $t_n^2=F_{1,n}$. This is an easy proof of equation (2.71) from Introduction to Econometrics, by Christopher Dougherty, published by Oxford University Press, UK, in 2016.