4.2 Estimates, standard errors, and confidence intervals
Jargon
Parameters are the unknown numbers that determine the statistical model
Coefficients are, for example, the slope and intercept
scale or variance is the measurement error
estimand or quantity of interest is some summary of parameters or data of interest
We use data to contruct estimates of parameters or other quantities of interest.
standard error is the estimated standard deviation of an estimate.
Confidence interval represents a range of values of a parameter or quantity of interest that are roughly consistent with the data, given the assumed sampling distribution. If the model is correct, then in repeated applications the 50% and 95% confidence intervals will include the true value 50% and 95% of the time.
When the sampling distribution is a normal distribution with mean \(\mu\) and standard deviation \(\sigma\), and \(n\) draws (data
) are made from this distribution, then the estimate for \(\mu\) is just the mean(data)
, the standard error is the sd(data)/sqrt(n)
, and confidence intervals can be estimated using quantiles.
If the normal distribution is a good approximation:
- 2 standard errors ~ 95% quantile
- 2/3 standard errors ~ 50% quantile