Intro and Basics Reference Page
How Deviant and Mean Are Your Data?
- Video links go directly to the YouTube video.
- JTF (Just the Facts) videos are the streamlined versions: greenscreen LaTeX equations, clean graphics, minimal narrative. Think efficient and to the point. Each has a companion TFS video.
- TFS (The Full Story) videos include interviews, additional explanation, and equations written on whiteboards. Same math, more context, more personality. Each has a matching JTF version.
- Info Page links lead to definitions, expanded explanations, and related material—because sometimes you really do need to explain it.
- Visuals links contain the greenscreen or whiteboard materials used in the video, for those who like to see the scaffolding.
- Wondering about the titles? See Appendix B: If you have to explain it, it’s no longer funny.
- Videos marked This is NOT a DOFPro video were not produced by DOFPro but are included because they are relevant to the topic. They are shown in red so no one calls the academic integrity police.
Intro to Basic Statistics
Engineers must collect, analyze, and interpret large amounts of data. To evaluate a data set, a number of basic statistical calculations are commonly used, including the determination of the mean, the standard deviation, and the confidence interval around the mean.
This set of videos introduces the fundamentals of these calculations and demonstrates how to perform them by hand, on a spreadsheet, with MATLAB, and with R.
Part 1 Just the Facts
Part 1 of Just the Facts introduces the concepts of the population (or true) mean and standard deviation, as well as the sample mean and sample standard deviation, and explains how to calculate them.
Part 1 The Full Story
Part 1 of The Full Story examines common conceptions and misconceptions about the mean and standard deviation, explains how to calculate both population and sample statistics, and prepares the viewer for their later use in uncertainty analysis.
How Deviant and Mean Are Your Data? Part 2
Part 2 explains how to calculate the standard error, confidence interval, and Student’s \(t\), and how those quantities are used in practice.
Basic Statistics by Hand
Basic Statistics by Hand demonstrates how to calculate the sample mean, sample standard deviation, standard error, and confidence interval by hand using only a calculator and a Student’s \(t\)-table.
Basic Statistics on a Spreadsheet
Basic Statistics on a Spreadsheet demonstrates how to calculate the sample mean, sample standard deviation, standard error, and confidence interval first by using the spreadsheet as a calculator and then by using built-in statistical functions.
Basic Statistics with MATLAB
This is NOT a DOFPro video. It may contain copyrighted material and references to proprietary software.
Basic Statistics in MATLAB demonstrates how to calculate the sample mean, sample standard deviation, standard error, and confidence interval using both the command line and an .m file.
Basic Statistics in R
Basic Statistics in R demonstrates how to calculate the sample mean, sample standard deviation, standard error, and confidence interval using both the command line and a script.
Examples and Definitions
Example Links
- Basic Statistical Example (Cereal Data) – An R script used to generate the animations in the basic statistics videos
- Student’s t-Table – A spreadsheet containing one-sided and two-sided \(t\) values
- Basic Statistics on a Spreadsheet Two Ways – The spreadsheet used in the spreadsheet video
- Command Line Input for Basic Statistics in R
- R Script for Basic Statistics Example – The R script used in the R video
Definitions
- True or Population Mean, \(\mu\)
- The arithmetic mean of an entire population. In practice, it is usually unknown and must be estimated from sample data.
- Sample Mean, \(\bar{x}\)
- The arithmetic mean of a sample of \(N\) measurements:
\[ \bar{x}=\frac{1}{N}\sum_{i=1}^{N} x_i \]
- True Variance, \(\sigma^2\)
- A measure of the spread of an entire population about the population mean:
\[ \sigma^2=\frac{1}{N}\sum_{i=1}^{N}(x_i-\mu)^2 \]
- True or Population Standard Deviation, \(\sigma\)
- The square root of the population variance:
\[ \sigma = \sqrt{\sigma^2} \]
- Sample Variance, \(S^2\)
- A measure of the spread of a sample about the sample mean:
\[ S^2=\frac{1}{N-1}\sum_{i=1}^{N}(x_i-\bar{x})^2 \]
- Sample Standard Deviation, \(S\)
- The square root of the sample variance:
\[ S = \sqrt{S^2} \]
- Error, \(\varepsilon_i\)
- The difference between an observed value and the true or population mean:
\[ \varepsilon_i = x_i - \mu \]
- Residual, \(e_i\)
- The difference between an observed value and the sample mean:
\[ e_i = x_i - \bar{x} \]
- Estimated Standard Error, \(S_{\bar{x}}\)
- An estimate of the standard deviation of the sample mean:
\[ S_{\bar{x}}=\frac{S}{\sqrt{N}} \]
- Student’s \(t\)
- A probability distribution used when estimating confidence intervals from small samples, especially when the population standard deviation is unknown. As the sample size increases, the \(t\)-distribution approaches the normal distribution.
- Confidence Interval, \(\bar{x} \pm \lambda\)
- An interval centered on the sample mean that is expected, with a specified probability, to contain the true mean.
For the form used in these videos,
\[ \lambda = t S_{\bar{x}} \]
so that the confidence interval is
\[ \bar{x} \pm t S_{\bar{x}} \]
Other Links and Videos
Other Links
Previous and Following Videos
Two videos back
How to Have a Quickie with Your Data Part 2
Nonlinear least squares
Video, Info Page, Visuals
Previous video
Solving Nonlinear Equations with Spreadsheets
See Title
Video, Info Page, Visuals
Next video
This is NOT a DOFPro video.
Linear Regression
Derivation of the equations for linear regression
Video, Info Page
Two videos forward
This is NOT a DOFPro video.
Linear Regression Spreadsheet
Performing linear regression with a spreadsheet
Video, Info Page