The Mad Hatter’s Guide to Data Viz and Stats in R
  1. Data Viz and Stats
  2. Inference
  • Data Viz and Stats
    • Tools
      • Introduction to R and RStudio
    • Descriptive Analytics
      • Data
      • Inspect Data
      • Graphs
      • Summaries
      • Counts
      • Quantities
      • Groups
      • Distributions
      • Groups and Distributions
      • Change
      • Proportions
      • Parts of a Whole
      • Evolution and Flow
      • Ratings and Rankings
      • Surveys
      • Time
      • Space
      • Networks
      • Miscellaneous Graphing Tools, and References
    • Inference
      • Basics of Statistical Inference
      • 🎲 Samples, Populations, Statistics and Inference
      • Basics of Randomization Tests
      • Inference for a Single Mean
      • Inference for Two Independent Means
      • Inference for Comparing Two Paired Means
      • Comparing Multiple Means with ANOVA
      • Inference for Correlation
      • Testing a Single Proportion
      • Inference Test for Two Proportions
    • Modelling
      • Modelling with Linear Regression
      • Modelling with Logistic Regression
      • 🕔 Modelling and Predicting Time Series
    • Workflow
      • Facing the Abyss
      • I Publish, therefore I Am
      • Data Carpentry
    • Arts
      • Colours
      • Fonts in ggplot
      • Annotating Plots: Text, Labels, and Boxes
      • Annotations: Drawing Attention to Parts of the Graph
      • Highlighting parts of the Chart
      • Changing Scales on Charts
      • Assembling a Collage of Plots
      • Making Diagrams in R
    • AI Tools
      • Using gander and ellmer
      • Using Github Copilot and other AI tools to generate R code
      • Using LLMs to Explain Stat models
    • Case Studies
      • Demo:Product Packaging and Elderly People
      • Ikea Furniture
      • Movie Profits
      • Gender at the Work Place
      • Heptathlon
      • School Scores
      • Children’s Games
      • Valentine’s Day Spending
      • Women Live Longer?
      • Hearing Loss in Children
      • California Transit Payments
      • Seaweed Nutrients
      • Coffee Flavours
      • Legionnaire’s Disease in the USA
      • Antarctic Sea ice
      • William Farr’s Observations on Cholera in London
    • Projects
      • Project: Basics of EDA #1
      • Project: Basics of EDA #2
      • Experiments

On this page

  • 1 What is Inference?
  • 2 An Idea to Encourage You: Stats Lessons from Sholay!!
  • 3 References
  • 4 Modules
Categories
All (12)
Bootstrap (1)
Central Limit Theorem (1)
Confidence Intervals (1)
Distributions (2)
Explanatory Variable (1)
Generating Parallel Worlds (3)
Inference (1)
Monte Carlo Simulation (2)
Null Distributions (2)
Permutation (3)
Random Number Generation (3)
Randomization (1)
Sampling (1)
Standard Error (1)
t.test (1)
  1. Data Viz and Stats
  2. Inference

Statistical Inference

Published

November 30, 2022

1 What is Inference?

Important

Statistical inference is the process of drawing conclusions about the entire population based on the information in a sample.

In this Section we will examine samples from populations and find procedures for estimating parameters such as means and sd. We will also devise procedures for comparing means and variances across more than one population. The conditions that make these procedures possible and accurate will also be studied and we will find alternative methods when those assumptions breakdown.

Based on our ideas of data and types of variables, here is a table of what we may infer, based on the underlying data:

Data Types and Inference
Variable(s) Estimating What? Population Parameter Sample Statistic
Single Qual variable Proportion p \(\hat{p}\)
Single Quant variable Mean \(\mu\) \(\bar{x}\)
Two Qual Variables Difference in Proportions \(p_1 -p_2\) \(\hat{p_1} - \hat{p_2}\)
One Qual, one Quant Difference in Means \(\mu_1 - \mu_2\) \(\bar{x_1}-\bar{x_2}\)
Two Quant variables Correlation \(\rho\) r

We will examine inference procedures for all these cases.

2 An Idea to Encourage You: Stats Lessons from Sholay!!

Gabbar: “Kitne Aadmi thay?
Stats Teacher: How many observations do you have? n < 30 is a joke.

Gabbar: Kya Samajh kar aaye thay? Gabbar khus hoga? Sabaasi dega kya?
Stats Teacher: What are the levels in your Factors? Are they binary? Don’t do ANOVA just yet!

Gabbar: (Fires off three rounds ) Haan, ab theek hai!
Stats Teacher: Yes, now the dataset is balanced wrt the factor (Treatment and Control).

Gabbar: Is pistol mein teen zindagi aur teen maut bandh hai. Dekhte hain kisko kya milega.
Stats Teacher: This is our Research Question, for which we will Design an Experiment.

Gabbar: (Twirls the chambers of his revolver) “Hume kuchh nahi pataa!”
Stats Teacher: Let us perform a non-parametric Permutation Test for this Factor!

Gabbar: “Kamaal ho gaya!”
Stats Teacher: Fantastic! Our p-value is so small that we can reject the NULL Hypothesis!!

Go and like this post at: https://www.linkedin.com/pulse/stat-lessons-from-sholay-arvind-venkatadri-wgtrf/?trackingId=c0b4UCTLRea6U%2Bj%2Bm4TCtw%3D%3D

3 References

  1. https://www.openintro.org/book/os/

4 Modules

Title Reading Time
Basics of Statistical Inference 8 min
🎲 Samples, Populations, Statistics and Inference 41 min
Uncertainty 5 min
Basics of Randomization Tests 12 min
Inference for a Single Mean 39 min
Inference for Two Independent Means 56 min
Inference for Comparing Two Paired Means 29 min
Comparing Multiple Means with ANOVA 49 min
Inference for Correlation 39 min
Testing a Single Proportion 20 min
Inference Test for Two Proportions 40 min
Permutation Tests 23 min
No matching items
Back to top
Miscellaneous Graphing Tools, and References
Basics of Statistical Inference

License: CC BY-SA 2.0

Website made with ❤️ and Quarto, by Arvind V.

Hosted by Netlify .