## Statistical Thinking

# Explaining p-values with puppies

You’ll find p-values lurking all over data science (and all the rest of science, for that matter). If you took STAT101, the explanation you probably heard runs something like this: *A p-value is the probability of observing a **statistic** at least as extreme as ours, conditional on the null hypothesis.* No wonder that didn’t stick! Let’s try it with puppies instead…

*(If you prefer to learn in video form, link’s at the bottom.)*

# Setting the (crime) scene

Imagine coming home and discovering *this* in your kitchen:

Let’s put this suspect on trial for the crime of sticking his head in the garbage bin!

We’ll work with a** **** default action** of

*not yelling at Fido*and a corresponding

**of “**

*null hypothesis**Fido is innocent.”*If you’re new to these concepts or unsure how to set up hypotheses, read this.

# Describe the null world

The first step in calculating a p-value is to take a deep breath and say, *“Okay, Fido, I’m going to imagine that you are **innocent**.”*

What we’re doing here is visualizing the null hypothesis world and figuring out how things work there so we can make a toy model of it. That’s what the calculations are all about.

# Does this evidence surprise you?

You’ve just finished imagining how your world works if **Fido never goes after garbage**.

“How surprising would this evidence be if Fido’s innocent?”

It’s time to ask the big question: How likely is this world to cough up something at least as damning the evidence we saw in real life?

When you answer that question with a number, that number *is* the p-value itself!

# A verdict based on surprise

If you live with an eight year old (that special sort of mischief), it’s plausible that an innocent Fido gets decorated with new collar (made of bin lid) every now and then. Your p-value might not be such a small number. Since the evidence then looks plausible under Fido’s innocence, you’ll see no reason to change your mind about calling Fido a good dog.

If you live alone with Fido, you could still imagine a way to get evidence at least this damning. Maybe your crazy neighbor climbed in through your window, ran all around your apartment, put the bin lid on the dog’s head …and jumped out the window again!

This is possible. It’s just not very *probable*. When you squint at that probability, you find the p-value so teensy tiny that continuing to entertain the dog’s innocence makes *you* feel ridiculous. So you say, “I reject the null hypothesis. I find you guilty. BAD DOG, FIDO!”

A p-value doesn’t *prove* anything. It’s simply a way to use surprise as a basis for making a reasonable decision.

It’s possible that you came to the wrong conclusion — uncertainty is a jerk that way. You won’t know whether you got it right until it’s too late. That’s life. We can only strive to do our best in an uncertain world. The p-value is simply a way to use surprise as a basis for making a reasonable decision. If you start expecting it to do something else for you, you’ll deserve all the scorn the internet loves to throw at p-value abusers.

# Summary

A p-value asks, *“If I’m living in a world where I should be taking my **default action**, how unsurprising is my evidence?”* The higher the p-value, the less ridiculous I’ll feel about persisting with my planned action. If the p-value is low enough, I’ll change my mind and do something else.

*If you want to learn about the **p-value controversy** and read my take on all the emotions the p-value causes, check out the next article in this series: **Why are p-values like needles?*

# Thanks for reading! How about an AI course?

If you had fun here and you’re looking for an applied AI course designed to be fun for beginners and experts alike, here’s one I made for your amusement: