Question 1

What does p = 0.05 actually mean?

Accepted Answer

If the null hypothesis is true (no real effect), there's a 5% probability of observing data as extreme as or more extreme than what you found. It does NOT mean there's a 5% chance H₀ is true, or a 95% chance your finding is real. It's the probability of the data given H₀, not the probability of H₀ given the data.

Question 2

Why is 0.05 the standard significance level?

Accepted Answer

Ronald Fisher suggested 0.05 as a convenient threshold in 1925, saying it's "convenient to take this point as a limit." It has no special mathematical meaning - it's simply convention. Different fields use different standards: physics often requires p < 0.0000003 (5 sigma), while some social sciences accept p < 0.10 for exploratory work.

Question 3

Should I use one-tailed or two-tailed tests?

Accepted Answer

Use two-tailed tests unless you have strong theoretical reasons to only test one direction AND you determined this BEFORE seeing the data. One-tailed tests have lower thresholds for significance, so using them post-hoc is considered questionable. When in doubt, use two-tailed.

Question 4

My p-value is 0.06. Is it almost significant?

Accepted Answer

There's no such thing as "almost significant." Either you reject H₀ at your pre-specified α or you don't. However, p = 0.06 and p = 0.04 represent very similar evidence levels. The solution is to report exact p-values and effect sizes, letting readers assess the evidence themselves.

Question 5

Can I check different α levels after seeing the p-value?

Accepted Answer

This is problematic because it inflates error rates. Your α level should be set before analyzing data. However, reporting that "results were significant at α = 0.05 but not at α = 0.01" provides useful context. Better yet, just report the exact p-value.

Question 6

What if my p-value is very small, like 0.0001?

Accepted Answer

A very small p-value indicates strong statistical evidence against H₀, but NOT necessarily a large or important effect. With large samples, even tiny, meaningless differences can yield very small p-values. Always consider effect size and practical significance.

Question 7

Why do I need degrees of freedom?

Accepted Answer

Degrees of freedom (df) determine the shape of t, chi-square, and F distributions. They represent the number of values free to vary in your calculation. More df generally means the distribution approaches normal, affecting critical values and p-values. Always report df with your test statistic.

Question 8

What is the difference between p-value and α?

Accepted Answer

α (alpha) is your pre-set significance threshold - the Type I error rate you're willing to accept. The p-value is calculated from your data. If p ≤ α, you reject H₀; if p > α, you don't. Think of α as your decision criterion and p-value as your observed result.

Question 9

Can a p-value be greater than 1?

Accepted Answer

No. P-values are probabilities and must be between 0 and 1. If you calculate a p-value > 1 or < 0, there's an error in your calculation. Double-check your formula, degrees of freedom, and whether you're using the correct distribution.

Question 10

How do I interpret a non-significant result?

Accepted Answer

A non-significant result (p > α) means you lack sufficient evidence to reject H₀ - it does NOT prove H₀ is true. The effect might exist but be too small to detect with your sample size. Report the result honestly, discuss power limitations, and avoid claiming "no effect exists."

P-value	Evidence Against H₀
> 0.10	Weak or none
0.05 - 0.10	Marginal
0.01 - 0.05	Moderate
0.001 - 0.01	Strong
< 0.001	Very strong

Research Question	Test Type
"Is there a difference?"	Two-tailed
"Is it greater than?"	Right-tailed
"Is it less than?"	Left-tailed

Decision	H₀ True	H₀ False
Reject H₀	Type I (α)	Correct ✓
Keep H₀	Correct ✓	Type II (β)

P-Value Calculator

Common Test Statistics

Related Calculators

About This Calculator

How to Use the P-Value Calculator

Understanding P-Values

Definition

Interpretation

Decision Rule

Example

One-Tailed vs. Two-Tailed Tests

Two-Tailed Test

Right-Tailed Test

Left-Tailed Test

Choosing the Right Test

Common Test Statistics

Z-Test (Normal Distribution)

T-Test (Student's t Distribution)

Chi-Square Test (χ²)

F-Test

Type I and Type II Errors

Type I Error (False Positive)

Type II Error (False Negative)

The Trade-off

Balancing Errors

Common Misconceptions

Misconception 1: "P = probability H₀ is true"

Misconception 2: "p > 0.05 means no effect"

Misconception 3: "p = 0.05 means 5% chance results are due to chance"

Misconception 4: "Smaller p = larger effect"

Misconception 5: "p = 0.049 vs p = 0.051 are meaningfully different"

Best Practices

Beyond P-Values: Modern Statistical Practice

Confidence Intervals

Effect Size

Bayesian Approaches

Practical Recommendations

Pro Tips

Frequently Asked Questions