Question 1

What is the difference between one-tailed and two-tailed t-tests?

Accepted Answer

A two-tailed test checks if the mean differs in either direction (≠), while a one-tailed test checks only one direction (< or >). Two-tailed is the default and more conservative. Use one-tailed only when you have a strong theoretical reason to predict the direction of the effect before seeing the data.

Question 2

When should I use a paired t-test vs. two-sample t-test?

Accepted Answer

Use a paired t-test when observations are matched or repeated (before/after on same subjects, twins, matched controls). Use a two-sample t-test when comparing two independent groups with no connection between individuals. Paired tests are more powerful because they control for individual differences.

Question 3

What does the p-value actually tell me?

Accepted Answer

The p-value is the probability of obtaining results as extreme as observed if the null hypothesis were true. It is NOT the probability that H₀ is true or that your result is due to chance. A small p-value (< α) suggests the data is unlikely under H₀, so we reject H₀.

Question 4

What is Cohen's d and why is it important?

Accepted Answer

Cohen's d is an effect size measure that quantifies the magnitude of difference between groups in standard deviation units. It's important because p-values depend on sample size - large samples can make tiny effects significant. Cohen's d tells you if the difference is practically meaningful: small (0.2), medium (0.5), large (0.8).

Question 5

Should I use Welch's t-test or pooled t-test?

Accepted Answer

Welch's t-test is generally recommended as the default for two-sample tests because it doesn't assume equal variances and performs well even when variances are equal. Use pooled t-test only when you're confident variances are equal (ratio < 2) and sample sizes are equal.

Question 6

What if my data isn't normally distributed?

Accepted Answer

The t-test is fairly robust to non-normality, especially with larger samples (n > 30) due to the Central Limit Theorem. For severe non-normality with small samples, consider non-parametric alternatives: Mann-Whitney U test (two-sample) or Wilcoxon signed-rank test (paired).

Question 7

How do I interpret a confidence interval?

Accepted Answer

A 95% confidence interval means: if we repeated the study many times, 95% of the intervals would contain the true parameter. For t-tests, if the CI for the difference doesn't include zero, the result is significant at α = 0.05. Narrower intervals indicate more precise estimates.

Question 8

What sample size do I need for a t-test?

Accepted Answer

Sample size depends on the effect size you want to detect and desired power. For α = 0.05 and 80% power: detecting a large effect (d=0.8) needs ~25 per group, medium effect (d=0.5) needs ~65 per group, small effect (d=0.2) needs ~400 per group. Use power analysis to plan studies.

Question 9

What does degrees of freedom mean?

Accepted Answer

Degrees of freedom (df) reflects the amount of information in your data available for estimating variability. For one-sample and paired t-tests, df = n-1. For two-sample pooled t-test, df = n₁+n₂-2. For Welch's test, df is calculated from a formula and may be non-integer.

Question 10

Can I use a t-test to compare percentages?

Accepted Answer

Not directly - t-tests are for continuous data. For comparing proportions, use z-test for proportions, chi-square test, or Fisher's exact test. However, if you have continuous percentage data (like percent correct scores) that's approximately normal, a t-test may be appropriate.

Subject	Before	After	Difference (d)
1	150	142	-8
2	165	158	-7
3	145	140	-5
...	...	...	...

p-value	Interpretation
p < 0.001	Very strong evidence against H₀
p < 0.01	Strong evidence against H₀
p < 0.05	Moderate evidence against H₀
p < 0.10	Weak evidence against H₀
p ≥ 0.10	Little evidence against H₀

Cohen's d	Interpretation	Example
0.2	Small	Barely noticeable
0.5	Medium	Noticeable
0.8	Large	Obvious
1.2	Very large	Substantial
2.0	Huge	Massive

Situation	Minimum n per group
Normal data, equal n	10-15
Slightly non-normal	20-25
Unequal groups	15-20 in smaller group
Very non-normal	Use non-parametric

Effect Size (d)	n per group
Small (0.2)	~400
Medium (0.5)	~65
Large (0.8)	~25

T-Test Calculator

One-Sample t-test: Compare sample mean to known population mean

Related Calculators

About This Calculator

How to Use the T-Test Calculator

One-Sample t-test

When to Use

The Formula

Hypotheses

Example

Two-Sample t-test

When to Use

Two Versions

Pooled (Equal Variance)

Welch's (Unequal Variance)

Choosing Between Them

Example

Paired t-test

When to Use

The Concept

The Formula

Calculating Differences

Example

Understanding p-values

What p-value Means

Interpreting p-values

Common Significance Levels (α)

One-tailed vs. Two-tailed

Statistical vs. Practical Significance

Effect Size: Cohen's d

What is Effect Size?

Formula

Interpretation

Why Effect Size Matters

Reporting Standards

Assumptions and Violations

Key Assumptions

1. Normality

2. Independence

3. Equal Variance (Two-sample)

Sample Size Considerations

Power Analysis

Pro Tips

Frequently Asked Questions

What is the difference between one-tailed and two-tailed t-tests?

When should I use a paired t-test vs. two-sample t-test?

What does the p-value actually tell me?

What is Cohen's d and why is it important?

Should I use Welch's t-test or pooled t-test?

What if my data isn't normally distributed?

How do I interpret a confidence interval?

What sample size do I need for a t-test?

What does degrees of freedom mean?

Can I use a t-test to compare percentages?

More Calculators You Might Like