Statistics

P-value Calculator

Estimate a p-value from a known test statistic, distribution choice, and tail direction.

Preparing P-value Calculator

Please wait ...

Input

Choose the test type, enter the test statistic, add degrees of freedom if needed, and pick the tail that matches your hypothesis.

Input summary

Your calculator summary shows here.

Step-by-Step Calculation

Step-by-step work appears here

Calculate first and the full working will be placed here.

Hypothesis test probability

Interpreting a p-value as evidence under a specific null hypothesis

A p-value is tied to the null model

A p-value measures how unusual the observed test statistic would be if the null hypothesis were true. It is not the probability that the null hypothesis is true. It is also not the probability that the result happened by random chance in a casual sense. The calculation only has meaning after the null hypothesis, test statistic, distribution, and tail direction are chosen.

This calculator is useful when those inputs are already known. If the test statistic has not been calculated yet, finish that step first. A p-value without the correct statistic or tail choice can look precise while answering the wrong test.

Tail direction changes the area being measured

A left-tailed test looks for unusually small values. A right-tailed test looks for unusually large values. A two-tailed test looks for values unusually far from the null expectation in either direction. The same test statistic can produce different p-values under different tail choices.

The tail should come from the alternative hypothesis, not from the data after seeing the result. Choosing the tail afterward makes the test easier to pass and weakens the interpretation.

Small p-values suggest tension with the null hypothesis

A small p-value means the observed statistic is unusual under the null model. Many courses compare it with a significance level such as 0.05. If the p-value is below that level, the result is called statistically significant for that test.

Statistical significance does not automatically mean practical importance. A tiny effect can be significant in a huge sample, while an important-looking effect may fail to reach significance in a small sample.

The significance level should be chosen before calculation

The alpha level is the cutoff used for the decision. Common choices include 0.05, 0.01, and 0.10. The correct choice depends on the context, the cost of a false positive, and the standards of the class or field.

Do not move the cutoff after seeing the p-value. A result of 0.047 compared with 0.05 and a result of 0.053 compared with 0.05 are close numerically, but the preselected decision rule treats them differently.

Z tests and t tests use different reference curves

Some p-values are based on the standard normal curve. Others use a t distribution with degrees of freedom. The calculator input must match the test that produced the statistic. Entering a t statistic as if it were a z statistic can produce the wrong area.

For standard-normal work, the Z-Score Calculator can help connect a raw value, mean, and standard deviation to the z scale. After the statistic is known, this page focuses on the tail probability.

Degrees of freedom are not decoration

When a t distribution is used, degrees of freedom control the curve shape. Smaller degrees of freedom create heavier tails, which can change the p-value noticeably. Copying the wrong degrees of freedom is a common reason two answers disagree.

Degrees of freedom usually come from the sample size and test design. For a one-sample t test, it is often n - 1. Other tests use different formulas, so the test type must be known.

P-values and confidence intervals tell related stories

A hypothesis test and a confidence interval often agree when they are built from the same assumptions. If a 95 percent confidence interval excludes the null value, a two-sided test at alpha 0.05 often rejects that null value. The two tools are not identical, but they are connected.

The Confidence Interval Calculator is useful when the question asks for a plausible range instead of a reject-or-fail decision. A p-value gives evidence against a point claim; an interval shows a range of compatible values.

Sample size affects how sensitive the test becomes

Larger samples can detect smaller differences because estimates become more stable. Smaller samples need stronger evidence to produce a small p-value. That is why the sample size should always be reported with a hypothesis test result.

If the question is about planning data collection before a study, the Sample Size Calculator belongs earlier in the workflow. P-value work happens after data and a test statistic exist.

Report the result with the test context

A useful p-value statement includes the test type, test statistic, tail choice, p-value, and conclusion against the chosen alpha level. Writing only "p = 0.03" leaves out too much information. Readers need to know what model produced that value and what decision rule was used.

Round p-values carefully. Very small values are often written as p < 0.001 instead of 0.000. Avoid saying a null hypothesis is proven false; hypothesis testing supports decisions under uncertainty rather than absolute proof.