Probability Distributions

Definition

A probability distribution is a mathematical function that describes the probability of different possible values of a variable. Probability distributions are often depicted using graphs or probability tables.

Common probability distributions include the binomial distribution, Poisson distribution, and uniform distribution. Certain types of probability distributions are used in hypothesis testing, including the standard normal distributionStudent’s t distribution, and the F distribution. 1

Before delving into probability distributions, there are prerequisite concepts that have to be initially understood: Discrete Data, Continuous Data and Random Variable.

Discrete Data

The term “Discrete” can be defined as “separate”, “distinct”, or “detached”. Discrete data can only take on particular values. Each value is distinct and there’s no grey area in between. Discrete data can be numeric — like numbers of apples — but it can also be categorical — like red or blue, or male or female, or good or bad. There are two questions you can ask yourself when deciding if data is discrete: 2

  • Can you count it?
  • Can it be divided into smaller and smaller parts?

Continuous Data

If a data point can take on any value between two specified values, it is considered to be continuous. Continuous data is often measurements on a scale, such as height, weight, and temperature. Continuous data is not restricted to defined separate values, but can occupy any value over a continuous range. Between any two continuous data values, there may be an infinite number of others. Continuous data is always essentially numeric. 2

Random Variable

A random variable or stochastic variable can be conceptualized informally as a variable whose values depend on outcomes of a random phenomenon. It is a way to map outcomes of random processes to numbers. In other words we are quantifying outcomes by mapping them to a number. For example, we can define a random variable X in which X =1 if we toss a fair coin and it lands on heads and X = 0 if it lands on tails. In statistics, we deem phenomenon to be random if individual outcomes are uncertain but there in nonetheless a regular distribution of outcomes in a large number of repetitions. “Random” in statistics is not a synonym for “haphazard” but a description of a kind of order that emerges only in the long run. Random variables are a useful mechanism to assign probabilities to sample outcomes. Suppose that to each point of a sample space we assign a number. We then have a function defined on the sample space. This function, called a random variable, is usually denoted by a capital letter such as X or Y. 2

Who

The following share how probability is used in real-life situations on a regular basis. 10

  • Weather Forecasting
  • Sports Betting
  • Politics
  • Sales Forecasting
  • Health Insurance
  • Grocery Store Staffing
  • Natural Disasters
  • Traffic
  • Investing
  • Statistician
  • Cost Estimator
  • Insurance Underwriter
  • Market Research Analyst
  • Atmospheric Scientists

What

Normal

In a normal distribution, data is symmetrically distributed with no skew. When plotted on a graph, the data follows a bell shape, with most values clustering around a central region and tapering off as they go further away from the center.

Normal distributions are also called Gaussian distributions or bell curves because of their shape. 1

Standard Normal

The standard normal distribution, also called the z-distribution, is a special normal distribution where the mean is 0 and the standard deviation is 1.

Any normal distribution can be standardized by converting its values into z-scores. Z-scores tell you how many standard deviations from the mean each value lies.

Converting a normal distribution into a z-distribution allows you to calculate the probability of certain values occurring and to compare different data sets. 3

Normal distribution vs the standard normal distribution

All normal distributions, like the standard normal distribution, are unimodal and symmetrically distributed with a bell-shaped curve. However, a normal distribution can take on any value as its mean and standard deviation. In the standard normal distribution, the mean and standard deviation are always fixed.

Every normal distribution is a version of the standard normal distribution that’s been stretched or squeezed and moved horizontally right or left.

The mean determines where the curve is centered. Increasing the mean moves the curve right, while decreasing it moves the curve left.

The standard deviation stretches or squeezes the curve. A small standard deviation results in a narrow curve, while a large standard deviation leads to a wide curve. 3

CurvePosition or shape (relative to standard normal distribution)
A (M = 0, SD = 1)Standard normal distribution
B (M = 0, SD = 0.5)Squeezed, because SD < 1
C (M = 0, SD = 2)Stretched, because SD > 1
D (M = 1, SD = 1)Shifted right, because M > 0
E (M = –1, SD = 1)Shifted left, because M < 0

Chi-Squared Distribution

A chi-square (Χ2) distribution is a continuous probability distribution that is used in many hypothesis tests.

The shape of a chi-square distribution is determined by the parameter k. The graph below shows examples of chi-square distributions with different values of k. 5

Chi-Squared Table

The chi-square (Χ2) distribution table is a reference table that lists chi-square critical values. A chi-square critical value is a threshold for statistical significance for certain hypothesis tests and defines confidence intervals for certain parameters.

Chi-square critical values are calculated from chi-square distributions. They’re difficult to calculate by hand, which is why most people use a reference table or statistical software instead. 6

You will need a chi-square critical value if you want to: 6

Poisson Distribution

Poisson distribution is a discrete probability distribution. It gives the probability of an event happening a certain number of times (k) within a given interval of time or space.

The Poisson distribution has only one parameter, λ (lambda), which is the mean number of events. The graph below shows examples of Poisson distributions with different values of λ. 4

t Distribution

The t-distribution, also known as Student’s t-distribution, is a way of describing data that follow a bell curve when plotted on a graph, with the greatest number of observations close to the mean and fewer observations in the tails.

It is a type of normal distribution used for smaller sample sizes, where the variance in the data is unknown. 7

In statistics, the t-distribution is most often used to: 7

t Table

Student’s table is a reference table that lists critical values of t. Student’s table is also known as the table, t-distribution table, t-score table, t-value table, or t-test table.

A critical value of t defines the threshold for significance for certain statistical tests and the upper and lower bounds of confidence intervals for certain estimates. It is most commonly used when:

  • Testing whether two means are significantly different (two-sample tests)
  • Testing whether two variables are significantly related (linear regression or correlation)
  • Calculating confidence intervals (of means or regression coefficients)

The critical values of t are calculated from Student’s distribution. Student’s t distribution is the distribution of the test statistic t. The critical values of t are difficult to calculate by hand, which is why most people use a table or computer software instead. 9

The Pre-requisites Required to Using a T-table

  • The number of tails: We need to know whether our t-test is one-tailed or two-tailed because we will use the respective one-tail or two-tail row to mark the alpha level. The alpha levels are listed at top of the table (0.50, 0.25, 0.20, 0.15…for the one-tail and 1.00, 0.50, 0.40, 0.30…for the two-tails) and as you can see they vary based on whether the t-test is one-tail or two-tails.
  • Degrees of freedom: The degrees of freedom (df) indicate the number of independent values that can vary in an analysis without breaking any constraints. The degrees of freedom will either be explicitly mentioned in the problem statement or if it is not explicitly mentioned, all you have to do is subtract one from your sample size (n – 1) and what you get will be your df or degrees of freedom.
  • Alpha level: The alpha level ( α ), also known as the significance level is the probability of rejecting the null hypothesis when it is true. The common alpha levels for t-test are 0.01, 0.05 and 0.10

Once you have all three, all you have to do is pick the respective column for one-tail or two-tail from the table and map the intersection of the values for the degrees of freedom (df) and the alpha level. 8

Why

See Theoretical Knowledge Vs Practical Application.

How

Many of the References and Additional Reading websites and Videos will assist you with understanding and applying probability distributions.

As some professors say: “It is intuitively obvious to even the most casual observer.

References

1 Turney, Shaun. 2022. “Probability Distribution | Formula, Types, & Examples”. Scribbr. https://www.scribbr.com/statistics/probability-distributions/.

2 Dagnechaw, Shiffraw. “What Is A Probability Distribution? And Why They Are Important?” 2020. Medium. https://shiffdag.medium.com/what-is-a-probability-distribution-and-why-they-are-important-dee5e5c1ba99.

3 Bhandari, Pritha. 2020. “The Standard Normal Distribution”. Scribbr. https://www.scribbr.com/statistics/standard-normal-distribution/.

4 Turney, Shaun. 2022. “Poisson Distributions | Definition, Formula & Examples”. Scribbr. https://www.scribbr.com/statistics/poisson-distribution/.

5 Turney, Shaun. 2022. “Chi-Square (Χ²) Distributions | Definition & Examples”. Scribbr. https://www.scribbr.com/statistics/chi-square-distributions/.

6 Turney, Shaun. 2022. “Chi-Square (Χ²) Table | Examples & Downloadable Table”. Scribbr. https://www.scribbr.com/statistics/chi-square-distribution-table/.

7 Bevans, Rebecca. 2020. “T-Distribution: What It Is And How To Use It”. Scribbr. https://www.scribbr.com/statistics/t-distribution/.

8 “T Table – T Table”. 2022. T Table. https://www.tdistributiontable.com/.

9 Turney, Shaun. 2022. “Student’s T Table (Free Download) | Guide & Examples”. Scribbr. https://www.scribbr.com/statistics/students-t-table/.

10 Zach. 2021. “10 Examples Of Using Probability In Real Life – Statology”. Statology. https://www.statology.org/probability-real-life-examples/.

Stelios Avramidis, and Robert Israel. 2015. “Why Is The Sum Of The Rolls Of Two Dices A Binomial Distribution? What Is Defined As A Success In This Experiment?”. Mathematics Stack Exchange. https://math.stackexchange.com/questions/1204396/why-is-the-sum-of-the-rolls-of-two-dices-a-binomial-distribution-what-is-define.

Additional Reading

“6 Useful Probability Distributions With Applications To Data Science Problems”. 2021. Medium. https://towardsdatascience.com/6-useful-probability-distributions-with-applications-to-data-science-problems-2c0bee7cef28.

Normal

Frost, Jim. 2018. “Normal Distribution In Statistics”. Statistics By Jim. https://statisticsbyjim.com/basics/normal-distribution/.

“Normal Distribution”. 2022. mathsisfun.com. https://www.mathsisfun.com/data/standard-normal-distribution.html.

“Normal Distribution”. 2022. stattrek.com. https://stattrek.com/probability-distributions/normal.

“Normal Distribution – Wikipedia”. 2022. en.wikipedia.org. https://en.wikipedia.org/wiki/Normal_distribution.

“Normal Distribution (Definition, Formula, Table, Curve, Properties & Examples)”. 2022. BYJUS. https://byjus.com/maths/normal-distribution/.

“Normal Distributions Review (Article) | Khan Academy”. 2022. Khan Academy. https://www.khanacademy.org/math/statistics-probability/modeling-distributions-of-data/normal-distributions-library/a/normal-distributions-review.

Standard Normal

“Standard Normal Distribution”. 2022. stattrek.com. https://stattrek.com/probability-distributions/standard-normal.

“Standard Normal Distribution Table”. 2022. mathsisfun.com. https://www.mathsisfun.com/data/standard-normal-distribution-table.html.

“Standard Normal Distribution – Z-Score, Area And Examples”. 2022. BYJUS. https://byjus.com/maths/standard-normal-distribution/.

“Using the Standard Normal Distribution Table”. 2022. ThoughtCo. https://www.thoughtco.com/standard-normal-distribution-table-3126264.

Zach. 2021. “Normal Distribution vs. Standard Normal Distribution: The Difference”. Statology. https://www.statology.org/normal-distribution-vs-standard-normal-distribution/.

Chi-Squared Distribution

“Chi Square Distribution”. 2022. onlinestatbook.com. https://onlinestatbook.com/2/chi_square/distribution.html.

“Chi-Square Distribution”. 2022. stattrek.com. https://stattrek.com/probability-distributions/chi-square.

“Chi-Square Statistic: How To Calculate It / Distribution”. 2022. Statistics How To. https://www.statisticshowto.com/probability-and-statistics/chi-square/.

Chi-Squared Table

“Chi-Square Table”. 2022. mathsisfun.com. https://www.mathsisfun.com/data/chi-square-table.html.

“Engineering Tables/Chi-Squared Distribution – Wikibooks, Open Books For An Open World”. 2022. en.wikibooks.org. https://en.wikibooks.org/wiki/Engineering_Tables/Chi-Squared_Distibution.

Shah, Piyush. 2022. ” Chi-Square Distribution Calculator “. atozmath.com. https://atozmath.com/CONM/DistributionTables.aspx?q=c.

Poisson Distribution

“Poisson Calculator”. 2022. stattrek.com. https://stattrek.com/online-calculator/poisson.

“Poisson Distribution”. 2022. stattrek.com. https://stattrek.com/probability-distributions/poisson.

“Poisson Distribution – Definition, Formula, Table, Examples”. 2022. CUEMATH. https://www.cuemath.com/data/poisson-distribution/.

“Poisson Distribution | Brilliant Math & Science Wiki”. 2022. brilliant.org. https://brilliant.org/wiki/poisson-distribution/.

“Poisson Distribution / Poisson Curve: Simple Definition”. 2022. Statistics How To. https://www.statisticshowto.com/probability-and-statistics/statistics-definitions/probability-distribution/poisson-distribution/.

“Understanding A Poisson Distribution”. 2022. Investopedia. https://www.investopedia.com/terms/p/poisson-distribution.asp.

t Distribution

Frost, Jim. 2022. “T Distribution: Definition & Uses”. Statistics By Jim. https://statisticsbyjim.com/probability/t-distribution/.

“Student’s T-Distribution In Statistics – GeeksforGeeks”. 2020. GeeksForGeeks. https://www.geeksforgeeks.org/students-t-distribution-in-statistics/.

“Student’s T-Distribution – Wikipedia”. 2022. en.wikipedia.org. https://en.wikipedia.org/wiki/Student%27s_t-distribution.

“T Distribution”. 2022. stattrek.com. https://stattrek.com/probability-distributions/t-distribution.

“T-Distribution / Student’s T: Definition, Step By Step Articles, Video”. 2022. Statistics How To. https://www.statisticshowto.com/probability-and-statistics/t-distribution/.

“Understanding T Distribution”. 2022. Investopedia. https://www.investopedia.com/terms/t/tdistribution.asp.

t Table

Frost, Jim. 2022. “T-Distribution Table Of Critical Values”. Statistics By Jim. https://statisticsbyjim.com/hypothesis-testing/t-distribution-table/.

“Statistics – T-Distribution Table”. 2022. tutorialspoint.com. https://www.tutorialspoint.com/statistics/t_distribution_table.htm.

Videos

Normal

The Normal Distribution, Clearly Explained!!!

The normal, or Gaussian, distribution is the most common distribution in all of statistics. Here I explain the basics of how these distributions are created and how they should be interpreted.

An Introduction to the Normal Distribution

An introduction to the normal distribution, often called the Gaussian distribution. The normal distribution is an extremely important continuous probability distribution that arises very frequently in probability and statistics.

Normal Distribution & Probability Problems

This calculus video tutorial provides a basic introduction into normal distribution and probability. It explains how to solve normal distribution problems using a simple chart and using calculus by evaluating the definite integral of the probability density function for a bell shaped curve or normal distribution curve. This video contains 1 practice problem in the form of a word problem with many parts giving you plenty of examples to master this topic. In this video, I explain how to evaluate the definite integral using wolfram’s alpha online calculator for definite integrals. You need to determine the population mean mu and standard deviation sigma as well as the lower and upper limits of integration in order to determine the probability of an event occurring within a certain range of X values where X is a continuous random variable. You need to be familiar with the 68-95-99.7 rule. Approximately 68% of the population lies within 1 standard deviation of the population mean or average. 95% of the population lies within 2 standard deviations of the mean and 99.7% lies within 3 standard deviations of the mean.

Standard Normal Distribution

Standard Normal Distribution Tables, Z Scores, Probability & Empirical Rule – Stats

This statistics video tutorial provides a basic introduction into standard normal distributions. It explains how to find the Z-score given a value of x as well as the mean and standard deviation. It explains how to determine the probability by finding the area under the curve represented by f(x) – the probability density function using the empirical rule and the Z-tables. This video contains plenty of examples and practice problems.

Standard Normal Distribution is the distribution we get after standardizing any Normal Distribution. But before we explore this concept, we first need to explain what a transformation is. So, a transformation is a way in which we can alter every element of a distribution to get a new distribution with similar characteristics. For Normal Distributions we can use addition, subtraction, multiplication and division without changing the type of the distribution. For instance, if we add a constant to every element of a Normal Distribution, the new distribution would still be Normal.

Chi-Squared Distribution

Intro to Chi Square

What is a chi-square test? Difference between chi square tests for independence and goodness of fit. Overview of formulas, advantages and disadvantages.

Chi-Square Test [Simply explained]

The chi-square test is a hypothesis test used for categorical variables with nominal or ordinal measurement scale. The chi-square test checks whether the frequencies occurring in the sample differ significantly from the frequencies one would expect. Thus, the observed frequencies are compared with the expected frequencies in the Chi2-Test and their deviations are examined.

Poisson Distribution

Introduction to Poisson Distribution – Probability & Statistics

This statistics video tutorial provides a basic introduction into the poisson distribution. It explains how to identify the mean with a changing time interval in order to calculate the probability of an event occurring.

An Introduction to the Poisson Distribution

An introduction to the Poisson distribution. I discuss the conditions required for a random variable to have a Poisson distribution. work through a simple calculation example, and briefly discuss the relationship between the binomial distribution and the Poisson distributions.

t Distribution

Introduction to the t Distribution (non-technical)

A brief non-technical introduction to the t distribution, how it relates to the standard normal distribution, and how it is used in inference for the mean.

Student’s T Distribution – Confidence Intervals & Margin of Error

This statistics video tutorial provides a basic introduction into the student’s t-distribution. It explains how to construct confidence intervals around a population mean using the student’s t-distribution as well as calculating the margin of error or error bound of the mean. It’s a good indication to use the t-distribution as opposed to the normal distribution when the sample size is less than 30 and when you’re given the sample standard deviation instead of the population standard deviation.

The t-distribution – why we need it | explained with confidence intervals

In this video, we will discuss the basics of the t-distribution and try to understand why we need it. To explain the basics of the t-distribution, we will use it to construct confidence intervals for small samples.

Website Powered by WordPress.com.

Up ↑