Statistical Tests

Photo by Coffee Geek on Unsplash

Generate Random Data

  • This is what the data looks like
  • Sampling few rows from our “Population”

Are these two samples from the same distribution?

• what if we didn’t know already

Is the difference between the means actually statistically significant

• Differences of means will be normally distributed if we repeatedly sample.

• Sampling distribution of difference of means will be normally distributed

• But since we don’t know the distribution parameters

• We assume T-distribution

Two sample T-test

Conclusions

• We can see that these two samples have similar means and variances therefore its safe to assume they come from the same distribution/population

--

--

School of Data Science @ University of North Carolina — Charlotte

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Abhijeet Pokhriyal

School of Data Science @ University of North Carolina — Charlotte