Lab 4: Inferential Statistics for Penguins#

Caution

This lab will provide an opportunity to practice inferential statistics in Python. Please start by opening this Google Colab notebook and copying it to your Google Drive.

Important

Please submit your lab by modifying the code in the in the GC above!

Warning

Please make sure to submit the lab by October 24/25, 2022, depending on your section.

Lab Instructions#

  1. Pre-load ‘taxis’ data set from Seaborn (already pre-loaded in GC template)

  2. Create histograms and QQ-plots for the following variables: distance, fare, tip, total

  3. Test whether the mean tip on the trip is $\mu \leq 2$ dollars.

  4. Compare mean of total depending on the type of payment (total_payment)

  5. Compare whether mean distance (distance) is different for all pick-up boroughs.

  6. Compare whether mean tips (tip) is different for all drop-off boroughs.

  7. Create scatter plots for distance versus fare. Calculate correlation of this relationship.

  8. Print your GC to pdf and submit to GauchoSpace.