Interim Report#

These guidelines will help you apply the methods learned in lecture and labs onto your geographical analysis project with the data that you collected for your Data Report. The key to successful interim report is being tedious. State your hypotheses, use visualizations to derive interesting insights from the data. Support your visual insights using statistical tests we have covered.

Warning

The Data Report is due 11/13/2022 by 23.59.


Requirements#

Treat this report as a standalone document to that you prepare for your supervisor at potential job. You are given the data set and your task is to tell interesting story using graphs, plots, maps, followed by statistical hypotheses to support your initial findings derived from visualizations.

  1. At least two scatter plots or bar charts or pie charts or time series plot (or any other type of a plot)

  2. At least one summary table (crosstab table). This can be the table that you would use for plotting. For instance aggregate statistic (mean, sum, etc.) per state that you will be using to create a choropleth map.

  3. At least one correlation coefficient

  4. At least one one sample t/z-test

  5. At least one two sample test

  6. At least one ANOVA test

  7. At least one Global Moran’s $I$

  8. At least one Moran scatter plot

  9. At least one choropleth map with clusters (hotspots/coldspots) and outliers

  10. At least one clustering partitioning on your data (clustering can be done on both: point pattern data and areal (polygon) data.)

Important

Make sure to follow the hypothesis steps outlined in the Hypothesis testing slides: state hypotheses, set your confidence level, run the test, get p-value, interpret your p-value by either failing to reject or rejecting null.


Project Instructions#

  1. Create a new Google Colab notebook

  2. Import the data from your interim report into the notebook

  3. Break your report into three parts: Visual Exploration, Statistical Tests, Geographic Analysis. Use level-1 heading for titles of these parts (i.e. # Part name).

  4. Add content to the report, by including all of the items found in the ‘Project Requirements’ section.

  5. Provide hypotheses and interpretations for corresponding parts.

  6. Weave the plots and interpretations into a coherent document. The data story should be easy to follow.

  7. Submit the following to GauchoSpace.

    1. Data (as either a csv or a geojson). This step is not required if you used the ‘gdown’ approach described above.

    2. Notebook in ipynb format.

    3. Notebook as exported pdf.

Caution

Your submitted notebook must run and/or have all of the cell output rendered, so that it could be graded.

Tip

Use the code from the recently announced example notebook and gdown.

Tip

Examples of well-written notebooks for exploratory data analysis: