This directory includes a few odds and ends:
jc1.txtis a dataset of Jeopardy Contestants crawled from j-archive.com and posted on Reddit.
jq2.txtis a dataset Jeopardy Questions also crawled from j-archive.com
California/Nevada Precipitation Data
simple_scrape.pyis a Python script to crawl data from the NOAA Website for the California-Nevada River Forecast Center, e.g. pages like https://www.cnrfc.noaa.gov/monthly_precip_2020.php). If you run it, it will create a subdirectory called
outputand store each year’s data there.
monthly_precip_full.csvis the concatenation of the outputs of
flow_CalDataEngExample.zipis a Trifacta flow export file. It contains all the recipes to take the
monthly_precip_full.csvfile and generate the remaining files:
mpf.txt. This is not a human-readable format—to make use of it, you need to go to Flows->Import Flow in Trifacta as described here.
mm.txtis a pivot table (matrix) of
mmp.txtis a pivot table of
mmr.txtis a un-pivoted (relational) version of
mpf.txtis a cleaned-up version of
monthly_precip_full.csvwith all the display junk from the web stripped out and the relevant fields replicated into each row.