Data Analytics for Economists [Fall 2018]
Monday and Wednesday 4:00PM-5:15PM @ Biochemistry 1120
Teaching Assistant: Dennis McWeeny
Office hours: M 9:30AM-11:30AM @ Sco Sci 6470
Professor: Kim J. Ruhl
Office hours: T&R 2:30PM-3:30PM @ Soc Sci 7444
As the project deadline nears, I have been receiving many requests from people for meetings who cannot make my regular office hours (T/R 2:30-3:30). To ease the burden on everyone, I will be devoting some time for each of our last three class periods to 'office hours.' My goal is to shoot for an end of class at about 4:50 or so. Everyone is free to leave then. Anyone who would like to ask questions and talk about projects can stay and we can meet. Since everyone has this time blocked out of their schedules, this should save some of you extra trips to my office.
In addition: Dennis will hold extra office hours this Friday, December 7, 9:30-11:30 AM and 1:00-3:00 PM. Dennis' office hours on Monday December 10 are cancelled.
December 4, 2018
Coding practice #5 is posted to our shared folder on winstat (9_coding_practice folder). It is also posted below, under week 14. It is due Monday, December 10 at the end of class.
November 26, 2018
The due date for the project proposal has been extended to the end of class on Wednesday 11/28.
November 15, 2018
Exam #2 is available in our shared folder on the X:\ drive and posted under week 11 below. The exam is open book and open internet, but you should not consult with others. Work through the exam on your own. As with the coding practice, print out the notebook and bring it to class on Monday. Early versions of the website had it due at 1:00. This is not the case. It is due by the end of class on Monday, November 19.
I am not going to sugar coat this: Dennis cooked up a tough problem for question 2. The file 'map.pdf' shows you what the end product should look like. It's a hard problem but the payoff is big!
November 8, 2018
Coding practice #4 is posted to our shared folder on winstat (9_coding_practice folder). It is also posted below, under week 10. It is due Monday, November 12 at the end of class. I have also added all the 'finished' notebooks to the 999_finished folder in our shared drive on winstat.
October 25, 2018
Coding practice #3 is posted to our shared folder on winstat (9_coding_practice folder). It is also posted below, under week 8. It is due Monday, October 29 at the end of class.
October 24, 2018
We will not hold class on the day before Thanksgiving, November 21.
October 16, 2018
A reminder that, due to technical difficulties, the exam is now due on Wednesday October 16 at the end of class.
October 8, 2018
Exam #1 will be available Thursday, October 11. The exam is open book and open internet, but you should not consult with others. Work through the exam on your own. As with the coding practice, print out the notebook and bring it to class on Monday. Early versions of the website had it due at 1:00. This is not the case. It is due by the end of class on Monday, October 15.
October 3, 2018
Coding practice #2 is posted. It requires two files with data: GDPA.csv and banks_and_branches.csv. They are all posted under week 5 below. Have a good weekend.
August 14, 2018
Welcome! This page is a work in progress; more coming soon. I am looking forward to meeting all of you, but until then, here are a few things you can do before we get started to smooth your transition into class.
- You will need a laptop. This is a hands-on course: we will be writing and debugging code together during class periods. If you do not have access to a laptop, please contact me immediately and we can work something out.
- You will need to be connected to the internet during class. Make sure your UWNet login is working.
- This course is about using data to answer questions. Start looking at some examples of people doing exactly this. FiveThirtyEight is a nice read, as is The Economist. On Twitter, check out @nytgraphics.
This is the big-picture. We will add a lot more detail as we progress. We will also dip in and out of topics as we go. For example, we will do some basic plotting throughout the entire course, but we will not dig into the details until later.
In the readings, McKinney refers to the book Python for Data Analysis (Second edition) by Wes McKinney. There is a lot more in this book than we will cover, but it is a good reference.
McKinney, parts of chapter 2
Using data to answer questions
Working on Winstat
Creating, opening, and saving our work
Documenting our work
McKinney, parts of chapters 2 and 3
Syntax, assignment, calculation
Data structures, types
Mckinney, parts of chapters 5-8
Working with data:
The pandas package
Loading and cleaning data, missing values
Merging and joining data sets
McKinney, parts of chapter 9
The matplotlib package
Scatter, bar, and line plots
Labeling and annotating plots
This will depend on timing, but we might cover map visualizations, time series analysis, web APIs, machine learning basics, web scraping, formal econometrics...other ideas?
This week-by-week schedule will be constantly updated. Some topics may take longer than scheduled (and others may take less) but exam, coding practice, and project due dates will not.
Installing and using winstat
McKinney Ch. 2.2 parts: "Running the Jupyter Notebook," "Tab Completion," "Introspection"
McKinney Ch. 2.3 up to "Control Flow": Everything except "Duck Typing" and "Bytes and Unicode"
Week 1: September 10 & 12
Winstat // Juypter notebooks // Markdown
Assignment // Calculation // Types // Strings
Week 2: September 17 & 19
Lists // Tuples // Dicts // More on types
Loops // Conditionals
Week 4: October 1 & 3
Matplotlib: Figures and axes // Plot types // Styles
Pandas: Reading and writing files
Week 7: October 22 & 24
Reshaping // Merge // Join
Week 9: November 5 & 7
Seaborn // Maps
Week 12: November 26 & 28
Probit & logit // AR(p)
Due November 26: Project proposal // Project information
Week 13: December 3 & 5
Intro to machine learning // cross validation