Institute for Data Science and Big Data

January 4-8 & 11-15, 2021 | Held virtually

This Institute covers the essential basis for doing data science as practiced in the 21st century. Data scientists are expected to know how to obtain relevant data for a specific problem, clean and explore data, create models using data, state inferences, make reliable predictions, and communicate findings to multiple, possibly non-technical, audiences. We will cover each of these steps in 10 intense working days.
Register as AU Student Register as Professional

The course consists of a dynamic mixture of theoretical lectures, guest speakers, and group assignments. The guest lectures include data science leaders from Washington’s unparalleled mixture of government, academia, and business. Statistical topics include: exploratory methods, graphics, regression, machine learning, ensembles, clustering analysis, and Bayesian approaches. Specific technical skills including: R, Python, Pandas, JavaScript, HTML/CSS, API Interactions, Social Media Mining, SQL, Tableau, GitHub, and more.

View the course flyer

Course Information

Instructor: Ryan Moore, Associate Professor, Department of Government
Course schedule: January 4-8 and January 11-15, 2021; 9 am to 5 pm every day
Course number: SPA-020/SPA-420/SPA-620

Please note that the dates have changed from the original scheduled dates of January 2-10, 2021. 

Participants will receive a certificate of completion.  

Registration for AU Students

AU students can register for the Data Science Institute as a regular course. Talk with your adviser about registering for SPA-420 (undergraduate) or SPA-620 (graduate). This course awards 4 credit hours, and is part of the academic Spring semester.

Registration for Professionals & Visiting Students

Professionals and visiting students must fill out the application form to be considered. We will follow up with you via e-mail to obtain your application materials. Once accepted, detailed information will be sent with instructions on how to register as a visiting student.

Application Deadline: December 11, 2020

Cost (non-credit): $1,500.00

All visiting students (new to AU), enrolled for credit or not-for-credit register for the class through AU Central. The non-credit course option is SPA-020. A visiting student fills out the course registration form and submits the form directly to AU Central. Tuition payment is due during registration. Instructions to register will be sent upon acceptance of your application.

The Institute is not open to registration through the Consortium of DC area schools. Interested students from local area schools can register for the class as a visiting student through AU Central. If taking the class for course credit, they would have to make arrangements to transfer the course credit back to their home school.

Frequently Asked Questions

The prerequisite is a basic statistics 101 class or equivalent knowledge. Some previous knowledge of R is useful but not required.

We will use R and Python at a fairly basic level for the most part, with some advanced usage carefully explained as if participants have not seen it before.

Yes, there will be a hands-on group project with real data.

Decisions are made on a rolling basis, and are currently being accepted. Non-AU students have the same preference as AU students.

2020 Syllabus

Review the course syllabus from January 2020 by instructor Ryan Moore.

Read Syllabus