COMP SCI 639: Foundations of Data Science (Spring 2022)

Instructor: Jelena Diakonikolas

Email: jelena at cs dot wisc dot edu

Office hours: Tuesday and Thursday after class (4-5pm), or by appointment. In person, subject to local health guidelines.

Communication policy: I try to respond to all emails, but during the semester my email load may become too high, in which case I may miss responding to some emails. If your question is urgent and I do not respond promptly, please send me a reminder. For all non-urgent class-related questions, please use the class Piazza (accessible from Canvas) and/or one of the office hours slots.

Teaching Assistants: Cheuk Yin (Eric) Lin and Xufeng Cai

Email: clin353 at wisc dot edu, xcai74 at wisc dot edu

Office hours: Monday 9-11am (Xufeng) and Wednesday 3-5pm (Eric)

Class Schedule

This class meets on Tuesdays and Thursdays in CS 1221, 2.30-3-45pm (75 min). Optional and supplementary discussion sessions are held by the TAs on Fridays 2.30-3.45pm in CS 1325.

General Course Information


Most of the class is theoretical and assumes mathematical maturity: you need to be comfortable with reading, understanding, and writing proofs. Courses in linear algebra and probability and statistics as well as a proof-based theoretical course (or instructor permission) are required.

Some of the homework problems will require coding in Python, and basic knowledge of Python is expected.

Course Material

There is no required textbook for this course.

Lecture notes for each of the lectures, as well as additional readings are provided on the course Canvas page.

Course Outline

This class focuses on theoretical foundations of data science. While we will cover some applications and examples in Python, this class is not an applied, coding-based class.

This is a tentative list of topics that will be covered in class. Most of the topics listed here will be covered, and some other topics may be added.

Course Load/Assessment

All grades will be posted on Canvas. The information provided here is tentative and is subject to change.

Homework: There will be 5-6 homework assignments, accounting for ~50% of the grade. You may discuss problems with other students, but you need to declare it on your homework submission. Any discussion can be verbal only: you are required to work out and write the solutions on your own. Submitting someone else's work as your own constitutes academic misconduct. Academic honesty is taken very seriously in this class, and any breach of it will be treated according to the University Policy.

Homework assignments and solutions will be posted on Canvas.

Midterm: Date and Time: TBD. Held in person, in class. Accounts for ~20% of the grade.

Project: Done in pairs. Can be one of the following: (i) a lecture on a data science topic that we did not cover in class, (ii) literature review of a specific data science topic, or (iii) a deep dive on a research question in data science. Accounts for ~30% of the grade.

Academic Policies

Academic Integrity

Disability Accomodation

Institutional Statement on Diversity

