Capstone Project

Columbia Data Science

Capstone Overview

This course provides a unique opportunity for students in the M.S. in Data Science program to apply their knowledge of the foundations, theory and methods of data science to address data driven problems in industry, government and the non-profit sector. The course activities focus on a semester-length project sponsored by a local organization. The project synthesizes the statistical, computational, engineering and social challenges involved in solving complex real-world problems. Typically, three or four students work together as a team on each project. Each team is supervised by a faculty mentor and projects typically progress through the following phases:

  1. Background and problem definition
  2. Data wrangling, munging and cleaning
  3. Exploratory Data Analysis
  4. Coding prototypes of algorithms and models
  5. Data Visualization
  6. Reporting and communicating
  7. Productionizing any models or algorithms if applicable

Big Data

Outline of course

Students will meet as a cohort once a week where they will share best practices, and discuss relevant readings on topics including (1) entrepreneurship, (2) ethics, especially the ethics of mathematical models and algorithms, and (3) process and design thinking.

Data Science

Please note: this information is subject to change based on the Faculty's discretion.

For more information, please contact us at datascience@columbia.edu.

Back to Top