CSC/DSP310 Homepage

Programming for Data Science

Spring 2021


Dr. Lutz Hamel
Tyler, Rm 251
Office Hours: MW 11-noon or by appointment, online only


Michael Eiger
Office hours: Friday noon to 2PM
Room: online only (for link info see BrightSpace)


Data science exists at the intersection of computer science, statistics, and machine learning. That means writing programs to access and manipulate data so that it becomes available for analysis using statistical and machine learning techniques is at the core of data science. Data scientists use their data and analytical ability to find and interpret rich data sources; manage large amounts of data despite hardware, software, and bandwidth constraints; merge data sources; ensure consistency of datasets; create visualizations to aid in understanding data; build mathematical models using the data; and present and communicate the data insights/findings.

This course provides a survey of data science. Topics include data driven programming in Python; data sets, file formats and meta-data; descriptive statistics, data visualization, and foundations of predictive data modeling and machine learning; accessing web data and databases; distributed data management. You will work on weekly substantial programming problems such as accessing data in database and visualize it or build machine learning models of a given data set.

Upon completion of this course

Documents of Interest:



Jupyter/Colab Notebooks

Data Sets