Programming for Data Science
Dr. Lutz Hamel
Tyler, Rm 251
Office Hours: MW 11-noon or by appointment, online only
Office hours: Friday noon to 2PM
Room: online only (for link info see BrightSpace)
Data science exists at the intersection of computer science, statistics, and
machine learning. That means writing programs to access and manipulate data so
that it becomes available for analysis using statistical and machine learning
techniques is at the core of data science. Data scientists use their data and
analytical ability to find and interpret rich data sources; manage large amounts
of data despite hardware, software, and bandwidth constraints; merge data
sources; ensure consistency of datasets; create visualizations to aid in
understanding data; build mathematical models using the data; and present and
communicate the data insights/findings.
This course provides a survey of data science. Topics include data driven
programming in Python; data sets, file formats and meta-data; descriptive
statistics, data visualization, and foundations of predictive data modeling and
machine learning; accessing web data and databases; distributed data management.
You will work on weekly substantial programming problems such as accessing data
in database and visualize it or build machine learning models of a given data
Upon completion of this course
- You will have a detailed view of what data science entails.
- You will be able to access, visualize, build and evaluate models of data.
- You will be able to solve problems in data science using standard tools and techniques.
Documents of Interest: