MATH4100

Download as PDF

Introduction to Data Science

Mathematics SC - College of Science

Description

The course begins by bootstrapping student's coding skills in the programming language Python, followed by a review of the relevant concepts from statistics. After that, we will move through a series of data science methods using real-life, project-based, lectures and computer labs. The major goals of this course are to learn how to use tools for acquiring, cleaning, analyzing, exploring, and visualizing data; making data-driven inferences and decisions; and effectively communicating results. These will be accomplished through an in-depth sequence of topics which will introduce students to the following data preparation and analysis methods: Acquiring data through web-scraping and data APIs, Cleaning and reshaping messy datasets using methods such as data frames, regular expressions or dedicated tools, Exploratory data analysis and visualization, Hypothesis testing, Clustering and classification, Rating and ranking, Recommendations, Network analysis, Regression and statistical inference, Natural language processing, Working with large data: databases, parallel programming. A major component of this course will be learning how to use python-based programming tools to apply these methods to real-life datasets. Students should have a basic-level of programming experience before taking this course.

Minimum Credits

3

Maximum Credits

3

Repeat for Credit

No

Required Requisite(s):

008865

Course Attributes

BD - QI (Quantitative Intensive BS)

Semesters Typically Offered

Spring

Cross Listed Courses