ISE 291: Introduction to Data Science - Spring 2021 (T202)


Lecture: MW | 11:00 - 12:15 | online
Instructor: Ahmad Almulhem
Office: 22/407-2
Email: ahmadsm at kfupm
Office Hours: UT:09:00 - 09:50 and by appointment (email me)

Description

An overview of Data driven approach, Data analytics lifecycle. Basic statistics: Variance, Co-variance, Correlation, Confidence interval and Histogram. Data frames, series, slicing, sorting. Relational database with primary and foreign key. SQL implementation in Python. Data acquisition, cleaning, scrubbing, and manipulation. Correlation analysis, PCA, Linear Regression, Gradient descent, Bayesian classifier, Decision tree, K-means clustering, Hierarchical clustering, Big data, and high-dimensional data. Overview of MapReduce and Hadoop.

Textbook

Chirag Shah, “A Hands-On Introduction to Data Science,” Cambridge University Press, 2020

Resources