Data Science
Lecture, BUITEMS, Department of Software Engineering and Department of Computer Engineering, 2023
3 Credit Hours - Spring 2023
Welcome to Data Science! The course material and lectures will be posted on this site and students will be notified accordingly.
Reference Book |
---|
1. “Data Science from Scratch” by Joel Grus |
supplementary books |
---|
2.[Python for Data Analysis by Wes McKinney (O’Reilly) |
3.”Python Data Science Handbook” by Jake VanderPlas |
Lectures
Date | Topic | lecture links | Assignments and Quizes |
---|---|---|---|
(Week 1) | What is Data Science? Data Science Methodology | Lecture 1 | |
(Week 2) | Overview of Python for data Science | Python Introduction, Numpy, Python Series and Dataframes - Book Chapter no. 2 | Assignment 1 |
(Week 3) | Data Types and Sources | Types of Data Fetching Data Through API | QUIZ |
(Week 4) | Data Cleaning and Preprocessing | Pivot Table Scales Merging DataFrames GroupBy | |
(Week 5) | Exploratory Data Analysis (EDA) | Basic Understanding of Data univariate Analysis Bivariate Analysis | |
(Week 6) | Types of Charts and Graphs | GGPLOT | |
(Week 7) | Tools for Data Visualization | Data Visualization | QUIZ |
(Week 8) | Statistical Inference | Statistical Testing | |
(Week 9) | Introduction to Machine Learning | ML | QUIZ |
(Week 10) | Regression Analysis | basic linear Regression Dataset polynomial Regression Regression Metrics | |
(Week 11) | Classification Analysis | binary classification with metrics Multiclassification on IRIS Multiclassification on MNIST(image dataset) | |
(Week 12) | Decision Trees and Random Forest | Decision Trees and Random Forest understanding Random Forest notebook Dataset Feature Importance | |
(Week 13) | Unsupervised Learning : Clustering Analysis | Clustering introduction Kmeans notebook Dataset | QUIZ |
(Week 14) | Unsupervised Learning : Dimensionality Reduction | Dimensionality reduction | |
(Week 15) | Big Dagta and Databases for Data Science | SQL for DataScience, Big Data and Datascience | |
(Week 16) | Ethics in Applied Data Science | Ethics and its importance, fairness and ethics practice by Google | QUIZ |
Updates
Sessional Division *5 marks for Quizes. *10 marks Professional well-maintained Github Repository with self-explanatory code and readme files. *10 marks for assignments.
github first commit due on March 19th (CE) April 7th (SE)
*Office Hours on Tuesday 11:45 am - 1 pm and 3pm to 5pm (on prior appointment). *meanwhile for questions or scheduling a meeting in case these don’t work for you, please email.