Overview
According to Harvard Business Review, Data Science is the sexiest job of the 21st century.
A recent report by Google concluded that since the last 18 months, the interest in Machine Learning has doubled.
Learn Data Science with Python Certification
Python programming, in the recent years, has become one of the most preferred languages in Data Science. And when it comes to building Machine Learning systems, Python provides an ideally powerful and flexible platform to build on. Through a comprehensive, hands-on approach, this course provides you the opportunity you need to experiment with a wide variety of Data Science and Machine Learning algorithms. We believe that a practical, hands-on approach is the key to meaningful learning and skills advancement. With this in mind, we integrate real-life exercises and activities throughout our trainings, with long-term retention of learning and development in mind.
What You'll Learn
- Introducing data science, with a focus on the job outlook and market requirements
- Data Science Project Life Cycle
- Basics of Statistics – Measures of Central Tendency and Measures of Dispersion
- Discrete and Continuous Distribution Functions
- Advanced Statistics Concepts – Sampling, Statistical Inference and Testing of Hypothesis
- Introduction of Python Programming, Anaconda and Spyder
- Installation and Configuration of Python
- Control Structures and Data Structures in Python
- Hands-on Applied Statistics Concepts using Python
- Functions and Packages in Python
- Graphics and Data Visualization Libraries in Python
- Introduction to Machine Learning
- Machine Learning Models and Case Studies with Python
- Software developers and programmers who want to reap the benefits of a lucrative Data Science and Machine Learning career
- Data Analysts or Financial Analysts from the non-IT industry who want to make a transition to the IT industry
- Individuals, students and corporate professionals who want to upgrade their technical skill set
Curriculum
- Data Science Introduction
- Data Science Project Lifecycle – CRISP-DM Model
- Data Science Toolkit
- Job outlook
- Prerequisite& Target Audience
- Introduction to Python, Anaconda, Spyder & Jupyter Notebook
- Installation & Configuration
- Basic Python Programming Concepts
- Data Structures in Python
- List
- Tuples
- Dictionary
- NumPy Array & it’s applications
- Control Structures
- Creating Custom Functions
- Exception Handling
- Random Variable
- Type of Random variables
- Discrete & Continuous
- Nominal
- Ordinal
- Interval
- Ratio
- Central Tendencies
- Mean
- Mode
- Median
- Measurement of dispersion
- Variance
- Standard Deviation
- Basic Statistics using NumPy
- Introduction to Probability Theory
- Probability Distribution Analysis
- Probability Mass Function
- Probability Density Function
- Normal Distribution
- Standard Normal Distribution
- Covariance & Correlation
- Pandas Dataframes & its applications
- Importing tables from RDBMS
- Analytics & Data Visualization using Matplotlib
- Univariate & Bivariate Statistical Analysis using Matplotlib
- Line Plot
- Area Plot
- Histogram
- Box Plot
- Scatter Plot
- Sampling Analysis
- Inferential Statistics
- Sampling Distribution
- Central Limit Theorem
- Hypothesis Testing
- 1 tail test and 2 tail test
- Type I and Type II errors
- P value
- Level of Significance
- Confidence Interval
- Statistical Analysis using Seaborn
- KDE Plot
- RegPlot
- Joint Plot
- Heatmap
- Data Sampling
- Simulating Normal Distribution
- Calculating PDF & CDF
- Hypothesis Testing – Case Study
- Introduction to Machine Learning
- Estimation Function
- Reducible & Irreducible errors
- Supervised & Unsupervised
- ML Algorithms ML Model Training & Testing
- Parametric & Non- Parametric Algorithms
- Regression Analysis
- Simple Linear Regression
- Multiple Linear Regression
- Linear Regression methods
- Ordinary Least Square
- R Squared method
- Adjusted R Square
- Regression Evaluation Metrics – MSE,RMSE
- Bias & Variance
- Model Under fitting and Overfitting
- Feature Engineering
- Null Data Imputation Techniques
- Outlier Analysis
- Categorical Encoding
- Label Encoding
- One Hot Encoding
- Feature Selection Techniques
- Correlation Analysis
- Chi Square Test
- Machine Learning Case Study 1 – Multiple Linear Regression
- Logistic Regression
- Simple Logistic Regression
- Multiple Logistic Regression
- Logistic Regression Function
- ROC AUC Analysis
- Model Evaluation using Confusion Metrix
- Accuracy, Precision, Recall & Specificity
- Machine Learning Case Study 2– Multiple Logistic Regression
- Feature Scaling
- Addressing Imbalanced Data using SMOTE/MSMOTE
- Model Cross Validation using K- Fold Cross Validation Classification Analysis
- K Nearest Neighbor Classifier
- Decision Trees
- Classification and Regression Tree
- Random Forest
- Information Gain & Entropy
- Machine Learning Case Study 3 – Classification Analysis using KNN,
- Decision Tree & Random Forest
- Clustering Algorithms
- K Means Clustering
- Hierarchical Clustering
- Elbow Curve Graph
- Machine Learning Case Study 4 – Clustering Analysis using K-Means
- Clustering
- Recommendation Engines
- Collaborative filtering & Types
- Machine Learning Case Study 5 – Recommendation Engine using
- Collaborative filtering