Basic Statistics & Regression for Machine Learning in Python
Basic Statistics & Regression for Machine Learning in Python, available at $54.99, has an average rating of 4.65, with 66 lectures, based on 14 reviews, and has 2630 subscribers.
You will learn about Python Basics, Statistics and Regression behind Machine Learning in Python and also using Manual Calculations This course is ideal for individuals who are Beginner who wants to learn the mathematics for Machine Learning It is particularly useful for Beginner who wants to learn the mathematics for Machine Learning.
Enroll now: Basic Statistics & Regression for Machine Learning in Python
Summary
Title: Basic Statistics & Regression for Machine Learning in Python
Price: $54.99
Average Rating: 4.65
Number of Lectures: 66
Number of Published Lectures: 66
Number of Curriculum Items: 66
Number of Published Curriculum Objects: 66
Original Price: $109.99
Quality Status: approved
Status: Live
What You Will Learn
- Python Basics, Statistics and Regression behind Machine Learning in Python and also using Manual Calculations
Who Should Attend
- Beginner who wants to learn the mathematics for Machine Learning
Target Audiences
- Beginner who wants to learn the mathematics for Machine Learning
Hello and welcome to my new course Basic Statistics and Regression for Machine Learning
You know.. there are mainly two kinds of ML enthusiasts.
The first type fantasize about Machine Learning and Artificial Intelligence. Thinking that its a magical voodoo thing. Even if they are into coding, they will just import the library, use the class and its functions. And will rely on the function to do the magic in the background.
The second kind are curious people. They are interested to learn what’s actually happening behind the scenes of these functions of the class. Even though they don’t want to go deep with all those mathematical complexities, they are still interested to learn what’s going on behind the scenes at least in a shallow Layman’s perspective way.
In this course, we are focusing mainly on the second kind of learners.
That’s why this is a special kind of course. Here we discuss the basics of Machine learning and the Mathematics of Statistical Regression which powers almost all of the the Machine Learning Algorithms.
We will have exercises for regression in both manual plain mathematical calculations and then compare the results with the ones we got using ready-made python functions.
Here is the list of contents that are included in this course.
In the first session, we will set-up the computer for doing the basic machine learning python exercises in your computer. We will install anaconda, the python framework. Then we will discuss about the components included in it. For manual method, a spreadsheet program like MS Excel is enough.
Before we proceed for those who are new to python, we have included few sessions in which we will learn the very basics of python programming language. We will learn Assignment, Flow control, Lists and Tuples, Dictionaries and Functions in python. We will also have a quick peek of the Python library called Numpy which is used for doing matrix calculations which is very useful for machine learning and also we will have an overview of Matplotlib which is a plotting library in python used for drawing graphs.
In the third session, we will discuss the basics of machine learning and different types of data.
In the next session we will learn a statistics technique called Central Tendency Analysis which finds out a most suitable single central value that attempts to describe a set of data and its behaviour. In statistics, the three common measures of central tendency are the mean, median, and mode. We will find mean, median, and mode using both manual calculation method and also using python functions.
After that we will try the statistics techniques called variance and standard deviation. Variance of a dataset measures how far a set of numbers is spread out from their average or central value. The Standard Deviation is a measure of how much these spread out numbers are. We will at first try the variance and standard deviation manually using plain mathematical calculations. After that, we will implement a python program to find both these values for the same dataset and we will verify the results.
Then comes a simple yet very useful technique called percentile. In statistics, a percentile is a score below which a given percentage of scores in a distribution falls. For easy understanding, we will try an example with manual calculation of percentile using raw data set at first and later we will do it with the help of python functions. We will then double-check the results
After that we will learn about distributions. It describes the grouping or the density of the samples in a dataset. There are two types. Normal Distribution where probability of x is highest at centre and lowest in the ends whereas in Uniform Distribution probability of x is constant. We will try both these distributions using visualization of data. We will do the calculation using manual calculation methods and also using python language.
There is a value called z score or standard score in statistics which helps us to determine where the value lies in the distribution. For z score also at first we will try calculation using python functions. Later the z score will be calculated with manual methods and will compare the results.
Those were the case of a single valued dataset. That is the dataset containing only a single column of data. For multi-variable dataset, we have to calculate the regression or the relation between the columns of data. At first we will visualize the data, analyse its form and structure using a scatter plot graph.
Then as the first type of regression analysis we will start with an introduction to simple Linear Regression. At first we will manually find the co-efficient of correlation using manual calculation and will store the results. After that we will find the slope equation using the obtained results. And then using the slope equation, we will predict future values. This prediction is the basic and important feature of all Machine Learning Systems. Where we give the input variable and the system will predict the output variable value.
Then we will repeat all these using Python Numpy library Methods and will do the future value prediction and later will compare the results. We will also discuss the scenarios which we can consider as a strong Linear Regression or weak Linear Regression.
Then we will see another type of regression analysis technique called as Polynomial Linear Regression which is best suited for finding the relation between the independent variable x and the dependent variable y.
The regression line in the graph will be a straight line with slope for Simple Linear Regression and for Polynomial Linear Regression, it will be a curve.
In the coming sessions, we will have a brief introduction about polynomial linear regression and the visualization of the modified dataset with x and y values. Using python we will then find the polynomial regression co-efficient value, the r2 value and also we will do future value prediction using python numpy library.
Then we will repeat the same using the plain old manual calculation method. At first we have to manually find the Standard Deviation components. Then later we will substitute these SD components in the equation to find a, b and c values. using these a,b,c values we will then find the final polynomial regression equation. This equation will enable us to do a manual prediction for future values.
And after that, here comes the Multiple regression. Here in this regression we can consider multiple number of independent x variables and one independent y variable. We will have an introduction about this type of regression. We will make necessary changes to our dataset to match the multiple regression requirement.
Since our dataset is getting more complex by the introduction of multiple independent variable columns, it may not be able to be managed by using a plain array for the dataset. We will use a csv or comma separated values file to save the dataset. We will have an exercise to read data from a csv file and save the data in corresponding data-frames. Once we have the data imported to our python program, we will do a visualization using a new library called seaborn which is a derivative of the scikitlearn library.
Using the python numpy and scikitlearn library, multiple regression can be done very easily. Just use the method and pass in the required parameters. Rest will be done by the python library itself. We will find the regression object and then using that we can do prediction for future values.
But with manual calculation, things will start getting complex. Its a lengthy calculation which needs to be done in multiple steps. In the first step we will have an introduction about the equations that we are going to use in the manual method and also we will find the mean values. Then in the second step we will find the components that are required to find the a,b and c values. Then in the third step, we will find the a,b and c values. And in the final step, using a,b,c values we will find the multiple regression equation and using this equation we will do future value prediction of our dataset. We will also try to get the value of the co-efficient of regression.
That’s all about the popular regression methods that are included in the course. Now we can go ahead with a very important topic in data preparation for machine learning. Many machine learning algorithms love to have input values which are scaled to a standard range. We will learn a technique called data normalization or standardization in which all the different ranges of data values will be scaled down to fit within a range of 0 to 1. This will improve the performance of the algorithms very much compared to a non scaled dataset.
For normalization also, just like the regression examples, we will at first try the normalization using python code which will be very easy to generate values. Then later we will repeat this with plain old school type of mathematical calculations.
In the final session, we will discuss more resources which you can follow for going further from the point that we have already learned.
That’s all about the topics which are currently included in this quick course. The code, notepad and jupyter notebook files used in this course has been uploaded and shared in a folder. I will include the link to download them in the last session or the resource section of this course. You are free to use the code in your projects with no questions asked.
Also after completing this course, you will be provided with a course completion certificate which will add value to your portfolio.
So that’s all for now, see you soon in the class room. Happy learning and have a great time.
Course Curriculum
Chapter 1: Course Introduction and Table of Contents
Lecture 1: Course Introduction and Table of Contents
Chapter 2: Environment Setup: Preparing your Computer
Lecture 1: Environment Setup – Part 1
Lecture 2: Environment Setup – Part 2
Chapter 3: Essential Components Included in Anaconda
Lecture 1: Essential Components Included in Anaconda
Chapter 4: Python Basics – Assignment
Lecture 1: Python Basics – Assignment
Chapter 5: Python Basics – Flow Control
Lecture 1: Python Basics – Flow Control – Part 1
Lecture 2: Python Basics – Flow Control – Part 2
Chapter 6: Python Basics – List and Tuples
Lecture 1: Python Basics – List and Tuples
Chapter 7: Python Basics – Dictionary and Functions
Lecture 1: Python Basics – Dictionary and Functions – part 1
Lecture 2: Python Basics – Dictionary and Functions – part 2
Chapter 8: Numpy Basics
Lecture 1: Numpy Basics – Part 1
Lecture 2: Numpy Basics – Part 2
Chapter 9: Matplotlib Basics
Lecture 1: Matplotlib Basics – part 1
Lecture 2: Matplotlib Basics – part 2
Chapter 10: Basics of Data for Machine Learning
Lecture 1: Basics of Data for Machine Learning
Chapter 11: Central Data Tendency – Mean
Lecture 1: Central Data Tendency – Mean
Chapter 12: Central Data Tendency – Median and Mode
Lecture 1: Central Data Tendency – Median and Mode – Part 1
Lecture 2: Central Data Tendency – Median and Mode – Part 2
Lecture 3: 'keepdims' to True or False to avoid this warning.
Chapter 13: Variance and Standard Deviation Manual Calculation
Lecture 1: Variance and Standard Deviation Manual Calculation – Part 1
Lecture 2: Variance and Standard Deviation Manual Calculation – Part 2
Chapter 14: Variance and Standard Deviation using Python
Lecture 1: Variance and Standard Deviation using Python
Chapter 15: Percentile Manual Calculation
Lecture 1: Percentile Manual Calculation
Chapter 16: Percentile using Python
Lecture 1: Percentile using Python
Chapter 17: Uniform Distribution
Lecture 1: Uniform Distribution
Chapter 18: Normal Distribution
Lecture 1: Normal Distribution – Part 1
Lecture 2: Normal Distribution – Part 2
Chapter 19: Manual Z score calculation
Lecture 1: Manual Z score calculation
Chapter 20: Z score calculation using python
Lecture 1: Z score calculation using python
Chapter 21: Multi Variable Dataset Scatter Plot
Lecture 1: Multi Variable Dataset Scatter Plot
Chapter 22: Introduction to Linear Regression
Lecture 1: Introduction to Linear Regression
Chapter 23: Manually finding Linear Regression Correlation Coefficient
Lecture 1: Manually finding Linear Regression Correlation Coefficient – Part 1
Lecture 2: Manually finding Linear Regression Correlation Coefficient – Part 2
Chapter 24: Manually finding Linear Regression Slope Equation
Lecture 1: Manually finding Linear Regression Slope Equation – Part 1
Lecture 2: Manually finding Linear Regression Slope Equation – Part 2
Chapter 25: Manually Predicting the Future Value using Equation
Lecture 1: Manually Predicting the Future Value using Equation
Chapter 26: Linear Regression using Python Introduction
Lecture 1: Linear Regression using Python Introduction
Chapter 27: Linear Regression using Python
Lecture 1: Linear Regression using Python – Part 1
Lecture 2: Linear Regression using Python – Part 2
Chapter 28: Strong and Weak Linear Regression
Lecture 1: Strong and Weak Linear Regression
Chapter 29: Predicting Future value using Linear Regression in Python
Lecture 1: Predicting Future value using Linear Regression in Python
Chapter 30: Polynomial Regression Introduction
Lecture 1: Polynomial Regression Introduction
Chapter 31: Polynomial Regression Visualization
Lecture 1: Polynomial Regression Visualization
Chapter 32: Polynomial Regression Prediction and R2 value
Lecture 1: Polynomial Regression Prediction and R2 value
Chapter 33: Polynomial Regression Finding SD Components
Lecture 1: Polynomial Regression Finding SD Components
Chapter 34: Polynomial Regression Manual Method Equations
Lecture 1: Polynomial Regression Manual Method Equations
Chapter 35: Finding SD components for abc
Lecture 1: Finding SD components for abc
Chapter 36: Finding abc
Lecture 1: Finding abc
Chapter 37: Polynomial Regression Equation and Prediction
Lecture 1: Polynomial Regression Equation and Prediction
Chapter 38: Polynomial Regression coefficient
Lecture 1: Polynomial Regression coefficient
Chapter 39: Multiple Regression Introduction
Lecture 1: Multiple Regression Introduction
Chapter 40: Multiple Regression using Python – Part 1 – Data Import as CSV
Lecture 1: Multiple Regression using Python – Part 1 – Data Import as CSV
Chapter 41: Multiple Regression using Python – Part 2 – Data Visualization
Lecture 1: Multiple Regression using Python – Part 2 – Data Visualization
Chapter 42: Creating Multiple Regression Object and Prediction using Python
Lecture 1: Creating Multiple Regression Object and Prediction using Python
Chapter 43: Manual Multiple Regression – Intro and Finding Means
Lecture 1: Manual Multiple Regression – Intro and Finding Means
Chapter 44: Manual Multiple Regression – Finding Components
Lecture 1: Manual Multiple Regression – Finding Components – Part 1
Instructors
-
Abhilash Nelson
Computer Engineering Master & Senior Programmer at Dubai
Rating Distribution
- 1 stars: 0 votes
- 2 stars: 0 votes
- 3 stars: 1 votes
- 4 stars: 6 votes
- 5 stars: 7 votes
Frequently Asked Questions
How long do I have access to the course materials?
You can view and review the lecture materials indefinitely, like an on-demand channel.
Can I take my courses with me wherever I go?
Definitely! If you have an internet connection, courses on Udemy are available on any device at any time. If you don’t have an internet connection, some instructors also let their students download course lectures. That’s up to the instructor though, so make sure you get on their good side!
You may also like
- Top 10 Language Learning Courses to Learn in November 2024
- Top 10 Video Editing Courses to Learn in November 2024
- Top 10 Music Production Courses to Learn in November 2024
- Top 10 Animation Courses to Learn in November 2024
- Top 10 Digital Illustration Courses to Learn in November 2024
- Top 10 Renewable Energy Courses to Learn in November 2024
- Top 10 Sustainable Living Courses to Learn in November 2024
- Top 10 Ethical AI Courses to Learn in November 2024
- Top 10 Cybersecurity Fundamentals Courses to Learn in November 2024
- Top 10 Smart Home Technology Courses to Learn in November 2024
- Top 10 Holistic Health Courses to Learn in November 2024
- Top 10 Nutrition And Diet Planning Courses to Learn in November 2024
- Top 10 Yoga Instruction Courses to Learn in November 2024
- Top 10 Stress Management Courses to Learn in November 2024
- Top 10 Mindfulness Meditation Courses to Learn in November 2024
- Top 10 Life Coaching Courses to Learn in November 2024
- Top 10 Career Development Courses to Learn in November 2024
- Top 10 Relationship Building Courses to Learn in November 2024
- Top 10 Parenting Skills Courses to Learn in November 2024
- Top 10 Home Improvement Courses to Learn in November 2024