Computer Vision: Python OCR & Object Detection Quick Starter
Computer Vision: Python OCR & Object Detection Quick Starter, available at $69.99, has an average rating of 4.4, with 55 lectures, based on 538 reviews, and has 6089 subscribers.
You will learn about Optical Character Recognition with Tesseract Library, Image Recognition using Keras, Object Recognition using MobileNet SSD, Mask R-CNN, YOLO, Tiny YOLO optical Character Recognition Image Recognition Object Recognition This course is ideal for individuals who are Beginners or who wants to start with Python based OCR, Image Recognition and Object Recognition It is particularly useful for Beginners or who wants to start with Python based OCR, Image Recognition and Object Recognition.
Enroll now: Computer Vision: Python OCR & Object Detection Quick Starter
Summary
Title: Computer Vision: Python OCR & Object Detection Quick Starter
Price: $69.99
Average Rating: 4.4
Number of Lectures: 55
Number of Published Lectures: 55
Number of Curriculum Items: 55
Number of Published Curriculum Objects: 55
Original Price: $109.99
Quality Status: approved
Status: Live
What You Will Learn
- Optical Character Recognition with Tesseract Library, Image Recognition using Keras, Object Recognition using MobileNet SSD, Mask R-CNN, YOLO, Tiny YOLO
- optical Character Recognition
- Image Recognition
- Object Recognition
Who Should Attend
- Beginners or who wants to start with Python based OCR, Image Recognition and Object Recognition
Target Audiences
- Beginners or who wants to start with Python based OCR, Image Recognition and Object Recognition
Hi There!
welcome to my new course ‘Optical Character Recognition and Object Recognition Quick Start with Python’. This is the third course from my Computer Vision series.
Image Recognition, Object Detection, Object Recognition and also Optical Character Recognition are among the most used applications of Computer Vision.
Using these techniques, the computer will be able to recognize and classify either the whole image, or multiple objects inside a single image predicting the class of the objects with the percentage accuracy score. Using OCR, it can also recognize and convert text in the images to machine readable format like text or a document.
Object Detection and Object Recognition is widely used in many simple applications and also complex ones like self driving cars.
This course will be a quick starter for people who wants to dive into Optical Character Recognition, Image Recognition and Object Detection using Python without having to deal with all the complexities and mathematics associated with typical Deep Learning process.
Let’s now see the list of interesting topics that are included in this course.
At first we will have an introductory theory session about Optical Character Recognition technology.
After that, we are ready to proceed with preparing our computer for python coding by downloading and installing the anaconda package and will check and see if everything is installed fine.
Most of you may not be coming from a python based programming background. The next few sessions and examples will help you get the basic python programming skill to proceed with the sessions included in this course. The topics include Python assignment, flow-control, functions and data structures.
Then we will install the dependencies and libraries that we require to do the Optical Character Recognition. We are using Tesseract Library to do the OCR. At first we will install the Library and then its python bindings. We will also install OpenCV, which is the Open Source Computer Vision library in Python.
We also will install the Pillow library, which is the Python Image Library. Then we will have an introduction to the steps involved in the Optical Character Recognition and later will proceed with coding and implementing the OCR program. We will use few example images to do a Character Recognition testing and will verify the results.
Then we will have an introduction to Convolutional Neural Networks , which we will be using to do the Image Recognition. Here we will be classifying a full image based on the single primary object in it.
We will then proceed with installing the Keras Library which we will be using to do the Image recognition. We will be using the built in , pre-trained Models that are included in Keras. The base code in python is also provided in the Keras documentation.
At first We will be using the popular pre-trained model architecture called the VGGNet. we will have an introductory session about the architecture of VGGNet. Then we will proceed with using the pre-trained VGGNet 16 Model included in keras to do Image Recognition and classification. We will try with few sample images to check the predictions. Then will move on to a deeper VGGNet 19 Model included in keras to do Image Recognition and classification.
Then we will try the ResNet pre-trained model included with the Keras library. We will include the model in the code and then we will try with few sample images to check the predictions.
And after that we will try the Inception pre-trained model. We will also include the model in the code and then we will try with few sample images to check the predictions. Then will go ahead with the Xception pre-trained model. Here also, we will include the model in the code and then we will try with few sample images.
And those were Image Recognition pre-trained models, which can only label and classify a complete image based on the primary object in it. Now we will proceed with Object Recognition in which we can detect and label multiple objects in a single image.
At first we will have an introduction to MobileNet-SSD Pre-trained Model, which is single shot detector that is capable of detecting multiple objects in a scene. We will be also be having a quick discussion about the dataset that is used to train this model.
Later we will be implementing the MobileNet-SSD Pre-trained Model in our code and will get the predictions and bounding box coordinates for every object detected. We will draw the bounding box around the objects in the image and write the label along with the confidence value.
Then we will go ahead with object detection from a live video. We will be streaming the real-time live video from the computer’s webcam and will try to detect objects from it. We will draw rectangle around each object detected in the live video along with the label and confidence.
In the next session, we will go ahead with object detection from a pre-saved video. We will be streaming the saved video from our folder and will try to detect objects from it. We will draw rectangle around each object detected along with the label and confidence.
Later we will be going ahead with the Mask-RCNN Pre-trained Model. In the previous model, we were only able to get a bounding box around the object, but in Mask-RCNN, we can get both the box co-ordinates as well the mask over the exact shape of object detected. We will have an introduction about this model and its details.
Later we will be implementing the Mask-RCNN Pre-trained Model in our code and as the first step we will get the predictions and bounding box coordinates for every object detected. We will draw the bounding box around the objects in the image and write the label along with the confidence value.
Later we will be getting the mask returned for each object predicted. We will process that data and use it to draw translucent multi coloured masks over each and every object detected and write the label along with the confidence value.
Then we will go ahead with object detection from a live video using Mask-RCNN. We will be streaming the real-time live video from the computer’s webcam and will try to detect objects from it. We will draw the mask over the perimeter of each object detected in the live video along with the label and confidence.
And like we did for our previous model, we will go ahead with object detection from a pre-saved video using Mask-RCNN. We will be streaming the saved video from our folder and will try to detect objects from it. We will draw coloured masks for object detected along with the label and confidence.
The Mask-RCNN is very accurate with vast class list but will be very slow in processing images using low power CPU based computers. MobileNet-SSD is fast but less accurate and low in number of classes. We need a perfect blend of speed and accuracy which will take us to Object Detection and Recognition using YOLO pre-trained model. we will have an overview about the yolo model in the next session and then we will implement yolo object detection from a single image.
And using that as the base, we will try the yolo model for object detection from a real time webcam video and we will check the performance. Later we will use it for object recognition from the pre-saved video file.
To further improve the speed of frames processed, we will use the model called Tiny YOLO which is a light weight version of the actual yolo model. We will use tiny yolo at first for the pre-saved video and will analyse the accuracy as well as speed and then we will try the same for a real-time video from webcam and see the difference in performance compared to actual yolo.
That’s all about the topics which are currently included in this quick course. The code, images and libraries used in this course has been uploaded and shared in a folder. I will include the link to download them in the last session or the resource section of this course. You are free to use the code in your projects with no questions asked.
Also after completing this course, you will be provided with a course completion certificate which will add value to your portfolio.
So that’s all for now, see you soon in the class room. Happy learning and have a great time.
Course Curriculum
Chapter 1: Course Introduction and Table of Contents
Lecture 1: Course Introduction and Table of Contents
Chapter 2: Introduction to OCR Concepts and Libraries
Lecture 1: Introduction to OCR Concepts and Libraries
Chapter 3: Setting up Environment – Anaconda
Lecture 1: Setting up Environment – Anaconda
Chapter 4: Python Basics (Optional)
Lecture 1: Python Basics – Part 1 – Assignment
Lecture 2: Python Basics – Part 2 – Flow Control
Lecture 3: Python Basics – Part 3 – Data Structures
Lecture 4: Python Basics – Part 4 – Functions
Chapter 5: Tesseract OCR Setup
Lecture 1: Tesseract OCR Setup – Part 1
Lecture 2: Tesseract Download Link
Lecture 3: Tesseract OCR Setup – Part 2
Chapter 6: OpenCV Setup
Lecture 1: OpenCV Setup
Chapter 7: Tesseract Image OCR Implementation
Lecture 1: Tesseract Image OCR Implementation – Part 1
Lecture 2: Tesseract Image OCR Implementation – Part 2
Chapter 8: Optional: cv2.imshow() Not Responding Issue Fix
Lecture 1: Optional: cv2.imshow() Not Responding Issue Fix
Lecture 2: Optional Fix 2: cv2.imshow() Not Responding
Chapter 9: Optional: OCR Text not printing in console
Lecture 1: Fix text not printing in console
Chapter 10: Introduction to CNN – Convolutional Neural Networks – Theory Session
Lecture 1: Introduction to CNN – Convolutional Neural Networks – Theory Session
Chapter 11: Installing Additional Dependencies for CNN
Lecture 1: Fix: if 'conda install' fails to resolve
Lecture 2: Installing Additional Dependencies for CNN
Lecture 3: IMPORTANT FIX: "ImportError: Keras requires TensorFlow 2.2 or higher' Windows OS
Chapter 12: Introduction to VGGNet Architecture
Lecture 1: Introduction to VGGNet Architecture
Chapter 13: Image Recognition using Pre-Trained VGGNet16 Model
Lecture 1: Image Recognition using Pre-Trained VGGNet16 Model – Part 1
Lecture 2: cannot import img_to_array, load_img
Lecture 3: Error: No module named keras.applications
Lecture 4: Keras Import Error OR Module Not Found Error
Lecture 5: Image Recognition using Pre-Trained VGGNet16 Model – Part 2
Lecture 6: TensorFlow "Module Not Found" Error Fix (Optional) – Do ONLY if you have error
Chapter 14: Image Recognition using Pre-Trained VGGNet19 Model
Lecture 1: Image Recognition using Pre-Trained VGGNet19 Model
Chapter 15: Image Recognition using Pre-Trained ResNet Model
Lecture 1: Image Recognition using Pre-Trained ResNet Model
Chapter 16: Image Recognition using Pre-Trained Inception Model
Lecture 1: Image Recognition using Pre-Trained Inception Model
Chapter 17: Image Recognition using Pre-Trained Xception Model
Lecture 1: Image Recognition using Pre-Trained Xception Model
Chapter 18: Introduction to MobileNet-SSD Pretrained Model
Lecture 1: Introduction to MobileNet-SSD Pretrained Model
Chapter 19: Mobilenet SSD Object Detection
Lecture 1: Mobilenet SSD Object Detection – Part 1
Lecture 2: Mobilenet SSD Object Detection – Part 2
Chapter 20: Mobilenet SSD Realtime Video
Lecture 1: Mobilenet SSD Realtime Video
Chapter 21: Mobilenet SSD Pre-saved Video
Lecture 1: Mobilenet SSD Pre-saved Video
Chapter 22: Mask RCNN Pre-trained model Introduction
Lecture 1: Mask RCNN Pre-trained model Introduction
Chapter 23: MaskRCNN Bounding Box Implementation
Lecture 1: MaskRCNN Bounding Box Implementation – Part 1
Lecture 2: MaskRCNN Bounding Box Implementation – Part 2
Chapter 24: MaskRCNN Object Mask Implementation
Lecture 1: MaskRCNN Object Mask Implementation – Part 1
Lecture 2: MaskRCNN Object Mask Implementation – Part 2
Chapter 25: MaskRCNN Realtime Video
Lecture 1: MaskRCNN Realtime Video – Part 1
Lecture 2: MaskRCNN Realtime Video – Part 2
Chapter 26: MaskRCNN Pre-saved Video
Lecture 1: MaskRCNN Pre-saved Video
Chapter 27: YOLO Pre-trained Model Introduction
Lecture 1: YOLO Pre-trained Model Introduction
Chapter 28: YOLO Implementation
Lecture 1: YOLO Implementation – Part 1
Lecture 2: YOLO Implementation – Part 2
Lecture 3: Error: invalid index to scalar variable
Chapter 29: YOLO Real-time Video
Lecture 1: YOLO Real-time Video
Chapter 30: YOLO Pre-saved Video
Lecture 1: YOLO Pre-saved Video
Chapter 31: Tiny YOLO Pre-saved Video
Lecture 1: Tiny YOLO Pre-saved Video
Chapter 32: Tiny YOLO Real-time Video
Lecture 1: Tiny YOLO Real-time Video
Chapter 33: YOLOv4 – Step 1 – Updating OpenCV version
Lecture 1: YOLOv4 – Step 1 – Updating OpenCV version
Chapter 34: YOLOv4 – Step 2 – Object Recognition Implementation
Lecture 1: YOLOv4 – Step 2 – Object Recognition Implementation
Chapter 35: SOURCE CODE AND FILES ATTACHED
Lecture 1: SOURCE CODE AND FILES ATTACHED
Instructors
-
Abhilash Nelson
Computer Engineering Master & Senior Programmer at Dubai
Rating Distribution
- 1 stars: 7 votes
- 2 stars: 9 votes
- 3 stars: 58 votes
- 4 stars: 215 votes
- 5 stars: 249 votes
Frequently Asked Questions
How long do I have access to the course materials?
You can view and review the lecture materials indefinitely, like an on-demand channel.
Can I take my courses with me wherever I go?
Definitely! If you have an internet connection, courses on Udemy are available on any device at any time. If you don’t have an internet connection, some instructors also let their students download course lectures. That’s up to the instructor though, so make sure you get on their good side!
You may also like
- Top 10 Language Learning Courses to Learn in November 2024
- Top 10 Video Editing Courses to Learn in November 2024
- Top 10 Music Production Courses to Learn in November 2024
- Top 10 Animation Courses to Learn in November 2024
- Top 10 Digital Illustration Courses to Learn in November 2024
- Top 10 Renewable Energy Courses to Learn in November 2024
- Top 10 Sustainable Living Courses to Learn in November 2024
- Top 10 Ethical AI Courses to Learn in November 2024
- Top 10 Cybersecurity Fundamentals Courses to Learn in November 2024
- Top 10 Smart Home Technology Courses to Learn in November 2024
- Top 10 Holistic Health Courses to Learn in November 2024
- Top 10 Nutrition And Diet Planning Courses to Learn in November 2024
- Top 10 Yoga Instruction Courses to Learn in November 2024
- Top 10 Stress Management Courses to Learn in November 2024
- Top 10 Mindfulness Meditation Courses to Learn in November 2024
- Top 10 Life Coaching Courses to Learn in November 2024
- Top 10 Career Development Courses to Learn in November 2024
- Top 10 Relationship Building Courses to Learn in November 2024
- Top 10 Parenting Skills Courses to Learn in November 2024
- Top 10 Home Improvement Courses to Learn in November 2024