Optical Character Recognition (OCR) in Python
Optical Character Recognition (OCR) in Python, available at $79.99, has an average rating of 4.59, with 95 lectures, based on 241 reviews, and has 2489 subscribers.
You will learn about Use Tesseract, EAST and EasyOCR tools for text recognition in images and videos Understand the differences between OCR in controlled and natural environments Apply image pre-processing techniques to improve image quality, such as: thresholding, inversion, resizing, morphological operations and noise reduction Use EAST architecture and EasyOCR library for better performance in natural scenes Train an OCR from scratch using Deep Learning and Convolutional Neural Networks Application of natural language processing techniques in the texts extracted by OCR (word cloud and named entity recognition) License plate reading This course is ideal for individuals who are Anyone interested in OCR (Optical Character Recognition) or Undergraduate students who are studying subjects related to Artificial Intelligence, Digital Image Processing or Computer Vision or Data Scientists who want to increase their knowledge in Computer Vision or Professionals interested in developing professional optical character recognition solutions or People interested in creating their own custom OCR It is particularly useful for Anyone interested in OCR (Optical Character Recognition) or Undergraduate students who are studying subjects related to Artificial Intelligence, Digital Image Processing or Computer Vision or Data Scientists who want to increase their knowledge in Computer Vision or Professionals interested in developing professional optical character recognition solutions or People interested in creating their own custom OCR.
Enroll now: Optical Character Recognition (OCR) in Python
Summary
Title: Optical Character Recognition (OCR) in Python
Price: $79.99
Average Rating: 4.59
Number of Lectures: 95
Number of Published Lectures: 95
Number of Curriculum Items: 95
Number of Published Curriculum Objects: 95
Original Price: $19.99
Quality Status: approved
Status: Live
What You Will Learn
- Use Tesseract, EAST and EasyOCR tools for text recognition in images and videos
- Understand the differences between OCR in controlled and natural environments
- Apply image pre-processing techniques to improve image quality, such as: thresholding, inversion, resizing, morphological operations and noise reduction
- Use EAST architecture and EasyOCR library for better performance in natural scenes
- Train an OCR from scratch using Deep Learning and Convolutional Neural Networks
- Application of natural language processing techniques in the texts extracted by OCR (word cloud and named entity recognition)
- License plate reading
Who Should Attend
- Anyone interested in OCR (Optical Character Recognition)
- Undergraduate students who are studying subjects related to Artificial Intelligence, Digital Image Processing or Computer Vision
- Data Scientists who want to increase their knowledge in Computer Vision
- Professionals interested in developing professional optical character recognition solutions
- People interested in creating their own custom OCR
Target Audiences
- Anyone interested in OCR (Optical Character Recognition)
- Undergraduate students who are studying subjects related to Artificial Intelligence, Digital Image Processing or Computer Vision
- Data Scientists who want to increase their knowledge in Computer Vision
- Professionals interested in developing professional optical character recognition solutions
- People interested in creating their own custom OCR
Within the area of Computer Vision is the sub-area of Optical Character Recognition (OCR), which aims to transform images into texts. OCR can be described as converting images containing typed, handwritten or printed text into characters that a machine can understand. It is possible to convert scanned or photographed documents into texts that can be edited in any tool, such as the Microsoft Word. A common application is automatic form reading, in which you can send a photo of your credit card or your driver’s license, and the system can read all your data without the need to type them manually. A self-driving car can use OCR to read traffic signs and a parking lot can guarantee access by reading the license plate of the cars!
To take you to this area, in this course you will learn in practice how to use OCR libraries to recognize text in images and videos, all the code implemented step by step using the Python programming language! We are going to use Google Colab, so you do not have to worry about installing libraries on your machine, as everything will be developed online using Google’s GPUs! You will also learn how to build your own OCR from scratch using Deep Learning and Convolutional Neural Networks! Below you can check the main topics of the course:
-
Recognition of texts in images and videos using Tesseract, EasyOCR and EAST
-
Search for specific terms in images using regular expressions
-
Techniques for improving image quality, such as: thresholding, color inversion, grayscale, resizing, noise removal, morphological operations and perspective transformation
-
EAST architecture and EasyOCR library for better performance in natural scenes
-
Training an OCR from scratch using TensorFlow and modern Deep Learning techniques, such as Convolutional Neural Networks
-
Application of natural language processing techniques in the texts extracted by OCR (word cloud and named entity recognition)
-
License plate reading
These are just some of the main topics! By the end of the course, you will know everything you need to create your own text recognition projects using OCR!
Course Curriculum
Chapter 1: Introduction
Lecture 1: Course content
Lecture 2: Introduction to OCR
Lecture 3: Course materials
Chapter 2: OCR with Tesseract
Lecture 1: Introduction to Tesseract
Lecture 2: Preparing the environment
Lecture 3: First text recognition
Lecture 4: Support for other languages
Lecture 5: Page segmentation mode (PSM)
Lecture 6: Page orientation detection
Lecture 7: Selection of texts 1
Lecture 8: Selection of texts 2
Lecture 9: Selection of texts 3
Lecture 10: Search using regular expressions
Lecture 11: Detections in natural scenarios
Chapter 3: Techniques for image pre-processing
Lecture 1: Grayscale
Lecture 2: Thresholding – intuition
Lecture 3: Simple thresholding
Lecture 4: Thresholding with Otsu method
Lecture 5: Adaptive thresholding
Lecture 6: Gaussian adaptive thresholding
Lecture 7: Color inversion
Lecture 8: Resizing – intuition
Lecture 9: Resizing – implementation
Lecture 10: Morphological operations – intuition
Lecture 11: Morphological operations – implementation
Lecture 12: Noise removal – intuition
Lecture 13: Noise removal – implementation
Lecture 14: Text recognition with OCR
Lecture 15: HOMEWORK
Lecture 16: Homework solution
Chapter 4: OCR with EAST for natural scenes
Lecture 1: EAST – introduction
Lecture 2: Pre-processing the image
Lecture 3: Loading the neural network
Lecture 4: Decoding the image 1
Lecture 5: Decoding the image 2
Lecture 6: Text recognition
Chapter 5: Training a custom OCR
Lecture 1: Importing the libraries
Lecture 2: MNIST 0-9 dataset
Lecture 3: Kaggle A-Z dataset
Lecture 4: Joining the datasets
Lecture 5: Pre-processing the data
Lecture 6: Building the neural network
Lecture 7: Training the neural network
Lecture 8: Evaluating the neural network
Lecture 9: Saving the neural network
Lecture 10: Testing with images
Lecture 11: Preparing the environment
Lecture 12: Pre-processing the image
Lecture 13: Contour detection
Lecture 14: Processing the detections 1
Lecture 15: Processing the detections 2
Lecture 16: Character recognition
Lecture 17: Problems with 0 and O, 1 and l, 5 and S
Lecture 18: Problems with undetected texts
Chapter 6: Natural scenarios with EasyOCR
Lecture 1: Preparing the environment
Lecture 2: Text recognition
Lecture 3: Writing the results on the image
Lecture 4: Other languages – French and Chinese
Lecture 5: Text recognition (background)
Chapter 7: OCR in videos
Lecture 1: Preparing the environment
Lecture 2: Video settings
Lecture 3: Processing the video
Lecture 4: OCR with EAST and Tesseract
Lecture 5: OCR with EasyOCR
Chapter 8: Project 1: Searching for specific terms
Lecture 1: Preparing the environment
Lecture 2: Text recognition
Lecture 3: Searching for texts
Lecture 4: Word cloud
Lecture 5: Named entity recognition
Lecture 6: Search for texts in images
Lecture 7: Saving the results
Chapter 9: Project 2: Scanner + OCR
Lecture 1: Preparing the environment
Lecture 2: Contour detection
Lecture 3: Perspective transformation
Lecture 4: OCR with Tesseract
Lecture 5: Improving image quality
Lecture 6: Putting all together
Chapter 10: Project 3: License plate reading
Lecture 1: Pre-processing the image
Lecture 2: Text recognition
Lecture 3: Improving image quality
Chapter 11: Extra content 1: artificial neural networks
Lecture 1: Biological fundamentals
Lecture 2: Single layer perceptron
Lecture 3: Multilayer perceptron – sum and activation functions
Lecture 4: Multilayer perceptron – error calculation
Lecture 5: Gradient descent
Lecture 6: Delta parameter
Lecture 7: Updating weights with backpropagation
Lecture 8: Bias, error, stochastic gradient descent, and more parameters
Chapter 12: Extra content 2: convolutional neural networks
Instructors
-
Jones Granatyr
Professor -
Gabriel Alves
Developer -
AI Expert Academy
Instructor
Rating Distribution
- 1 stars: 3 votes
- 2 stars: 4 votes
- 3 stars: 18 votes
- 4 stars: 75 votes
- 5 stars: 141 votes
Frequently Asked Questions
How long do I have access to the course materials?
You can view and review the lecture materials indefinitely, like an on-demand channel.
Can I take my courses with me wherever I go?
Definitely! If you have an internet connection, courses on Udemy are available on any device at any time. If you don’t have an internet connection, some instructors also let their students download course lectures. That’s up to the instructor though, so make sure you get on their good side!
You may also like
- Top 10 Video Editing Courses to Learn in November 2024
- Top 10 Music Production Courses to Learn in November 2024
- Top 10 Animation Courses to Learn in November 2024
- Top 10 Digital Illustration Courses to Learn in November 2024
- Top 10 Renewable Energy Courses to Learn in November 2024
- Top 10 Sustainable Living Courses to Learn in November 2024
- Top 10 Ethical AI Courses to Learn in November 2024
- Top 10 Cybersecurity Fundamentals Courses to Learn in November 2024
- Top 10 Smart Home Technology Courses to Learn in November 2024
- Top 10 Holistic Health Courses to Learn in November 2024
- Top 10 Nutrition And Diet Planning Courses to Learn in November 2024
- Top 10 Yoga Instruction Courses to Learn in November 2024
- Top 10 Stress Management Courses to Learn in November 2024
- Top 10 Mindfulness Meditation Courses to Learn in November 2024
- Top 10 Life Coaching Courses to Learn in November 2024
- Top 10 Career Development Courses to Learn in November 2024
- Top 10 Relationship Building Courses to Learn in November 2024
- Top 10 Parenting Skills Courses to Learn in November 2024
- Top 10 Home Improvement Courses to Learn in November 2024
- Top 10 Gardening Courses to Learn in November 2024