Web Crawling with Nodejs (H&M, Amazon, LinkedIn, AliExpress)
Web Crawling with Nodejs (H&M, Amazon, LinkedIn, AliExpress), available at $59.99, has an average rating of 4.6, with 38 lectures, based on 117 reviews, and has 767 subscribers.
You will learn about Differences between web crawling and web scraping in Nodejs The 3 main methods to use in web crawling, and when to use what method! How to get data from sites like H&M and AliExpress easily and fast using their hidden API's How to build a web crawler for server rendered sites like Amazon to crawl all their products How to build a Puppeteer based web crawler for a site that requires JavaScript like Linkedin This course is ideal for individuals who are Students looking to learn web crawling with Nodejs or Students looking to learn web scraping with Nodejs It is particularly useful for Students looking to learn web crawling with Nodejs or Students looking to learn web scraping with Nodejs.
Enroll now: Web Crawling with Nodejs (H&M, Amazon, LinkedIn, AliExpress)
Summary
Title: Web Crawling with Nodejs (H&M, Amazon, LinkedIn, AliExpress)
Price: $59.99
Average Rating: 4.6
Number of Lectures: 38
Number of Published Lectures: 38
Number of Curriculum Items: 38
Number of Published Curriculum Objects: 38
Original Price: $199.99
Quality Status: approved
Status: Live
What You Will Learn
- Differences between web crawling and web scraping in Nodejs
- The 3 main methods to use in web crawling, and when to use what method!
- How to get data from sites like H&M and AliExpress easily and fast using their hidden API's
- How to build a web crawler for server rendered sites like Amazon to crawl all their products
- How to build a Puppeteer based web crawler for a site that requires JavaScript like Linkedin
Who Should Attend
- Students looking to learn web crawling with Nodejs
- Students looking to learn web scraping with Nodejs
Target Audiences
- Students looking to learn web crawling with Nodejs
- Students looking to learn web scraping with Nodejs
Do you want to build a webcrawler in Nodejs?
In this course you will learn how to build a webcrawler using the newest JavaScript syntax with popular sites like H&M, Amazon, LinkedIn and AliExpress!
You’ll learn how to find hidden API’s on sites like H&M and AliExpress and see how you can even avoid building a web crawler in the first place, you can save a lot of time this way!
Then I show how to build a web crawler for Amazon the test-driven way, by building out tests for the various product page layouts there is on Amazon.
After that we’ll take a look at how to automate login and scraping profiles from LinkedIn using Puppeteer, the automated Chromium browser!
Course Curriculum
Chapter 1: Intro to web crawling and web scraping
Lecture 1: What is the web crawling and web scraping and how is it different ?
Lecture 2: Legality of web scraping and web crawling
Lecture 3: Tools we will be using during development
Lecture 4: Methods of web crawling and web scraping
Lecture 5: Optional: note about deprecation of Request/Request-Promise
Chapter 2: Getting all products from H&M and saving it to mongodb (Method 1)
Lecture 1: Finding hidden API using Chrome Dev Tools
Lecture 2: Testing hidden API inside Postman, and finding other section API endpoints
Lecture 3: Initializing NPM + some info about Nodejs Request and Needle
Lecture 4: Creating our HTTP request with needle inside Nodejs
Lecture 5: Adding User-Agent header to get past denial in nodejs
Lecture 6: Creating MongoDB cluster for saving data
Lecture 7: Connecting to MongoDB cluster from Nodejs
Lecture 8: Saving data to MongoDB
Lecture 9: Getting all products in MongoDB using a loop with offset variable and pagesize
Chapter 3: AliExpress – getting lots of products and prices using Method 1 (hidden API)
Lecture 1: Finding hidden API using Chrome Dev Tools
Lecture 2: Making API request from Postman with correct headers
Lecture 3: Making API request from Nodejs using Fetch API
Lecture 4: Getting many items using a for loop and sleep function
Lecture 5: Saving AliExpress products to MongoDB
Chapter 4: Building a Amazon webcrawler in Nodejs (Method 2, HTTP Requests)
Lecture 1: Intro to project
Lecture 2: Why are we using HTTP requests and not Puppeteer?
Lecture 3: Initializing NPM + installing jest, cheerio and needle npm packages
Lecture 4: Writing our reuseable httpRequest module for our testing and crawling
Lecture 5: Creating our test HTML file (check resources for URL)
Lecture 6: Setting up testing and intro to testing
Lecture 7: Writing our first test for our HTML parser
Lecture 8: Getting title from product page and making our test pass
Lecture 9: Getting the price from product page
Lecture 10: Making our second test and getting product links from page
Lecture 11: Writing out our actual webcrawling in 6 minutes!
Lecture 12: Setup so we only crawl only unique product ID's
Lecture 13: Adding a new test case for different layout + outtro
Chapter 5: Puppeteer web crawling on LinkedIn
Lecture 1: Intro to project
Lecture 2: Initializing project with puppeteer and cheerio packages
Lecture 3: Opening Puppeteer browser and navigating to URL
Lecture 4: Login to Linkedin using Puppeteer
Lecture 5: Getting profile links on a LinkedIn profile
Lecture 6: Building web crawler loop for Puppeteer
Instructors
-
Stefan Hyltoft
B.Eng Software Engineer
Rating Distribution
- 1 stars: 2 votes
- 2 stars: 1 votes
- 3 stars: 7 votes
- 4 stars: 33 votes
- 5 stars: 74 votes
Frequently Asked Questions
How long do I have access to the course materials?
You can view and review the lecture materials indefinitely, like an on-demand channel.
Can I take my courses with me wherever I go?
Definitely! If you have an internet connection, courses on Udemy are available on any device at any time. If you don’t have an internet connection, some instructors also let their students download course lectures. That’s up to the instructor though, so make sure you get on their good side!
You may also like
- Top 10 Language Learning Courses to Learn in November 2024
- Top 10 Video Editing Courses to Learn in November 2024
- Top 10 Music Production Courses to Learn in November 2024
- Top 10 Animation Courses to Learn in November 2024
- Top 10 Digital Illustration Courses to Learn in November 2024
- Top 10 Renewable Energy Courses to Learn in November 2024
- Top 10 Sustainable Living Courses to Learn in November 2024
- Top 10 Ethical AI Courses to Learn in November 2024
- Top 10 Cybersecurity Fundamentals Courses to Learn in November 2024
- Top 10 Smart Home Technology Courses to Learn in November 2024
- Top 10 Holistic Health Courses to Learn in November 2024
- Top 10 Nutrition And Diet Planning Courses to Learn in November 2024
- Top 10 Yoga Instruction Courses to Learn in November 2024
- Top 10 Stress Management Courses to Learn in November 2024
- Top 10 Mindfulness Meditation Courses to Learn in November 2024
- Top 10 Life Coaching Courses to Learn in November 2024
- Top 10 Career Development Courses to Learn in November 2024
- Top 10 Relationship Building Courses to Learn in November 2024
- Top 10 Parenting Skills Courses to Learn in November 2024
- Top 10 Home Improvement Courses to Learn in November 2024