Web Scraping with Python

Go to class
Write Review

Free Online Course: Web Scraping with Python provided by LinkedIn Learning is a comprehensive online course, which lasts for 1-2 hours worth of material. The course is taught in English and is free of charge. Upon completion of the course, you can receive an e-certificate from LinkedIn Learning. Web Scraping with Python is taught by Ryan Mitchell.

Overview
  • This course dives into the process, strategies, and best practices of web scraping. Learn how to use the Python framework, Scrapy, to practice key techniques.

Syllabus
  • Introduction

    • How to learn to stop worrying and love the bot
    • What you should know
    1. Basic Web Scraping
    • What is web scraping?
    • How the internet works: A brief summary
    • Hello world with Scrapy
    • Challenge: Scraping all data on a page
    • Solution: Scraping all data on a page
    2. Learning to Crawl
    • Crawling a website
    • Recording data
    • Scrapy settings file
    • Structuring your scrapers for extensibility/reusability
    • Challenge: Scraping news sites
    • Solution: Scraping news sites
    3. Advanced Techniques
    • Submitting a form
    • Finding and using hidden APIs
    • Sitemaps and robots.txt
    • Challenge: Using CNN's sitemap
    • Solution: Using CNN's sitemap
    4. Acting Human
    • Logging in
    • Browser automation with Selenium
    • Interacting with a page
    Conclusion
    • Next steps