Web Scraping With Python

Web Scraping With Python

Collecting Data From the Modern Web

Book - 2015 | First edition.
Average Rating:
Rate this:

Learn web scraping and crawling techniques to access unlimited data from any web source in any format. With this practical guide, you'll learn how to use Python scripts and web APIs to gather and process data from thousands--or even millions--of web pages at once.

Ideal for programmers, security professionals, and web administrators familiar with Python, this book not only teaches basic web scraping mechanics, but also delves into more advanced topics, such as analyzing raw data or using scrapers for frontend website testing. Code samples are available to help you understand the concepts in practice.

Learn how to parse complicated HTML pages Traverse multiple pages and sites Get a general overview of APIs and how they work Learn several methods for storing the data you scrape Download, read, and extract data from documents Use tools and techniques to clean badly formatted data Read and write natural languages Crawl through forms and logins Understand how to scrape JavaScript Learn image processing and text recognition
Publisher: Sebastopol, CA : O'Reilly Media, 2015.
Edition: First edition.
ISBN: 9781491910290
Branch Call Number: 005.133 PYTHON
Characteristics: xiii, 238 pages : illustrations, 24 cm


From the critics

Community Activity


Add a Summary
Apr 08, 2016

This book is a great introduction to Python. It also introduces several related web technologies. For web scraping, you will be learning more about parsing HTML, JavaScript, and JSON REST API’s. You will dip into Natural Language Processing and character encoding. There is also a quick chapter on OCR so you can get text from images. Naturally, there is some talk of MySQL to store the information. 200 well written pages, legendary O’Reilly quality.


Add a Comment

There are no comments for this title yet.


Add Age Suitability

There are no ages for this title yet.


Add Notices

There are no notices for this title yet.


Add a Quote

There are no quotes for this title yet.

Explore Further


Subject Headings

No similar edition of this title was found at SLPL.

Try searching for Web Scraping With Python to see if SLPL owns related versions of the work.

Suggest for Purchase

To Top