Skip to content

Latest commit

 

History

History
31 lines (18 loc) · 651 Bytes

README.md

File metadata and controls

31 lines (18 loc) · 651 Bytes

XKCD Comics Scraper Using Scrapy

Intro

A Web Scraper that scrapes data off XKCD Comics using Scrapy.

Scraped Contents

  • Title
  • Comic's URL
  • Comic image's URL
  • Transcript of the comic (if present)

Requirements

  • Python 3.x
  • Scrapy

Usage

scrapy crawl xkcd-spider > xkcd.json

Example output JSON and HTML source files are here.

Caveat

Example output has been beautified with JSON Lint for readability.

Working of this web scraper depends on the source at XKCD Comics.