Skip to content
@scrapy

Scrapy project

An open source and collaborative framework for extracting the data you need from websites. In a fast, simple, yet extensible way.

Pinned Loading

  1. scrapy scrapy Public

    Scrapy, a fast high-level web crawling & scraping framework for Python.

    Python 53.5k 10.6k

  2. scrapy.org scrapy.org Public

    The scrapy.org website

    HTML 60 140

Repositories

Showing 10 of 27 repositories
  • scrapy Public

    Scrapy, a fast high-level web crawling & scraping framework for Python.

    scrapy/scrapy’s past year of commit activity
    Python 53,496 BSD-3-Clause 10,596 452 (19 issues need help) 222 Updated Dec 16, 2024
  • parsel Public

    Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors

    scrapy/parsel’s past year of commit activity
    Python 1,156 BSD-3-Clause 147 29 (1 issue needs help) 12 Updated Dec 16, 2024
  • itemadapter Public

    Common interface for data container classes

    scrapy/itemadapter’s past year of commit activity
    Python 62 BSD-3-Clause 13 6 4 Updated Dec 12, 2024
  • scrapy.org Public

    The scrapy.org website

    scrapy/scrapy.org’s past year of commit activity
    HTML 60 140 1 1 Updated Dec 9, 2024
  • protego Public

    A pure-Python robots.txt parser with support for modern conventions.

    scrapy/protego’s past year of commit activity
    DIGITAL Command Language 56 BSD-3-Clause 28 5 (1 issue needs help) 1 Updated Nov 15, 2024
  • scrapyd Public

    A service daemon to run Scrapy spiders

    scrapy/scrapyd’s past year of commit activity
    Python 2,978 BSD-3-Clause 569 7 0 Updated Nov 11, 2024
  • w3lib Public

    Python library of web-related functions

    scrapy/w3lib’s past year of commit activity
    Python 394 BSD-3-Clause 105 11 (1 issue needs help) 4 Updated Oct 16, 2024
  • itemloaders Public

    Library to populate items using XPath and CSS with a convenient API

    scrapy/itemloaders’s past year of commit activity
    Python 45 BSD-3-Clause 16 17 4 Updated Oct 16, 2024
  • cssselect Public

    CSS Selectors for Python

    scrapy/cssselect’s past year of commit activity
    Python 291 61 17 4 Updated Oct 16, 2024
  • queuelib Public

    Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python

    scrapy/queuelib’s past year of commit activity
    Python 271 BSD-3-Clause 54 3 2 Updated Oct 16, 2024