Main difference between v1 and v2 of the scrapers is the upgrade of SDK to v2,
which requires node v15.10+. SDK v2 uses http2 to do the requests with cheerio-scraper
,
and the http2 support in older node versions were too buggy, so we decided to
drop support for those. If you need to run on older node version, use SDK v1.
Please refer to the SDK 1.0 migration guide for more details about functional changes in the SDK. SDK v2 basically only changes the required node version and has no other breaking changes.
- deprecated
useRequestQueue
option has been removed- request queue will be always used
- deprecated
context.html
getter from thecheerio-scraper
has been removed- use
context.body
instead
- use
- deprecated
prepareRequestFunction
input option- use
pre/postNavigationHooks
instead
- use
- removed
puppeteerPool
/autoscaledPool
from thecrawlingContext
objectpuppeteerPool
was replaces bybrowserPool
autoscaledPool
andbrowserPool
and available on thecrawler
property ofcrawlingContext
object
- custom "Key-value store name" option in Advanced configuration is now fixed, previously the default store was always used