stereobooster/react-snap

Crawl only html documents

Open

#339 opened on Feb 28, 2019

View on GitHub
 (3 comments) (0 reactions) (0 assignees)JavaScript (4,982 stars) (403 forks)batch import
enhancementhelp wanted

Description

Feature Request

Is your feature request related to a problem? Please describe. Puppeteer does not handle PDF files. react-snap will crawl to them if links exist and the process will crash. There doesn't seem to be a way to ignore this.

Describe the solution you'd like I'd like the ability to be able to specify files/paths/globs to ignore as an option.

Describe alternatives you've considered I've looked at other snapshot libraries but prefer this one. I've looked at whether it's possible to pass an argument to puppeteer to handle this but it doesn't seem to be.

Teachability, Documentation, Adoption, Migration Strategy I'd imagine an option in package.json, like:

ignore: ["**.pdf"]

Contributor guide