dixudx/tumblr-crawler

Some suggestions

Open

#85 opened on Jun 5, 2018

View on GitHub
 (3 comments) (0 reactions) (0 assignees)Python (1,144 stars) (353 forks)batch import
help wanted

Description

  1. It needs to support basic form of content address like "https://tumblr.blahblah.com/blah" When approach to a certain tumblr blog with http protocol is blocked by ISP, try https:// or make it as basic form.

  2. There should be a method to suppress repeating download when the download fails once. Save dummy file with the file name, for example.

  3. When the address is form of "https://www.tumblr.com/dashboard/blog/blah", it skips downloads.

Contributor guide