Commit Graph

1261 Commits (c73c2cda50c20c51f8117b04ee76d3724ec9dff8)

Author SHA1 Message Date
Mike Fährmann eacebf41e4
fix typo in README
6 years ago
Leonardo Taccari 1e38f65996 [instagram] Add support for GraphSidecar media types (#201)
6 years ago
Mike Fährmann 6ba67b0537
[hypnohub] add extractors (closes #196)
6 years ago
Mike Fährmann fe27154a10
[komikcast] fix extraction
6 years ago
Mike Fährmann 5ec55ec4fc
[deviantart] improve URLs for non-downloadable deviations
6 years ago
Mike Fährmann c7a6b0ed90
[deviantart] add 'metadata' option (#189)
6 years ago
Mike Fährmann 8d96a8ce4c
[500px] add user-, gallery-, and image-extractors (#185)
6 years ago
Mike Fährmann d0f88c35be
[komikcast] fix extraction
6 years ago
Mike Fährmann 6277a739e4
[35photo] add user-, genre-, and image-extractors (#162)
6 years ago
Mike Fährmann fb14f80d62
[tumblr] fix avatar URLs for non-OAuth1.0 calls (closes #193)
6 years ago
Mike Fährmann 973a720a7a
[weibo] fix unit test URL patterns
6 years ago
Mike Fährmann a2af2d2965
adjust cache maxage values
6 years ago
Mike Fährmann f612284d24
cache cfclearance cookies
6 years ago
Mike Fährmann 591a07f20c
small code changes and cleanups
6 years ago
Mike Fährmann 6f57d44ec2
[seaotterscans] remove extractor
6 years ago
Mike Fährmann 6dae6bee37
automatically detect and bypass cloudflare challenge pages
6 years ago
Mike Fährmann 25aaf55514
[smugmug] improve format selection (closes #183)
6 years ago
Mike Fährmann 7c1cb923a4
[myportfolio] replace unit test
6 years ago
Mike Fährmann fffbfd3dce
[imgspice] fix extraction
6 years ago
Mike Fährmann 4ca4631bad
simplify auto-disabling certificate verification
6 years ago
Mike Fährmann 09d872a2b1
generalize extractor creation code
6 years ago
Mike Fährmann 8dc6be246b
[shopify] add custom retry logic for 430 status codes (#175)
6 years ago
Mike Fährmann 0887fb61f4
[komikcast] update test results
6 years ago
Mike Fährmann 976ccb267f
[myportfolio] combine gallery and user extractors
6 years ago
Mike Fährmann efd104e45e
[instagram] reject more non-user URLs (#180)
6 years ago
HRXN 56e0e92e0d [shopify] cosmetic changes in shopify.py (#181)
6 years ago
Mike Fährmann 9c0e2f294b
[shopify] add generic collection and product extractors (#175)
6 years ago
Mike Fährmann 26c4365baa
adjust metadata types for GalleryExtractors
6 years ago
Mike Fährmann 13e0f2a78f
[deviantart] add 'scraps' extractor (closes #168)
6 years ago
Mike Fährmann 3ea11f5d5e
[nhentai] rewrite
6 years ago
Mike Fährmann 3595cd582f
use GalleryExtractor as common base class
6 years ago
Mike Fährmann a138d5873d
[hentaifoundry] improve/fix extraction
6 years ago
Mike Fährmann 280531c8ff
[pururin] add gallery extractor (closes #174)
6 years ago
Mike Fährmann 3159dd79d5
[seiga] use HTTPS
6 years ago
Mike Fährmann f6734142ee
[komikcast] remove 'width' and 'height' info
6 years ago
Mike Fährmann d0059cab79
[tumblr] check for null URLs (closes #165)
6 years ago
Mike Fährmann e687a6095e
[luscious] raise exception if album is not available
6 years ago
Mike Fährmann 22d3a2fcc8
[artstation] add extractor for artwork listings (#80)
6 years ago
Mike Fährmann 937a802b49
[dynastyscans] add extractors for images and image searches
6 years ago
Mike Fährmann b09a8184ca
move TestJob into test module; test _extractor values
6 years ago
Mike Fährmann 19860655a3
[weibo] add 'user' and 'status' extractors
6 years ago
Mike Fährmann f8782c05f2
[paheal] rename "tags" to "search_tags"
6 years ago
Mike Fährmann c7b8421333
[deviantart] don't match 'www' as a potential username
6 years ago
Mike Fährmann 5530871b5a
change results of text.nameext_from_url()
6 years ago
Mike Fährmann 32edf4fc7b
add '_extractor' info to manga extractor results
6 years ago
Mike Fährmann 89ee8cd7e4
filter "private" kwdict entries
6 years ago
Mike Fährmann 61741d7333
provide type information for Queue messages
6 years ago
Mike Fährmann 2e516a1e3e
store the full original URL in Extractor.url
6 years ago
Mike Fährmann 580baef72c
change Chapter and MangaExtractor classes
6 years ago
Mike Fährmann 4b1880fa5e
propagate 'match' to base extractor constructor
6 years ago
Mike Fährmann ade86da7a1
[tsumino] replace test
6 years ago
Mike Fährmann 1f3422c28b
[mangahere] fix extraction
6 years ago
Mike Fährmann 84ae72b8d8
[ngomik] fix extraction
6 years ago
Mike Fährmann 02d733d219
[simplyhentai] fix and improve tag extraction
6 years ago
Mike Fährmann 3a0b4af744
[seiga] recognize /thumb/ URLs
6 years ago
Mike Fährmann 8fc6fbfa34
[artstation] recognize shortened project URLs
6 years ago
Mike Fährmann 9a9cd32461
implement alternative constructor for extractors
6 years ago
Mike Fährmann abbd45d0f4
update handling of extractor URL patterns
6 years ago
Mike Fährmann 6284731107
simplify extractor constants
6 years ago
Mike Fährmann 34bab080ae
rewrite URL patterns to use only 1 per extractor
6 years ago
Mike Fährmann 0e46db6f45
rename some base classes
6 years ago
Mike Fährmann 793b24e513
[imagehosts] fix and improve various extractors
6 years ago
Mike Fährmann bc0951d974
allow for simplified test data structures
6 years ago
Mike Fährmann 050bc1aa4a
[reactor] simplify tests
6 years ago
Mike Fährmann 2f3a021d72
[hentaicafe] restore functionality
6 years ago
Mike Fährmann 347398f692
fix various tests
6 years ago
Mike Fährmann 00dc37ccbf
replace AsynchronousMixin Extractor with a Mixin
6 years ago
Mike Fährmann 4d656a81ca
replace SharedConfigExtractor class with a Mixin
6 years ago
Mike Fährmann ccb95d0ba4
[mastodon] changes/improvements based on foolfuuka/-slide
6 years ago
Mike Fährmann 12ff750111
[foolfuuka] smaller code changes and updates
6 years ago
Mike Fährmann e1bf3b225e
[foolslide] dynamically generate extractor classes
6 years ago
Mike Fährmann 58a9eede38
[foolfuuka] dynamically generate extractor classes
6 years ago
Mike Fährmann 22d7a783d5
update extraction result tests
6 years ago
Mike Fährmann 197d0e99a4
[tsumino] more useful error message (#161)
6 years ago
Mike Fährmann d36ec51e5a
[tsumino] add extractor for search results (#161)
6 years ago
Mike Fährmann 1c1367ec5b
[behance] fix empty docstring
6 years ago
Mike Fährmann 45e529ab91
[behance] fix extraction
6 years ago
Mike Fährmann bfbbac4495
[tsumino] add login capabilities (#161)
6 years ago
Mike Fährmann dd358b4564
improve cookie handling during logins
6 years ago
Mike Fährmann 6126615698
update URLs for supportedsites.rst
6 years ago
Mike Fährmann 80a75a1ecf
[tsumino] add gallery extractor (#161)
6 years ago
Mike Fährmann 2d2953a5bf
add 'text.parse_float()' + cleanup in text.py
6 years ago
Mike Fährmann 0c32dc5858
[hentaifox] add extractor for search results (#160)
6 years ago
Mike Fährmann 580947bfce
[hentaifox] rename Chapter- to GalleryExtractor (#160)
6 years ago
Mike Fährmann 8095f5f81a
[mangapark] fix manga title extraction
6 years ago
Mike Fährmann 0156189468
[hentaifox] add chapter extractor (#160)
6 years ago
Mike Fährmann e4171d6baf
[luscious] add login capabilities (closes #159)
6 years ago
Mike Fährmann 4f49fdf065
[mastodon] various improvements and fixes (#144)
6 years ago
Mike Fährmann 3f608a84b7
[photobucket] don't crash if JSON data is missing
6 years ago
Mike Fährmann 134487ffb0
[exhentai] stop extraction if image limit is exceeded (#141)
6 years ago
Mike Fährmann e868fb4393
[exhentai] improve gallery extraction
6 years ago
Mike Fährmann a50e9faf0e
[newgrounds] recognize direct links
6 years ago
Mike Fährmann c5559fa07d
[photobucket] improve subalbum extraction (#117)
6 years ago
Mike Fährmann ecad69100a
[photobucket] add 'image' extractor (#117)
6 years ago
Mike Fährmann b50b30f1c9
[photobucket] download subalbums (#117)
6 years ago
Mike Fährmann d19bac71be
[photobucket] add 'album' extractor (#117)
6 years ago
Mike Fährmann 78b5f29a00
[sankaku] unescape tags
6 years ago
Mike Fährmann 9b8ac12eed
[behance] enable 'categorytransfer' for collections (#157)
6 years ago
Mike Fährmann 217a0687ef
[behance] add 'collection' extractor (closes #157)
6 years ago
Mike Fährmann b8fed34548
add generalized extractors for Mastodon instances (#144)
6 years ago
Mike Fährmann 66460337f1
[mangapark] fix extraction
6 years ago
Mike Fährmann 2ffc105887
[exhentai] extract tag metadata
6 years ago
Mike Fährmann 0fb98d1d79
[hbrowse] extract tag metadata
6 years ago
Mike Fährmann 9bbbadd93a
[hbrowse] use HTTPS
6 years ago
Mike Fährmann 2fbf072723
[newgrounds] ensure consistent tag order
6 years ago
Mike Fährmann d7a4739cf6
[hbrowse] print error message if site is down
6 years ago
Mike Fährmann 98c6520384
[pinterest] update root URL of API calls
6 years ago
Mike Fährmann 751e535948
[nhentai] fix extraction (closes #156)
6 years ago
Mike Fährmann 89df37a173
[artstation] use a separate dict for each asset (#154)
6 years ago
Mike Fährmann 1734a6c879
[reactor] detect "circular" redirects (#148)
6 years ago
Mike Fährmann e53cdfd6a8
update build_supportedsites.py
6 years ago
Mike Fährmann 1e4d351ad3
[danbooru] add authentication support (closes #151)
6 years ago
Mike Fährmann 06cbf5f9c4
implement 'chapter-reverse' option (#149)
6 years ago
Mike Fährmann e95b24f056
[reactor] add wait-min & -max options (#148)
6 years ago
Mike Fährmann 8e01cf0ef8
[reactor] generalize extractors (#148)
6 years ago
Mike Fährmann 1737d7f576
[joyreactor] fix and improve pagination (#148)
6 years ago
Mike Fährmann 8753627ef4
[joyreactor] improve error handling for faulty JSON (#148)
6 years ago
Mike Fährmann a36f52a730
[joyreactor] add extractor for search results (#148)
6 years ago
Mike Fährmann a303efb597
[mangadex] handle manga pages without chapters
6 years ago
Mike Fährmann 0afa913de4
[tumblr] add tests for hidden and private blogs (#145)
6 years ago
Mike Fährmann fa7fa2f8ff
[deviantart1 update tests]
6 years ago
Mike Fährmann b7b5456a32
[kissmanga] use HTTPS
6 years ago
Mike Fährmann 259123732f
[readcomiconline] improve comic-page parsing
6 years ago
Mike Fährmann 4ab0960083
[reddit] add metadata to extracted URLs
6 years ago
Mike Fährmann 2f4f60de33
[tumblr] add tests for each post type
6 years ago
Mike Fährmann 98314aa04c
[mangapark] detect non-existent chapters
6 years ago
Mike Fährmann 6c71e9cf5d
[deviantart] add separate 'sta.sh' extractor (#113)
6 years ago
Mike Fährmann f9ace0f4a3
[mangapark] fix manga extraction ... again
6 years ago
Mike Fährmann 28f9539551
[tumblr] change default values for post types and inline media
6 years ago
Mike Fährmann 5be95034ba
[tumblr] add option to download avatars (#137)
6 years ago
Mike Fährmann 7471933d5f
use extractor.request for all other API calls
6 years ago
Mike Fährmann 995844c915
[instagram] relax test pattern even more
6 years ago
Mike Fährmann 2e5f82e59e
[tumblr] don't follow 'external' Tumblr URLs (#139)
6 years ago
Mike Fährmann 0c9762f00e
[mangapark] fix extraction
6 years ago
Mike Fährmann c9ef5ed364
[luscious] ensure URLs have a scheme
6 years ago
Mike Fährmann 851ee9f89f
[sensescans] replace tests
6 years ago
Mike Fährmann 0be7ee3106
[hitomi] fix image subdomains (closes #142)
6 years ago
Mike Fährmann fe96835d25
[kissmanga] add fallback for chapter-string parsing (#20)
6 years ago
Mike Fährmann 4d73cc785d
update test results
6 years ago
Mike Fährmann 049a9575c4
[tumblr] fix inline extraction #2
6 years ago
Mike Fährmann f6bf66f72c
[pixiv] create directory for each "work" item (#136)
6 years ago
Mike Fährmann b7a9f6cc49
[tumblr] improve inline extraction (#137)
6 years ago
Mike Fährmann 010da8372a
[instagram] relax test pattern
6 years ago
Mike Fährmann 1c6b9ba322
[readcomiconline] use HTTPS
6 years ago
Leonardo Taccari 2655a2ea02 Add support for instagram.com user profiles and pages (#134)
6 years ago
HRXN e80ee77d71 tumblr.py: update regex for video (#133)
6 years ago
Mike Fährmann 9a98b6769d
use extractor.request for API calls (#130)
6 years ago
Mike Fährmann ad2cefda6b
[tumblr] in case of exception use filename as 'hash' (#129)
6 years ago
Mike Fährmann 95636418ad
[tumblr] catch exception for 'hash' extraction (fixes #129)
6 years ago
Mike Fährmann 40e30694f3
[pinterest] fix pin.it redirects
6 years ago