Commit Graph

1199 Commits (50187818984af68f7443b24034c102c2e881daaf)

Author SHA1 Message Date
Mike Fährmann f612284d24
cache cfclearance cookies
6 years ago
Mike Fährmann 591a07f20c
small code changes and cleanups
6 years ago
Mike Fährmann 6f57d44ec2
[seaotterscans] remove extractor
6 years ago
Mike Fährmann 6dae6bee37
automatically detect and bypass cloudflare challenge pages
6 years ago
Mike Fährmann 25aaf55514
[smugmug] improve format selection (closes #183)
6 years ago
Mike Fährmann 7c1cb923a4
[myportfolio] replace unit test
6 years ago
Mike Fährmann fffbfd3dce
[imgspice] fix extraction
6 years ago
Mike Fährmann 4ca4631bad
simplify auto-disabling certificate verification
6 years ago
Mike Fährmann 09d872a2b1
generalize extractor creation code
6 years ago
Mike Fährmann 8dc6be246b
[shopify] add custom retry logic for 430 status codes (#175)
6 years ago
Mike Fährmann 0887fb61f4
[komikcast] update test results
6 years ago
Mike Fährmann 976ccb267f
[myportfolio] combine gallery and user extractors
6 years ago
Mike Fährmann efd104e45e
[instagram] reject more non-user URLs (#180)
6 years ago
HRXN 56e0e92e0d [shopify] cosmetic changes in shopify.py (#181)
6 years ago
Mike Fährmann 9c0e2f294b
[shopify] add generic collection and product extractors (#175)
6 years ago
Mike Fährmann 26c4365baa
adjust metadata types for GalleryExtractors
6 years ago
Mike Fährmann 13e0f2a78f
[deviantart] add 'scraps' extractor (closes #168)
6 years ago
Mike Fährmann 3ea11f5d5e
[nhentai] rewrite
6 years ago
Mike Fährmann 3595cd582f
use GalleryExtractor as common base class
6 years ago
Mike Fährmann a138d5873d
[hentaifoundry] improve/fix extraction
6 years ago
Mike Fährmann 280531c8ff
[pururin] add gallery extractor (closes #174)
6 years ago
Mike Fährmann 3159dd79d5
[seiga] use HTTPS
6 years ago
Mike Fährmann f6734142ee
[komikcast] remove 'width' and 'height' info
6 years ago
Mike Fährmann d0059cab79
[tumblr] check for null URLs (closes #165)
6 years ago
Mike Fährmann e687a6095e
[luscious] raise exception if album is not available
6 years ago
Mike Fährmann 22d3a2fcc8
[artstation] add extractor for artwork listings (#80)
6 years ago
Mike Fährmann 937a802b49
[dynastyscans] add extractors for images and image searches
6 years ago
Mike Fährmann b09a8184ca
move TestJob into test module; test _extractor values
6 years ago
Mike Fährmann 19860655a3
[weibo] add 'user' and 'status' extractors
6 years ago
Mike Fährmann f8782c05f2
[paheal] rename "tags" to "search_tags"
6 years ago
Mike Fährmann c7b8421333
[deviantart] don't match 'www' as a potential username
6 years ago
Mike Fährmann 5530871b5a
change results of text.nameext_from_url()
6 years ago
Mike Fährmann 32edf4fc7b
add '_extractor' info to manga extractor results
6 years ago
Mike Fährmann 89ee8cd7e4
filter "private" kwdict entries
6 years ago
Mike Fährmann 61741d7333
provide type information for Queue messages
6 years ago
Mike Fährmann 2e516a1e3e
store the full original URL in Extractor.url
6 years ago
Mike Fährmann 580baef72c
change Chapter and MangaExtractor classes
6 years ago
Mike Fährmann 4b1880fa5e
propagate 'match' to base extractor constructor
6 years ago
Mike Fährmann ade86da7a1
[tsumino] replace test
6 years ago
Mike Fährmann 1f3422c28b
[mangahere] fix extraction
6 years ago
Mike Fährmann 84ae72b8d8
[ngomik] fix extraction
6 years ago
Mike Fährmann 02d733d219
[simplyhentai] fix and improve tag extraction
6 years ago
Mike Fährmann 3a0b4af744
[seiga] recognize /thumb/ URLs
6 years ago
Mike Fährmann 8fc6fbfa34
[artstation] recognize shortened project URLs
6 years ago
Mike Fährmann 9a9cd32461
implement alternative constructor for extractors
6 years ago
Mike Fährmann abbd45d0f4
update handling of extractor URL patterns
6 years ago
Mike Fährmann 6284731107
simplify extractor constants
6 years ago
Mike Fährmann 34bab080ae
rewrite URL patterns to use only 1 per extractor
6 years ago
Mike Fährmann 0e46db6f45
rename some base classes
6 years ago
Mike Fährmann 793b24e513
[imagehosts] fix and improve various extractors
6 years ago
Mike Fährmann bc0951d974
allow for simplified test data structures
6 years ago
Mike Fährmann 050bc1aa4a
[reactor] simplify tests
6 years ago
Mike Fährmann 2f3a021d72
[hentaicafe] restore functionality
6 years ago
Mike Fährmann 347398f692
fix various tests
6 years ago
Mike Fährmann 00dc37ccbf
replace AsynchronousMixin Extractor with a Mixin
6 years ago
Mike Fährmann 4d656a81ca
replace SharedConfigExtractor class with a Mixin
6 years ago
Mike Fährmann ccb95d0ba4
[mastodon] changes/improvements based on foolfuuka/-slide
6 years ago
Mike Fährmann 12ff750111
[foolfuuka] smaller code changes and updates
6 years ago
Mike Fährmann e1bf3b225e
[foolslide] dynamically generate extractor classes
6 years ago
Mike Fährmann 58a9eede38
[foolfuuka] dynamically generate extractor classes
6 years ago
Mike Fährmann 22d7a783d5
update extraction result tests
6 years ago
Mike Fährmann 197d0e99a4
[tsumino] more useful error message (#161)
6 years ago
Mike Fährmann d36ec51e5a
[tsumino] add extractor for search results (#161)
6 years ago
Mike Fährmann 1c1367ec5b
[behance] fix empty docstring
6 years ago
Mike Fährmann 45e529ab91
[behance] fix extraction
6 years ago
Mike Fährmann bfbbac4495
[tsumino] add login capabilities (#161)
6 years ago
Mike Fährmann dd358b4564
improve cookie handling during logins
6 years ago
Mike Fährmann 6126615698
update URLs for supportedsites.rst
6 years ago
Mike Fährmann 80a75a1ecf
[tsumino] add gallery extractor (#161)
6 years ago
Mike Fährmann 2d2953a5bf
add 'text.parse_float()' + cleanup in text.py
6 years ago
Mike Fährmann 0c32dc5858
[hentaifox] add extractor for search results (#160)
6 years ago
Mike Fährmann 580947bfce
[hentaifox] rename Chapter- to GalleryExtractor (#160)
6 years ago
Mike Fährmann 8095f5f81a
[mangapark] fix manga title extraction
6 years ago
Mike Fährmann 0156189468
[hentaifox] add chapter extractor (#160)
6 years ago
Mike Fährmann e4171d6baf
[luscious] add login capabilities (closes #159)
6 years ago
Mike Fährmann 4f49fdf065
[mastodon] various improvements and fixes (#144)
6 years ago
Mike Fährmann 3f608a84b7
[photobucket] don't crash if JSON data is missing
6 years ago
Mike Fährmann 134487ffb0
[exhentai] stop extraction if image limit is exceeded (#141)
6 years ago
Mike Fährmann e868fb4393
[exhentai] improve gallery extraction
6 years ago
Mike Fährmann a50e9faf0e
[newgrounds] recognize direct links
6 years ago
Mike Fährmann c5559fa07d
[photobucket] improve subalbum extraction (#117)
6 years ago
Mike Fährmann ecad69100a
[photobucket] add 'image' extractor (#117)
6 years ago
Mike Fährmann b50b30f1c9
[photobucket] download subalbums (#117)
6 years ago
Mike Fährmann d19bac71be
[photobucket] add 'album' extractor (#117)
6 years ago
Mike Fährmann 78b5f29a00
[sankaku] unescape tags
6 years ago
Mike Fährmann 9b8ac12eed
[behance] enable 'categorytransfer' for collections (#157)
6 years ago
Mike Fährmann 217a0687ef
[behance] add 'collection' extractor (closes #157)
6 years ago
Mike Fährmann b8fed34548
add generalized extractors for Mastodon instances (#144)
6 years ago
Mike Fährmann 66460337f1
[mangapark] fix extraction
6 years ago
Mike Fährmann 2ffc105887
[exhentai] extract tag metadata
6 years ago
Mike Fährmann 0fb98d1d79
[hbrowse] extract tag metadata
6 years ago
Mike Fährmann 9bbbadd93a
[hbrowse] use HTTPS
6 years ago
Mike Fährmann 2fbf072723
[newgrounds] ensure consistent tag order
6 years ago
Mike Fährmann d7a4739cf6
[hbrowse] print error message if site is down
6 years ago
Mike Fährmann 98c6520384
[pinterest] update root URL of API calls
6 years ago
Mike Fährmann 751e535948
[nhentai] fix extraction (closes #156)
6 years ago
Mike Fährmann 89df37a173
[artstation] use a separate dict for each asset (#154)
6 years ago
Mike Fährmann 1734a6c879
[reactor] detect "circular" redirects (#148)
6 years ago
Mike Fährmann e53cdfd6a8
update build_supportedsites.py
6 years ago
Mike Fährmann 1e4d351ad3
[danbooru] add authentication support (closes #151)
6 years ago