Commit Graph

2040 Commits (89ea1384fc6f79ce9147d14a1ea9b060adcaf3fb)

Author SHA1 Message Date
Mike Fährmann cc4ac80302
[weasyl] add 'favorite' extractor (#1032)
4 years ago
Mike Fährmann e9cc719497
[weasyl] update and simplify
4 years ago
Mike Fährmann 6514312126
[nijie] add 'include' option (closes #1018)
4 years ago
Mike Fährmann 0d43456323
[hentaifoundry] add 'include' option
4 years ago
Zanny ebb7737b9b
Weasyl Extractor (#977)
4 years ago
Mike Fährmann aeb0d32333
[twitter] improve twitpic extraction (fixes #1019)
4 years ago
Mike Fährmann 7cd383c0f9
update extractor test results
4 years ago
Mike Fährmann 1e313d5b84
implement 'sleep-request' option
4 years ago
Mike Fährmann c43b3894be
[myhentaigallery] update and fix extraction (#1001)
4 years ago
choeronline 05b9ac8d37
[myhentaigallery] add extractor (#1001)
4 years ago
Mike Fährmann 2626629117
[danbooru] handle posts without 'id' (fixes #1004)
4 years ago
Mike Fährmann cc1fb0b4ea
[500px] update query hash
4 years ago
Mike Fährmann da87a5fb7e
[exhentai] fix accessing config before main constructor
4 years ago
Mike Fährmann f5b7ae01c1
update extractor test results
4 years ago
Mike Fährmann 136df52d1f
[deviantart] support watchers-only/paid deviations (#995)
4 years ago
Mike Fährmann 055c32e0f7
precompute extractor config paths
4 years ago
Mike Fährmann 231dd4c800
accumulate postprocessor objects (#994)
4 years ago
Mike Fährmann 3108e85b89
[worldthree] remove extractors
4 years ago
Mike Fährmann 8fed3eb8cb
[jaiminisbox] remove extractors
4 years ago
Mike Fährmann dcf3ad7eef
[furaffinity] update download URL extraction (fixes #988)
4 years ago
Mike Fährmann 3918b69677
remove 'extractor.blacklist' context manager
4 years ago
Mike Fährmann 2b8d57f0ab
[twitter] support '/intent/user?user_id=…' URLs (#980)
4 years ago
Mike Fährmann a3b473bd2f
[twitter] support specifying users by ID (#980)
4 years ago
Mike Fährmann a0d916ed41
[exhentai] update wait time before original image download (#978)
4 years ago
Mike Fährmann f6fd449b59
reduce wait time growth rate from exponential to linear
4 years ago
Mike Fährmann bc48514d84
[aryion] get post ID via gallery-item (fixes #981, closes #982)
4 years ago
Mike Fährmann 799ca07fc8
[imgur] update
4 years ago
Mike Fährmann 7876a03ece
[tumblr] create directories for each post (fixes #965)
4 years ago
Mike Fährmann d50f3b333a
update extractor test results
4 years ago
Mike Fährmann 0f55b8e80a
[exhentai] fix type check from dbbbb21 (#940)
4 years ago
Mike Fährmann e33293fdd8
[hentaihand] update to new site layout
4 years ago
Mike Fährmann fda9e296dd
[gelbooru] fix extraction without API
4 years ago
Mike Fährmann 69e4871005
update extractor test results
4 years ago
Mike Fährmann ab1af66a97
[imgur] add 'search' extractor (#934)
4 years ago
Mike Fährmann e4bbc1fb5c
[imgur] add 'tag' extractor (#934)
4 years ago
Mike Fährmann deaacc70bb
[hitomi] update URL pattern for tag searches
4 years ago
ArtaxIsSleeping 0e941553ec
[aryion] Add username/password support (#960)
4 years ago
Mike Fährmann 84e04cc23b
[500px] fix extraction and update URL patterns (fixes #956)
4 years ago
Mike Fährmann d4ff767291
[reddit] improve gallery extraction (fixes #955)
4 years ago
Mike Fährmann 7140fe7e6d
[hitomi] fix redirect processing
4 years ago
Mike Fährmann a57b6b3c3a
[reddit] handle deleted galleries (fixes #953)
4 years ago
Mike Fährmann 063c71cd84
[furaffinity] add 'search' extractor (closes #915)
4 years ago
Mike Fährmann dbbbb21180
[exhentai] add ability to specify custom image limit (#940)
4 years ago
Mike Fährmann b2009ea39e
[aryion] update folder mime type list (fixes #945)
4 years ago
Mike Fährmann d06ad148c7
[shopify] use alternate regex for products on collection pages
4 years ago
Mike Fährmann 7619152988
[reactor] sort 'tags'
4 years ago
Mike Fährmann cd9de613a2
[exhentai] adjust image limit costs (#940)
4 years ago
Mike Fährmann 2e6f6ee1c1
[mangoxo] fix login
4 years ago
Mike Fährmann a6a080656c
[pixnet] detect password-protected albums (#177)
4 years ago
Mike Fährmann 67ac6667af
[mangareader] fix extraction
4 years ago
Mike Fährmann 2b88c90f6f
[blogger] add search extractor (#925)
4 years ago
Mike Fährmann d5067c51c5
[instagram] support '/reel/' URLs
4 years ago
Mike Fährmann 2c9766b29f
fix UnboundLocalError in Extractor.request()
4 years ago
Mike Fährmann aa64149583
[blogger] support searching posts by labels (closes #925)
4 years ago
Mike Fährmann 60ba3cb946
[reddit] support gallery posts (closes #920)
4 years ago
Mike Fährmann 0d84d3af55
[subscribestar] extract attached media files (#852)
4 years ago
Mike Fährmann 19bf76bcf8
update extractor test results
4 years ago
Mike Fährmann 0762d6b29c
[inkbunny] add 'num' field (#283)
4 years ago
Mike Fährmann fbc4278fe4
[instagram] wait before GraphQL requests (#901)
4 years ago
Mike Fährmann ec5870576d
[imgur] handle 403 overcapacity responses (closes #910)
4 years ago
Mike Fährmann d6a271d2c7
add 'response' objects to 'HttpError's
4 years ago
Mike Fährmann 72c5578a27
[hentainexus] improve/simplify code
4 years ago
Mike Fährmann 627d2141d3
[xhamster] fix extraction (closes #917)
4 years ago
Mike Fährmann 27e31f4a16
[myportfolio] raise 'NotFoundError' for deleted posts
4 years ago
Mike Fährmann f317a57c5e
[simplyhentai] fix 'gallery_id' extraction
4 years ago
Mike Fährmann daeef8a5e3
[vsco] handle missing 'description' fields
4 years ago
Mike Fährmann 26a967cbd4
[pinterest] match 'pinterest.co.uk' URLs (fixes #914)
4 years ago
Mike Fährmann c5aaa1de77
[inkbunny] simplify metadata structure (#283)
4 years ago
Mike Fährmann b921fee24d
[inkbunny] fix submission order (#283)
4 years ago
Mike Fährmann e50c75628c
[subscribestar] update 'date' parsing
4 years ago
Mike Fährmann c4ed9f4faa
[inkbunny] add 'metadata' option (#283)
4 years ago
Mike Fährmann 493cadb1e7
[inkbunny] add 'orderby' option (#283)
4 years ago
Mike Fährmann 336e682a7a
[inkbunny] handle gallery/scraps URLs (#283)
4 years ago
Mike Fährmann 8dbf827649
[bobx] remove module
4 years ago
Mike Fährmann 8f64585ff2
[twitter] handle 429 responses without x-rate-limit-reset header
4 years ago
Mike Fährmann d2e17e16bf
[inkbunny] update tests (#283)
4 years ago
Mike Fährmann 57f7d9b790
[inkbunny] improve error handling (#283)
4 years ago
Mike Fährmann baf5d0e3c1
[gfycat] skip malformed gfycat responses (closes #902)
4 years ago
Mike Fährmann 453f3bc519
[blogger] improve error messages for missing posts/blogs (#903)
4 years ago
Mike Fährmann 87202b8d74
[inkbunny] add 'user' and 'post' extractors (#283)
4 years ago
Mike Fährmann 2ecf1efb16
update extractor test results
4 years ago
Mike Fährmann d5fcffcced
[subscribestar] add login capabilities (#852)
4 years ago
Mike Fährmann ecaecc4064
[exhentai] add 'domain' option (#897)
4 years ago
Mike Fährmann 45c32213dc
[gfycat] retry 404'ed videos on redgifs (closes #874)
4 years ago
Mike Fährmann cf44571fe0
[gfycat] add 'user' and 'search' extractors
4 years ago
Mike Fährmann 11b744d971
[mangakakalot] improve/fix chapter extraction
4 years ago
Mike Fährmann 2da71cb561
[twitter] raise proper exception if user doesn't exist (#891)
4 years ago
Leonardo Taccari 86e5a05e29
[twitter] add support for nitter.net URLs in pattern (#890)
4 years ago
Mike Fährmann e17d4f44f6
[newgrounds] fix favorites extraction
4 years ago
Mike Fährmann c51fbd72ba
update extractor test results
4 years ago
Mike Fährmann 9cd1bc6907
[mangakakalot] update URL patterns, fix flake8 errors (#876)
4 years ago
jakem72360 7dfdcc3fbf
[mangakakalot] Added extractors for MangaKakalot (#876)
4 years ago
Mike Fährmann cb0132e441
[khinsider] add 'format' option (closes #840)
4 years ago
Mike Fährmann d594977ca1
[artstation] add 'following' extractor (closes #888)
4 years ago
Mike Fährmann 3855d0dd3c
[twitter] add debug messages for all skipped Tweets (#867)
4 years ago
Mike Fährmann 27d163afb3
[imgur] support all '/t/...' URLs (closes #880)
4 years ago
Mike Fährmann f5c9f1d066
[subscribestar] use current date instead of hard-coded '2020' (#852)
4 years ago
Mike Fährmann 5a6e750704
[reddit] fix AttributeError when using 'recursion' (fixes #879)
4 years ago
Mike Fährmann 94a08f0bcb
[reddit] limit title length in default filenames (#873)
4 years ago
Mike Fährmann 3424fb96c3
[redgifs] support gifsdeliverynetwork.com URLs (#874)
4 years ago
Mike Fährmann f1344fe552
[patreon] yield images and attachments before postfiles (#871)
4 years ago
Mike Fährmann 6e2af9a8d8
[twitter] improve error message formatting
4 years ago
Mike Fährmann c28db7a6ea
[8muses] support 'comics.8muses.com' URLs
4 years ago
Mike Fährmann d5bfb0b38c
set pseudo extension for Metadata messages (#865)
4 years ago
Mike Fährmann 821524e4ee
[subscribestar] add 'user' and 'post' extractors (#852)
4 years ago
Mike Fährmann e62ebb4643
update CHANGELOG before building sdist and wheel packages
4 years ago
Mike Fährmann f1ddbff0b5
[aryion] add 'recursive' option (fixes #832)
4 years ago
Mike Fährmann 699062b91f
Revert "[kissmanga] workaround for CAPTCHAs (#818)"
4 years ago
Mike Fährmann 0cac14c3bd
update extractor test results
4 years ago
Mike Fährmann 5e5be67c26
[tumblr] prevent KeyErrors when using reblogs=same-blog
4 years ago
Mike Fährmann 9da2bc67f8
[twitter] add option to filter media from quoted tweets (#854)
4 years ago
Mike Fährmann 56ab5fb8f4
[twitter] improve handling of quoted tweets (#854)
4 years ago
Mike Fährmann bd0e1ca1a5
[imgur] build directory path for each file (closes #842)
4 years ago
Mike Fährmann a8c2d997e8
[twitter] treat quoted tweets like retweets (#833)
4 years ago
Mike Fährmann aed1c63e51
[twitter] improve search results (fixes #847)
4 years ago
Mike Fährmann 0e714b9a0e
[pinterest] add 'section' extractor (#835)
4 years ago
Mike Fährmann 53cc498d9c
improve config lookup when there are multiple possible locations
4 years ago
Mike Fährmann d81a8e6544
[twitter] update tests
4 years ago
Mike Fährmann d39eedd9bb
[twitter] improve handling of deleted tweets (fixes #838)
4 years ago
Mike Fährmann 1ae1df0d27
update '--write-pages' (#737)
4 years ago
Mike Fährmann dc16f73965
[twitter] move '_guest_token()' into TwitterAPI class
4 years ago
Mike Fährmann 3561d1020a
[twitter] always provide an 'author' field (#831, #833)
4 years ago
Mike Fährmann 7158bdd7c7
[weibo] improve extractor logic (#829)
4 years ago
Mike Fährmann 0371fd54a1
[artstation] add 'date' metadata field (#839)
4 years ago
Mike Fährmann 8c857052d7
[mastodon] ignore toots without media attachments
4 years ago
Mike Fährmann de045d39b2
[mastodon] add 'date' metadata field (#839)
4 years ago
Mike Fährmann d5d90a0450
[weibo] add 'date' field to 'status' objects (#829)
4 years ago
Mike Fährmann 5ba90f72ca
[pinterest] add support for sections (closes #835)
4 years ago
Mike Fährmann c37a1c06c8
[twitter] add extractor for liked tweets (closes #837)
4 years ago
Mike Fährmann b94394104c
[twitter] don't download video previews (#833)
4 years ago
Mike Fährmann bb882b8cdb
improve output of '-K' for parent extractors (#825)
4 years ago
Mike Fährmann 4cf3d54718
[kissmanga] workaround for CAPTCHAs (fixes #818)
4 years ago
Mike Fährmann 7daef6ee70
update extractor test results
4 years ago
Mike Fährmann ffb6c5277a
[furaffinity] add 'artist_url' metadata field (closes #821)
4 years ago
Mike Fährmann be04e44e2c
[reddit] catch JSON decode errors (#765)
4 years ago
Mike Fährmann cf863f60b3
[redgifs] add 'user' and 'search' extractors (closes #724)
4 years ago
Mike Fährmann 998d1d3a5c
[webtoons] generalize and improve comic extraction (fixes #820)
4 years ago
Mike Fährmann 036a40943a
[twitter] don't cache results of 'user_by_screen_name()'
4 years ago
Mike Fährmann 4442dfe7b8
[twitter] add 'reply_to' metadata to replies
4 years ago
Mike Fährmann 83b7bd0413
[nhentai] fix extraction (closes #819)
4 years ago
Mike Fährmann d769bb4b80
[twitter] improve pagination
4 years ago
Mike Fährmann 5bc1097f9d
[twitter] metadata cleanup #2
4 years ago
Mike Fährmann c6c06c41f6
[deviantart] don't add journal text to description (#712)
4 years ago
Mike Fährmann 4aea5138dd
[sensescans] use https://
4 years ago
Mike Fährmann 3eed5f52d7
[twitter] small metadata cleanup
4 years ago
Mike Fährmann 655c98cbef
[twitter] skip unavailable tweets
4 years ago
Mike Fährmann 41d03160ff
[deviantart] also search journals for sta.sh links (#712)
4 years ago
Mike Fährmann 2132e5461a
[twitter] restore TwitPic support
4 years ago
Mike Fährmann bd0f21478a
[twitter] login using the mobile nojs login page
4 years ago
Mike Fährmann a10f31dde5
[twitter] rewrite; use new interface (#740, #806)
4 years ago
Mike Fährmann 3bad1579ee
update extractor test results
4 years ago
Mike Fährmann 864f4220d9
update output of 'oauth:…' (#616)
4 years ago
Mike Fährmann 0f459f340b
[instagram] fix and re-enable login with username&password
4 years ago
Mike Fährmann 3e0848a482
[instagram] disable login with username&password (#756)
4 years ago
Mike Fährmann a32aea41e1
[instagram] update 'query_hash' values
4 years ago
Mike Fährmann 2bff8dd465
[hentainexus] fix flake8 issues (#787)
4 years ago
Mike Fährmann a63682a9c0
[instagram] simplify code & complete tests (#743)
4 years ago
墨焓 a4e3d40672
hentainexus.py minor fix (#787)
4 years ago
Vrihub 62b65e59d0
Add instagram metadata: post_pageurl, post_tags (#743)
4 years ago
Mike Fährmann 275cceeb6a
[redgifs] fix extraction (#724)
4 years ago
Mike Fährmann 45baa13615
update extractor test results
4 years ago
Mike Fährmann dfcf2a2c91
write OAuth token to cache by default (#616)
4 years ago
Mike Fährmann 15c3d29062
move dump_response() into a separate function (#737)
4 years ago
Mike Fährmann a363da4b43
include redirects and headers in --write-pages dumps (#737)
4 years ago
Mike Fährmann 6bcdb264e0
[imgur] treat 't/unmuted' URLs as galleries
4 years ago
Mike Fährmann b6cee3e45b
[imgur] fix extraction of animated images without 'mp4' entry
4 years ago
Leonardo Taccari bcac31b7c7
[webtoons] make archive_fmt unique (#779)
4 years ago
Mike Fährmann e19f665a44
[danbooru] change default for 'ugoira' to 'false'
4 years ago
Mike Fährmann 3201fe3521
add global SENTINEL object
4 years ago
Mike Fährmann c8787647ed
add global WINDOWS bool
4 years ago
Mike Fährmann 6294e2c540
add 'text.ensure_http_scheme()'
4 years ago
Mike Fährmann 0378d079a5
[webtoons] fixes and simplifications (#593, #761)
4 years ago
Mike Fährmann ab11b1c896
[imagechest] simplify code (#750)
4 years ago
Mike Fährmann 846d3a2466
[sexcom] replace 404ed test
4 years ago
Mike Fährmann 9b4635917f
[gelbooru] simplify and fix pool extraction
4 years ago
Leonardo Taccari 39cd389679
[webtoons] Add a new extractor for webtoons.com (#761)
4 years ago
Bepis 7b5711ee04
[imagechest] Add new extractor for ImageChest (#750)
4 years ago
Mike Fährmann a1e739b96c
reuse connection adapters from parent extractors
4 years ago
Mike Fährmann f8f95e68a7
improve '--write-pages' (#737)
4 years ago
Mike Fährmann 09cc9dbec0
prevent flake8 errors from comments looking like type annotations
4 years ago
Mike Fährmann 2d6724180b
[hiperdex] update domain to hiperdex.info
4 years ago
Vrihub 4cc761c730
Implement --write-pages option (#736)
4 years ago
Mike Fährmann f557cac074
[redgifs] add image extractor (#724)
4 years ago
Mike Fährmann 65b1cb7acd
[deviantart] use private access tokens for Journals (fixes #738)
4 years ago
Mike Fährmann 0bf0146bfe
[reddit] don't send OAuth headers for file downloads (fixes #729)
4 years ago
Mike Fährmann d6a480682f
update test results
4 years ago
Leonardo Taccari b47cfc5ac9
[speakerdeck] Add a new extractor for speakerdeck.com (#726)
4 years ago
Mike Fährmann 90491ab606
[artstation] improve embed extraction (#720)
4 years ago
Mike Fährmann 999efec5cc
[deviantart] limit API wait times to 2**9=512 seconds (#721)
4 years ago
Mike Fährmann 504de79d8b
[vsco] fix extraction
4 years ago
Mike Fährmann 5e2974d699
[weibo] add 'videos' option
4 years ago
Mike Fährmann 9f638c2e01
[twitter] add 'replies' option (closes #705)
4 years ago
Mike Fährmann fc3e54275b
[patreon] respect filters and sort order in query params (#711)
4 years ago
Mike Fährmann 46b9a4d8ff
[patreon] improve hash extraction (#693, #713)
4 years ago
Mike Fährmann c56a751dae
[newgrounds] fix URLs produced by 'followng' extractors (#684)
4 years ago
Mike Fährmann a4fd620a25
[hiperdex] revert domain back to hiperdex.com
4 years ago
Mike Fährmann 233b6f93a2
[patreon] recognize URLs with creator IDs (#711)
4 years ago
Mike Fährmann 38b6bd66b0
[500px] match 'web.500px.com' subdomains
4 years ago
Mike Fährmann d3b3b30107
update test results
4 years ago
Mike Fährmann 5d7ca76885
retry Cloudflare challenges
4 years ago