Commit Graph

3521 Commits (6a87c314af421277f451b284e4a627e675e544cf)

Author SHA1 Message Date
Mike Fährmann 243de697b9
merge #3976: [reddit] support cross-posted media (#887, #3586)
1 year ago
Mike Fährmann f8c4c5eef9
[reddit] simplify and add tests
1 year ago
thatfuckingbird 822a77d846 [danbooru] add support for booru.borvar.art instance
1 year ago
Mike Fährmann f3cca50b9e
[mangadex] update links to API docs
1 year ago
Mike Fährmann 65a9f4b124
merge #3950: [misskey] add 'favorite' extractor
1 year ago
Mike Fährmann c76f0f3a1b
[misskey] update
1 year ago
Mike Fährmann 3fca455b82
[pixiv] add 'embeds' option (#1241)
1 year ago
Mike Fährmann d1f2ef3b7b
[imagechest] update
1 year ago
Mike Fährmann 856f6c10cd
allow for GalleryExtractors to skip loading gallery_url
1 year ago
Mike Fährmann 4fc9675d48
[fanbox] skip 404ed or otherwise invalid posts (#4088)
1 year ago
Mike Fährmann 56b8b8cd36
[pixiv] support short novel URLs
1 year ago
Mike Fährmann e6f55d1555
[imagechest] add API support and 'access-token' option (#4065)
1 year ago
Mike Fährmann 77abcf5ab3
[gofile] automatically fetch 'website-token' by default
1 year ago
Mike Fährmann e3fed9bd17
[tcbscans] update domain to 'tcbscans.com' (#4080)
1 year ago
Mike Fährmann a83983c651
[instagram] add 'order-posts' option (#4017, #3993)
1 year ago
Mike Fährmann d680623db3
[instagram] add 'order-files' option (#4017, #3993)
1 year ago
Naatie f9b7a033e0 [misskey] refactor misskey extractor
1 year ago
Naatie 04dbfd994e [misskey] add my favorites extractor
1 year ago
Mike Fährmann 82a12d6126
[nsfwalbum] detect placeholder images
1 year ago
Mike Fährmann 011e4607c3
[poipiku] extract full 'descriptions' (#4066)
1 year ago
Mike Fährmann 5037013e2b
[gofile] update 'website-token' (#4056)
1 year ago
Mike Fährmann 6b6bb4be73
[weibo] require numeric IDs to have length >= 10 (#4059)
1 year ago
Mike Fährmann 494acabd38
[danbooru] refactor pagination logic (#4002)
1 year ago
Mike Fährmann fd0e1ffd6e
[danbooru] improve 75666cf9 (#4002)
1 year ago
Mike Fährmann e41e45ff6b
[gofile] add basic password support (#4056)
1 year ago
Mike Fährmann 20dc13f832
[pixiv] initial 'novel' support (#1241, #4044)
1 year ago
Mike Fährmann c698c3de44
[newgrounds] add default delay between requests (#4046)
1 year ago
Mike Fährmann 708f478d15
[danbooru][e621] add 'date' metadata field (#4047)
1 year ago
Mike Fährmann 35c23a2fd8
merge #4031: [mangadex] add 'status' and 'tags' metadata
1 year ago
Mike Fährmann 2266fc8cc5
[mangadex] update and extend test results
1 year ago
Janne Alaranta 1ce5dc9e18 fix whitespaces
1 year ago
Janne Alaranta 13dedae09f add status and tags info to mangadex extractor
1 year ago
Mike Fährmann be0fa94b2e
[imagechest] load all images when a 'Load More' button is present
1 year ago
Mike Fährmann 7eadcbea70
[4chanarchives] add end condition for 'board' extractor (#4012)
1 year ago
Mike Fährmann 1406f7125f
[4chanarchives] add 'thread' and 'board' extractors (#4012)
1 year ago
Mike Fährmann d12dd3813c
[imgur] fix internal image/album URLs
1 year ago
Mike Fährmann 8520de57f0
[imgur] add 'favorite-folder' extractor (#4016)
1 year ago
Mike Fährmann 3ca5dac8b6
extend 'cookies-update' functionality
1 year ago
Mike Fährmann bc6d65d203
implement 'Extractor.config_deprecated()'
1 year ago
Mike Fährmann 850df34c31
remove '&' from URL patterns part 2
1 year ago
Mike Fährmann 4d415376d1
[pinterest] fix 'pin.it' extractor
1 year ago
Mike Fährmann 657b6a9100
[pinterest] update endpoint for related board pins
1 year ago
Mike Fährmann 79f47f98dd
[nana] remove module
1 year ago
Mike Fährmann 0e74df1de8
[420chan] remove module
1 year ago
Mike Fährmann 7499fa7075
[exhentai] remove and update sad panda check
1 year ago
Mike Fährmann 076380e079
remove '*' indicating keyword-only arguments
1 year ago
Mike Fährmann 0c46758a93
[foolslide] remove 'sensescans.com'
1 year ago
Mike Fährmann a08fdfac6e
[foolfuuka] add 'archive.palanq.win'
1 year ago
Mike Fährmann 1870df8b23
[foolfuuka] remove 'tokyochronos.net'
1 year ago
Mike Fährmann ef4e2d8178
[foolfuuka] remove 'archive.alice.al'
1 year ago
Mike Fährmann b12dad8df5
[pixiv] fix 'pixivision' extraction
1 year ago
Mike Fährmann 5fb7107f2b
[imxto] fix 'gallery' extraction
1 year ago
Mike Fährmann 15d7c5a199
[behance] 'items()' -> 'values()'
1 year ago
Mike Fährmann 0fb580135d
[behance] fix extraction (#3980)
1 year ago
Alexandru Vasilescu d4f8b2fe22 fix: linter issues
1 year ago
Alexandru Vasilescu 1b918bd937 fix(extractor): fix extraction for cross-posted reddit videos and galleries
1 year ago
Mike Fährmann 215028a462
[manganelo] match more minor version separators (#3972)
1 year ago
thatfuckingbird 9f76783ac0 [pixiv] allow sorting by popularity (requires pixiv premium)
1 year ago
Mike Fährmann 7865067d19
[shimmie2] add generic extractors for Shimmie2 sites (#3734)
1 year ago
Mike Fährmann 28419bf45a
[itchio] add 'game' extractor (#3923)
1 year ago
Mike Fährmann 5297ee0cd9
[tumblr] add 'day' extractor (#3951)
1 year ago
Mike Fährmann de670bd7de
[tumblr] update pagination logic (#2191)
1 year ago
Mike Fährmann 98c9fdb414
[deviantart] revert e9353c63; retry downloads with private token
1 year ago
Mike Fährmann 5d7435e803
[nitter] extract user IDs from encoded banner URLs
1 year ago
Mike Fährmann 7f25cab56e
[sankaku] support post URLs with MD5 hashes (#3952)
1 year ago
Mike Fährmann a05120412a
[oauth] catch exception from 'webbrowser.get()' (#3947)
1 year ago
Mike Fährmann 3fc2223893
merge #3935: [reddit] match 'preview.redd.it' URLs
1 year ago
Mike Fährmann 1d505b39f8
[twitter] support 'profile-conversation' entries (#3938)
1 year ago
Mike Fährmann aaf58a1259
[imgur] document 'client-id' option (#3937)
1 year ago
Mike Fährmann 202f5d86a7
[reddit] ignore 'id-max' value "zik0zj"/2147483647
1 year ago
Mike Fährmann 8586ee81be
[nana] fix 'keyword' tests
1 year ago
ClosedPort22 cd4bfb0dd1
[reddit] match 'preview.redd.it' URLs
1 year ago
Mike Fährmann faca32a850
[sankaku] sanitize 'date:…' tags (#1790)
1 year ago
Mike Fährmann 6f1e34ec69
[vipergirls] add 'thread' and 'post' extractors
1 year ago
Mike Fährmann 81bd2af83e
[2chen] update domain to sturdychan.help
1 year ago
Mike Fährmann f500b45b5e
[twitter] improve 480bc34e
1 year ago
Mike Fährmann 5b635f2317
[imxto] add 'gallery' extractor (#1289)
1 year ago
Mike Fährmann 359e31e462
[nozomi] update file URLs (#3925)
1 year ago
Mike Fährmann 2dfd4a3de2
[imagefap] extract 'categories' metadata and fix empty 'tags'
1 year ago
Mike Fährmann 480bc34e54
[twitter] do not overwrite previously assigned users (#3922)
1 year ago
Mike Fährmann 02ec5bb8e5
[imagefap] extract 'description' metadata (#3905)
1 year ago
Mike Fährmann d253a3c542
merge #3841: [urlshortener] add support for bit.ly & t.co
1 year ago
Mike Fährmann 5e63942b37
[urlshortener] update
1 year ago
Mike Fährmann c45f09d2a8
[imagechest] fix extraction (#3914)
1 year ago
Mike Fährmann 2cd4411ff8
[nitter] extract videos from 'source' elements (#3912)
1 year ago
Mike Fährmann 9501579279
[sexcom] fix fetching HD videos
1 year ago
Mike Fährmann a2f7274eae
[sexcom] fix pagination (#3906)
1 year ago
Mike Fährmann e9353c63d6
[deviantart] keep using private access tokens
1 year ago
Mike Fährmann e70af6a550
[hentaifoundry] do not update filters when cookies are provided
1 year ago
Mike Fährmann 9c29c904c7
[mastodon] try to get account IDs without access token
1 year ago
Mike Fährmann 1614c5c4bf
[generic] write regular expressions without 'x' flags
1 year ago
Mike Fährmann d84a617273
[hentaifoundry] fix setting content filters (#3887)
1 year ago
ClosedPort22 875485313f
[urlshortener] force HTTPS
1 year ago
Mike Fährmann 0a7eee3ee0
[deviantart] add 'public' option
1 year ago
Mike Fährmann f5a59c4170
[twitter] add 'date_bookmarked' metadata (#3816)
1 year ago
Mike Fährmann 1c1f6fdc80
[twitter] fix regression from 160335ad
1 year ago
Mike Fährmann 160335ad44
[twitter] add 'date_liked' metadata for liked Tweets (#3816)
1 year ago
Mike Fährmann 6d850ce629
[twitter] calculate 'date' from Tweet IDs
1 year ago
Mike Fährmann 25949bd767
merge #3871: [hotleak] Fix downloading of creators whose name starts with a category name
1 year ago
Mike Fährmann dbe06cdba1
[twitter] warn about 'withheld' Tweets and users (#3864)
1 year ago
Mike Fährmann 3cc1dd1572
[twitter] update query hashes
1 year ago
Mike Fährmann 3846ce0de5
[twitter] update to bookmark timeline v2 (#3859)
1 year ago
Mike Fährmann 34699fbf64
[deviantart:search] detect login redirects (#3860)
1 year ago
Mike Fährmann e6cb92864a
[twitter] allow setting custom features per API endpoint
1 year ago
Balgden 4b141cce66
Fix indentation
1 year ago
Balgden bbc5977121
Fix line length
1 year ago
Balgden ffd30abcb3
[hotleak] Fix downloading of creators whose name starts with a category name
1 year ago
Mike Fährmann 5ca9d55595
merge #3870: [blogger] update 'sub' regex to get the highest resolution url
1 year ago
Mike Fährmann fd7ce4c081
merge #3868: [shopify] fix 'collection' extractor
1 year ago
Mike Fährmann 135ac9c302
merge #3854: [twitter] fix: graphql_timeline_v2_bookmark_timeline cannot be null
1 year ago
enduser420 bbb1e34c34 [blogger] update sub regex
1 year ago
enduser420 96e3dd2128 [shopify] fix 'collection' extractor
1 year ago
Mike Fährmann ac97aca99c
[realbooru] fix extraction
1 year ago
Mike Fährmann 75666cf9c3
[danbooru] reduce API requests for fetching extended 'metadata'
1 year ago
Amer Jazaerli bebbff6578
fix: graphql_timeline_v2_bookmark_timeline cannot be null
1 year ago
ClosedPort22 71b26adb9b
[urlshortener] add tinyurl.com as an example
2 years ago
Mike Fährmann 421db26aff
[bunkr] update domain to 'bunkr.la'
2 years ago
ClosedPort22 9e2a945013
[urlshortener] add support for bit.ly & t.co
2 years ago
Mike Fährmann 9b5e7ce8b9
[hiperdex] fix extraction
2 years ago
Mike Fährmann 89a67c45e0
[nitter] support nitter.it (#3819)
2 years ago
Mike Fährmann 88f29a751d
[nitter] skip broadcasts
2 years ago
Mike Fährmann 1e013eba5a
[nitter] fix extraction for instances without user banners
2 years ago
Mike Fährmann d94aa1ee02
[gelbooru] fix --range for favorites (#3704)
2 years ago
Mike Fährmann 1f82b00b8f
[gelbooru] fix and improve --range for pools
2 years ago
Mike Fährmann 197882cf12
[twitter] add 'hashtag' extractor (#3783)
2 years ago
Mike Fährmann 9789ebac52
[naverwebtoon] fix extraction (#3729)
2 years ago
Mike Fährmann 72f1f16eb2
[weibo] support 'mix_media_info' entries (#3793)
2 years ago
ClosedPort22 d4fb4ff47f
[twitter] extract TwitPic URLs in text (#3792)
2 years ago
Mike Fährmann 2bb937014f
[twitter] fall back to legacy /media endpoint when not logged in
2 years ago
Mike Fährmann b68094d326
[twitter] support 'note_tweet's
2 years ago
Mike Fährmann 3dcabc97ed
[twitter] update API endpoints and parameters
2 years ago
Mike Fährmann dcb8af659a
[gelbooru] extract favorites without needing cookies (#3704)
2 years ago
Mike Fährmann b756dc13aa
[gelbooru] warn about missing cookies for favorites (#3704)
2 years ago
Mike Fährmann 17bd053d94
[hiperdex] fix extraction (#3768)
2 years ago
Mike Fährmann 817fc0fbd1
[nitter] remove nitter.pussthecat.org
2 years ago
Mike Fährmann 67ec91cdbd
[downloader:http] change '_http_retry' to accept a Python function
2 years ago
Mike Fährmann 175822e065
merge #3738: [generic] add tests
2 years ago
Mike Fährmann 4883420e67
[generic] revert pattern change
2 years ago
Mike Fährmann 9037128315
[twitter] fix some 'original' retweets not downloading (#3744)
2 years ago
Mike Fährmann ea3d95e7e8
merge #3740: [deviantart] add support for fxdeviantart.com URLs
2 years ago
Mike Fährmann 9abcb2b6e5
update headers and ciphers for '"browser": "chrome"'
2 years ago
ClosedPort22 c489aecb3e
[deviantart] add support for fxdeviantart.com URLs
2 years ago
ClosedPort22 34a7fab0e2
[generic] add support for IDNs
2 years ago
Mike Fährmann c9a7345228
[newgrounds] prevent archive ID overlap (#3681)
2 years ago
Mike Fährmann da9840a39d
[reddit] update 'videos' option (#3712)
2 years ago
Mike Fährmann baf41d7437
[misskey] update (#3717)
2 years ago
Mike Fährmann 6762d99515
merge #3717: [misskey] add misskey extractors
2 years ago
Mike Fährmann b8a702929d
[oauth] import extractor modules on demand
2 years ago
Mike Fährmann dd88740ec7
replace remaining instances of base64 with binascii
2 years ago
enduser420 e1867cf5eb [misskey] add 'renotes' and 'replies' options
2 years ago
enduser420 a95b5e0d8e [misskey] add misskey extractors
2 years ago
Mike Fährmann 0d142e403c
[szurubooru] add 'tag' and 'post' extractors (#3583, #3713)
2 years ago
Mike Fährmann b14f8d5817
[gelbooru] add 'favorite' extractor (#3704)
2 years ago
Mike Fährmann a70a3e5da6
[mangasee] extract 'author' and 'genre' metadata (#3703)
2 years ago
Mike Fährmann 6b03506655
[deviantart] allow searching when not logged in
2 years ago
Mike Fährmann 511a051705
[fanbox] fix crash with missing images (#3673)
2 years ago
Mike Fährmann 3fa456d989
[deviantart] remove mature scraps warning (#3691)
2 years ago
Mike Fährmann 51301e0c31
replace remaining time.sleep() calls
2 years ago
Mike Fährmann 6ed4309aba
[deviantart] add 'gallery-search' extractor (#1695)
2 years ago
Mike Fährmann 3d8777fbc1
move user agent string to util.py
2 years ago
Mike Fährmann e1df7f73b1
[deviantart] add 'search' extractor
2 years ago
Mike Fährmann 4f029ab38b
[pornpics] support '/pornstar' and '/channels' listings
2 years ago
Mike Fährmann cbe4769246
[danbooru] use gallery-dl UA (#3665)
2 years ago
Mike Fährmann 253ac08203
pre-define and use 'gallery-dö/<version>' UA string
2 years ago
Mike Fährmann b4899c266f
merge #3656: [deviantart] fix crash when handling deleted deviations in status updates
2 years ago
Mike Fährmann bb11c2a576
merge #3662: [redgifs] add 'collection' extractors
2 years ago
Mike Fährmann 884f1848d6
[redgifs] fix syntax for older Python versions
2 years ago
Mike Fährmann 725baedad3
[deviantart] use '/collections/all' endpoint for favorites
2 years ago
Mike Fährmann 2bd8f2f4bd
[pornpics] add 'search' and 'tag' extractors
2 years ago
Mike Fährmann 79bc82884c
[pornpics] add 'gallery' extractor (#263, #3544, #3654)
2 years ago
Mike Fährmann 7bdc1d6d3d
[manganelo] update and fix metadata extraction
2 years ago
Mike Fährmann 363bb76dff
[manganelo] simplify URL pattern
2 years ago
enduser420 b28bd9789e [redgifs] add 'collection' extractors
2 years ago
ClosedPort22 f4e211356d
[deviantart] slight refactor
2 years ago
Mike Fährmann bd5d08abbc
[catbox] add 'file' extractor (#3570)
2 years ago
Mike Fährmann 8e1e8a5bea
[soundgasm] rewrite (#3578)
2 years ago
Mike Fährmann 0b93420a81
[pinterest] unescape search terms (#3621)
2 years ago
Mike Fährmann ad96e70546
[bunkr] fix extraction (#3636, #3655)
2 years ago
Mike Fährmann 9335d55bbc
[manganelo] support mobile-only chapters
2 years ago
ClosedPort22 a74114ef7a
[deviantart] fix crash when handling deleted deviations
2 years ago
Mike Fährmann 75570ad3f1
[oauth] remove stray 'exit()' (#3628)
2 years ago
Mike Fährmann 8fb043e8ff
[tumblr] raise more detailed errors for dashboard-only blogs
2 years ago
Mike Fährmann ce996dd21b
[poipiku] warn about incorrect passwords (#3646)
2 years ago
Mike Fährmann 70ce45d965
[oauth] use default name for browsers without 'name' attribute
2 years ago
Mike Fährmann 2a53e6445c
[bunkr] update domain (#3636)
2 years ago
Mike Fährmann 5503ac4d5e
replace json.dumps with direct calls to JSONEncoder.encode
2 years ago
Mike Fährmann dd884b02ee
replace json.loads with direct calls to JSONDecoder.decode
2 years ago
Mike Fährmann 8805bd38ab
merge #3622: [imagetwist] add phun.imagetwist.com and imagehaha.com support
2 years ago
Mike Fährmann 706ec70e89
[imagetwist] simplify pattern and add tests
2 years ago
Mike Fährmann f2e91732ae
[instagram] add 'user' metadata field (#3107)
2 years ago
Prinz23 29f0830b53 [imagetwist] add phun.imagetwist.com and imagehaha.com alias to imagetwist extractor
2 years ago
Mike Fährmann bbf0911a46
[e621] implement 'notes' and 'pools' metadata extraction
2 years ago
Mike Fährmann 925b467496
split e621 from danbooru module (#3425)
2 years ago
Mike Fährmann 1ae48a54f8
[twitter] add 'transform' option
2 years ago
Mike Fährmann 489c51cecc
[telegraph] fix extraction when images not in <figure> (#3590)
2 years ago
Mike Fährmann 0f7e6c422a
merge #3596: [shopify] support ohpolly.com
2 years ago
enduser420 fcf7030b85 [shopify] support ohpolly.com
2 years ago
Mike Fährmann a6a631f992
merge #3589: [redgifs] support v3 URLs
2 years ago
Mike Fährmann 137a395ae0
[imagefap] fix infinite pagination loop (#3594)
2 years ago
Mike Fährmann 3c708ade8f
[imagefap] fix metadata extraction
2 years ago