Commit Graph

3286 Commits (202f5d86a74ec82b6ea2e63ee647c97d6d1d59a0)

Author SHA1 Message Date
Mike Fährmann d63af4f3d3
merge #3555: [generic] fix regex for non-src image URLs
2 years ago
Mike Fährmann 8993b10751
[mastodon] add 'num' and 'count' metadata fields (#3517)
2 years ago
Mike Fährmann d817d23ccb
[instagram] update csrf token handling
2 years ago
Mike Fährmann 00b94946b3
[instagram] show -o cursor=… after every error (#3440)
2 years ago
ClosedPort22 674c719646
[deviantart] refactor base36 conversion
2 years ago
ClosedPort22 293abb8921
[deviantart] add support for /deviation/ URLs
2 years ago
thatfuckingbird 8cfeed78b1 [generic] fix regex for non-src image URLs
2 years ago
Mike Fährmann fc6ea8ee5c
[instagram] update API domain and headers
2 years ago
ClosedPort22 597b89245e
[deviantart] misc improvements to status extractor
2 years ago
Mike Fährmann 137de090dd
merge #3549: [twitter] fix search (#3536)
2 years ago
Mike Fährmann 02e314c1b6
merge #3537: [wikifeet/wikifeetx] add 'gallery' extractor
2 years ago
Mike Fährmann 568112dfbb
[oauth] improve output
2 years ago
ClosedPort22 ab58c375b4
[twitter] fix search (#3536)
2 years ago
Mike Fährmann df91ebb945
[oauth] simplify OAuth 1.0a init
2 years ago
ClosedPort22 013733c9e9
[deviantart] fix index fields for embedded/shared images
2 years ago
ClosedPort22 c4aeca7a5a
[deviantart] improve handling of statuses
2 years ago
ClosedPort22 3b32671fbd
[deviantart] add extractor for status updates
2 years ago
Mike Fährmann 107c60c973
[sankaku] update URL pattern (#3523)
2 years ago
enduser420 5cb263fdd2 [wikifeet/wikifeetx] add 'gallery' extractor
2 years ago
Mike Fährmann 35a30498bc
merge #3531: [kemonoparty] improve hash extraction
2 years ago
Mike Fährmann 9683d79bb7
[twitter] "fix" search pagination (#3536, #3534)
2 years ago
Mike Fährmann 4fec848858
[twitter] use "browser": "firefox" by default (#3522)
2 years ago
Mike Fährmann 78937564fd
[twitter] fix login after 32b03433
2 years ago
ClosedPort22 20d6194ffa
[kemonoparty] improve hash extraction
2 years ago
Mike Fährmann 80a2ff2d38
support setting 'write-pages' to "ALL"
2 years ago
Mike Fährmann c881548a27
add 'extractor.retry-codes' option (#3313)
2 years ago
Mike Fährmann e30e8aeef7
[mastodon] rename '_check_move' -> '_check_moved'
2 years ago
Mike Fährmann 32b0343334
[twitter] refresh guest tokens (#3445, #3458)
2 years ago
Mike Fährmann 512abeb4ae
[booru] add 'url' option
2 years ago
Mike Fährmann c87bd1a752
[danbooru] extend 'metadata' option
2 years ago
Mike Fährmann 26c3292538
[twitter] disable TLS 1.2 ciphers by default (#3522)
2 years ago
Mike Fährmann 18fe4b334d
[twitter] remove 'tweet_search_mode' from search parameters (#3522)
2 years ago
Mike Fährmann 85bd1cbc89
[kemonoparty] fix regression from 473bd380 (#3519)
2 years ago
Mike Fährmann 473bd380c8
[kemonoparty] reject invalid/empty files (#3510)
2 years ago
Mike Fährmann 4833ec323e
[imagefap] add 'folder' extractor (#3504)
2 years ago
Mike Fährmann 362cd6991b
[pixiv] implement 'metadata-bookmark' option (#3417)
2 years ago
Mike Fährmann 2142b9c7ae
merge #3503: [myhentaigallery] handle whitespace before title tag
2 years ago
Mike Fährmann 3a0450adbf
[behance] use default delay between requests (#2507)
2 years ago
Mike Fährmann 2cae4567ba
[telegraph] fix file URLs (#3506)
2 years ago
Mike Fährmann cbaeee9533
[imagefap] warn about redirects to '/human-verification' (#1140)
2 years ago
Mike Fährmann 435de1329a
[imagefap] use default delay between requests (#1140)
2 years ago
Erik Rimskog a8a982359e [myhentaigallery] handle whitespace before the title tag
2 years ago
Mike Fährmann d1dd52349a
merge #3189: [tcbscans] add 'chapter' and 'manga' extractors
2 years ago
Mike Fährmann 2f31d21509
merge #3455: [twitter] apply tweet type checks before uniqueness check
2 years ago
enduser420 e8541a131d [tcbscans] add 'chapter' and 'manga' extractors
2 years ago
Mike Fährmann 9695c4e88d
emit debug logging message when loading cookies from file
2 years ago
Mike Fährmann 30a31836e7
merge #3449: [twitter] force HTTPS for TwitPic URLs
2 years ago
Mike Fährmann e18482e9ae
[twitter] improve 'http' -> 'https' replacement
2 years ago
Mike Fährmann 4fd6da474f
merge #3473: [twitter] fix crash when using 'expand' and 'syndication'
2 years ago
Mike Fährmann a918ce29b5
run tests on ubuntu-20.04
2 years ago
Mike Fährmann 6514828d4e
emit debug logging message when loading cookies from file
2 years ago
Mike Fährmann 3a238fd490
[poipiku] warn about login requirements
2 years ago
Mike Fährmann f29ba089ff
merge #3474: [fanleaks] add 'post' and 'model' extractors
2 years ago
Mike Fährmann 6933727b45
merge #3483: [twitter] implement 'syndication=extended'
2 years ago
Mike Fährmann 07ed3a1fbf
merge #3460: [poipiku] fix extraction for a different warning button style
2 years ago
Mike Fährmann 9116398c1c
[pinterest] add 'domain' option (#3484)
2 years ago
blankie 2f985bcddb
[poipiku] fix extraction for a different warning button style
2 years ago
Mike Fährmann 294108c90a
[pinterest] support 'All Pins' boards (#2855, #3484)
2 years ago
Mike Fährmann 77df8d3116
[deviantart] implement username&password login for scraps (#1029)
2 years ago
Mike Fährmann ed2d715019
fix 'keywords' in extractor tests (#3491)
2 years ago
ClosedPort22 6853b14be3
[twitter] apply suggestions from code review
2 years ago
Mike Fährmann 4611237f8c
merge #3457: [danbooru] extract uploader metadata (if option is set)
2 years ago
Mike Fährmann e7522482bb
merge #3463: [lynxchan] support 'bbw-chan.nl'
2 years ago
Mike Fährmann 7d6c846176
[fanbox] return 'imageMap' files in order (#2718)
2 years ago
Mike Fährmann dc8e7ff54e
[bunkr] fix URLs returned by API (#3481)
2 years ago
enduser420 5fedef3a1a [fanleaks] update 'model' URL pattern
2 years ago
enduser420 5a740ef78b [fanleaks] add 'post' and 'model' extractors
2 years ago
ClosedPort22 7c8eab8d52
[twitter] implement 'syndication=extended'
2 years ago
ClosedPort22 be3286206a
[twitter] assume 'conversation_id' when using syndication
2 years ago
ClosedPort22 ce8dbb1ccc
[twitter] fix crash when using 'expand' and 'syndication'
2 years ago
ClosedPort22 38786a9593
[twitter] refactor extraction of TwitPic URLs
2 years ago
enduser420 527bb2c4ab [lynxchan/bbw-chan] add 'thread' and 'board' extractors
2 years ago
blankie f82ee93676
[danbooru] extract uploader metadata (if metadata is set)
2 years ago
ClosedPort22 250d35107c
[twitter] prioritize tweet type checks (#3439)
2 years ago
ClosedPort22 3eb352fcb0
[twitter] force HTTPS for TwitPic URLs
2 years ago
Mike Fährmann bee354c264
Merge pull request #3415 from enduser420/extractor/fapello
2 years ago
Mike Fährmann 8d7585534e
Merge pull request #3367 from the-blank-x/deviantart-view
2 years ago
blankie 6614d94b08
[deviantart] add /view URL support
2 years ago
Mike Fährmann dd6eeb4336
Merge pull request #3366 from ClosedPort22/da-extra-stash
2 years ago
Mike Fährmann f36cbb3911
Merge pull request #3413 from ClosedPort22/e621-manual-pagination
2 years ago
ClosedPort22 dd4a4a3fa6
[e621] softcode the pagination threshold
2 years ago
ClosedPort22 9faa4ed738
[e621] refactor pagination control
2 years ago
Mike Fährmann 7851a2c520
[seiga] raise error when redirected to login page (#3401)
2 years ago
Mike Fährmann 68ce5f965d
[instagram] remove unused code
2 years ago
Mike Fährmann 4063563cd7
[zerochan] update for layout v3
2 years ago
Mike Fährmann 1e6407ca98
Merge pull request #3414 from pubak42/master
2 years ago
ClosedPort22 bf1649dadb
[imgur] add support for imgur.io URLs
2 years ago
enduser420 7e08e2d982 [fapello] set 'filename_fmt'
2 years ago
enduser420 e5076ba056 [fapello] add 'post', 'user' and 'path' extractors
2 years ago
pubak42 e7326cdf1d
[sex.com] Download videos from cdn (#3408)
2 years ago
ClosedPort22 d0ad6d0e67
[e621] implement manual pagination mode
2 years ago
Mike Fährmann 6f0735568c
[2chen] fix file URLs
2 years ago
enduser420 a2be06d873
[2chen] add '.club' support (#3406)
2 years ago
Mike Fährmann a6d4733e11
[pixiv] extract 'date_url' metadata (#3405)
2 years ago
Mike Fährmann 1317625ec4
[webmshare] add 'video' extractor (#2410)
2 years ago
Mike Fährmann 90a9c0790f
[twitter] update 'search' pagination (#544)
2 years ago
Mike Fährmann 1cbc234819
[mangafox] extract more metadata (#3167)
2 years ago
Mike Fährmann 3082544fff
misc fixes
2 years ago
enduser420 41bf236d36
[lynxchan] add generic extractors for lynxchan imageboards (#3394)
2 years ago
Mike Fährmann 3c75c3bbc4
[soundgasm] add 'user' extractor (#3384)
2 years ago
Mike Fährmann 2952add4a8
[reddit] increase 'id-max' default value (#3397)
2 years ago
Mike Fährmann a001c9c06f
[instagram] prevent post 'date' overwriting file 'date' (#3392)
2 years ago
Mike Fährmann 6b6f886dcf
[bunkr] update domain (#3391)
2 years ago
ClosedPort22 bf3fd5951a
Merge branch 'master' into da-extra-stash
2 years ago
Mike Fährmann eb94568e1f
[soundgasm] add 'audio' extractor (#3384)
2 years ago
Mike Fährmann cd931e1139
update extractor test results
2 years ago
Mike Fährmann 989ec9fc79
[khinsider] fix metadata extraction
2 years ago
Mike Fährmann 1c25cc7a3e
[warosu] fix and update
2 years ago
Mike Fährmann 79e52f3539
[imgth] rewrite
2 years ago
Mike Fährmann 202c1210d5
[exhentai] fix pagination
2 years ago
Mike Fährmann 4a3a1f4c87
[komikcast] update domain and fix extraction
2 years ago
ClosedPort22 13d825731e
[deviantart] fix test for sta.sh URL extraction
2 years ago
ClosedPort22 6356c9be96
[deviantart] extract sta.sh URLs from 'text_content'
2 years ago
Mike Fährmann 5f57a27ba6
[imagetwist] fix extraction
2 years ago
Mike Fährmann a42ba25ca1
[foolslide] remove 'kireicake'
2 years ago
Mike Fährmann 86f0597c95
[kissgoddess] remove module
2 years ago
Mike Fährmann 20e12b5d7c
[nitter] support '/i/user/' URLs (#3310)
2 years ago
Mike Fährmann fceaee3c4f
[lolisafe] remove zz.ht
2 years ago
Mike Fährmann 4554c43d5f
[bunkr] use 'media-files' servers for more file types
2 years ago
enduser420 4bc756dfe0
[2chen] fix extraction (#3356)
2 years ago
enduser420 54844944ab
[pixhost] add 'gallery' support (#3353)
2 years ago
enduser420 213676c785
[fapachi] add 'post' and 'user' extractors (#3347)
2 years ago
Mike Fährmann a18511e346
[nitter] retry downloads on 404 (#3313)
2 years ago
Mike Fährmann 88610c3478
[patreon] update API query parameters
2 years ago
Mike Fährmann c19b1f03b9
[patreon] fix '403 Forbidden' errors
2 years ago
Mike Fährmann fc34f76cc5
[bunkr] fix video downloads (#3326)
2 years ago
Mike Fährmann 86a396e086
[bcy] fix JSONDecodeError (#3321)
2 years ago
Mike Fährmann 5b9a22af7f
[patreon] improve 'campaign_id' extraction (#3235)
2 years ago
Mike Fährmann 1bdd0e4338
[nitter] support '/i/web/' Tweet URLs (#3310)
2 years ago
Mike Fährmann 7e277d0f7d
[weibo] add 'count' metadata field (#3305)
2 years ago
Mike Fährmann 4287a93202
[nitter] handle base64-encoded filenames
2 years ago
sudo a6305d031c
[hitomi] apply format check for every image (#3030) (#3280)
2 years ago
Steven Docherty a7c7953107
[reddit] use 'dash_url' for videos (#3258) (#3306)
2 years ago
Mike Fährmann 0e75358af8
[twitter] fix using user IDs for suspended accounts
2 years ago
Mike Fährmann c25905641e
[weibo] fix bug with empty 'playback_list' (#3301)
2 years ago
Mike Fährmann 6cb12f513b
[nitter] support quoted Tweets
2 years ago
Mike Fährmann aabfa7cf34
[nitter] fix direct Tweet links
2 years ago
Mike Fährmann a41d093bb1
[nitter] add 'retweets' option (#3278)
2 years ago
Mike Fährmann 3d6489a4c0
[nitter] update 'user' and 'author'
2 years ago
Mike Fährmann e99ce99284
[danbooru] remove stray 'print()'
2 years ago
Mike Fährmann ed49e63d95
[nitter] set 'hlsPlayback' cookie
2 years ago
Mike Fährmann e081b1fac4
[nitter] sanitize filenames (#3294)
2 years ago
Mike Fährmann e31d12139c
[nitter] add 'videos' option (#3279)
2 years ago
enduser420 8c4e21b110
[itaku] remove 'Extreme' rating (#3287)
2 years ago
Mike Fährmann 72c5d26e85
[hotleak] fix UnboundLocalError (#3288, #3293)
2 years ago
Mike Fährmann 501d9bccfe
[artstation] add 'max-posts' option (#3270)
2 years ago
Mike Fährmann b1ad6f2289
[artstation] add 'pro-first' option (#3273)
2 years ago
Mike Fährmann 5a17e15b76
[pixiv] preserve 'tags' order (#3266)
2 years ago
Mike Fährmann 1392b44bfe
[inkbunny] provide additional metadata (#3274)
2 years ago
Mike Fährmann a24dcbe802
[twitter] fix login (#3220)
2 years ago
Mike Fährmann 53a5d95b7d
[instagram] skip private check for avatars (#3255)
2 years ago
Mike Fährmann 08fd1ff835
[twitter] add 'avatar' and 'background' extractors (#349, #3023)
2 years ago
Mike Fährmann 6379157543
[instagram] use REST API by default
2 years ago
enduser420 7897f68225
[wallhaven] update 'user' extractor (#3226)
2 years ago
enduser420 5a68b5cb3c
[wallhaven] add 'user' extractor (#3213)
2 years ago
enduser420 442b03f7c3
[khinsider] fix song extraction (#3219)
2 years ago
Mike Fährmann eaae4d9b65
[pixiv] stop with error for invalid search/ranking parameters
2 years ago
Mike Fährmann 368f156378
[pixiv] rankings: add support for the new daily AI and daily AI R18
2 years ago
Mike Fährmann 6c153750fa
[nitter] add extractors for Nitter instances (#2696)
2 years ago
Mike Fährmann 9f06e79868
implement '"user-agent": "browser"' (#2636)
2 years ago
Mike Fährmann 70c7fbe89a
[instagram] add 'guide' extractor (#3192)
2 years ago
enduser420 93ea8ca8e3
[imxto] extract additional metadata (#3175)
2 years ago
Mike Fährmann e3abab8629
[weibo] send 'Referer' headers (#3188)
2 years ago
Mike Fährmann 6423f990de
[realbooru] fix 'tags' extraction (#2530)
2 years ago
Mike Fährmann ecad02cf3f
[realbooru] fix download URLs (#2530)
2 years ago
Mike Fährmann 15cd114c9c
[twitter] update bookmarks pagination (#3172)
2 years ago
Mike Fährmann 20fbba9d7c
[exhentai] add metadata to search results (#3181)
2 years ago
Mike Fährmann 6a0c5e34f4
[exhentai] fix pagination (#3181)
2 years ago
Mike Fährmann 171262c1b6
[instagram] remove login support
2 years ago
Mike Fährmann 93e6bd6847
[uploadir] use utf-8 filenames (#3162)
2 years ago
Mike Fährmann b7a83ac726
[uploadir] update (#3162)
2 years ago
Mike Fährmann ccb80f1b8b
[uploadir] add support for 'uploadir.com' (#3162)
2 years ago
Mike Fährmann b0cb4a1b9c
replace 'text.extract()' with 'text.extr()' where possible
2 years ago
Mike Fährmann 4fd3c893fa
[booru] adjust/match '_tags' and '_notes' code
2 years ago
Mike Fährmann 88954aa2e4
[gelbooru_v02] implement 'notes' extraction
2 years ago
ClosedPort22 4e80d3210e
[tumblr] Fallback to `gifv` when possible (#3095) (#3159)
2 years ago
thatfuckingbird 9d3f86dbcd
[twitter] update URL for syndication API (#3160)
2 years ago
enduser420 c01cad599a
[lolisafe] add support for xbunkr (#3156)
2 years ago
Allen 9fc142d27b
[mastodon] add "remote_instance" field (#3119)
2 years ago
Mike Fährmann 2a1cb403ee
Revert "[Deviantart] [#1776] Remove the "you need session cookies to download mature scraps" warning (#1777)"
2 years ago
Mike Fährmann 86790da2d5
update Cloudflare IUAM detection
2 years ago
Mike Fährmann 775895f44b
[booru] refactor 'tags' and 'notes' extraction
2 years ago
Luc Ritchie 0f9dfb7e62
[instagram] Fix AttributeError on user stories extraction (#3123)
2 years ago
Mike Fährmann f81dd5297a
[skeb] fix extraction (#3112)
2 years ago
enduser420 fb2dbb04e2
[moebooru] extract 'notes' (#3094)
2 years ago
Mike Fährmann 4e26bf98f5
[aibooru] support 'safe' subdomain (#3110)
2 years ago
Mike Fährmann 5c31791b3c
[mastodon] support '/web/' URLs (#3109)
2 years ago
Mike Fährmann 9a2cfd4421
[mastodon] support cross-instance user references (#3109)
2 years ago
Mike Fährmann 58d97188b4
[mastodon] add 'bookmark' extractor (#3109)
2 years ago
Mike Fährmann 46b64251eb
[bcy] fix extraction (#3103)
2 years ago
Mike Fährmann 77173694d5
[kemonoparty] fix 'dms' extraction (#3106)
2 years ago
Mike Fährmann f168ec9572
[instagram] extract 'coauthors' metadata (#3107)
2 years ago
Mike Fährmann 7c6af27eb8
[tumblr] add 'fallback-*' options (#2957)
2 years ago
Mike Fährmann 4aa56d500b
[hentaihere] fix test results
2 years ago
Mike Fährmann 75d707fd92
[hentaihere] update
2 years ago
Mike Fährmann d2fc73f20b
[hentai2read] fix manga metadata extraction
2 years ago
Mike Fährmann f4d06e5180
[manganelo] update domain to 'chapmanganato.com' (#3097)
2 years ago
Mike Fährmann 769e6754dc
[pixiv] use 'exact_match_for_tags' as default search mode (#3092)
2 years ago
Mike Fährmann a90e5cb354
[instagram] support 'instagram.com/s/' highlight URLs (#3076)
2 years ago
enduser420 fd19c4b228
[hentai2read] recognize '.' in chapter (#3089)
2 years ago