Commit Graph

3316 Commits (bc6d65d203200c32aec94c5f81778decde0352ee)

Author SHA1 Message Date
Mike Fährmann bbf0911a46
[e621] implement 'notes' and 'pools' metadata extraction
2 years ago
Mike Fährmann 925b467496
split e621 from danbooru module (#3425)
2 years ago
Mike Fährmann 1ae48a54f8
[twitter] add 'transform' option
2 years ago
Mike Fährmann 489c51cecc
[telegraph] fix extraction when images not in <figure> (#3590)
2 years ago
Mike Fährmann 0f7e6c422a
merge #3596: [shopify] support ohpolly.com
2 years ago
enduser420 fcf7030b85 [shopify] support ohpolly.com
2 years ago
Mike Fährmann a6a631f992
merge #3589: [redgifs] support v3 URLs
2 years ago
Mike Fährmann 137a395ae0
[imagefap] fix infinite pagination loop (#3594)
2 years ago
Mike Fährmann 3c708ade8f
[imagefap] fix metadata extraction
2 years ago
Mike Fährmann 17e24eacf0
[imagefap] update 'gallery' URLs (#3595)
2 years ago
Mike Fährmann c2bc70593e
implement ability to load external extractor classes
2 years ago
enduser420 a18f627bfc [redgifs] support v3 URLs
2 years ago
Mike Fährmann 13a90969c7
merge #3575: [nudecollect] add 'image' and 'album' extractors
2 years ago
Mike Fährmann aacd27e4ef
merge #3581: [hotleak] fix video URLs
2 years ago
Mike Fährmann abc3619feb
[lexica] add 'search' extractor (#3567)
2 years ago
Mike Fährmann 7c9b1ec830
[hotleak] optimize decoding video URLs
2 years ago
nifnat f14dbfe079 Make decode_video_url static (used in both post and creator extractor).
2 years ago
nifnat bd23a701f3 Tidy up code.
2 years ago
nifnat 7f34f99a26 Reverse engineered obfuscated JS function and reimplemented in python.
2 years ago
Mike Fährmann 0d818d3540
[fantia] send 'X-CSRF-Token' headers (#3576)
2 years ago
enduser420 2a5903dc16 [nudecollect] add 'image' and 'album' extractors
2 years ago
Mike Fährmann c8fdd5096e
merge #3571: [bunkr] Fix extracting mkv and ts files
2 years ago
Mike Fährmann 58c008e30a
[hiperdex] update domain (#3572)
2 years ago
Luc Ritchie 842064e597
[bunkr] Fix extracting ts files
2 years ago
Luc Ritchie 99ca0437e4
[bunkr] Fix extracting mkv files
2 years ago
Mike Fährmann 76b01b64cf
[kemonoparty] remove MD5 hash extraction (#3531)
2 years ago
Mike Fährmann 09fb212414
[philomena] match URLs with www subdomain
2 years ago
Mike Fährmann 7e2fd2e573
merge #3560: [deviantart] add support for /deviation/ and fav.me URLs
2 years ago
Mike Fährmann caae8fefe1
merge #3541: [deviantart] add extractor for status updates
2 years ago
ClosedPort22 c90b4ea8d9
[deviantart] add support for fav.me URLs
2 years ago
Mike Fährmann d63af4f3d3
merge #3555: [generic] fix regex for non-src image URLs
2 years ago
Mike Fährmann 8993b10751
[mastodon] add 'num' and 'count' metadata fields (#3517)
2 years ago
Mike Fährmann d817d23ccb
[instagram] update csrf token handling
2 years ago
Mike Fährmann 00b94946b3
[instagram] show -o cursor=… after every error (#3440)
2 years ago
ClosedPort22 674c719646
[deviantart] refactor base36 conversion
2 years ago
ClosedPort22 293abb8921
[deviantart] add support for /deviation/ URLs
2 years ago
thatfuckingbird 8cfeed78b1 [generic] fix regex for non-src image URLs
2 years ago
Mike Fährmann fc6ea8ee5c
[instagram] update API domain and headers
2 years ago
ClosedPort22 597b89245e
[deviantart] misc improvements to status extractor
2 years ago
Mike Fährmann 137de090dd
merge #3549: [twitter] fix search (#3536)
2 years ago
Mike Fährmann 02e314c1b6
merge #3537: [wikifeet/wikifeetx] add 'gallery' extractor
2 years ago
Mike Fährmann 568112dfbb
[oauth] improve output
2 years ago
ClosedPort22 ab58c375b4
[twitter] fix search (#3536)
2 years ago
Mike Fährmann df91ebb945
[oauth] simplify OAuth 1.0a init
2 years ago
ClosedPort22 013733c9e9
[deviantart] fix index fields for embedded/shared images
2 years ago
ClosedPort22 c4aeca7a5a
[deviantart] improve handling of statuses
2 years ago
ClosedPort22 3b32671fbd
[deviantart] add extractor for status updates
2 years ago
Mike Fährmann 107c60c973
[sankaku] update URL pattern (#3523)
2 years ago
enduser420 5cb263fdd2 [wikifeet/wikifeetx] add 'gallery' extractor
2 years ago
Mike Fährmann 35a30498bc
merge #3531: [kemonoparty] improve hash extraction
2 years ago
Mike Fährmann 9683d79bb7
[twitter] "fix" search pagination (#3536, #3534)
2 years ago
Mike Fährmann 4fec848858
[twitter] use "browser": "firefox" by default (#3522)
2 years ago
Mike Fährmann 78937564fd
[twitter] fix login after 32b03433
2 years ago
ClosedPort22 20d6194ffa
[kemonoparty] improve hash extraction
2 years ago
Mike Fährmann 80a2ff2d38
support setting 'write-pages' to "ALL"
2 years ago
Mike Fährmann c881548a27
add 'extractor.retry-codes' option (#3313)
2 years ago
Mike Fährmann e30e8aeef7
[mastodon] rename '_check_move' -> '_check_moved'
2 years ago
Mike Fährmann 32b0343334
[twitter] refresh guest tokens (#3445, #3458)
2 years ago
Mike Fährmann 512abeb4ae
[booru] add 'url' option
2 years ago
Mike Fährmann c87bd1a752
[danbooru] extend 'metadata' option
2 years ago
Mike Fährmann 26c3292538
[twitter] disable TLS 1.2 ciphers by default (#3522)
2 years ago
Mike Fährmann 18fe4b334d
[twitter] remove 'tweet_search_mode' from search parameters (#3522)
2 years ago
Mike Fährmann 85bd1cbc89
[kemonoparty] fix regression from 473bd380 (#3519)
2 years ago
Mike Fährmann 473bd380c8
[kemonoparty] reject invalid/empty files (#3510)
2 years ago
Mike Fährmann 4833ec323e
[imagefap] add 'folder' extractor (#3504)
2 years ago
Mike Fährmann 362cd6991b
[pixiv] implement 'metadata-bookmark' option (#3417)
2 years ago
Mike Fährmann 2142b9c7ae
merge #3503: [myhentaigallery] handle whitespace before title tag
2 years ago
Mike Fährmann 3a0450adbf
[behance] use default delay between requests (#2507)
2 years ago
Mike Fährmann 2cae4567ba
[telegraph] fix file URLs (#3506)
2 years ago
Mike Fährmann cbaeee9533
[imagefap] warn about redirects to '/human-verification' (#1140)
2 years ago
Mike Fährmann 435de1329a
[imagefap] use default delay between requests (#1140)
2 years ago
Erik Rimskog a8a982359e [myhentaigallery] handle whitespace before the title tag
2 years ago
Mike Fährmann d1dd52349a
merge #3189: [tcbscans] add 'chapter' and 'manga' extractors
2 years ago
Mike Fährmann 2f31d21509
merge #3455: [twitter] apply tweet type checks before uniqueness check
2 years ago
enduser420 e8541a131d [tcbscans] add 'chapter' and 'manga' extractors
2 years ago
Mike Fährmann 9695c4e88d
emit debug logging message when loading cookies from file
2 years ago
Mike Fährmann 30a31836e7
merge #3449: [twitter] force HTTPS for TwitPic URLs
2 years ago
Mike Fährmann e18482e9ae
[twitter] improve 'http' -> 'https' replacement
2 years ago
Mike Fährmann 4fd6da474f
merge #3473: [twitter] fix crash when using 'expand' and 'syndication'
2 years ago
Mike Fährmann a918ce29b5
run tests on ubuntu-20.04
2 years ago
Mike Fährmann 6514828d4e
emit debug logging message when loading cookies from file
2 years ago
Mike Fährmann 3a238fd490
[poipiku] warn about login requirements
2 years ago
Mike Fährmann f29ba089ff
merge #3474: [fanleaks] add 'post' and 'model' extractors
2 years ago
Mike Fährmann 6933727b45
merge #3483: [twitter] implement 'syndication=extended'
2 years ago
Mike Fährmann 07ed3a1fbf
merge #3460: [poipiku] fix extraction for a different warning button style
2 years ago
Mike Fährmann 9116398c1c
[pinterest] add 'domain' option (#3484)
2 years ago
blankie 2f985bcddb
[poipiku] fix extraction for a different warning button style
2 years ago
Mike Fährmann 294108c90a
[pinterest] support 'All Pins' boards (#2855, #3484)
2 years ago
Mike Fährmann 77df8d3116
[deviantart] implement username&password login for scraps (#1029)
2 years ago
Mike Fährmann ed2d715019
fix 'keywords' in extractor tests (#3491)
2 years ago
ClosedPort22 6853b14be3
[twitter] apply suggestions from code review
2 years ago
Mike Fährmann 4611237f8c
merge #3457: [danbooru] extract uploader metadata (if option is set)
2 years ago
Mike Fährmann e7522482bb
merge #3463: [lynxchan] support 'bbw-chan.nl'
2 years ago
Mike Fährmann 7d6c846176
[fanbox] return 'imageMap' files in order (#2718)
2 years ago
Mike Fährmann dc8e7ff54e
[bunkr] fix URLs returned by API (#3481)
2 years ago
enduser420 5fedef3a1a [fanleaks] update 'model' URL pattern
2 years ago
enduser420 5a740ef78b [fanleaks] add 'post' and 'model' extractors
2 years ago
ClosedPort22 7c8eab8d52
[twitter] implement 'syndication=extended'
2 years ago
ClosedPort22 be3286206a
[twitter] assume 'conversation_id' when using syndication
2 years ago
ClosedPort22 ce8dbb1ccc
[twitter] fix crash when using 'expand' and 'syndication'
2 years ago
ClosedPort22 38786a9593
[twitter] refactor extraction of TwitPic URLs
2 years ago
enduser420 527bb2c4ab [lynxchan/bbw-chan] add 'thread' and 'board' extractors
2 years ago
blankie f82ee93676
[danbooru] extract uploader metadata (if metadata is set)
2 years ago
ClosedPort22 250d35107c
[twitter] prioritize tweet type checks (#3439)
2 years ago
ClosedPort22 3eb352fcb0
[twitter] force HTTPS for TwitPic URLs
2 years ago
Mike Fährmann bee354c264
Merge pull request #3415 from enduser420/extractor/fapello
2 years ago
Mike Fährmann 8d7585534e
Merge pull request #3367 from the-blank-x/deviantart-view
2 years ago
blankie 6614d94b08
[deviantart] add /view URL support
2 years ago
Mike Fährmann dd6eeb4336
Merge pull request #3366 from ClosedPort22/da-extra-stash
2 years ago
Mike Fährmann f36cbb3911
Merge pull request #3413 from ClosedPort22/e621-manual-pagination
2 years ago
ClosedPort22 dd4a4a3fa6
[e621] softcode the pagination threshold
2 years ago
ClosedPort22 9faa4ed738
[e621] refactor pagination control
2 years ago
Mike Fährmann 7851a2c520
[seiga] raise error when redirected to login page (#3401)
2 years ago
Mike Fährmann 68ce5f965d
[instagram] remove unused code
2 years ago
Mike Fährmann 4063563cd7
[zerochan] update for layout v3
2 years ago
Mike Fährmann 1e6407ca98
Merge pull request #3414 from pubak42/master
2 years ago
ClosedPort22 bf1649dadb
[imgur] add support for imgur.io URLs
2 years ago
enduser420 7e08e2d982 [fapello] set 'filename_fmt'
2 years ago
enduser420 e5076ba056 [fapello] add 'post', 'user' and 'path' extractors
2 years ago
pubak42 e7326cdf1d
[sex.com] Download videos from cdn (#3408)
2 years ago
ClosedPort22 d0ad6d0e67
[e621] implement manual pagination mode
2 years ago
Mike Fährmann 6f0735568c
[2chen] fix file URLs
2 years ago
enduser420 a2be06d873
[2chen] add '.club' support (#3406)
2 years ago
Mike Fährmann a6d4733e11
[pixiv] extract 'date_url' metadata (#3405)
2 years ago
Mike Fährmann 1317625ec4
[webmshare] add 'video' extractor (#2410)
2 years ago
Mike Fährmann 90a9c0790f
[twitter] update 'search' pagination (#544)
2 years ago
Mike Fährmann 1cbc234819
[mangafox] extract more metadata (#3167)
2 years ago
Mike Fährmann 3082544fff
misc fixes
2 years ago
enduser420 41bf236d36
[lynxchan] add generic extractors for lynxchan imageboards (#3394)
2 years ago
Mike Fährmann 3c75c3bbc4
[soundgasm] add 'user' extractor (#3384)
2 years ago
Mike Fährmann 2952add4a8
[reddit] increase 'id-max' default value (#3397)
2 years ago
Mike Fährmann a001c9c06f
[instagram] prevent post 'date' overwriting file 'date' (#3392)
2 years ago
Mike Fährmann 6b6f886dcf
[bunkr] update domain (#3391)
2 years ago
ClosedPort22 bf3fd5951a
Merge branch 'master' into da-extra-stash
2 years ago
Mike Fährmann eb94568e1f
[soundgasm] add 'audio' extractor (#3384)
2 years ago
Mike Fährmann cd931e1139
update extractor test results
2 years ago
Mike Fährmann 989ec9fc79
[khinsider] fix metadata extraction
2 years ago
Mike Fährmann 1c25cc7a3e
[warosu] fix and update
2 years ago
Mike Fährmann 79e52f3539
[imgth] rewrite
2 years ago
Mike Fährmann 202c1210d5
[exhentai] fix pagination
2 years ago
Mike Fährmann 4a3a1f4c87
[komikcast] update domain and fix extraction
2 years ago
ClosedPort22 13d825731e
[deviantart] fix test for sta.sh URL extraction
2 years ago
ClosedPort22 6356c9be96
[deviantart] extract sta.sh URLs from 'text_content'
2 years ago
Mike Fährmann 5f57a27ba6
[imagetwist] fix extraction
2 years ago
Mike Fährmann a42ba25ca1
[foolslide] remove 'kireicake'
2 years ago
Mike Fährmann 86f0597c95
[kissgoddess] remove module
2 years ago
Mike Fährmann 20e12b5d7c
[nitter] support '/i/user/' URLs (#3310)
2 years ago
Mike Fährmann fceaee3c4f
[lolisafe] remove zz.ht
2 years ago
Mike Fährmann 4554c43d5f
[bunkr] use 'media-files' servers for more file types
2 years ago
enduser420 4bc756dfe0
[2chen] fix extraction (#3356)
2 years ago
enduser420 54844944ab
[pixhost] add 'gallery' support (#3353)
2 years ago
enduser420 213676c785
[fapachi] add 'post' and 'user' extractors (#3347)
2 years ago
Mike Fährmann a18511e346
[nitter] retry downloads on 404 (#3313)
2 years ago
Mike Fährmann 88610c3478
[patreon] update API query parameters
2 years ago
Mike Fährmann c19b1f03b9
[patreon] fix '403 Forbidden' errors
2 years ago
Mike Fährmann fc34f76cc5
[bunkr] fix video downloads (#3326)
2 years ago
Mike Fährmann 86a396e086
[bcy] fix JSONDecodeError (#3321)
2 years ago
Mike Fährmann 5b9a22af7f
[patreon] improve 'campaign_id' extraction (#3235)
2 years ago
Mike Fährmann 1bdd0e4338
[nitter] support '/i/web/' Tweet URLs (#3310)
2 years ago
Mike Fährmann 7e277d0f7d
[weibo] add 'count' metadata field (#3305)
2 years ago
Mike Fährmann 4287a93202
[nitter] handle base64-encoded filenames
2 years ago
sudo a6305d031c
[hitomi] apply format check for every image (#3030) (#3280)
2 years ago
Steven Docherty a7c7953107
[reddit] use 'dash_url' for videos (#3258) (#3306)
2 years ago
Mike Fährmann 0e75358af8
[twitter] fix using user IDs for suspended accounts
2 years ago
Mike Fährmann c25905641e
[weibo] fix bug with empty 'playback_list' (#3301)
2 years ago
Mike Fährmann 6cb12f513b
[nitter] support quoted Tweets
2 years ago
Mike Fährmann aabfa7cf34
[nitter] fix direct Tweet links
2 years ago
Mike Fährmann a41d093bb1
[nitter] add 'retweets' option (#3278)
2 years ago
Mike Fährmann 3d6489a4c0
[nitter] update 'user' and 'author'
2 years ago
Mike Fährmann e99ce99284
[danbooru] remove stray 'print()'
2 years ago
Mike Fährmann ed49e63d95
[nitter] set 'hlsPlayback' cookie
2 years ago
Mike Fährmann e081b1fac4
[nitter] sanitize filenames (#3294)
2 years ago
Mike Fährmann e31d12139c
[nitter] add 'videos' option (#3279)
2 years ago
enduser420 8c4e21b110
[itaku] remove 'Extreme' rating (#3287)
2 years ago
Mike Fährmann 72c5d26e85
[hotleak] fix UnboundLocalError (#3288, #3293)
2 years ago
Mike Fährmann 501d9bccfe
[artstation] add 'max-posts' option (#3270)
2 years ago
Mike Fährmann b1ad6f2289
[artstation] add 'pro-first' option (#3273)
2 years ago
Mike Fährmann 5a17e15b76
[pixiv] preserve 'tags' order (#3266)
2 years ago
Mike Fährmann 1392b44bfe
[inkbunny] provide additional metadata (#3274)
2 years ago
Mike Fährmann a24dcbe802
[twitter] fix login (#3220)
2 years ago
Mike Fährmann 53a5d95b7d
[instagram] skip private check for avatars (#3255)
2 years ago
Mike Fährmann 08fd1ff835
[twitter] add 'avatar' and 'background' extractors (#349, #3023)
2 years ago
Mike Fährmann 6379157543
[instagram] use REST API by default
2 years ago
enduser420 7897f68225
[wallhaven] update 'user' extractor (#3226)
2 years ago
enduser420 5a68b5cb3c
[wallhaven] add 'user' extractor (#3213)
2 years ago
enduser420 442b03f7c3
[khinsider] fix song extraction (#3219)
2 years ago
Mike Fährmann eaae4d9b65
[pixiv] stop with error for invalid search/ranking parameters
2 years ago
Mike Fährmann 368f156378
[pixiv] rankings: add support for the new daily AI and daily AI R18
2 years ago
Mike Fährmann 6c153750fa
[nitter] add extractors for Nitter instances (#2696)
2 years ago
Mike Fährmann 9f06e79868
implement '"user-agent": "browser"' (#2636)
2 years ago
Mike Fährmann 70c7fbe89a
[instagram] add 'guide' extractor (#3192)
2 years ago
enduser420 93ea8ca8e3
[imxto] extract additional metadata (#3175)
2 years ago
Mike Fährmann e3abab8629
[weibo] send 'Referer' headers (#3188)
2 years ago
Mike Fährmann 6423f990de
[realbooru] fix 'tags' extraction (#2530)
2 years ago
Mike Fährmann ecad02cf3f
[realbooru] fix download URLs (#2530)
2 years ago
Mike Fährmann 15cd114c9c
[twitter] update bookmarks pagination (#3172)
2 years ago
Mike Fährmann 20fbba9d7c
[exhentai] add metadata to search results (#3181)
2 years ago
Mike Fährmann 6a0c5e34f4
[exhentai] fix pagination (#3181)
2 years ago
Mike Fährmann 171262c1b6
[instagram] remove login support
2 years ago
Mike Fährmann 93e6bd6847
[uploadir] use utf-8 filenames (#3162)
2 years ago