Mike Fährmann
368f156378
[pixiv] rankings: add support for the new daily AI and daily AI R18
...
(#3214 , #3221 )
In remembrance of @thatfuckingbird
2 years ago
Mike Fährmann
6c153750fa
[nitter] add extractors for Nitter instances ( #2696 )
2 years ago
Mike Fährmann
9f06e79868
implement '"user-agent": "browser"' ( #2636 )
2 years ago
Mike Fährmann
70c7fbe89a
[instagram] add 'guide' extractor ( #3192 )
2 years ago
enduser420
93ea8ca8e3
[imxto] extract additional metadata ( #3175 )
2 years ago
Mike Fährmann
e3abab8629
[weibo] send 'Referer' headers ( #3188 )
2 years ago
Mike Fährmann
6423f990de
[realbooru] fix 'tags' extraction ( #2530 )
2 years ago
Mike Fährmann
ecad02cf3f
[realbooru] fix download URLs ( #2530 )
2 years ago
Mike Fährmann
15cd114c9c
[twitter] update bookmarks pagination ( #3172 )
...
Do not stop when there aren't any tweets in a batch,
but only when the same cursor value appears twice in a row.
2 years ago
Mike Fährmann
20fbba9d7c
[exhentai] add metadata to search results ( #3181 )
...
'gallery_id' and 'gallery_token'
2 years ago
Mike Fährmann
6a0c5e34f4
[exhentai] fix pagination ( #3181 )
2 years ago
Mike Fährmann
171262c1b6
[instagram] remove login support
...
broken feature that I cannot get to work anymore
2 years ago
Mike Fährmann
93e6bd6847
[uploadir] use utf-8 filenames ( #3162 )
2 years ago
Mike Fährmann
b7a83ac726
[uploadir] update ( #3162 )
...
- prevent extra HTTP request from redirects
- add 'id' metadata field
- set 'filename_fmt' and 'archive_fmt'
2 years ago
Mike Fährmann
ccb80f1b8b
[uploadir] add support for 'uploadir.com' ( #3162 )
2 years ago
Mike Fährmann
b0cb4a1b9c
replace 'text.extract()' with 'text.extr()' where possible
2 years ago
Mike Fährmann
4fd3c893fa
[booru] adjust/match '_tags' and '_notes' code
2 years ago
Mike Fährmann
88954aa2e4
[gelbooru_v02] implement 'notes' extraction
...
same code as for 'moebooru' works here as well
2 years ago
ClosedPort22
4e80d3210e
[tumblr] Fallback to `gifv` when possible ( #3095 ) ( #3159 )
2 years ago
thatfuckingbird
9d3f86dbcd
[twitter] update URL for syndication API ( #3160 )
...
Twitter changed the URL format to access tweet data through their syndication API.
2 years ago
enduser420
c01cad599a
[lolisafe] add support for xbunkr ( #3156 )
2 years ago
Allen
9fc142d27b
[mastodon] add "remote_instance" field ( #3119 )
...
Example Usage:
If the url is "mastodon:https://mastodon.example.org/@VoteChess@botsin.space the "remote_instance" will be "botsin.space"
...
"directory": ["mastodon", "{remote_instance|instance}", "{account[username]!l}"]
...
2 years ago
Mike Fährmann
2a1cb403ee
Revert "[Deviantart] [ #1776 ] Remove the "you need session cookies to download mature scraps" warning ( #1777 )"
...
This reverts commit 1f02878351
.
Mature scraps do yet again require cookies.
2 years ago
Mike Fährmann
86790da2d5
update Cloudflare IUAM detection
...
again
2 years ago
Mike Fährmann
775895f44b
[booru] refactor 'tags' and 'notes' extraction
...
- move HTML request for post pages into its own function
- move gelbooru_v02.py notes extraction to gelbooru.py
since it only works there
- clean up some code
2 years ago
Luc Ritchie
0f9dfb7e62
[instagram] Fix AttributeError on user stories extraction ( #3123 )
2 years ago
Mike Fährmann
f81dd5297a
[skeb] fix extraction ( #3112 )
...
'completed_at' is no longer included in API responses
2 years ago
enduser420
fb2dbb04e2
[moebooru] extract 'notes' ( #3094 )
2 years ago
Mike Fährmann
4e26bf98f5
[aibooru] support 'safe' subdomain ( #3110 )
2 years ago
Mike Fährmann
5c31791b3c
[mastodon] support '/web/' URLs ( #3109 )
2 years ago
Mike Fährmann
9a2cfd4421
[mastodon] support cross-instance user references ( #3109 )
2 years ago
Mike Fährmann
58d97188b4
[mastodon] add 'bookmark' extractor ( #3109 )
2 years ago
Mike Fährmann
46b64251eb
[bcy] fix extraction ( #3103 )
...
- fix regex for non-watermarked images
- fetch data from '/item/detail' pages for all other posts,
since '/apiv3/user/selfPosts' only has incomplete data
2 years ago
Mike Fährmann
77173694d5
[kemonoparty] fix 'dms' extraction ( #3106 )
2 years ago
Mike Fährmann
f168ec9572
[instagram] extract 'coauthors' metadata ( #3107 )
2 years ago
Mike Fährmann
7c6af27eb8
[tumblr] add 'fallback-*' options ( #2957 )
...
specifically 'fallback-delay' and 'fallback-retries'
and change default number of retries to 2 (down from 3)
2 years ago
Mike Fährmann
4aa56d500b
[hentaihere] fix test results
2 years ago
Mike Fährmann
75d707fd92
[hentaihere] update
...
- support minor versions in chapter URLs
- fix manga metadata extraction
- update tests
2 years ago
Mike Fährmann
d2fc73f20b
[hentai2read] fix manga metadata extraction
...
and update tests
2 years ago
Mike Fährmann
f4d06e5180
[manganelo] update domain to 'chapmanganato.com' ( #3097 )
2 years ago
Mike Fährmann
769e6754dc
[pixiv] use 'exact_match_for_tags' as default search mode ( #3092 )
2 years ago
Mike Fährmann
a90e5cb354
[instagram] support 'instagram.com/s/' highlight URLs ( #3076 )
2 years ago
enduser420
fd19c4b228
[hentai2read] recognize '.' in chapter ( #3089 )
2 years ago
enduser420
2ff1897421
[vichan] recognize board url w/o trailing slash ( #3087 )
2 years ago
enduser420
ac6111e693
[mangasee] add support for 'mangalife' ( #3086 )
2 years ago
KJ16609
300bc03deb
[gelbooru] allow alternate parameter order in post URLs ( #2821 )
2 years ago
Mike Fährmann
a7d23f1484
[vichan] add generic extractors for vichan imageboards
...
includes 8kun.top, smuglo.li, and wikieat.club
2 years ago
Mike Fährmann
04d3ebdfb4
[redgifs] fix 'token' extraction ( #3080 , #3081 )
2 years ago
thatfuckingbird
062ef238a6
add support for aibooru (using danbooru extractor) ( #3075 )
2 years ago
enduser420
0163ca86f7
[smugloli] add smugloli extractors ( #3060 )
2 years ago
Mike Fährmann
cf86f68864
[instagram] add 'avatar' extractor ( #929 , #1097 , #2992 )
2 years ago
Mike Fährmann
ea8113ff36
[reactor] match 'best', 'new', 'all' URLs ( #3073 )
2 years ago
Mike Fährmann
618c81afdf
[ngomik] remove module
...
"Access denied"
2 years ago
Mike Fährmann
94a2dfe205
[kemonoparty] update pagination offset
2 years ago
Mike Fährmann
52d1eb928d
[pixiv] extend 'metadata' option ( #3057 )
...
make it usable for all 'pixiv' extractors
2 years ago
Mike Fährmann
0714274f1f
[instagram] remove 'channel' extractor
2 years ago
Mike Fährmann
d0d4ce1a13
[danbooru] fix ugoira metadata extraction ( #3056 )
2 years ago
Mike Fährmann
096b8f2cfc
[instagram] prevent request for private '/tagged' feeds ( #3045 )
2 years ago
Mike Fährmann
3b369ce3d1
[nijie] add 'followed' extractor ( #3048 )
2 years ago
Mike Fährmann
c4a62a48ae
[nijie] add 'feed' extractor ( #3048 )
2 years ago
Mike Fährmann
d1314df6e6
[nozomi] fix extraction ( #3051 )
2 years ago
Mike Fährmann
277be410a7
[2chen] update 'archive_fmt'
2 years ago
Mike Fährmann
ed55bd3a5c
[redgifs] extract Bearer token ( #3037 )
2 years ago
Mike Fährmann
e974c75083
[redgifs] fix extraction ( #3037 )
...
send public Bearer token as 'authorization' header
2 years ago
Mike Fährmann
68466a7d61
[tumblr] support ' https://www.tumblr.com/BLOGNAME ' URLs ( #3034 )
2 years ago
Mike Fährmann
b6a68f5a4b
[fanbox] extend 'content' test result ( #3020 )
2 years ago
Mike Fährmann
f1f89b2436
[tumblr] add 'offset' option
2 years ago
Mike Fährmann
827ab0a62d
[instagram] fix login
...
- use mobile user agent header
- update general headers
- skip /data/shared_data/ step
2 years ago
Mike Fährmann
1ca6be8619
[fanbox] add 'content' metadata field ( #3020 )
2 years ago
Mike Fährmann
e5d229c524
[tumblr] sleep between fallback retries ( #2957 )
2 years ago
Mike Fährmann
b2b0b1c455
[hitomi] fall back to webp when format not available ( #3030 )
2 years ago
Mike Fährmann
1696f68a68
[8chan] add 'thread' and 'board' extractors ( #2938 )
2 years ago
Mike Fährmann
560f7b41d8
[vk] add 'tagged' extractor ( #2997 )
2 years ago
Mike Fährmann
122e1a467a
[vk] unescape error messages
2 years ago
Mike Fährmann
bc9d291c13
[imagefap] fix and improve folder extraction ( #3013 )
2 years ago
Mike Fährmann
55fca5fe4b
[imagefap] fix and improve gallery pagination ( #3013 )
2 years ago
Mike Fährmann
8b1fe0bcf1
emit debug logging messages before calling time.sleep() ( #2982 )
2 years ago
Mike Fährmann
14717f3fc9
[deviantart] add 'group' option ( #3018 )
...
disabling this option allows to better download from deleted accounts
2 years ago
Mike Fährmann
220a04a74a
[artstation] skip missing projects ( #3016 )
2 years ago
Mike Fährmann
a12ce2bb41
[deviantart] fix 'deviation' extraction ( #2981 )
2 years ago
Mike Fährmann
36afb519b3
[instagram] prevent crash on empty user profile
2 years ago
enduser420
f0321f423d
[2chen] Add 2chen.moe extractor ( #2707 )
...
* [2chen] Add 2chen.moe extractor
* change "==" to is
* fix for "test_unique_pattern_matches"
* fix regex pattern and group matching
* fix regex again
* [2chen] add 'reply_no' and 'hash' metadata and change 'filename_fmt'
also made an entry in supportedsites.md
* [2chen] unescape 'title'
* [2chen] partition() -> rpartition()
* [2chen] extract 'date' and 'name' metadata
* [2chen] remove 'offset' argument
* [2chen] do some changes
* [2chen] do some more changes
* [2chen] unescape 'name' and 'filename'
2 years ago
enduser420
f7ba19a1c0
[nana] add 'nana' extractors ( #2967 )
2 years ago
Mike Fährmann
fce6642699
[instagram] restore warnings for private profiles ( #3004 )
2 years ago
Mike Fährmann
3e65645cfa
[instagram] restore 'cursor' functionality ( #2991 )
2 years ago
Mike Fährmann
b8d268f57e
allow '/' and '?' in URL queries
2 years ago
Mike Fährmann
7b5dad075d
[fappic] fix extraction
2 years ago
Mike Fährmann
78694a61bb
[kemonoparty] restore 'favorites' API endpoints ( #2994 )
2 years ago
Mike Fährmann
5fd4374036
[sankaku] improve 429 and tag limit handling
2 years ago
Mike Fährmann
b84982b2f9
[kemonoparty] send Referer headers ( #2989 , #2990 )
2 years ago
blankie
98f67ae333
[instagram] add 'count' metadata field ( #2979 )
2 years ago
Mike Fährmann
4089bceddd
[sankaku] implement 'refresh' option ( #2958 )
2 years ago
Mike Fährmann
779e75c6f8
[kemonoparty] fix attachment IDs overwriting post IDs ( #2984 )
...
regression from 09a5cc61
2 years ago
Mike Fährmann
e1d714943b
[tumblr] catch exception when updating image token ( #2957 )
2 years ago
Mike Fährmann
e3a03f335c
[instagram] fix GraphQL bugs
2 years ago
Mike Fährmann
6c76b5f90c
[deviantart] fix extraction ( #2981 , #2983 )
...
send a 'csrf_token' with every Eclipse API request
2 years ago
Mike Fährmann
f728b5ca06
[tumblr] add fallback for failed higher-resolution images ( #2957 )
2 years ago
Mike Fährmann
6992d01e19
[artstation] support search filters ( #2970 )
2 years ago
Mike Fährmann
194803f3a7
[plurk] fix extraction ( #2977 )
2 years ago
Mike Fährmann
63e0924927
[pixiv] add 'series' extractor ( #2964 )
2 years ago
Mike Fährmann
aafea0c4f8
[artstation] fix searches ( #2970 )
2 years ago
Mike Fährmann
2c67bee5c4
[instagram] update
...
- reorder some functions and extractors
- add missing GraphQL endpoints
- fix some GraphQL bugs
2 years ago
Mike Fährmann
aa49bf13d2
[instagram] add 'api' option
2 years ago
Mike Fährmann
6f77193a24
[instagram] move API related code into separate classes
...
may contain bugs and is probably incomplete for the GraphQL variant
2 years ago
Mike Fährmann
ac45ed2764
[skeb] implement 'filters' option ( #2945 )
2 years ago
Mike Fährmann
32c30754d1
[tumblr] warn when unable to fetch higher-resolution images ( #2957 )
...
and download the smaller version
instead of failing with a 404 error
2 years ago
Mike Fährmann
ff532d6c3c
[newgrounds] extract 'type' metadata
2 years ago
Mike Fährmann
0393e59535
[newgrounds] add 'games' extractor ( #2955 )
2 years ago
Mike Fährmann
68f11e02a9
[skeb] add 'search_tags' metadata to search results ( #2945 )
2 years ago
Mike Fährmann
1378cbb8dd
[myportfolio] use fallback when no images are found ( #2959 )
2 years ago
Mike Fährmann
850608551c
[sankaku] detect expired links ( #2958 )
2 years ago
Mike Fährmann
09a5cc6103
[kemonoparty] add 'count' metadata field ( #2952 )
2 years ago
Mike Fährmann
89610a49dc
[instagram] use REST API endpoint for user feeds ( #2666 )
...
With this change, everything is using the newer REST API endpoints
providing higher-quality photos except the now obsolete '/channel' feed.
2 years ago
Mike Fährmann
6737499dbd
[instagram] use REST API endpoint for saved posts ( #2911 )
...
provides 'username' and 'fullname'
as well as higher-quality images
2 years ago
Mike Fährmann
50e3179c56
[instagram] update _user_by_screen_name()
...
use REST API
2 years ago
Mike Fährmann
3dacfb3c56
[instagram] update API headers
2 years ago
Mike Fährmann
4b2a006871
[skeb] add 'search' extractor ( #2945 )
2 years ago
Mike Fährmann
94b34f460e
[exhentai] add slash to the end of gallery URLs ( #2947 )
2 years ago
Mike Fährmann
2787c8511a
[mastodon] warn about moved accounts ( #2939 )
2 years ago
Mike Fährmann
d699310fdf
[blogger] add 'label' or 'query' metadata fields ( #2930 )
...
for '/search/label/…' or '/search?q=…' URLs
2 years ago
Mike Fährmann
eef50c1f28
[blogger] split 'search' extractor ( #2930 )
2 years ago
Mike Fährmann
d29fb94098
[bunkr] use 'media-files' servers for m4v and mov files ( #2925 )
2 years ago
enduser420
bd846abba0
[hotleak] add hotleak extractor ( #2909 ) ( #2890 )
2 years ago
Mike Fährmann
e99a9b2aff
[twitter] improve 'cards-blacklist' ( #2875 )
...
allow blacklisting domains and 'name:domain',
where 'domain' depends on a card's 'vanity_url' value
2 years ago
Mike Fährmann
aaf6992bae
[twitter] fix new-style '/card_img/' URLs
2 years ago
Mike Fährmann
40baa77630
[twitter] provide proper 'date' for syndication results ( #2920 )
2 years ago
Mike Fährmann
46fe469c53
[tumblr] implement 'ratelimit' option ( #2919 )
2 years ago
Mike Fährmann
d0b73fec14
[flickr] add support for secure.flickr.com ( #2910 )
2 years ago
Mike Fährmann
35eddaa94e
[reddit] prevent exception with empty submission URLs ( #2913 )
2 years ago
Mike Fährmann
464ea90d14
[exhentai] guess extension for original files ( #2842 )
...
makes it possible to sometimes, when guessed correctly ('.jpg'),
skip an original file download without costing image limit points
2 years ago
Mike Fährmann
551fdf7ad7
[exhentai] move 509 check into its own function
2 years ago
Mike Fährmann
7a799df17f
[tumblr] pre-compile regular expressions
2 years ago
Mike Fährmann
73a52a95b0
update Cloudflare IUAM detection
2 years ago
Mike Fährmann
673b6f1218
[poipiku] use 'img-org.poipiku.com' as image domain ( #2796 )
2 years ago
Mike Fährmann
4ca1a6e5f3
[bunkr] fix extraction ( #2903 )
2 years ago
Mike Fährmann
8b76149521
[exhentai] improve 509.gif detection ( #2901 )
2 years ago
Mike Fährmann
2ed58029f9
{paheal[ add proper support for videos ( #2892 )
2 years ago
Mike Fährmann
444dfb4aa6
[instagram] add 'highlight_title' and 'date' metadata
...
to highlight posts (#2879 )
2 years ago
Mike Fährmann
7f764ebee6
[redgifs] "fix" download URLs ( #2884 )
2 years ago
Mike Fährmann
3cb8327c60
[zerochan] add 'metadata' option ( #2861 )
2 years ago
blankie
9745b48830
[tumblr] attempt to fetch high-quality inline images ( #2877 )
...
* [tumblr] attempt to fetch high-quality images (again)
Fixes #1846 , and fixes #1344
* slight refactor
* update configuration.rst entry
2 years ago
Mike Fährmann
daef91c925
[smugmug] update default API credentials ( #2881 )
...
The old key lacked v2 access and I'm unable to accept
the new terms of service since my old account got deleted
2 years ago
Mike Fährmann
4d78ca89db
[twitter] add 'cards-blacklist' option ( #2875 )
2 years ago
Mike Fährmann
4d7cb0bf56
[twitter] general support for unified cards ( #2875 )
...
just removing the 'type' check seems to work
2 years ago
Mike Fährmann
7ddfff957c
[twitter] support "image_website" unified cards ( #2875 )
2 years ago
Mike Fährmann
2eb0ddd083
[hitomi] fix error when number of tag results is multiple of 25
...
(#2870 )
2 years ago
Mike Fährmann
3cebf787c4
[slideshare] fix metadata extraction
2 years ago
Mike Fährmann
da11fb32d0
update extractor test results
2 years ago
Mike Fährmann
636d03df95
[nijie] reduce cache maxage to 90 days
2 years ago
Mike Fährmann
f375ec0ffa
[vsco] fix 'collection' extraction
2 years ago