Mike Fährmann
bb6a130942
automatically set required DDoS-GUARD cookies ( #1779 )
...
for kemono.party and seiso.party
3 years ago
Mike Fährmann
bd08ee2859
remove most 'yield Message.Version' statements
...
only leave them in oauth.py as noop results
3 years ago
Mike Fährmann
bdfdabf498
show warning if extractor doesn't yield any results ( #1759 )
3 years ago
Mike Fährmann
d320ee6251
implement a 'fallback' option ( closes #1770 )
3 years ago
Mike Fährmann
c866fcba48
[twitter] fix 'logout' ( #1719 )
...
delete 'auth_token' cookie and cookies.txt path
3 years ago
Mike Fährmann
9cb5ea5eda
update default User-Agent headers
3 years ago
Mike Fährmann
1e0f59d902
release version 1.18.3
3 years ago
Mike Fährmann
52984f7e22
[twitter] add option to log out when blocked ( #1719 )
3 years ago
Mike Fährmann
ed4b3c48cb
fix flake8 and other tests
3 years ago
enormous-muscles
975e1ac6e2
Add Wikieat extractor ( #1699 )
...
* Add Wikieat extractor
* Add Wikieat extractor to extractor list
3 years ago
Nyasume
fa6af46756
Added ability to download GIFs instead of mp4 from Luscious and Reactor ( #1701 )
3 years ago
Ryu JuHeon
9429eaa0a3
[hitomi]: fix image URLs ( #1765 )
3 years ago
Mike Fährmann
c34dbc86bb
[kemonoparty] update file server domain ( #1764 )
3 years ago
Mike Fährmann
e5a93e113f
[twitter] extend 'replies' option ( #1254 )
...
Allow setting 'replies to '"self"' to only download from self-replies.
3 years ago
Mike Fährmann
f9096584ab
[behance] fix 'collection' extraction
3 years ago
Mike Fährmann
229498b8aa
[twitter] warn about suspended accounts etc ( closes #1759 )
3 years ago
Mike Fährmann
a5de2244d4
[furaffinity] fix using 'category-tranfer' ( #1274 )
3 years ago
Mike Fährmann
cadfad4eea
[danbooru] add 'external' option ( closes #1747 )
3 years ago
Mike Fährmann
5b1c62bfa9
[furaffinity] add 'external' option ( closes #1492 )
3 years ago
Mike Fährmann
b5b1cf22b7
[downloader:http] reorder HTTP header sources
...
so that any header can be overwritten by a user, except Range
3 years ago
Mike Fährmann
5d5ab669fa
[instagram] use custom User-Agent header for video downloads
...
(#1682 , #1623 , #1580 )
3 years ago
Mike Fährmann
7b029dfe85
[instagram] increase default delay between HTTP requests to 8s
...
(closes #1732 )
3 years ago
Mike Fährmann
5eca3781be
[kemonoparty] fix username extraction ( #1750 )
3 years ago
Mike Fährmann
fe970fc87f
[vk] prevent exception for empty/private profiles (fixes 1742)
3 years ago
Mike Fährmann
ac91a84543
[bbc] provide fallback URLs ( #1706 )
3 years ago
Mike Fährmann
a316e44f8e
[bbc] add 'width' option ( #1706 )
3 years ago
Mike Fährmann
c37c2818fb
[nsfwalbum] retry all requests when extracting image URLs
...
(#1733 , fixes #1271 )
3 years ago
Mike Fährmann
220cfe244e
[deviantart] get original files for GIF previews ( #1731 )
3 years ago
Mike Fährmann
7a0da4f93f
[newgrounds] add 'format' option ( closes #1729 )
3 years ago
Mike Fährmann
223a4e79cd
[newgrounds] fix using 'category-tranfer' ( #1274 )
3 years ago
Mike Fährmann
4e95cef6d2
[nsfwalbum] retry backend requests ( fixes #1733 )
3 years ago
Mike Fährmann
6c11105587
[bbc] improve image dimensions ( #1706 )
...
download the 1920xN versions instead of 976x549
3 years ago
Mike Fährmann
57c1a86082
[bbc] support multi-page gallery listings ( closes #1730 )
3 years ago
Mike Fährmann
2792ed6e4b
implement 'util.format_value()'
3 years ago
Mike Fährmann
552032d4fb
release version 1.18.2
3 years ago
Mike Fährmann
486474800f
[kemonoparty] skip duplicated patreon files ( closes #1689 )
...
this behavior can be disabled with the 'patreon-skip-file' option
3 years ago
Mike Fährmann
da7297c0b9
[comicvine] add extractor ( closes #1712 )
3 years ago
Mike Fährmann
e4788fa663
[bbc] add 'gallery' and 'programme' extractors ( closes #1706 )
3 years ago
Mike Fährmann
13d4045a8a
add 'archive-prefix' option ( #1711 )
3 years ago
Mike Fährmann
c3b5c88b04
update extractor test results
3 years ago
Mike Fährmann
9e42cd58ea
replace ChainPredicate class with 'functools.partial'
3 years ago
Mike Fährmann
1b2f9050fb
rename all instances of 'kwds' to 'kwdict'
3 years ago
Mike Fährmann
3868ec02d1
[pururin] update domain and fix extraction
3 years ago
Mike Fährmann
b89a44090f
[naverwebtoon] fix comic extraction
3 years ago
Mike Fährmann
4048eabeab
allow 'extractor.modules' to be a string
...
with comma-separated module names
3 years ago
Mike Fährmann
c8e678a5b4
[instagram] fix extraction of '/explore/tags/' posts
...
(closes #1666 )
3 years ago
Mike Fährmann
a6a51f207d
[moebooru] fix 'tags' ending with a '+' when logged in ( #1702 )
3 years ago
Mike Fährmann
f5b097165e
[ytdl] transfer YoutubeDL objects to downloader ( #1680 )
...
allows specifying downloader-specific options per subcategory
but overwrites all downloader.ytdl settings
3 years ago
Mike Fährmann
06e69ea79a
[ytdl] actually set options for YoutubeDL objects ( #1680 )
...
I somehow managed to remove the options parameter for
the YoutubeDL constructor in 9a849cdf
without noticing ...
3 years ago
Mike Fährmann
fc19010808
[downloader:ytdl] fix 'outtmpl' setting for yt_dlp ( #1680 )
...
yt_dlp supports multiple outtmpl settings for different file types and
uses its 'outtmpl_dict' for that.
3 years ago
Mike Fährmann
dfe1f490e9
[mangadex] use custom User-Agent header ( #1535 )
3 years ago
Mike Fährmann
36a2aff363
[vk] improve metadata extraction and URL pattern ( fixes #1691 )
...
- always fetch all user metadata
- use 'user[name]' for directory names if available
3 years ago
Mike Fährmann
b9783403d9
add 'url-metadata' option ( #1659 , #1073 )
3 years ago
Mike Fährmann
e622e004f0
[ytdl] improve module imports ( #1680 )
...
Apply 'extractor.ytdl.module' for every URL, not just the first.
3 years ago
Mike Fährmann
e95f99882f
extend 'parent-metadata' functionality ( #1687 , #1651 , #1364 )
3 years ago
Mike Fährmann
193401ce3b
[ytdl] "fix" cookie transfer between session and ytdl ( #1680 )
...
requests' CookieJar class is not quite compatible with the standard
http.cookiejar.CookieJar used by youtube_dl
3 years ago
Mike Fährmann
9a849cdf61
[ytdl] allow setting 'module' for subcategories ( #1680 )
3 years ago
Mike Fährmann
dff0da60f9
[ytdl] add 'generic' option ( #1680 )
3 years ago
Mike Fährmann
d3da96142a
[ytdl] support cookies + username&password ( #1680 )
3 years ago
Mike Fährmann
36ac2197db
[ytdl] add extractor for sites supported by youtube-dl
...
(#1680 , #878 )
Can be used by prefixing any URL with 'ytdl:',
or by setting 'extractor,ytdl.enabled' to 'true'.
3 years ago
Mike Fährmann
64240c8d42
[imagevenue] fix extraction
...
(closes #1677 )
3 years ago
Mike Fährmann
d287d2eb88
[kemonoparty] parse 'o' query parameters ( #1674 )
3 years ago
Mike Fährmann
8b036778e3
[kemonoparty] add 'max-posts' option ( #1674 )
3 years ago
Mike Fährmann
5612ca31c2
[hitomi] fix image URLs ( closes #1679 )
3 years ago
Mike Fährmann
8ecca3af58
[pixiv] add extractor for 'pixivision' articles ( #1672 )
3 years ago
Mike Fährmann
312a28e78a
[mastodon] add 'replies' option ( #1669 )
3 years ago
Mike Fährmann
513c491cea
[mastodon] reset 'params' after first pagination iteration
...
otherwise query parameters in 'params' get specified twice the second
time around - once from the 'links["next"]' URL and once from 'params'
itself.
3 years ago
Mike Fährmann
a1f5b78039
[mastodon] add 'reblogs' option ( #1669 )
3 years ago
Mike Fährmann
317ecc8180
use HTML tables in docs/formatting.md
3 years ago
Mike Fährmann
5f1b13d1a5
release version 1.18.1
3 years ago
Mike Fährmann
21c2da454f
update extractor test results
3 years ago
Mike Fährmann
7f591c78cb
[mangafox] cleanup
3 years ago
FollieHiyuki
4763bc1e4e
Add MangaExtractor for mangafox ( #1633 )
3 years ago
Mike Fährmann
b519bf567c
[hiperdex] use domain from input URL
3 years ago
Mike Fährmann
93d356712c
[mastodon] implement 'text-posts' option ( #1569 )
...
similar to Twitter's 'text-tweets'
3 years ago
Mike Fährmann
414bdc95a3
[twitter] set 'retweet_id' for original retweets ( #1481 )
3 years ago
Mike Fährmann
5323c1c73a
[twitter] ensure guest tokens are returned as string ( #1665 )
3 years ago
Mike Fährmann
9ee45f3617
[kemonoparty] warn about missing DDoS-GUARD cookies
3 years ago
Mike Fährmann
344aab3fb7
[seisoparty] warn about missing DDoS-GUARD cookies
3 years ago
Mike Fährmann
035562bd11
[twitter] remove old-style URLs from image fallback lists
3 years ago
Mike Fährmann
daf821b8b6
[seisoparty] use user names instead of IDs by default ( #1635 )
3 years ago
Mike Fährmann
e4db1bad14
[seisoparty] also extract files hosted on 'cdn-2' servers ( #1635 )
3 years ago
Mike Fährmann
267bbf5996
[mangasee] add 'chapter' and 'manga' extractors
3 years ago
Mike Fährmann
fad4918208
[deviantart] use UUIDs in internal folder/collection URLs
3 years ago
Mike Fährmann
64986f9435
fix depth counter in UrlJob
...
regression from adf4d661
It would either stop at the first level (-g) or go infinitely deep (-G)
Going down to for example level 3 with -ggg didn't work.
3 years ago
Mike Fährmann
0179581340
add 'T' format string conversion ( #1646 )
...
to convert 'date'/datetime to timestamp
3 years ago
Mike Fährmann
f74cf52e2b
[seisoparty] add 'user' and 'post' extractors ( #1635 )
3 years ago
Mike Fährmann
759735fb02
[kemonoparty] fix 'username' extraction ( fixes #1652 )
...
The site's <title> content changed from
<title>NAME | Kemono</title>
to
<title>
NAME | Kemono
</title>
3 years ago
Mike Fährmann
befe635022
cache parsed Formatter functions
3 years ago
Mike Fährmann
a416e54765
[directlink] manually encode Referer URLs ( fixes #1647 )
...
Trying to send a non-latin-1-encodable header raises an exception,
so we encode the Referer value ourselves with 'errors=ignore'.
3 years ago
Mike Fährmann
8bdeb2a6dd
[webtoons] match arbitrary language codes ( closes #1643 )
3 years ago
Mike Fährmann
79b7ee2712
use 'functools.partial' in '_build_cleanfunc' when possible
...
makes calls to the returned function a slight bit faster (~10%)
3 years ago
Mike Fährmann
ceaf7fd989
optimize 'base-directory' initialization and usage
...
apply 'clean_path()' only once
3 years ago
Mike Fährmann
2ca011dfa8
add 'kwdict' argument to PathFormat.build_filename()
3 years ago
Mike Fährmann
fd00d47116
implement conditional directories ( #1394 )
...
They work the same way as conditional filenames (84d2e640
), e.g.
"directory": {
"score >= 20": ["high score"],
"score >= 5" : ["mid score"],
"" : ["{category}", "default"]
}
3 years ago
Mike Fährmann
ee1064a2b2
release version 1.18.0
3 years ago
Mike Fährmann
4adc44df69
[furaffinity] improve metadata extraction ( fixes #1630 )
...
Fetch 'title' and 'artist' metadata from a different location,
since for posts with an empty title the <title> element is
completely empty and does not contain the artist's name.
3 years ago
Mike Fährmann
e98fa01c44
[hitomi] update image URL code ( fixes #1637 )
3 years ago
Mike Fährmann
e9ab97396f
[kemonoparty] update default filenames and archive IDs ( #1514 )
...
Add an enumeration index so that attachments and regular files with the
same filename still get downloaded and not counted as duplicate files
(even though for patreon posts they usually are)
This invalidates all previously generated archive IDs.
To keep using old names and IDs, set
'filename' to "{id}_{title}_{filename}.{extension}" and
'archive-format' to "{service}_{user}_{id}_{filename}.{extension}".
3 years ago
Mike Fährmann
fb4b4725ba
[hiperdex] match 'hiperdex2.com' URLs
...
still doesn't properly work due to Cloudflare CAPTCHA and IUAM page
3 years ago