Mike Fährmann
1b2f9050fb
rename all instances of 'kwds' to 'kwdict'
3 years ago
Mike Fährmann
3868ec02d1
[pururin] update domain and fix extraction
3 years ago
Mike Fährmann
b89a44090f
[naverwebtoon] fix comic extraction
3 years ago
Mike Fährmann
4048eabeab
allow 'extractor.modules' to be a string
...
with comma-separated module names
3 years ago
Mike Fährmann
c8e678a5b4
[instagram] fix extraction of '/explore/tags/' posts
...
(closes #1666 )
3 years ago
Mike Fährmann
a6a51f207d
[moebooru] fix 'tags' ending with a '+' when logged in ( #1702 )
3 years ago
Mike Fährmann
f5b097165e
[ytdl] transfer YoutubeDL objects to downloader ( #1680 )
...
allows specifying downloader-specific options per subcategory
but overwrites all downloader.ytdl settings
3 years ago
Mike Fährmann
06e69ea79a
[ytdl] actually set options for YoutubeDL objects ( #1680 )
...
I somehow managed to remove the options parameter for
the YoutubeDL constructor in 9a849cdf
without noticing ...
3 years ago
Mike Fährmann
fc19010808
[downloader:ytdl] fix 'outtmpl' setting for yt_dlp ( #1680 )
...
yt_dlp supports multiple outtmpl settings for different file types and
uses its 'outtmpl_dict' for that.
3 years ago
Mike Fährmann
dfe1f490e9
[mangadex] use custom User-Agent header ( #1535 )
3 years ago
Mike Fährmann
36a2aff363
[vk] improve metadata extraction and URL pattern ( fixes #1691 )
...
- always fetch all user metadata
- use 'user[name]' for directory names if available
3 years ago
Mike Fährmann
b9783403d9
add 'url-metadata' option ( #1659 , #1073 )
3 years ago
Mike Fährmann
e622e004f0
[ytdl] improve module imports ( #1680 )
...
Apply 'extractor.ytdl.module' for every URL, not just the first.
3 years ago
Mike Fährmann
e95f99882f
extend 'parent-metadata' functionality ( #1687 , #1651 , #1364 )
3 years ago
Mike Fährmann
193401ce3b
[ytdl] "fix" cookie transfer between session and ytdl ( #1680 )
...
requests' CookieJar class is not quite compatible with the standard
http.cookiejar.CookieJar used by youtube_dl
3 years ago
Mike Fährmann
9a849cdf61
[ytdl] allow setting 'module' for subcategories ( #1680 )
3 years ago
Mike Fährmann
dff0da60f9
[ytdl] add 'generic' option ( #1680 )
3 years ago
Mike Fährmann
d3da96142a
[ytdl] support cookies + username&password ( #1680 )
3 years ago
Mike Fährmann
36ac2197db
[ytdl] add extractor for sites supported by youtube-dl
...
(#1680 , #878 )
Can be used by prefixing any URL with 'ytdl:',
or by setting 'extractor,ytdl.enabled' to 'true'.
3 years ago
Mike Fährmann
64240c8d42
[imagevenue] fix extraction
...
(closes #1677 )
3 years ago
Mike Fährmann
d287d2eb88
[kemonoparty] parse 'o' query parameters ( #1674 )
3 years ago
Mike Fährmann
8b036778e3
[kemonoparty] add 'max-posts' option ( #1674 )
3 years ago
Mike Fährmann
5612ca31c2
[hitomi] fix image URLs ( closes #1679 )
3 years ago
Mike Fährmann
8ecca3af58
[pixiv] add extractor for 'pixivision' articles ( #1672 )
3 years ago
Mike Fährmann
312a28e78a
[mastodon] add 'replies' option ( #1669 )
3 years ago
Mike Fährmann
513c491cea
[mastodon] reset 'params' after first pagination iteration
...
otherwise query parameters in 'params' get specified twice the second
time around - once from the 'links["next"]' URL and once from 'params'
itself.
3 years ago
Mike Fährmann
a1f5b78039
[mastodon] add 'reblogs' option ( #1669 )
3 years ago
Mike Fährmann
317ecc8180
use HTML tables in docs/formatting.md
3 years ago
Mike Fährmann
5f1b13d1a5
release version 1.18.1
3 years ago
Mike Fährmann
21c2da454f
update extractor test results
3 years ago
Mike Fährmann
7f591c78cb
[mangafox] cleanup
3 years ago
FollieHiyuki
4763bc1e4e
Add MangaExtractor for mangafox ( #1633 )
3 years ago
Mike Fährmann
b519bf567c
[hiperdex] use domain from input URL
3 years ago
Mike Fährmann
93d356712c
[mastodon] implement 'text-posts' option ( #1569 )
...
similar to Twitter's 'text-tweets'
3 years ago
Mike Fährmann
414bdc95a3
[twitter] set 'retweet_id' for original retweets ( #1481 )
3 years ago
Mike Fährmann
5323c1c73a
[twitter] ensure guest tokens are returned as string ( #1665 )
3 years ago
Mike Fährmann
963d177a68
document format string syntax
...
or at least attempt to
3 years ago
Mike Fährmann
9ee45f3617
[kemonoparty] warn about missing DDoS-GUARD cookies
3 years ago
Mike Fährmann
344aab3fb7
[seisoparty] warn about missing DDoS-GUARD cookies
3 years ago
Mike Fährmann
035562bd11
[twitter] remove old-style URLs from image fallback lists
3 years ago
Mike Fährmann
daf821b8b6
[seisoparty] use user names instead of IDs by default ( #1635 )
3 years ago
Mike Fährmann
e4db1bad14
[seisoparty] also extract files hosted on 'cdn-2' servers ( #1635 )
3 years ago
Mike Fährmann
267bbf5996
[mangasee] add 'chapter' and 'manga' extractors
3 years ago
Mike Fährmann
fad4918208
[deviantart] use UUIDs in internal folder/collection URLs
3 years ago
Mike Fährmann
64986f9435
fix depth counter in UrlJob
...
regression from adf4d661
It would either stop at the first level (-g) or go infinitely deep (-G)
Going down to for example level 3 with -ggg didn't work.
3 years ago
Mike Fährmann
0179581340
add 'T' format string conversion ( #1646 )
...
to convert 'date'/datetime to timestamp
3 years ago
Mike Fährmann
f74cf52e2b
[seisoparty] add 'user' and 'post' extractors ( #1635 )
3 years ago
Mike Fährmann
759735fb02
[kemonoparty] fix 'username' extraction ( fixes #1652 )
...
The site's <title> content changed from
<title>NAME | Kemono</title>
to
<title>
NAME | Kemono
</title>
3 years ago
Mike Fährmann
befe635022
cache parsed Formatter functions
3 years ago
Mike Fährmann
a416e54765
[directlink] manually encode Referer URLs ( fixes #1647 )
...
Trying to send a non-latin-1-encodable header raises an exception,
so we encode the Referer value ourselves with 'errors=ignore'.
3 years ago