Mike Fährmann
5f1b13d1a5
release version 1.18.1
3 years ago
Mike Fährmann
21c2da454f
update extractor test results
3 years ago
Mike Fährmann
7f591c78cb
[mangafox] cleanup
3 years ago
FollieHiyuki
4763bc1e4e
Add MangaExtractor for mangafox ( #1633 )
3 years ago
Mike Fährmann
b519bf567c
[hiperdex] use domain from input URL
3 years ago
Mike Fährmann
93d356712c
[mastodon] implement 'text-posts' option ( #1569 )
...
similar to Twitter's 'text-tweets'
3 years ago
Mike Fährmann
414bdc95a3
[twitter] set 'retweet_id' for original retweets ( #1481 )
3 years ago
Mike Fährmann
5323c1c73a
[twitter] ensure guest tokens are returned as string ( #1665 )
3 years ago
Mike Fährmann
963d177a68
document format string syntax
...
or at least attempt to
3 years ago
Mike Fährmann
9ee45f3617
[kemonoparty] warn about missing DDoS-GUARD cookies
3 years ago
Mike Fährmann
344aab3fb7
[seisoparty] warn about missing DDoS-GUARD cookies
3 years ago
Mike Fährmann
035562bd11
[twitter] remove old-style URLs from image fallback lists
3 years ago
Mike Fährmann
daf821b8b6
[seisoparty] use user names instead of IDs by default ( #1635 )
3 years ago
Mike Fährmann
e4db1bad14
[seisoparty] also extract files hosted on 'cdn-2' servers ( #1635 )
3 years ago
Mike Fährmann
267bbf5996
[mangasee] add 'chapter' and 'manga' extractors
3 years ago
Mike Fährmann
fad4918208
[deviantart] use UUIDs in internal folder/collection URLs
3 years ago
Mike Fährmann
64986f9435
fix depth counter in UrlJob
...
regression from adf4d661
It would either stop at the first level (-g) or go infinitely deep (-G)
Going down to for example level 3 with -ggg didn't work.
3 years ago
Mike Fährmann
0179581340
add 'T' format string conversion ( #1646 )
...
to convert 'date'/datetime to timestamp
3 years ago
Mike Fährmann
f74cf52e2b
[seisoparty] add 'user' and 'post' extractors ( #1635 )
3 years ago
Mike Fährmann
759735fb02
[kemonoparty] fix 'username' extraction ( fixes #1652 )
...
The site's <title> content changed from
<title>NAME | Kemono</title>
to
<title>
NAME | Kemono
</title>
3 years ago
Mike Fährmann
befe635022
cache parsed Formatter functions
3 years ago
Mike Fährmann
a416e54765
[directlink] manually encode Referer URLs ( fixes #1647 )
...
Trying to send a non-latin-1-encodable header raises an exception,
so we encode the Referer value ourselves with 'errors=ignore'.
3 years ago
Mike Fährmann
8bdeb2a6dd
[webtoons] match arbitrary language codes ( closes #1643 )
3 years ago
Mike Fährmann
79b7ee2712
use 'functools.partial' in '_build_cleanfunc' when possible
...
makes calls to the returned function a slight bit faster (~10%)
3 years ago
Mike Fährmann
e661607e8b
[mangadex] document 'metadata' and 'lang' options ( #1535 )
3 years ago
Mike Fährmann
ceaf7fd989
optimize 'base-directory' initialization and usage
...
apply 'clean_path()' only once
3 years ago
Mike Fährmann
2ca011dfa8
add 'kwdict' argument to PathFormat.build_filename()
3 years ago
Mike Fährmann
fd00d47116
implement conditional directories ( #1394 )
...
They work the same way as conditional filenames (84d2e640
), e.g.
"directory": {
"score >= 20": ["high score"],
"score >= 5" : ["mid score"],
"" : ["{category}", "default"]
}
3 years ago
Mike Fährmann
ee1064a2b2
release version 1.18.0
3 years ago
Mike Fährmann
4adc44df69
[furaffinity] improve metadata extraction ( fixes #1630 )
...
Fetch 'title' and 'artist' metadata from a different location,
since for posts with an empty title the <title> element is
completely empty and does not contain the artist's name.
3 years ago
Mike Fährmann
e98fa01c44
[hitomi] update image URL code ( fixes #1637 )
3 years ago
Mike Fährmann
e9ab97396f
[kemonoparty] update default filenames and archive IDs ( #1514 )
...
Add an enumeration index so that attachments and regular files with the
same filename still get downloaded and not counted as duplicate files
(even though for patreon posts they usually are)
This invalidates all previously generated archive IDs.
To keep using old names and IDs, set
'filename' to "{id}_{title}_{filename}.{extension}" and
'archive-format' to "{service}_{user}_{id}_{filename}.{extension}".
3 years ago
Mike Fährmann
fb4b4725ba
[hiperdex] match 'hiperdex2.com' URLs
...
still doesn't properly work due to Cloudflare CAPTCHA and IUAM page
3 years ago
Mike Fährmann
95bc1139e0
[instagram] update query hashes
3 years ago
Mike Fährmann
23018a46f6
[instagram] fix login ( fixes #1631 )
3 years ago
Mike Fährmann
cac0110d8b
[redgifs] update API server address ( fixes #1632 )
...
napi.redgifs.com -> api.redgifs.com
3 years ago
Mike Fährmann
0d2961ae81
[500px] remove last query hash entry
...
forgot to include this in b56e2450
3 years ago
Mike Fährmann
7273cf8536
[pixiv] support fetching privately followed users ( fixes #1628 )
3 years ago
Mike Fährmann
e60962f7e5
[philomena] improve tag escapes handling ( fixes #1629 )
3 years ago
Mike Fährmann
d8908ca577
[unsplash] update collections URL pattern ( fixes #1627 )
3 years ago
Mike Fährmann
9ed13703cc
[sankaku] handle empty tags ( fixes #1617 )
3 years ago
Mike Fährmann
b56e245094
[500px] update GraphQL queries
...
500px changed its method from query hashes to sending the entire query
string for every request.
3 years ago
Mike Fährmann
e9993451c3
remove Python version checks
3 years ago
Mike Fährmann
40eb8025cc
require an argument for --clear-cache
3 years ago
Mike Fährmann
a751afdfb3
[twitter] change some defaults
...
- 'retweets' option: true -> false
- 'quoted' option : true -> false
i.e. disable downloading tweets from other user's timelines by default
- search directory:
'["{category}", "Search", "{search}"]' ->
'["{category}", "{user[name]}"]'
i.e. change it to the same as other twitter extractors (#1308 )
3 years ago
Mike Fährmann
4e4ca3c330
[deviantart] pin API version ( #1611 )
...
'/gallery/folders' in the newest version doesn't include subfolders.
It probably only needs the right query parameter to do so, but that
doesn't seem to be documented anywhere.
3 years ago
Mike Fährmann
d09bc5bd34
[subscribestar] improve attachment filenames ( #1609 )
3 years ago
Mike Fährmann
2986bf63bf
[mangafox] update URL pattern ( fixes #1608 )
...
also accept non-numeric volume labels, e.g. vTBD
3 years ago
Mike Fährmann
53dab5c289
[mangadex] revert chapter handling ( #1535 )
...
Spawn a new ChapterExtractor for each individual chapter
instead of handling them directly with a MangaExtractor.
Doing it that way broke too many features like
--chapter-filter, --chapter-range, --zip, etc.
3 years ago
Mike Fährmann
def0148582
restructure code in PathFormat constructor
3 years ago