Mike Fährmann
f5b097165e
[ytdl] transfer YoutubeDL objects to downloader ( #1680 )
...
allows specifying downloader-specific options per subcategory
but overwrites all downloader.ytdl settings
3 years ago
Mike Fährmann
06e69ea79a
[ytdl] actually set options for YoutubeDL objects ( #1680 )
...
I somehow managed to remove the options parameter for
the YoutubeDL constructor in 9a849cdf
without noticing ...
3 years ago
Mike Fährmann
fc19010808
[downloader:ytdl] fix 'outtmpl' setting for yt_dlp ( #1680 )
...
yt_dlp supports multiple outtmpl settings for different file types and
uses its 'outtmpl_dict' for that.
3 years ago
Mike Fährmann
dfe1f490e9
[mangadex] use custom User-Agent header ( #1535 )
3 years ago
Mike Fährmann
36a2aff363
[vk] improve metadata extraction and URL pattern ( fixes #1691 )
...
- always fetch all user metadata
- use 'user[name]' for directory names if available
3 years ago
Mike Fährmann
b9783403d9
add 'url-metadata' option ( #1659 , #1073 )
3 years ago
Mike Fährmann
e622e004f0
[ytdl] improve module imports ( #1680 )
...
Apply 'extractor.ytdl.module' for every URL, not just the first.
3 years ago
Mike Fährmann
e95f99882f
extend 'parent-metadata' functionality ( #1687 , #1651 , #1364 )
3 years ago
Mike Fährmann
193401ce3b
[ytdl] "fix" cookie transfer between session and ytdl ( #1680 )
...
requests' CookieJar class is not quite compatible with the standard
http.cookiejar.CookieJar used by youtube_dl
3 years ago
Mike Fährmann
9a849cdf61
[ytdl] allow setting 'module' for subcategories ( #1680 )
3 years ago
Mike Fährmann
dff0da60f9
[ytdl] add 'generic' option ( #1680 )
3 years ago
Mike Fährmann
d3da96142a
[ytdl] support cookies + username&password ( #1680 )
3 years ago
Mike Fährmann
36ac2197db
[ytdl] add extractor for sites supported by youtube-dl
...
(#1680 , #878 )
Can be used by prefixing any URL with 'ytdl:',
or by setting 'extractor,ytdl.enabled' to 'true'.
3 years ago
Mike Fährmann
64240c8d42
[imagevenue] fix extraction
...
(closes #1677 )
3 years ago
Mike Fährmann
d287d2eb88
[kemonoparty] parse 'o' query parameters ( #1674 )
3 years ago
Mike Fährmann
8b036778e3
[kemonoparty] add 'max-posts' option ( #1674 )
3 years ago
Mike Fährmann
5612ca31c2
[hitomi] fix image URLs ( closes #1679 )
3 years ago
Mike Fährmann
8ecca3af58
[pixiv] add extractor for 'pixivision' articles ( #1672 )
3 years ago
Mike Fährmann
312a28e78a
[mastodon] add 'replies' option ( #1669 )
3 years ago
Mike Fährmann
513c491cea
[mastodon] reset 'params' after first pagination iteration
...
otherwise query parameters in 'params' get specified twice the second
time around - once from the 'links["next"]' URL and once from 'params'
itself.
3 years ago
Mike Fährmann
a1f5b78039
[mastodon] add 'reblogs' option ( #1669 )
3 years ago
Mike Fährmann
317ecc8180
use HTML tables in docs/formatting.md
3 years ago
Mike Fährmann
5f1b13d1a5
release version 1.18.1
3 years ago
Mike Fährmann
21c2da454f
update extractor test results
3 years ago
Mike Fährmann
7f591c78cb
[mangafox] cleanup
3 years ago
FollieHiyuki
4763bc1e4e
Add MangaExtractor for mangafox ( #1633 )
3 years ago
Mike Fährmann
b519bf567c
[hiperdex] use domain from input URL
3 years ago
Mike Fährmann
93d356712c
[mastodon] implement 'text-posts' option ( #1569 )
...
similar to Twitter's 'text-tweets'
3 years ago
Mike Fährmann
414bdc95a3
[twitter] set 'retweet_id' for original retweets ( #1481 )
3 years ago
Mike Fährmann
5323c1c73a
[twitter] ensure guest tokens are returned as string ( #1665 )
3 years ago
Mike Fährmann
9ee45f3617
[kemonoparty] warn about missing DDoS-GUARD cookies
3 years ago
Mike Fährmann
344aab3fb7
[seisoparty] warn about missing DDoS-GUARD cookies
3 years ago
Mike Fährmann
035562bd11
[twitter] remove old-style URLs from image fallback lists
3 years ago
Mike Fährmann
daf821b8b6
[seisoparty] use user names instead of IDs by default ( #1635 )
3 years ago
Mike Fährmann
e4db1bad14
[seisoparty] also extract files hosted on 'cdn-2' servers ( #1635 )
3 years ago
Mike Fährmann
267bbf5996
[mangasee] add 'chapter' and 'manga' extractors
3 years ago
Mike Fährmann
fad4918208
[deviantart] use UUIDs in internal folder/collection URLs
3 years ago
Mike Fährmann
64986f9435
fix depth counter in UrlJob
...
regression from adf4d661
It would either stop at the first level (-g) or go infinitely deep (-G)
Going down to for example level 3 with -ggg didn't work.
3 years ago
Mike Fährmann
0179581340
add 'T' format string conversion ( #1646 )
...
to convert 'date'/datetime to timestamp
3 years ago
Mike Fährmann
f74cf52e2b
[seisoparty] add 'user' and 'post' extractors ( #1635 )
3 years ago
Mike Fährmann
759735fb02
[kemonoparty] fix 'username' extraction ( fixes #1652 )
...
The site's <title> content changed from
<title>NAME | Kemono</title>
to
<title>
NAME | Kemono
</title>
3 years ago
Mike Fährmann
befe635022
cache parsed Formatter functions
3 years ago
Mike Fährmann
a416e54765
[directlink] manually encode Referer URLs ( fixes #1647 )
...
Trying to send a non-latin-1-encodable header raises an exception,
so we encode the Referer value ourselves with 'errors=ignore'.
3 years ago
Mike Fährmann
8bdeb2a6dd
[webtoons] match arbitrary language codes ( closes #1643 )
3 years ago
Mike Fährmann
79b7ee2712
use 'functools.partial' in '_build_cleanfunc' when possible
...
makes calls to the returned function a slight bit faster (~10%)
3 years ago
Mike Fährmann
ceaf7fd989
optimize 'base-directory' initialization and usage
...
apply 'clean_path()' only once
3 years ago
Mike Fährmann
2ca011dfa8
add 'kwdict' argument to PathFormat.build_filename()
3 years ago
Mike Fährmann
fd00d47116
implement conditional directories ( #1394 )
...
They work the same way as conditional filenames (84d2e640
), e.g.
"directory": {
"score >= 20": ["high score"],
"score >= 5" : ["mid score"],
"" : ["{category}", "default"]
}
3 years ago
Mike Fährmann
ee1064a2b2
release version 1.18.0
3 years ago
Mike Fährmann
4adc44df69
[furaffinity] improve metadata extraction ( fixes #1630 )
...
Fetch 'title' and 'artist' metadata from a different location,
since for posts with an empty title the <title> element is
completely empty and does not contain the artist's name.
3 years ago
Mike Fährmann
e98fa01c44
[hitomi] update image URL code ( fixes #1637 )
3 years ago
Mike Fährmann
e9ab97396f
[kemonoparty] update default filenames and archive IDs ( #1514 )
...
Add an enumeration index so that attachments and regular files with the
same filename still get downloaded and not counted as duplicate files
(even though for patreon posts they usually are)
This invalidates all previously generated archive IDs.
To keep using old names and IDs, set
'filename' to "{id}_{title}_{filename}.{extension}" and
'archive-format' to "{service}_{user}_{id}_{filename}.{extension}".
3 years ago
Mike Fährmann
fb4b4725ba
[hiperdex] match 'hiperdex2.com' URLs
...
still doesn't properly work due to Cloudflare CAPTCHA and IUAM page
3 years ago
Mike Fährmann
95bc1139e0
[instagram] update query hashes
3 years ago
Mike Fährmann
23018a46f6
[instagram] fix login ( fixes #1631 )
3 years ago
Mike Fährmann
cac0110d8b
[redgifs] update API server address ( fixes #1632 )
...
napi.redgifs.com -> api.redgifs.com
3 years ago
Mike Fährmann
0d2961ae81
[500px] remove last query hash entry
...
forgot to include this in b56e2450
3 years ago
Mike Fährmann
7273cf8536
[pixiv] support fetching privately followed users ( fixes #1628 )
3 years ago
Mike Fährmann
e60962f7e5
[philomena] improve tag escapes handling ( fixes #1629 )
3 years ago
Mike Fährmann
d8908ca577
[unsplash] update collections URL pattern ( fixes #1627 )
3 years ago
Mike Fährmann
9ed13703cc
[sankaku] handle empty tags ( fixes #1617 )
3 years ago
Mike Fährmann
b56e245094
[500px] update GraphQL queries
...
500px changed its method from query hashes to sending the entire query
string for every request.
3 years ago
Mike Fährmann
e9993451c3
remove Python version checks
3 years ago
Mike Fährmann
40eb8025cc
require an argument for --clear-cache
3 years ago
Mike Fährmann
a751afdfb3
[twitter] change some defaults
...
- 'retweets' option: true -> false
- 'quoted' option : true -> false
i.e. disable downloading tweets from other user's timelines by default
- search directory:
'["{category}", "Search", "{search}"]' ->
'["{category}", "{user[name]}"]'
i.e. change it to the same as other twitter extractors (#1308 )
3 years ago
Mike Fährmann
4e4ca3c330
[deviantart] pin API version ( #1611 )
...
'/gallery/folders' in the newest version doesn't include subfolders.
It probably only needs the right query parameter to do so, but that
doesn't seem to be documented anywhere.
3 years ago
Mike Fährmann
d09bc5bd34
[subscribestar] improve attachment filenames ( #1609 )
3 years ago
Mike Fährmann
2986bf63bf
[mangafox] update URL pattern ( fixes #1608 )
...
also accept non-numeric volume labels, e.g. vTBD
3 years ago
Mike Fährmann
53dab5c289
[mangadex] revert chapter handling ( #1535 )
...
Spawn a new ChapterExtractor for each individual chapter
instead of handling them directly with a MangaExtractor.
Doing it that way broke too many features like
--chapter-filter, --chapter-range, --zip, etc.
3 years ago
Mike Fährmann
def0148582
restructure code in PathFormat constructor
3 years ago
Mike Fährmann
84d2e64024
combine conditional filenames into `filename` option ( #1394 )
3 years ago
Mike Fährmann
1197ee2c20
[mangadex] add extractor for a user's followed feed ( #1535 )
3 years ago
Mike Fährmann
07c8adbd8b
[mangadex] implement login with username & password ( #1535 )
3 years ago
Mike Fährmann
b93cbe6720
[postprocessor:zip] run 'finalize()' regardless of 'filter' ( #1460 )
3 years ago
Mike Fährmann
3e332eaf53
[mangadex] update to API v5 ( #1535 )
3 years ago
Mike Fährmann
ff8d0d7280
[postprocessor:metadata] ensure reliable tags order ( fixes #1598 )
3 years ago
Mike Fährmann
04f4f9badb
[oauth] prevent exceptions when reporting errors ( #1603 )
3 years ago
Mike Fährmann
dfe1e09df0
add '-T/--terminate' command-line option ( #1399 )
3 years ago
Mike Fährmann
6ecb710a7f
add '-P/--postprocessor' command-line option
3 years ago
Mike Fährmann
83fc4c1098
update post processor config capabilities
...
This change makes it possible to specify just the name of a post processor
in the "postprocessors" list instead of a dict with all of its options.
The options for it will then be taken from inside the "postprocessor"
block similar to "extractor", "downloader", or "output" blocks.
This makes it possible to for example override the default settings for
--write-metadata by specifying a custom "metadata" block, or to set a
custom post processor block ("cbz") and then use it by referencing just
its name in "postprocessors" lists.
{
"postprocessor":
{
"metadata": {
"name": "metadata",
"event": "post",
"filename": "{tweet_id|post_id|id}.json"
},
"cbz": {
"name" : "zip",
"compression": "store",
"extension" : "cbz"
}
}
}
3 years ago
Mike Fährmann
4a747a31a3
[postprocessor:metadata] handle dicts in mode;tags ( fixes #1598 )
3 years ago
Mike Fährmann
a3bf878329
[idolcomplex] improve and fix pagination ( #1601 )
...
always rely on the 'next-page-url' value and its query parameters
3 years ago
Mike Fährmann
3cbbefd4ed
support 'filter' option for post processors ( #1460 )
3 years ago
Mike Fährmann
4cf40434d7
initial support for conditional filenames ( #1394 )
3 years ago
Mike Fährmann
0abad8bc12
implement 'compile_expression()'
3 years ago
Mike Fährmann
e39c4633ba
[cyberdrop] b64decode -> a2b_base64
3 years ago
Mike Fährmann
407627ec86
[foolfuuka] support 'archive.wakarimasen.moe' ( closes #1595 )
3 years ago
Mike Fährmann
78f89d2e61
[idolcomplex] fix pagination ( closes #1594 )
3 years ago
Mike Fährmann
f14a36cfc9
release version 1.17.5
3 years ago
Mike Fährmann
52052a0e1a
[manganelo] update domain to 'manganato.com'
3 years ago
Mike Fährmann
c80b18a477
[weibo] extend 'retweets' option ( closes #1542 )
...
Setting 'retweets' to "original" will use metadata from the
original posts, and not from the retweeted ones.
3 years ago
Mike Fährmann
c0fa5058da
[kemonoparty] actually add a 'type' metadata field ( #1556 )
3 years ago
Mike Fährmann
bb50e85795
[postprocessor:ugoira] optimize writing ffconcat files
...
collect all content in-memory first and
write everything with a single 'write()'
3 years ago
Mike Fährmann
5c18db5fab
[postprocessor:ugoira] use 'concat' demuxer by default ( #1550 )
...
'image2' with nanasecond mtime timestamps doesn't work on Windows
3 years ago
Mike Fährmann
19a11fa11d
[postprocessor:ugoira] use frame extension from filenames ( #1550 )
3 years ago
Mike Fährmann
74d97e6c6b
[postprocessor:ugoira] add 'repeat-last-frame' option ( #1550 )
3 years ago
Mike Fährmann
be9547a5f3
[postprocessor:ugoira] use 'image2' demuxer ( #1550 )
...
Ensures exact frame timecodes with no duplicate frames.
Possible issues are the duration the last frame in an Ugoira with variable
frame durations is shown and insufficient timestamp precision of the
underlying file system (e.g. FAT32, ext3; works on ext4, tmpfs, NTFS).
3 years ago
thatfuckingbird
264beb8556
recognize v2.mangapark URLs ( #1578 )
...
* recognize v2.mangapark URLs
* update mangapark root url to use the v2 subdomain
3 years ago
thatfuckingbird
e6811c7450
[pixiv] implement 'max-posts' option ( #1558 )
...
* implement max-rank for pixiv
* rename to max-posts and make more generic
3 years ago
Mike Fährmann
8a909e478d
[imagebam] fix extraction of NSFW images ( #1534 )
3 years ago
Mike Fährmann
b5affc62aa
[twitter] rename 'text-only' to 'text-tweets' ( #570 )
3 years ago
Mike Fährmann
724ca61f36
[twitter] add 'text-only' option ( #570 )
3 years ago
Mike Fährmann
8fd8126117
fix ISO 639-1 code for Japanese
...
"jp" -> "ja"
3 years ago
Mike Fährmann
2c60c7d798
[reactor] skip deleted/empty posts
3 years ago
Mike Fährmann
532ac79fb0
update extractor test results
3 years ago
Mike Fährmann
d7bc4a2b8b
[500px] update query hashes
3 years ago
Mike Fährmann
0f35aca728
[aryion] minor code updates
3 years ago
Mike Fährmann
2eb46452ad
[aryion] update 'needle' to not skip text posts ( fixes #1568 )
...
on "Latest Updates" pages
"class='thumb scrollthumb' href='/g4/view/" and
"class='thumb' href='/g4/view/" both end with
"thumb' href='/g4/view/"
3 years ago
Mike Fährmann
adf4d661b3
use '_extractor' info in UrlJobs
3 years ago
Mike Fährmann
4fc9668922
[imgur] update URL patterns ( #1561 )
3 years ago
Mike Fährmann
1eabfa5c7a
[pillowfort] implement login with username & password ( #846 )
3 years ago
Mike Fährmann
24dd10ac3c
[patreon] extract user defined 'tags' ( #1539 , closes #1540 )
3 years ago
Mike Fährmann
a7e4917ee1
[pillowfort] add 'inline' option ( #846 )
...
to support images present in a post's 'content',
but not listed in 'media'.
also separates the file hash present at the beginning
of each 'filename' into its own field.
3 years ago
Mike Fährmann
efa6cc8ec3
[pillowfort] add 'external' option ( #846 )
...
for links to external Twitter posts etc.
3 years ago
Mike Fährmann
394fbb5f56
[twitter] strip useless t.co links ( #1532 )
...
The 'full_text' of Tweets with media content usually ends with a t.co
link to itself. This commit removes those.
3 years ago
Mike Fährmann
3a7c3ff138
support XDG_CONFIG_HOME ( closes #1545 )
...
This will only load either
${XDG_CONFIG_HOME}/gallery-dl/config.json or
${HOME}/.config/gallery-dl/config.json
if XDG_CONFIG_HOME is empty, never both.
3 years ago
Mike Fährmann
41457dbb1b
[twitter] resolve t.co URLs in 'content' ( #1532 )
3 years ago
Mike Fährmann
2b5d80862e
[kemonoparty] add 'type' metadata field ( #1556 )
...
'file', 'attachment', or 'inline'
3 years ago
Mike Fährmann
17b0ccb071
[twitter] add missing retweet media entities ( fixes #1555 )
...
from the original tweets
3 years ago
Mike Fährmann
5eeaaee01d
[pixiv] add 'metadata' option ( #1551 )
3 years ago
Mike Fährmann
0717456b4e
[kemonoparty] add 'metadata' option ( closes #1548 )
...
to fetch creator names with an additional HTTP request
3 years ago
Mike Fährmann
b50b8e6cf4
refactor applying 'parent-…' options
3 years ago
Mike Fährmann
7ab8374385
add 'parent-skip' option ( #1399 )
3 years ago
Mike Fährmann
c693db5b1a
add '"skip": "terminate"' option
...
Stops not only the current extractor/job,
but all parent extractors/jobs as well.
3 years ago
Mike Fährmann
4835888acc
release version 1.17.4
3 years ago
Mike Fährmann
36ed1efcfb
[pixiv] rename "noop" value for 'tags' option to "original"
...
(#1507 )
3 years ago
Mike Fährmann
14f983eab6
[deviantart] use default ID when 'client-id' is None
3 years ago
Mike Fährmann
3e4ffb0821
[gelbooru] add extractor for '/redirect.php' URLs ( #1530 )
3 years ago
Mike Fährmann
5e54105ae4
[instagram] update query hashes
3 years ago
Mike Fährmann
b3ee10a7fb
[500px] update query hashes
3 years ago
Mike Fährmann
15b0241bbc
[imagebam] fix extraction
3 years ago
Mike Fährmann
38ae61edd4
[inkbunny] add 'favorite' extractor ( #1521 )
3 years ago
Mike Fährmann
577fffad5f
[nozomi] update 'archive_fmt' values for tag and search extractors
...
… so they actually work for posts with more than 1 file.
(fixes #1523 )
3 years ago
Mike Fährmann
e300da1424
add 'output.skip' option
3 years ago
Mike Fährmann
c5ca7905ce
add 'noop()' and 'identity()' functions
3 years ago
Mike Fährmann
755164b36a
improve --clear-cache ( #1230 )
...
Allow for an optional argument to only delete cached entries from
a specific module.
delete all cache entries
$ gallery-dl --clear-cache
or
$ gallery-dl --clear-cache all
only delete entries for instagram
$ gallery-dl --clear-cache instagram
3 years ago
HRXN
e13cae182b
[nozomi] Extend default archive-fmt for Tag and Search Extractor ( #1529 )
...
Closes #1523
3 years ago
Mike Fährmann
bc868e7bb8
consider apparently long extensions as part of the filename
...
(#1516 )
3 years ago
Mike Fährmann
2133f1d77f
[readcomiconline] change domain to 'readcomiconline.li'
...
(closes #1517 )
3 years ago
Mike Fährmann
66f28e471c
[kemonoparty] update file URLs directly linking to kemono.party
...
(#1514 )
3 years ago
Mike Fährmann
6fa20d456b
[sankaku] update invalid-token detection ( fixes #1515 )
3 years ago
Mike Fährmann
4b65ebf652
[kemonoparty] fix file URLs ( #1514 )
...
files are now hosted on https://data.kemono.party/
3 years ago
Mike Fährmann
fa519f9202
[pixiv] change 'translated-tags' option ( #1507 )
...
- rename to 'tags'
- use string-values: "japanese", "translated", "noop"
- remove duplicate entries for "translated" tags
3 years ago
Mike Fährmann
5b4da4b4bf
reorder config access in Job constructor
...
(#1111 )
3 years ago
Mike Fährmann
221015e586
[downloader:http] disable filename extension changes for ugoira
...
(#1507 )
3 years ago
Mike Fährmann
e5123f56c9
fix crash when using --no-download with --ugoira-conv ( #1507 )
3 years ago
Mike Fährmann
07b6661a87
release version 1.17.3
3 years ago
thatfuckingbird
e47952ac14
add extractors for fantia and fanbox ( #1459 )
...
* add extractors for fantia and fanbox
* appease linter
* make docstrings unique
* [fantia] refactor post extraction
* [fantia] capitalize
* [fantia] improve regex pattern
* code style
* capitalize
* [fanbox] use BASE_PATTERN for url regexes
* [fanbox] refactor metadata and post extraction
* [fanbox] improve url base pattern
* [fanbox] accept creator page links ending with /posts
* [fanbox] more tests
* [fantia] improved pagination
* [fanbox] misc. code logic improvements
* [fantia] finish restructuring pagination code
* [fanbox] avoid making a request for each individual post when processing a creator page
* [fanbox] support embedded videos
* [fanbox] fix errors
* [fanbox] document extractor.fanbox.videos
* [fanbox] handle "article" and "entry" post types, all embeds
* [fanbox] fix downloading of embedded fanbox posts
3 years ago
Mike Fährmann
d900edfcfb
[simplyhentai] fix extraction
3 years ago
Mike Fährmann
ba8180b5e6
[bcy] don't crash with deleted posts
3 years ago
Mike Fährmann
d108421461
[myportfolio] fix extraction
3 years ago
Mike Fährmann
8b22d4e667
[mangapark] use '"browser": "firefox"' by default
...
to get rid of Cloudflare CAPTCHA resonses
3 years ago
Mike Fährmann
9514cb8c12
[exhentai] update 'limits' check ( #1487 )
...
Only use 'limits' to set a custom upper bound.
Checking if the actual maximum gets exceeded is not necessary.
3 years ago
thatfuckingbird
141ca4ac0a
[pixiv] also save untranslated tags when translated-tags is enabled ( #1501 )
3 years ago
Renan Vedovato Traba
9322c5e43b
[exhentai] restore limit config ( #1487 )
...
This partially reverts commit e9ec91c8
3 years ago
Mike Fährmann
cb86bb9cc9
[hentaicosplays] add 'slug' metadata field ( closes #1483 )
3 years ago
Mike Fährmann
b4ed7cb961
fix 'category-transfer' ( #1111 )
...
broken since commit 055c32e0
3 years ago
Mike Fährmann
dddda7d0e7
[hentaicosplays] use GalleryExtractor ( #1473 )
3 years ago
Mike Fährmann
d88e34f17e
[webtoons] use GalleryExtractor
3 years ago
Mike Fährmann
c4210b5371
[webtoons] update agegate/GDPR cookies
3 years ago
Mike Fährmann
d89eb7536b
[naverwebtoon] use GalleryExtractor
3 years ago
Mike Fährmann
9b52eb9bf1
[naverwebtoon] ignore non-comic images
3 years ago
Mike Fährmann
bdfcc9c4b1
update extractor test results
3 years ago
Hans Christian Gunawan
334d690687
[hentaicosplays] Add extractor ( #1473 )
3 years ago
Mike Fährmann
82c32d25af
[500px] update query hashes
3 years ago
Mike Fährmann
de14b7ad7a
[slideshare] fix extraction
3 years ago
Mike Fährmann
bef3105121
[komikcast] fix extraction
3 years ago
Mike Fährmann
086925e685
[shopify] support omgmiamiswimwear.com ( closes #1280 )
3 years ago
thatfuckingbird
224b883ff4
[danbooru] add option for extended metadata extraction ( #1458 )
...
* [danbooru] add option for extended metadata extraction
* appease linter
* [danbooru] update docs/configuration.rst
* [danbooru] rename extended-metadata -> metadata
3 years ago
thatfuckingbird
dff03a6605
[booru] add an option to extract notes (only gelbooru for now) ( #1457 )
...
* [booru] add an option to extract notes (currently implemented only for gelbooru)
* appease linter
* [gelbooru] rename "text" to "body" in note extraction
* add a code comment about reusing return value of _extended_tags
3 years ago
Mike Fährmann
78d7ee3ef4
[yuki] remove module for yuki.la
3 years ago
Mike Fährmann
a86ffb04bb
add 'output.fallback' option
...
to enable/disable fallback URLs for -g/--get-urls
3 years ago
Mike Fährmann
5a98bcec3a
[deviantart] improve folder name matching ( fixes #1451 )
3 years ago
thatfuckingbird
918b0441fb
[gelbooru] fix tag category extraction ( #1455 )
3 years ago
Mike Fährmann
fe6ce5b8f8
[erome] skip deleted albums ( fixes #1447 )
3 years ago
Mike Fährmann
457abf0e71
[deviantart] fix pagination for Eclipse results ( fixes #1444 )
...
- don't crash on missing keys
- use fallback for invalid 'nextOffset' values
3 years ago
Mike Fährmann
dee540050f
[8muses] fix JSON unobfuscation
...
limit the characters that get modified,
leave non-ASCII characters alone
4 years ago
Mike Fährmann
b869b3a9eb
[instagram] fetch media for incomplete GraphSidecar posts
...
GraphSidecar results from /tagged pages don't contain
all media elements, only the first one.
(#1439 )
4 years ago
Mike Fährmann
b0686d2174
[instagram] update query hashes
4 years ago
Mike Fährmann
e8e3717b71
[instagram] add extractor for /tagged posts ( #1439 )
4 years ago
Mike Fährmann
abafe71e04
[exhentai] fix image limit detection ( closes #1437 )
...
check for image limit message when downloading original files
4 years ago
Mike Fährmann
a75e485461
add archive format to InfoJob output ( #875 )
4 years ago
Mike Fährmann
52a7913abe
[artstation] download /4k/ images ( #1422 )
4 years ago
Mike Fährmann
37940193a6
build executables with SOCKS proxy support ( closes #1424 )
4 years ago
Christian Paul
41fbc20020
[webtoons]: Add cookie rstagGDPR_DE=true ( #1431 )
4 years ago
Mike Fährmann
583bee7725
release version 1.17.2
4 years ago
FollieHiyuki
e3b9f88540
Add manganelo extractor ( #1415 )
4 years ago
Mike Fährmann
fd858eed7b
[twitter] add 'user_likes' metadata field for liked tweets
...
i.e. the 'screen_name' of the user whose liked tweets get extracted.
Ideally this would replace 'user' or at least be in the same format,
but that would break backwards compatibility or be impossible/too
complicated thanks to API result differences.
(#1421 )
4 years ago
Mike Fährmann
8d124a3766
[twitter] rename variables
4 years ago
Mike Fährmann
105f3c9666
[twitter] add extractor for direct image links ( closes #1417 )
4 years ago
Mike Fährmann
ec3d5d58a8
[vk] improve extractor ( #474 )
...
- fetch all photos
- add 'metadata' option
- fix extracting photos without '?' in URL
4 years ago
Mike Fährmann
ebd142e2a8
[twitter] don't use youtube-dl for cards when videos are disabled
...
(#1416 )
4 years ago
Mike Fährmann
d5aad999dc
[tapas] implement login with username & password ( #692 )
4 years ago
Mike Fährmann
e9ec91c811
[exhentai] improve image limits check
...
- check if current image is the '509 Bandwidth Exceeded' notification
(https://ehgt.org/g/509.gif or https://exhentai.org/img/509.gif )
- remove 'limits' option
4 years ago
Mike Fährmann
387fe415d5
unescape items in text.split_html()
4 years ago
Mike Fährmann
36291176bc
[pinterest] add 'search' extractor ( #1411 )
4 years ago
Mike Fährmann
058cc47e9b
[bcy] improve pagination
4 years ago
Mike Fährmann
ddd48ceee5
update extractor test results
4 years ago
Mike Fährmann
1a540fbe00
[komikcast] fix extraction
4 years ago
Mike Fährmann
78fd63b8f0
remove 'text.clean_xml()'
...
was not used anywhere
4 years ago