Mike Fährmann
69995d789b
Revert "[twitter] use '{author[name]' in default directory names"
...
This reverts commit 9ad3cdc5d8
.
2 years ago
Mike Fährmann
946643c23c
[hitomi] use maxage for gg.js cache ( #2863 )
...
cached values become invalid after 1-2 hours
2 years ago
Mike Fährmann
d508b2c049
[gelbooru] implement 'pool' pagination ( #2853 )
2 years ago
Mike Fährmann
67a2efb885
[rule34] implement 'pool' pagination ( #2853 )
2 years ago
Mike Fährmann
70dc4ce911
[skeb] ignore article images with empty URL
...
8cf5981ded (commitcomment-81980633)
2 years ago
Mike Fährmann
f362d4a3c7
[e621] fix 'popular' extraction
2 years ago
Mike Fährmann
7e385ed63e
[foolfuuka] update domains
...
- remove nyafuu
- add rozenarcana (https://archive.alice.al/ )
- add tokyochronos (https://www.tokyochronos.net )
2 years ago
Mike Fährmann
6ba72b6bc6
[twitter] ignore invalid user entries ( #2850 )
2 years ago
blankie
e4cff67aaa
[tumblr] add count metadata field ( #2804 )
...
Fixes #2778
2 years ago
enduser420
574e38a287
[kemonoparty] add 'favorites' option ( #2826 ) ( #2831 )
...
* [kemonoparty] add 'favorites' option (#2826 )
* [kemonoparty] add regex for the url parameter and fallback on the config
option
* [kemonoparty] simplify
2 years ago
Mike Fährmann
a799fae2df
[catbox] add 'album' extractor ( #2410 )
...
adapted from https://github.com/mikf/gallery-dl/pull/2805
- rewrite using GalleryExtractor
- extract more metadata
- match lolisafe names
- add test
2 years ago
Mike Fährmann
264f1336ad
[twitter] unescape '+' in search queries ( #2226 )
...
... and do not raise exception if searched user does not exist
2 years ago
Mike Fährmann
21ff77fea0
[zerochan] extract more metadata for single posts
...
Neither HTML pages nor RSS feed entries have *all* metadata.
It might be necessary to do 1-2 extra HTTP requests to grab everything.
2 years ago
Mike Fährmann
391aecf219
[instagram] provide 'date' for directories ( #2830 )
2 years ago
Martin Juhasz
9ea521bec5
[oauth] add host config option ( #2806 )
2 years ago
Mike Fährmann
9ad3cdc5d8
[twitter] use '{author[name]' in default directory names
...
with the changes to 'user' (749802c7
),
'{user[name]' with enabled retweets / quote tweets
would put a lot of them in a wrong directory
2 years ago
Mike Fährmann
1c89ccb27d
[weibo] prevent errors when paginating over album entries ( #2817 )
2 years ago
Mike Fährmann
a27b17481f
[tumblr] restrict condition for calling _original_image
2 years ago
Mike Fährmann
df1c643dda
[tumblr] attempt to extract full-resolution photos
...
- for photos with apparent width == 2048 or height == 3072
- can be disabled with 'original' option
2 years ago
Mike Fährmann
a69f198ac9
[wallhaven] implement 'metadata' option ( #2803 )
2 years ago
Mike Fährmann
033faf67ec
[poipiku] update filter for static images ( #2796 )
2 years ago
Mike Fährmann
b9a14b5758
[poipiku] fix posts with more than one image ( #2796 )
2 years ago
Mike Fährmann
0f5826e884
[weibo] prevent exception for missing 'playback_list' ( #2792 )
2 years ago
Mike Fährmann
81a37d21d3
[twitter] simplify 'user' assignment
2 years ago
Mike Fährmann
8a70b94245
[twitter] implement constant 'user' for tweet URLs
2 years ago
Mike Fährmann
8cf5981ded
[skeb] add option to download 'article' images ( #1031 )
2 years ago
Mike Fährmann
43ec315a7f
[deviantart] use public access token for journals ( #2702 )
...
and retry with a private token if needed
2 years ago
Mike Fährmann
3f08a91131
[bunkr] fix extraction ( #2788 )
...
... again
2 years ago
Mike Fährmann
5038893cdd
[blogger] emit metadata for posts without files ( #2789 )
2 years ago
Mike Fährmann
98af5a0409
[zerochan] implement login with username & password ( #1434 )
2 years ago
Mike Fährmann
3a8addfe45
[zerochan] add 'tag' and 'image' extractors ( #1434 )
2 years ago
Mike Fährmann
e660e48a60
[vk] prevent exceptions for broken/invalid photos ( #2774 )
2 years ago
Mike Fährmann
f559943d77
[instagram] fix empty 'params' in '_pagination_api()'
2 years ago
Mike Fährmann
1540d0e695
[twitter] use filter:links ( #2766 )
2 years ago
Mike Fährmann
8d0801ad8e
[twitter] fall back to unfiltered search ( #2766 )
2 years ago
Marius Kaufmann
0aa8345a13
[mastodon] allow downloading without access token ( #2782 )
...
Most mastodon instances allow accessing /api/v1/accounts/XXXX/statuses and api/v1/statuses/XXXX without an API access token.
This commit allows users to download at least some links from such a mastodon instance that does not already have access tokens hard-coded into the extractor.
User extractor only works on links that include the user id such as https://mastodon.tld/@id:12345 . Status links work as-is.
2 years ago
thatfuckingbird
ea5ffb19a6
fanbox: download cover images in original size ( #2784 )
2 years ago
Chew Shee Yang
977d53b640
[Instagram] Add support for user's saved collection ( #2769 )
...
* [Instagram] Add support for user's saved collection
* [Instagram] Run formatter
* [Instagram] Simplify collection_id retrieval and add metadata
* [Instagram] Fix bug when params is not passed to _pagination_api
2 years ago
blankie
5b63df46c0
[tumblr] attempt to get higher-quality images ( #2761 )
2 years ago
blankie
59b16b3f70
[artstation] add 'num' and 'count' metadata fields ( #2764 )
2 years ago
Mike Fährmann
eb68d45544
add global 'warnings' option ( #2762 )
2 years ago
Mike Fährmann
f225247670
[gelbooru] add support for `api_key` and `user_id` ( #2767 )
2 years ago
Mike Fährmann
77bdd8fe0f
[twitter] implement constant 'user' for 'from:…' searches
2 years ago
Mike Fährmann
a267a05a3f
[twitter] update 'quote_id' and 'quote_by'
...
- 'quote_id' is now non-null for quoted Tweets and has the ID of the
quoting Tweet, instead the other way round like before
- 'quote_by' is now the 'screen_name' of the quoting user
(was the same the new 'quote_id' is now)
2 years ago
Mike Fährmann
749802c7bd
[twitter] update 'user' and 'author' fields
...
- 'author' is always the user who authored a tweet
- 'user' is always the user specified in the input URL
or equal to 'author' when the former is not given
2 years ago
Mike Fährmann
a566e63cdf
[tumblr] support '/blog/view' URLs ( #2760 )
2 years ago
Mike Fährmann
46f11a3118
[bunkr] fix extraction ( #2732 )
...
move bunkr.is code to its own module
2 years ago
Mike Fährmann
baf3815ebd
[nozomi] small code optimizations
2 years ago
blankie
836402bf58
[twitter] unescape content ( #2756 ) ( #2757 )
...
Fixes #2756
2 years ago
Mike Fährmann
62cc47755b
[nozomi] reduce memory consumption during searches ( #2754 )
...
only load and use the entire 'index.nozomi' database
if there are only negative search terms
2 years ago
Mike Fährmann
467a2a4d35
[instagram] add 'pinned' metadata field ( #2752 )
...
'pinned' is a list of user IDs for which a post is pinned
and empty if not pinned anywhere.
2 years ago
Mike Fährmann
fe2b3d57d4
[komikcast] update domain
2 years ago
Mike Fährmann
4e11ca737e
[hentaifoundry] fix metadata extraction
2 years ago
Mike Fährmann
f2e59cc906
[slideshare] fix 'description' extraction
2 years ago
Mike Fährmann
31e868fca1
[khinsider] extract 'platform' metadata
2 years ago
Mike Fährmann
c6a9bab019
update extractor test results
2 years ago
Mike Fährmann
539e3bbed9
[weibo] handle invalid/broken status objects
2 years ago
Mike Fährmann
32c75d12e8
[sankaku] rewrite URLs to s.sankakucomplex.com ( #2746 )
2 years ago
Mike Fährmann
d5ded11aa8
[pixiv] fix default filenames for backgrounds
2 years ago
Mike Fährmann
e1f501ed14
[mangakakalot] update domain
2 years ago
Mike Fährmann
2dc57637cf
[foolfuuka] remove archive.wakarimasen.moe
2 years ago
Mike Fährmann
98744977cf
[itaku] fix 'date' parsing
2 years ago
Mike Fährmann
b590774f67
[twitter] add 'count' metadata field ( #2741 )
2 years ago
Mike Fährmann
7c0505868c
[kemonoparty] ensure all files have an 'extension' ( #2740 )
2 years ago
Mike Fährmann
e4f48cc810
make it easier to disable default 'browser' settings
...
Previously it was necessary to set 'browser' to a non-empty, non-string
value to disable any default 'browser' value.
Now '-o browser=' or '-o browser=false' is enough.
2 years ago
Mike Fährmann
92b75bcdce
limit path length for --write-pages output on Windows ( #2733 )
2 years ago
Mike Fährmann
311e9383af
[pinterest] handle section pins with separate extractors ( #2684 )
2 years ago
Mike Fährmann
1d14928bd9
[twitter] ignore previously seen Tweets ( #2712 )
...
occurs primarily for /with_replies results when logged in
2 years ago
Mike Fährmann
4b2a0a0eda
[twitter] implement 'strategy' option ( #2712 )
...
to be able to better control what Tweets get used an returned
for twitter.com/USER URLs.
2 years ago
Mike Fährmann
c794777600
[newgrounds] prevent exception on empty results ( #2727 )
2 years ago
Mike Fährmann
36ead45546
[itaku] fix caching bug ( #1842 )
...
ItakuApi.user() would always return the first user it was called with,
regardless of its 'username' argument.
2 years ago
Mike Fährmann
127a190c94
[itaku] categorize sections by group ( #1842 )
2 years ago
Mike Fährmann
de20cadc68
add 'brotli' as optional dependency ( #2716 )
...
only send 'Accept-Encoding: br' if supported
2 years ago
Mike Fährmann
37453a9528
[newgrounds] only login if necessary ( #2715 )
2 years ago
Mike Fährmann
7b073bf9ef
Revert "[twitter] improve strategy for user URLs ( #2665 )"
...
'user_tweets_and_replies' was a mistake
2 years ago
Mike Fährmann
3a5d5c3a91
update default User-Agent header to Firefox 102 ESR
...
snd update headers and ciphers for "browser": "firefox"
2 years ago
Mike Fährmann
f8cfc3b08a
[skeb] add 'following' extractor ( #2698 )
2 years ago
Mike Fährmann
367a491128
[vk] get URLs from *_src entries ( #2535 )
...
https://github.com/mikf/gallery-dl/issues/2535#issuecomment-1166566986
2 years ago
Mike Fährmann
241e82e18d
[horne] add support for horne.red ( #2700 )
2 years ago
Mike Fährmann
7af4d2047b
[instagram] improve metadata generated by _parse_post_api()
...
(#2695 )
2 years ago
Mike Fährmann
3f50e2fb5f
[poipiku] add simple password support ( #1602 )
2 years ago
Mike Fährmann
9d8e99af80
[itaku] support videos ( #1842 )
2 years ago
Mike Fährmann
c8ec2c4e85
[itaku] add 'title' to default filenames ( #1842 )
2 years ago
Mike Fährmann
e0c60a1206
[itaku] metadata cleanup ( #1842 )
...
- parse 'date_added' as 'date'
- simplify 'tags', 'categorized_tags', and 'sections'
2 years ago
Mike Fährmann
27e8078fb7
[poipiku] add 'user' and 'post' extractors ( #1602 )
2 years ago
Mike Fährmann
fa902cd54d
[itaku] add 'gallery' and 'image' extractors ( #1842 )
2 years ago
Mike Fährmann
d6c6c8a4a0
[twitter] improve '"replies": "self"' ( #2665 )
...
If a username is given in the input URL,
only download from replies by that user.
2 years ago
Mike Fährmann
9c8d895d19
[twitter] implement 'csrf' option ( #2676 )
2 years ago
Mike Fährmann
08db8435f1
[twitter] fix pagination for conversion tweets
...
a relic from the switch to GraphQL API
2 years ago
Mike Fährmann
78d83345d3
[cyberdrop] add fallback URLs ( #2668 )
2 years ago
Mike Fährmann
834e900037
[unsplash] add collection_title and …_id metadata fields ( #2670 )
2 years ago
Mike Fährmann
6db77d4656
[weibo] support '?tabtype=video' listings ( #2601 )
2 years ago
Mike Fährmann
1da3ccf608
[twitter] implement 'expand' option ( #2665 )
2 years ago
Mike Fährmann
0add1fc090
[twitter] improve strategy for user URLs ( #2665 )
...
- use '/with_replies' when appropriate
- consider 'text-tweets'
- build search query as necessary
2 years ago
Mike Fährmann
45c980daf0
[weibo] fix retweets ( #2601 )
2 years ago
Mike Fährmann
ae1b24aa6a
[instagram] automatically invalidate expired login sessions
2 years ago
Mike Fährmann
47a92c8c7e
[instagram] provide 'date' for 'carousel_media' files ( #2660 )
2 years ago
Mike Fährmann
2064f20e11
[instagram} fix 'tag' extractor ( #2659 )
2 years ago
Mike Fährmann
6c0fa2f258
[readcomiconline] update
2 years ago
Mike Fährmann
61cbf8318c
[weibo] fix URLs generated by 'user' extractor ( #2601 )
2 years ago