Mike Fährmann
1aafa1d12d
add '--ugoira-conv-copy' command-line option ( #1550 )
3 years ago
Mike Fährmann
a164f4b91e
[postprocessor:ugoira] put '=' before mkvmerge input filename
3 years ago
Mike Fährmann
5c2d953766
[postprocessor:ugoira] auto-select demuxer by default
3 years ago
Mike Fährmann
e45792a5c4
[postprocessor:ugoira] insert extra frame
...
into files generated with the 'image2' demuxer to compensate for the
last frame not being shown for as long as it should.
This only happens for ugoira with non-uniform delays between frames and
only when 'repeat-last-frame' is enabled.
3 years ago
Mike Fährmann
37d584a9b2
[hitomi] update metadata extraction ( fixes #2444 )
...
remove 'hitomi.metadata' option, as it is no longer necessary
to make additional HTTP requests to fetch all metadata.
3 years ago
Mike Fährmann
e718dd7b32
[postprocessor:ugoira] support setting timecodes with mkvmerge
...
by selecting the "mkvmerge" demuxer
(#1550 )
3 years ago
Mike Fährmann
71bba774da
respect 'output.private' in '-K/--list-keywords' output
3 years ago
Mike Fährmann
b03ca7f10c
[aryion] provide correct 'date' independent of dst
3 years ago
Mike Fährmann
ba69fb669d
[kemonoparty] add 'duplicates' option ( closes #2440 )
3 years ago
Mike Fährmann
e7b30866d0
[postprocessor:mtime] fix timestamps from datetime objects ( #2307 )
...
'datetime.timestamp()', which got used to convert datetime objects to
POSIX timestamps, assumes naive datetimes represent LOCAL time, while
datetimes in 'date' metadata fields represent UTC time.
Ref: https://docs.python.org/3/library/datetime.html#datetime.datetime.timestamp
> Naive datetime instances are assumed to represent local time
> you can obtain the POSIX timestamp by … calculating the timestamp directly
3 years ago
Mike Fährmann
29db716a63
implement 'datetime_to_timestamp()'
...
and rename 'to_timestamp()'
to the more descriptive 'datetime_to_timestamp_string()'
3 years ago
Mike Fährmann
c0c1277c5f
[downloader:http] support sending POST data ( #2433 )
...
by setting the '_http_data' metadata field for a file
needed in addition to be3492776b
to download files with POST requests
3 years ago
Mike Fährmann
9313d4dc10
[pinterest] do not force 'm3u8_native' for video downloads ( #2436 )
3 years ago
Mike Fährmann
42f2fd2ed7
[twibooru] fix posts without 'name' ( fixes #2434 )
3 years ago
Mike Fährmann
40ce505420
[postprocessor:ugoira] add 'mtime' option ( #2307 )
3 years ago
Mike Fährmann
9bd27b1b8d
[postprocessor:metadata] implement archive options ( #2421 )
...
'archive', 'archive-format', and 'archive-prefix'
3 years ago
Mike Fährmann
be3492776b
[downloader:http] support using a different method than GET ( #2433 )
...
by setting the '_http_method' metadata field for a file
3 years ago
chinggg
6f1d5e8ab9
[unsplash] replace dash with space in search API queries ( #2429 )
3 years ago
Mike Fährmann
f8230dde43
[instagram] add 'previews' option ( #2135 )
3 years ago
Mike Fährmann
8295bc6d97
fix loading/storing cookies without domain
3 years ago
Mike Fährmann
500a479026
fix a third(!) bug in _check_cookies() ( #2372 )
...
turns out tests are worthless if you get em wrong ...
3 years ago
Mike Fährmann
c4cc387f7d
[furaffinity] fix search result pagination ( fixes #2402 )
3 years ago
Mike Fährmann
cf44aba333
[formatter] allow evaluating f-string literals
...
by starting a format string with '\fF'.
This was technically already possible with '\fE',
but this makes it a bit more convenient.
3 years ago
Mike Fährmann
d04e07d93b
build executables on windows-2019
...
windows-2016 gets removed on 2022.03.15
3 years ago
Mike Fährmann
3acfe683ea
release version 1.21.0
3 years ago
Mike Fährmann
281a5b3b28
[newgrounds] fix video descriptions ( #2328 )
3 years ago
Mike Fährmann
b1b15d6cef
[imagebam] add support for /view/ paths ( closes #2378 )
3 years ago
Mike Fährmann
e64c2b85d0
[fantia] apply patch ( #2381 )
...
from @thatfuckingbird with small adjustments
https://github.com/mikf/gallery-dl/issues/2381#issuecomment-1063208696
3 years ago
Mike Fährmann
f31ab0d2ec
[fanbox] fetch data for each individual post ( fixes #2388 )
...
Posts from 'https://api.fanbox.cc/post.listCreator '
do not contain a 'body' with all images anymore.
https://github.com/mikf/gallery-dl/pull/1459#discussion_r614322881
3 years ago
Mike Fährmann
fc277fa45f
[seiga] require authentication with 'user_session' cookie ( #2372 )
...
Login with username & password would now require entering a 2FA token.
see also 7b009cc893
3 years ago
Mike Fährmann
47cf05c4ab
refactor proxy handling code ( #2357 )
...
- allow gallery-dl proxy settings to overwrite environment proxies
- allow specifying different proxies for data extraction and download
- add 'downloader.proxy' option
- '-o extractor.proxy=–PROXY_URL -o downloader.proxy=null'
now has the same effect as youtube-dl's '--geo-verification-proxy'
3 years ago
Mike Fährmann
d50a1ec2cc
[subscribestar] unescape attachment URLs ( fixes #2370 )
3 years ago
Mike Fährmann
3ddc620ef6
[skeb] fix post extractor ( #2330 )
3 years ago
Orkun Koçyiğit
eb2bb7d998
[fantia] add 'num' enumeration index ( #2377 )
...
* Adding numerical ordering to fantia
* Fixed line to fit PEP8 line size limit
3 years ago
Mike Fährmann
fac8047899
[kemonoparty] limit default filename length ( #2373 )
3 years ago
Mike Fährmann
bfa5e61900
[patreon] add explicit 'image_large' file type ( #2257 )
...
to allow more control over when and if to download 'large_url' images
4fee3a0e52
forced them to be downloaded
instead of regular images, even though 'large_url' images are most likely
an upscaled version of the original.
3 years ago
Mike Fährmann
6ea3ff5173
[tumblr] notify users about registering an oauth application
...
if they hit the daily rate limit and are using default API credentials
3 years ago
Mike Fährmann
b5236656d5
[deviantart] notify users about registering an oauth application
...
if they get repeated 429 errors and are using default API credentials
3 years ago
Mike Fährmann
2aa47e8382
[twitter] handle Tweets with "softIntervention" entries
...
or other such things where the actual Tweet data is one level deeper
than usual
3 years ago
Mike Fährmann
64bbc7969d
[twitter] warn about age-restricted Tweets ( #2354 )
3 years ago
Mike Fährmann
e778be52bc
[twitter] update query hashes
3 years ago
Mike Fährmann
bddcec49f1
implement 'text.root_from_url()'
...
use domain from input URL for kemono
3 years ago
Mike Fährmann
92c492dc09
[kemonoparty] match beta.kemono.party URLs ( #2348 )
3 years ago
Mike Fährmann
4ea9157d51
[mangadex] fix chapters without 'translatedLanguage' ( #2352 )
3 years ago
Alice
f1cab23724
[skeb] add 'sent-requests' option ( #2322 ) ( #2330 )
...
* Update skeb.py
* Update configuration.rst
* flake8
3 years ago
dragobit
781fdfa212
[hentaicosplays] add Referer to headers ( #2317 )
3 years ago
Mike Fährmann
4385a34e05
[twitter] fix handling of 429 responses ( fixes #2339 )
...
Twitter doesn't return a valid JSON response for 429 errors anymore.
3 years ago
Mike Fährmann
5a50569360
[toyhouse] support 'art' listings ( #1546 , #2331 )
3 years ago
Mike Fährmann
1c79044433
[imagebam] set 'nsfw_inter' cookie ( fixes #2334 )
3 years ago
Mike Fährmann
d71c173150
[newgrounds] strip incomplete HTML tag from '_comment' ( #2328 )
3 years ago
Mike Fährmann
00fbc942e5
add 'timedelta' class to the global namespace in filter expressions
...
(https://github.com/mikf/gallery-dl/issues/2308#issuecomment-1046135924 )
3 years ago
Mike Fährmann
cf58048bd4
[newgrounds] add 'post_url' metadata field ( #2328 )
3 years ago
Mike Fährmann
59749555b1
[postprocessor:mtime] add 'event' option ( #2307 )
3 years ago
Mike Fährmann
7958995398
[postprocessor:metadata] implement 'mtime' option ( #2307 )
3 years ago
Mike Fährmann
7aa2e2cd84
[slideshare] fix extraction
3 years ago
Mike Fährmann
fdfdc1b614
[kissgoddess] add 'gallery' and 'model' extractors
...
(closes #1052 , #2304 )
3 years ago
Mike Fährmann
79a461a2c1
[mememuseum] add 'tag' and 'post' extractors ( closes #2264 )
3 years ago
Mike Fährmann
e5f6af6e32
[oauth:pixiv] add note about 'code' expiring in 30 seconds ( #2306 )
3 years ago
Mike Fährmann
bbc4190017
[bunkr] fix .mp4 downloads ( #2239 )
...
again ...
3 years ago
Mike Fährmann
254a5b26e0
[twibooru] add extractors for searches, galleries, and posts
...
(#2219 )
3 years ago
Mike Fährmann
9ebc20e290
[booru] call nameext_from_url() before update() and _prepare()
...
to be able to overwrite filename and extension in _prepare()
3 years ago
Mike Fährmann
4fee3a0e52
[patreon] download 'large_url' images if available ( #2257 )
3 years ago
Mike Fährmann
f5b2b9333f
fix another bug in _check:cookies ( #2160 )
...
regression introduced in ed317bfc
Added a couple of tests to hopefully catch such bugs
before they land in a release.
3 years ago
Mike Fährmann
c8414c9d95
build Linux executable in its own virtualenv
3 years ago
Mike Fährmann
6fdcfa941c
release version 1.20.5
3 years ago
Ailothaen
203a04a4a3
[reddit] Support of standalone submissions on personal pages of users ( #2301 )
...
* [reddit] Support of submissions on personal pages of users
* [reddit] Design improvement for user submissions
* [reddit] Removed functions declared twice
3 years ago
Mike Fährmann
f5b2a9fcc6
add 'signals-ignore' option ( #2296 )
3 years ago
Mike Fährmann
806bc62379
[redgifs] support 'i.redgifs.com' URLs ( closes #2300 )
3 years ago
Mike Fährmann
655b2de5d9
[vk] fix infinite pagination loops ( fixes #2297 )
3 years ago
Mike Fährmann
cc5b1ce91a
[inkbunny] rename search parameters to their API equivalents
...
(fixes #2292 )
3 years ago
Mike Fährmann
ed317bfcf1
warn about cookies expiring in less than 24 hours
...
requires an expiration timestamp,
so this only works with cookies from a cookies.txt file
3 years ago
David Hoppenbrouwers
b17e2dcf93
[wallpapercave] add extractor for images ( #2205 )
3 years ago
v-delta
c661737f36
[Imgbox] Fix ImgboxExtractor ( #2281 )
3 years ago
Thomas Jost
a7de819aca
[lightroom] add Lightroom gallery extractor ( #2263 )
3 years ago
Mike Fährmann
563bd0ecf4
[danbooru] inherit from BaseExtractor
...
- merge danbooru and e621 code
- support booru.allthefallen.moe (closes #2283 )
- remove support for old e621 tag search URLs
3 years ago
Mike Fährmann
bc0e853d30
combine KeyError & IndexError to common base class LookupError
3 years ago
Mike Fährmann
f1c853c6ef
[furaffinity] add 'layout' option ( #2277 )
...
to be able to force gallery-dl to parse according to a specific layout
in case its auto-detect fails
3 years ago
Mike Fährmann
b4f8e15a1f
allow BaseExtractors to use the domain pf the matched URL
3 years ago
Mike Fährmann
c0fddcefc5
[downloader:ytdl] make ImportErrors non-fatal ( #2273 )
3 years ago
Mike Fährmann
a57a44f510
[kemonoparty] handle files without 'name' ( fixes #2276 )
3 years ago
Mike Fährmann
4efe56f419
[furaffinity] improve new/old layout detection ( fixes #2277 )
3 years ago
Mike Fährmann
0f1e7ff319
[twitter] fix extraction ( #2275 )
3 years ago
Mike Fährmann
f351746483
release version 1.20.4
3 years ago
Mike Fährmann
dee0d22561
update extractor test results
3 years ago
Mike Fährmann
d7b8e04b50
[kemonoparty] use 'Accept-Encoding: identity' for all downloads
...
(#2267 )
fixes issues when data send with 'Content-Encoding: gzip' or other
encodings is larger than the actual file
3 years ago
enormous-muscles
55326377d8
Add Kohlchan extractor ( #2251 )
3 years ago
Mike Fährmann
cc7dce5755
[sexcom] add 'pins' extractor ( closes #2265 )
3 years ago
Mike Fährmann
02e18f56be
[e621] add 'favorite' extractor ( closes #2250 )
3 years ago
Mike Fährmann
70e6e1549e
[twitter] provide fallback URLs for card images
...
f2e8aedd74 (commitcomment-64057751)
3 years ago
Mike Fährmann
86fa412b47
[hitomi] add 'format' option ( #2260 )
...
default is 'webp' since downloading original files is no longer allowed
3 years ago
Mike Fährmann
492436f936
[twitter] add 'warnings' option ( #2258 )
...
disable reporting any non-fatal errors by default
3 years ago
Mike Fährmann
a5163e4c70
[twitter] restore 'logout' functionality ( #1719 )
3 years ago
Mike Fährmann
f58364f6a8
update Firefox cipher list
3 years ago
Mike Fährmann
7e6981dda6
rename 'disabletls12' to 'tls12'
...
and let config options override any default settings
3 years ago
Mike Fährmann
bb3e182562
overhaul session initialization
...
- share adapter & connection pool across sessions with the same
ssl options, ssl ciphers, and source address
- simplify browser emulation to just a list of headers and ciphers
3 years ago
Mike Fährmann
e670dc518e
[weibo] update pagination code ( fixes #2244 )
...
- send proper headers and query parameters
- use 'since_id' instead of page numbers
- set a 1-2 second delay between requests
3 years ago
Robert Pendell
4c651f6252
[patreon] Disable TLS 1.2 by default ( #2249 )
...
Disables TLS 1.2 on Patreon by default.
3 years ago
Robert Pendell
392cf079f7
Add ability to disable TLS 1.2 ( #2243 )
...
Fix for Patreon Cloudflare issues by having only TLS v1.3 or higher establish HTTPS connections
This now allows you to disable it on a per-host or global basis. Add disabletls12 as a config option either under extractor.(host) or just under extractor. Option is false by default.
Example:
"patreon":
{
"disabletls12": true,
"cookies": {
"session_id": "X"
}
}
3 years ago
Mike Fährmann
d33227fc38
[twitter] restore errors for protected timelines etc ( fixes #2237 )
3 years ago
Mike Fährmann
ebd3d5c1cc
[bunkr] fix .mp4 downloads ( closes #2239 )
3 years ago
Mike Fährmann
e2be199124
[gelbooru] improve and fix pagination ( #2230 , #2232 )
...
Use 'id:<POSTID' as a tag instead of going through pages with 'pid'.
Something similar was already implemented in 93cef784
,
but that got broken again in 3085aac4
.
3 years ago
Mike Fährmann
806badbeec
release version 1.20.3
3 years ago
Mike Fährmann
8230f31800
[twitter] update query hashes
3 years ago
Mike Fährmann
c180806cec
[twitter] fix deleted/invalid retweets ( #2225 )
3 years ago
Mike Fährmann
a2eecc6aa8
[kemonoparty] fix DMs extraction ( #2008 )
3 years ago
Mike Fährmann
2bf554a896
[twitter] fix several errors ( #2212 , #2216 , #2225 )
...
- fix Tweets with deleted quotes
- fix suspended Tweets without 'legacy' entry
- fix unified_cards without 'type'
3 years ago
Mike Fährmann
fbd17547f5
release version 1.20.2
3 years ago
Mike Fährmann
e5242b83bf
[twitter] define directory format for events ( #2109 )
3 years ago
Mike Fährmann
efb3e65a6a
[sexcom] extend URL pattern ( fixes #2220 )
3 years ago
vsyx
3f2b6335d7
[instagram] fix highlights extraction ( #2197 )
...
* [instagram] fix highlights extraction
* [instagram] improve highlights extraction
- 'yield' individual reels instead of collecting them in a list
and returning them all at once
- reduce 'chunk_size' to an even saver value
(instagram.com also uses 5)
3 years ago
Mike Fährmann
5ed26e1773
[twitter] fix pinned tweets ( #2216 )
...
caused by the changes in dffa440ede
3 years ago
Mike Fährmann
a9f78e6527
[twitter] improve error handling
...
- handle accounts without 'rest_id'
- handle timelines with empty 'instructions'
3 years ago
Mike Fährmann
729b07c1f5
[twitter] simplify
...
- use dict with common GraphQL variables
- reduce 'variables' size with custom JSON encoder instance
- centralise TwitterAPI() creation
3 years ago
Mike Fährmann
7cb29224f0
[philomena] fix search parameter escaping ( #2215 )
...
The pluses from search terms in /tags/ URLs need to be
replaced with spaces to get accepted by Philomena.
3 years ago
Mike Fährmann
9ca8bb2dc0
[twitter] improve error handling
3 years ago
Mike Fährmann
9a221494c3
[twitter] add 'event' extractor ( closes #2109 )
3 years ago
Mike Fährmann
14867dad6b
[twitter] fix unified cards from search results
3 years ago
Mike Fährmann
dffa440ede
[twitter] improve handling of deleted tweets ( #2212 )
3 years ago
Mike Fährmann
54ef874ba4
[twitter] fix retweet filter ( #2212 )
3 years ago
Mike Fährmann
cb43f7731b
[twitter] update to GraphQL API ( #2212 )
...
The old REST API endpoints, which were not used by Twitter since
summer 2021, are going to finally be phased out it seems, with
'/2/timeline/profile/USERID.json' being the first one.
Only Twitter's search doesn't have a GraphQL interface yet.
3 years ago
Mike Fährmann
de754590e0
add --source-address command-line option ( closes #2206 )
3 years ago
Mike Fährmann
698f35215e
[blogger] support new image domain ( fixes #2204 )
3 years ago
Mike Fährmann
c587b678d0
[mangadex] re-enable warning for external chapters ( #2193 )
3 years ago
Mike Fährmann
f2e8aedd74
[twitter] changes to 'cards' option
...
- change default value to 'true'
- only invoke youtube-dl for cards unsupported by gallery
when 'cards' is set to "ytdl"
"cards": true --> only download card images
"cards": "ytdl" --> download card images and
use youtube_dl on otherwise unsupported cards
3 years ago
Mike Fährmann
2d34d8ff8b
[reddit] allow downloading from quarantined subreddits ( #2180 )
3 years ago
Mike Fährmann
17c9c47ca0
[hitomi] fix 'tag' extraction ( fixes #2189 )
3 years ago
Mike Fährmann
df2f0c09bb
[twitter] support "image_carousel_website" unified cards
3 years ago
Mike Fährmann
cdc96e1217
[gelbooru] improve video file detection ( fixes #2188 )
...
not all files from 'https://video-cdnN.gelbooru.com ' are videos
3 years ago
Mike Fährmann
4acc31bd9f
[newgrounds] set suitabilities filter before starting a search
3 years ago
Mike Fährmann
58a7921b5c
release version 1.20.1
3 years ago
Mike Fährmann
170711af7e
[mangadex] fix extraction ( closes #2177 )
3 years ago
Mike Fährmann
199e7616a7
[rule34] use https://api.rule34.xxx for API requests
3 years ago
Mike Fährmann
6e0a6c484f
apply SPECIAL_EXTRACTORS only for blacklist settings
...
as was the case before 010d65dc
3 years ago
Mike Fährmann
37beb1298e
[newgrounds] add 'search' extractor ( closes #2161 )
3 years ago
Mike Fährmann
8b910dd8ae
[hitomi] fix image URLs
...
again and again ...
3 years ago
Mike Fährmann
dcfe08838d
restore -d/--dest functionality
...
change short option for --directory from -d to -D
3 years ago
Mike Fährmann
3085aac4d8
[gelbooru] handle changed API response format ( #2157 )
3 years ago
Mike Fährmann
38e2af29d6
[hitomi] fix image URLs
...
update '_parse_gg()' yet again
3 years ago
Mike Fährmann
6f2e0c9c3d
fix cookie checks for patreon, fanbox, fantia
...
The changes in 9a255344
caused a warning about missing cookies to be
displayed even if those cookies were present, because _check_cookies()
did not account for an empty cookiedomain.
3 years ago
Mike Fährmann
1e0278702d
[hitomi] update '_parse_gg()'
3 years ago
Mike Fährmann
3b7c7daa76
improve UNC path handling ( #2126 )
...
always call 'abspath()' on the directory path to handle cases when the
current working directory is UNC and 'base-directory' is relative.
3 years ago
Mike Fährmann
47eae4c393
release version 1.20.0
3 years ago
Mike Fährmann
becc7f85a6
[hitomi] fix image URLs
3 years ago
Mike Fährmann
6af8d71da6
[kemonoparty] use service as subcategory ( closes #2147 )
3 years ago
Vrihub
96fcff182c
generic extractor ( #735 )
...
* Generic extractor, see issue #683
* Fix failed test_names test, no subcategory needed
* Prefix directory_fmt with "generic"
* Relax regex (would break some urls)
* Flake8 compliance
* pattern: don't require a scheme
This fixes a bug when we force the generic extractor on urls without a
scheme (that are allowed by all other extractors).
* Fix using g: and r: on urls without http(s) scheme
Almost all extractors accept urls without an initial http(s) scheme.
Many extractors also allow for generic subdomains in their "pattern"
variable; some of them implement this with the regex character class
"[^.]+" (everything but a dot).
This leads to a problem when the extractor is given a url starting
with g: or r: (to force using the generic or recursive extractor)
and without the http(s) scheme: e.g. with "r:foobar.tumblr.com"
the "r:" is wrongly considered part of the subdomain.
This commit fixes the bug, replacing the too generic "[^.]+" with the
more specific "[\w-]+" (letters, digits and "-", the only characters
allowed in domain names), which is already used by some extractors.
* Relax imageurl_pattern_ext: allow relative urls
* First round of small suggested changes
* Support image urls starting with "//"
* self.baseurl: remove trailing slash
* Relax regexp (didn't catch some image urls)
* Some fixes and cleanup
* Fix domain pattern; option to enable extractor
Fixed the domain section for "pattern", to pass "test_add" and
"test_add_module" tests.
Added the "enabled" configuration option (default False) to enable the
generic extractor. Using "g(eneric):URL" forces using the extractor.
3 years ago
Mike Fährmann
4376b39a2b
[sexcom] fix and improve embed extraction ( fixes #2145 )
3 years ago
Mike Fährmann
6d190834ee
[instagram] fix error when PostPage data is not in GraphQL format
...
(#2037 )
3 years ago
Mike Fährmann
4edf43891c
add -d/--directory and -f/--filename command-line arguments
3 years ago
Mike Fährmann
dd67e24aa9
[lolisafe] include file ID in filenames
...
More precisely, it now splits the full 'filename' into 'name' and 'id'
instead of overwriting 'filename'. The format string stays the same as
before. Use '{name}.{extension}' to restore the old behavior.
before:
- filename: foobar
- id : 12345
now:
- filename: foobar-12345
- name : foobar
- id : 12345
3 years ago
Mike Fährmann
f3d61de18d
[artstation] create directories per asset ( closes #2136 )
3 years ago