Mike Fährmann
c3f01dc4e6
implement 'util.unique()'
4 years ago
Mike Fährmann
558cde139c
[paheal] fix extraction ( fixes #1088 )
4 years ago
Mike Fährmann
0211af7ca8
[hentaifoundry] update 'YII_CSRF_TOKEN' cookie handling
...
(fixes #1083 )
4 years ago
Mike Fährmann
d83b95fd28
[postprocessor:metadata] accept a string-list for 'content-format'
...
(closes #1080 )
4 years ago
Mike Fährmann
198c33ec36
also collect post processors from 'basecategory' entries
...
(fixes #1084 )
4 years ago
Mike Fährmann
350b1afe1c
speed up _list_classes() after iterating over all modules once
4 years ago
Mike Fährmann
5bcf28de93
add a 'extractor.modules' option
4 years ago
Mike Fährmann
18213dc5ba
release version 1.15.2
4 years ago
Mike Fährmann
de4a1e45c9
improve 'generate_csrf_token()'
...
no need to use hashlib.md5()
4 years ago
Mike Fährmann
b788712844
[fallenangels] fix extraction of '.5' chapters
4 years ago
Mike Fährmann
28d8541cb3
[mangafox] ensure download URLs have a scheme
4 years ago
Mike Fährmann
8e3a324c91
[mangakakalot] ignore "Go Home" buttons in chapter pages
4 years ago
Mike Fährmann
c14c5d82d6
[newgrounds] use generator for fallback URLs
4 years ago
Mike Fährmann
a09f42f6b3
improve filename_from_url() performance
...
Manually extracting the part between the last '/' and '?' instead of
relying on the standard libraries' 'urllib.parse.urlsplit()' increases
performance by ~400%.
urlsplit() : 3.64 secs per 1.000.000 iterations
partition(): 0.87 secs per 1.000.000 iterations
4 years ago
Mike Fährmann
968d3e8465
remove '&' from URL patterns
...
'/?&#' -> '/?#' and '?&#' -> '?#'
According to https://www.ietf.org/rfc/rfc3986.txt , URLs are
"organized hierarchically" by using "the slash ("/"), question
mark ("?"), and number sign ("#") characters to delimit components"
4 years ago
Mike Fährmann
1686dc1757
[twitter] support media from Cards ( #1005 , #937 )
...
Can be enabled with 'extractor.twitter.cards', but for now disabled by
default because cards can redirect to rather large videos from YouTube
or Twitch.
4 years ago
Mike Fährmann
ffd38215a4
[hitomi] fix image URLs and URL pattern
...
- non-webp files are now hosted on [a-c]b.hitomi.la
- removed ampersand from invalid slug characters
4 years ago
Mike Fährmann
bac8af69e8
update configuration.rst
...
- add some lines to better explain post processor usage
- syntax highlighting for JSON blocks and other smaller stuff
4 years ago
Mike Fährmann
05d7009cc6
rename 'Authentication' entries in supportedsites.rst
...
- change 'Optional' to 'Supported'
- use 'OAuth' and 'Cookies' in their own
- add link to weasyl API key option
4 years ago
Mike Fährmann
286718950c
[mangahere] ensure download URLs have a scheme ( fixes #1070 )
4 years ago
Mike Fährmann
76dfa11a65
[reddit] add 'date' metadata field ( closes #1068 )
4 years ago
Mike Fährmann
3f2ba629ea
[newgrounds] provide fallback URLs for video downloads ( #1042 )
4 years ago
Mike Fährmann
a3ca2f6080
update fallback URL handling
...
remove Message.Urllist and use a '_fallback' field inside a kwdict
4 years ago
Mike Fährmann
43dab3a228
[mangadex] unescape more metadata fields ( fixes #1066 )
...
like 'manga', 'author', 'artist', etc.
4 years ago
Mike Fährmann
ec61696316
add 't' format string conversion ( closes #1065 )
...
to Trim whitespace from the beginning and end of strings.
Example: '{field!t}' becomes 'foo' for 'field' == " \nfoo\t\r"
4 years ago
Mike Fährmann
5565025221
[xhamster] fix user profile extraction
4 years ago
Mike Fährmann
07432d6262
[seiga] fix flake8 and cookie test ( #1063 )
4 years ago
Mike Fährmann
d1c6d78477
fix rST markup in configuration.rst
4 years ago
Mike Fährmann
b8daabc3ca
[pinterest] implement login support ( closes #1055 )
...
being logged allows access to secret/protected boards
4 years ago
Mike Fährmann
1b1cf01d0d
add a general 'generate_csrf_token()' function
4 years ago
Mike Fährmann
7a0ba370d1
[gelbooru] rewrite mp4 video URLs ( fixes #1048 )
4 years ago
Mike Fährmann
6491db3eaf
[blogger] handle URLs with specified width/height ( closes #1061 )
...
get highest quality for images with
/wXXX-hXXX/ instead of the usual /sXXX/
4 years ago
Mike Fährmann
783e0af26d
[hentaifoundry] update and simplify
4 years ago
Mike Fährmann
5b844a72b7
[newgrounds] handle embeds without scheme ( #1033 )
4 years ago
kurumigi
7e0e872f4f
[seiga] Add metadata for single image downloads ( #1063 )
...
* [seiga] Support image metadata.
* [seiga] Update test data.
* [seiga] Fix cookie check.
* [test_cookies] [seiga] Fit test_cookies.py to the last commit.
4 years ago
Zanny
3ec60e894a
[weasyl] api-key authentication ( #1057 )
...
* [weasyl] support api keys
* [weasyl] document api-key authentication
* [weasyl] usernames can contain ~
4 years ago
Mike Fährmann
35056a07d1
release version 1.15.1
4 years ago
Mike Fährmann
844793847c
update extractor test results
4 years ago
Mike Fährmann
ddd6840509
[behance] fix 'collection' extraction
4 years ago
Mike Fährmann
c5e3971b18
[newgrounds] extract image embeds ( closes #1033 )
4 years ago
dawidsowa
43b156fb40
[reactor] match URLs without subdomain ( #1053 )
4 years ago
林博仁(Buo-ren, Lin)
9349fc7036
snap: Fix selective-checkout unable to build development snapshots ( #1046 )
...
Due to recent `snap info` command output changes selective-checkout no
longer can match the stable channel version, which, making it always
building stable releases even when it is being promoted to the stable
channel.
This patch fixes the problem by upgrading the part to the new version,
which now call the Snap Store API for the info instead of scraping the
`snap info gallery-dl` command's output, which should ensure stability
in the future.
Refer-to: Selective-checkout: Check out the tagged release revision if it isn't promoted to the stable channel - doc - snapcraft.io <https://forum.snapcraft.io/t/selective-checkout-check-out-the-tagged-release-revision-if-it-isnt-promoted-to-the-stable-channel/10617 >
Signed-off-by: 林博仁(Buo-ren, Lin) <Buo.Ren.Lin@gmail.com>
4 years ago
Aster Wang
faeea597bb
Add some hints for scoop users ( #1038 )
...
* Add some hints for scoop users
As not everyone is willing to use complicated chocolatey
* gallery-dl is in main bucket!
4 years ago
Mike Fährmann
fd20093c96
allow blacklist/whitelist to be empty lists/strings ( #1051 )
4 years ago
Mike Fährmann
3ebb174f2c
add missing extractor info when spawning new ones ( fixes #1051 )
...
Not having this information causes the blacklist/whitelist logic to
trigger and prevents things from functioning as intended when using
default settings.
Fixes issues for 8muses, deviantart, exhentai, and mangoxo.
4 years ago
Mike Fährmann
f9c1684af7
[newgrounds] restore original video URLs ( #1042 )
4 years ago
Mike Fährmann
73373c06ec
[weibo] handle posts with more than 9 images ( closes #926 )
...
Responses from '/api/container/getIndex' don't list more than
9 images per 'status' object, but the embedded JSON from a
'/detail/<ID>' page does.
4 years ago
Mike Fährmann
dd1e545597
[hentaifoundry] rename GalleryExtractor to PicturesExtractor
4 years ago
Mike Fährmann
c874071f5a
[kissmanga] remove module
4 years ago
Mike Fährmann
93e04bf9a9
[500px] update query hashes
4 years ago