Mike Fährmann
1a540fbe00
[komikcast] fix extraction
4 years ago
Mike Fährmann
78fd63b8f0
remove 'text.clean_xml()'
...
was not used anywhere
4 years ago
Mike Fährmann
8553b218d9
replace calls to 'os.path.splitext()' with 'str.rpartition()'
...
Makes functions who used it more than twice as fast
and we can get rid of an import as well.
4 years ago
Mike Fährmann
0a9af56e3c
build executables on GitHub Actions with Python 3.8
...
Python 3.9 is incompatible with Windows 7, so using a lower
Python version maybe allows those files to run on Windows 7.
4 years ago
Mike Fährmann
5aa30c3669
[tapas] add 'series' and 'episode' extractors ( #692 )
4 years ago
Mike Fährmann
ccfa5a8694
[twitter] better error message when logging in with 2FA ( #1409 )
4 years ago
Mike Fährmann
214ecf62ce
[deviantart] fix arguments for search/popular results ( #1408 )
4 years ago
Magnus Boman
522d0a834c
[aryion] Unescape paths too ( #1414 )
...
Without this you'll get paths like this:
- Starcross - Ch. 2 "The Ins and Outs of Sarah"
This commit changes it to:
- Starcross - Ch. 2 "The Ins and Outs of Sarah"
4 years ago
beesdotjson
5ad615f0db
fix PixivFavoriteExtractor regex ( #1405 )
...
* fix PixivFavoriteExtractor regex
* do not use lookbehind
4 years ago
Mike Fährmann
62cfee4d28
[vk] initial support for albums ( #474 )
4 years ago
Mike Fährmann
0e601de67b
[sankaku] simplify 'pool' tags ( #1388 )
...
normalize 'tags' and 'artist_tags' to a string-list
4 years ago
Mike Fährmann
d085ade9d5
[sankaku] add 'tag_string' metadata field ( #1388 )
...
The 'join()'ed version of 'tags'.
Handling lists in format strings isn't properly supported yet.
4 years ago
Mike Fährmann
2dffd231b7
[sankaku] add enumeration index for books ( #1388 )
4 years ago
Mike Fährmann
139fb84108
[deviantart] fix username for 'watch' results ( #794 )
...
before it'd use "/" as username
4 years ago
Mike Fährmann
91c2e15da9
[deviantart] add support for posts from watched users ( #794 )
4 years ago
Mike Fährmann
03c20d8c8e
[deviantart] update 'watch' URL pattern ( #794 )
4 years ago
Mike Fährmann
2846235669
[twitter] allow specifying a custom format for user results
...
(#1337 )
4 years ago
Mike Fährmann
bf241811dd
allow '_extractor' fields to be None or empty
4 years ago
Mike Fährmann
dc23cfd684
[deviantart] use fallback for /intermediary/ URLs
...
instead of checking availability with HEAD requests
4 years ago
Mike Fährmann
15daa62842
release version 1.17.1
4 years ago
Mike Fährmann
b0438c8f99
Revert "[deviantart] extend 'extra' option"
...
This reverts commit
5ad2b9c82b
,
5c32a7bf58
, and
83f465faca
.
(#1387 , #1356 )
4 years ago
Mike Fährmann
58b93635ee
[architizer] add 'firm' extractor ( #1369 )
4 years ago
Mike Fährmann
204523611c
[imgclick] use 'http://' for image URLs
...
The TLS certificate for main.imgclick.net is invalid.
4 years ago
Mike Fährmann
0725cfde4f
[tests] pin Ubuntu version to still be able to use Python 3.4
4 years ago
Mike Fährmann
0b55f5ad84
[imgur] fix/improve rate limit handling ( #1386 )
...
- also wait-and-retry on 429 status codes
- use infinite loop instead of recursive calls
- 'extractor.sleep()' -> 'extractor.wait()'
4 years ago
Mike Fährmann
69ca4e29f1
[deviantart] add 'watch' extractor ( #794 )
4 years ago
Mike Fährmann
fcdda6128c
[mangastream] remove module
4 years ago
Mike Fährmann
c677ea19dd
[mangareader] remove module
4 years ago
Mike Fährmann
71523aaab6
[architizer] add 'project' extractor ( #1369 )
4 years ago
Mike Fährmann
3378b39719
[twitter] implement 'users' option ( #1337 )
4 years ago
Mike Fährmann
847e9b0ed7
[philomena] support post URLs without '/images/'
...
e.g. 'derpibooru.org/1'
4 years ago
Mike Fährmann
466966bf83
[hentaicafe] remove module
4 years ago
Mike Fährmann
97641cd151
[hentainexus] remove module
4 years ago
Mike Fährmann
23641742a3
improve 'parent-directory' ( #1364 )
...
Allow forwarding metadata from the top-level extractor to all children
if 'parent-directory' is enabled for all extractors along the way.
For example 'reddit' -> 'gfycat' -> 'redgifs'
4 years ago
Mike Fährmann
c485d0a956
[philomena] add generalized extractors for philomena sites
...
(closes #1379 )
4 years ago
Mike Fährmann
6be7df53da
[hentaifox] improve metadata extraction ( fixes #1378 )
4 years ago
Mike Fährmann
72fe9ac0f3
[gelbooru_v01] support some more boorus by default
...
- https://drawfriends.booru.org/
- https://vidyart.booru.org/
- https://tlb.booru.org/
4 years ago
tux93
10c279f285
Weasyl: Drop the `&feature=submit` part of the favourite extractor URL ( #1374 )
...
It's optional and requiring it forces users to escape those URLs because
of the ampersand
4 years ago
Mike Fährmann
ec98b2c56f
categorize sites in supportedsites.md by basecategory
4 years ago
Mike Fährmann
a67e002f40
update docs/supportedsites
...
- use Markdown with inline HTML instead of reStructuredText
- move file from docs/supportedsites.rst to docs/supportedsites.md
- update Makefile, README, etc
4 years ago
Mike Fährmann
df94182e11
implement 'parent-metadata' option ( #1364 )
...
experimental, might not work as expected, etc.
4 years ago
Mike Fährmann
4be27ff0fe
[nozomi] support '/index-N.html' URLs ( closes #1365 )
...
and '/index-Popular-N.html'
4 years ago
Mike Fährmann
780bac4c8a
[gelbooru] update video server ( fixes #1368 )
...
from 'https://img2.gelbooru.com ' to 'https://img3.gelbooru.com '
and provide fallback URLs
4 years ago
Mike Fährmann
f8441e851a
[hentaifox] improve image extraction ( fixes #1366 )
...
build image URLs from embedded JSON data
instead 0f rewriting thumbnail URLs
4 years ago
Mike Fährmann
c7c3fef0bc
[exhentai] support '/tag/' URLs ( closes #1363 )
4 years ago
Mike Fährmann
90830daf85
[exhentai] improve 'favorites' extraction ( closes #1360 )
...
add special cases for when the favorite count is 0 (Never) or 1 (Once)
4 years ago
Mike Fährmann
b6719becf1
ensure '-s/--simulate' always prints filenames ( #1360 )
...
by assuming a potentially wrong filename extension in cases where the
correct one would only get known after a download started
4 years ago
Mike Fährmann
83f465faca
[deviantart] refactor 'extra' ( #1356 )
...
- change its expected type to string
- let users specify a list of sources (stash, posts) or 'all'
4 years ago
Mike Fährmann
5c32a7bf58
[deviantart] allow selecting source for 'extra' ( #1356 )
...
Setting 'extra' to "stash" or "deviations" will only download embedded
sta.sh content or deviations. 'true' still downloads both.
4 years ago
Mike Fährmann
a677123abb
[instagram] recognize 'reels' as option for 'include' ( #1329 )
4 years ago