Mike Fährmann
ec3d5d58a8
[vk] improve extractor ( #474 )
...
- fetch all photos
- add 'metadata' option
- fix extracting photos without '?' in URL
4 years ago
Mike Fährmann
ebd142e2a8
[twitter] don't use youtube-dl for cards when videos are disabled
...
(#1416 )
4 years ago
Mike Fährmann
d5aad999dc
[tapas] implement login with username & password ( #692 )
4 years ago
Mike Fährmann
e9ec91c811
[exhentai] improve image limits check
...
- check if current image is the '509 Bandwidth Exceeded' notification
(https://ehgt.org/g/509.gif or https://exhentai.org/img/509.gif )
- remove 'limits' option
4 years ago
Mike Fährmann
387fe415d5
unescape items in text.split_html()
4 years ago
Mike Fährmann
36291176bc
[pinterest] add 'search' extractor ( #1411 )
4 years ago
Mike Fährmann
058cc47e9b
[bcy] improve pagination
4 years ago
Mike Fährmann
ddd48ceee5
update extractor test results
4 years ago
Mike Fährmann
1a540fbe00
[komikcast] fix extraction
4 years ago
Mike Fährmann
78fd63b8f0
remove 'text.clean_xml()'
...
was not used anywhere
4 years ago
Mike Fährmann
8553b218d9
replace calls to 'os.path.splitext()' with 'str.rpartition()'
...
Makes functions who used it more than twice as fast
and we can get rid of an import as well.
4 years ago
Mike Fährmann
0a9af56e3c
build executables on GitHub Actions with Python 3.8
...
Python 3.9 is incompatible with Windows 7, so using a lower
Python version maybe allows those files to run on Windows 7.
4 years ago
Mike Fährmann
5aa30c3669
[tapas] add 'series' and 'episode' extractors ( #692 )
4 years ago
Mike Fährmann
ccfa5a8694
[twitter] better error message when logging in with 2FA ( #1409 )
4 years ago
Mike Fährmann
214ecf62ce
[deviantart] fix arguments for search/popular results ( #1408 )
4 years ago
Magnus Boman
522d0a834c
[aryion] Unescape paths too ( #1414 )
...
Without this you'll get paths like this:
- Starcross - Ch. 2 "The Ins and Outs of Sarah"
This commit changes it to:
- Starcross - Ch. 2 "The Ins and Outs of Sarah"
4 years ago
beesdotjson
5ad615f0db
fix PixivFavoriteExtractor regex ( #1405 )
...
* fix PixivFavoriteExtractor regex
* do not use lookbehind
4 years ago
Mike Fährmann
62cfee4d28
[vk] initial support for albums ( #474 )
4 years ago
Mike Fährmann
0e601de67b
[sankaku] simplify 'pool' tags ( #1388 )
...
normalize 'tags' and 'artist_tags' to a string-list
4 years ago
Mike Fährmann
d085ade9d5
[sankaku] add 'tag_string' metadata field ( #1388 )
...
The 'join()'ed version of 'tags'.
Handling lists in format strings isn't properly supported yet.
4 years ago
Mike Fährmann
2dffd231b7
[sankaku] add enumeration index for books ( #1388 )
4 years ago
Mike Fährmann
139fb84108
[deviantart] fix username for 'watch' results ( #794 )
...
before it'd use "/" as username
4 years ago
Mike Fährmann
91c2e15da9
[deviantart] add support for posts from watched users ( #794 )
4 years ago
Mike Fährmann
03c20d8c8e
[deviantart] update 'watch' URL pattern ( #794 )
4 years ago
Mike Fährmann
2846235669
[twitter] allow specifying a custom format for user results
...
(#1337 )
4 years ago
Mike Fährmann
bf241811dd
allow '_extractor' fields to be None or empty
4 years ago
Mike Fährmann
dc23cfd684
[deviantart] use fallback for /intermediary/ URLs
...
instead of checking availability with HEAD requests
4 years ago
Mike Fährmann
15daa62842
release version 1.17.1
4 years ago
Mike Fährmann
b0438c8f99
Revert "[deviantart] extend 'extra' option"
...
This reverts commit
5ad2b9c82b
,
5c32a7bf58
, and
83f465faca
.
(#1387 , #1356 )
4 years ago
Mike Fährmann
58b93635ee
[architizer] add 'firm' extractor ( #1369 )
4 years ago
Mike Fährmann
204523611c
[imgclick] use 'http://' for image URLs
...
The TLS certificate for main.imgclick.net is invalid.
4 years ago
Mike Fährmann
0725cfde4f
[tests] pin Ubuntu version to still be able to use Python 3.4
4 years ago
Mike Fährmann
0b55f5ad84
[imgur] fix/improve rate limit handling ( #1386 )
...
- also wait-and-retry on 429 status codes
- use infinite loop instead of recursive calls
- 'extractor.sleep()' -> 'extractor.wait()'
4 years ago
Mike Fährmann
69ca4e29f1
[deviantart] add 'watch' extractor ( #794 )
4 years ago
Mike Fährmann
fcdda6128c
[mangastream] remove module
4 years ago
Mike Fährmann
c677ea19dd
[mangareader] remove module
4 years ago
Mike Fährmann
71523aaab6
[architizer] add 'project' extractor ( #1369 )
4 years ago
Mike Fährmann
3378b39719
[twitter] implement 'users' option ( #1337 )
4 years ago
Mike Fährmann
847e9b0ed7
[philomena] support post URLs without '/images/'
...
e.g. 'derpibooru.org/1'
4 years ago
Mike Fährmann
466966bf83
[hentaicafe] remove module
4 years ago
Mike Fährmann
97641cd151
[hentainexus] remove module
4 years ago
Mike Fährmann
23641742a3
improve 'parent-directory' ( #1364 )
...
Allow forwarding metadata from the top-level extractor to all children
if 'parent-directory' is enabled for all extractors along the way.
For example 'reddit' -> 'gfycat' -> 'redgifs'
4 years ago
Mike Fährmann
c485d0a956
[philomena] add generalized extractors for philomena sites
...
(closes #1379 )
4 years ago
Mike Fährmann
6be7df53da
[hentaifox] improve metadata extraction ( fixes #1378 )
4 years ago
Mike Fährmann
72fe9ac0f3
[gelbooru_v01] support some more boorus by default
...
- https://drawfriends.booru.org/
- https://vidyart.booru.org/
- https://tlb.booru.org/
4 years ago
tux93
10c279f285
Weasyl: Drop the `&feature=submit` part of the favourite extractor URL ( #1374 )
...
It's optional and requiring it forces users to escape those URLs because
of the ampersand
4 years ago
Mike Fährmann
ec98b2c56f
categorize sites in supportedsites.md by basecategory
4 years ago
Mike Fährmann
a67e002f40
update docs/supportedsites
...
- use Markdown with inline HTML instead of reStructuredText
- move file from docs/supportedsites.rst to docs/supportedsites.md
- update Makefile, README, etc
4 years ago
Mike Fährmann
df94182e11
implement 'parent-metadata' option ( #1364 )
...
experimental, might not work as expected, etc.
4 years ago
Mike Fährmann
4be27ff0fe
[nozomi] support '/index-N.html' URLs ( closes #1365 )
...
and '/index-Popular-N.html'
4 years ago