Mike Fährmann
5fd4374036
[sankaku] improve 429 and tag limit handling
2 years ago
Mike Fährmann
4089bceddd
[sankaku] implement 'refresh' option ( #2958 )
2 years ago
Mike Fährmann
850608551c
[sankaku] detect expired links ( #2958 )
2 years ago
Mike Fährmann
32c75d12e8
[sankaku] rewrite URLs to s.sankakucomplex.com ( #2746 )
2 years ago
Mike Fährmann
05d4a0215a
[sankaku] extend URL patterns ( fixes #2647 )
...
- support URLs with ISO 639-1 language codes
- support black.… and white.… subdomains
2 years ago
Mike Fährmann
211de95dd0
update extractor test results
3 years ago
Mike Fährmann
9ed13703cc
[sankaku] handle empty tags ( fixes #1617 )
3 years ago
Mike Fährmann
c5ca7905ce
add 'noop()' and 'identity()' functions
3 years ago
Mike Fährmann
6fa20d456b
[sankaku] update invalid-token detection ( fixes #1515 )
3 years ago
Mike Fährmann
bdfcc9c4b1
update extractor test results
3 years ago
Mike Fährmann
0e601de67b
[sankaku] simplify 'pool' tags ( #1388 )
...
normalize 'tags' and 'artist_tags' to a string-list
4 years ago
Mike Fährmann
d085ade9d5
[sankaku] add 'tag_string' metadata field ( #1388 )
...
The 'join()'ed version of 'tags'.
Handling lists in format strings isn't properly supported yet.
4 years ago
Mike Fährmann
2dffd231b7
[sankaku] add enumeration index for books ( #1388 )
4 years ago
Mike Fährmann
96a51ff169
[sankaku] update invalid-token detection ( fixes #1309 )
4 years ago
Mike Fährmann
2da9068ea8
[sankaku] simplify login process
4 years ago
Mike Fährmann
b0beed7a06
[sankaku] add support for book searches ( closes #1204 )
4 years ago
Mike Fährmann
47a7a51944
[sankaku] fix 'invalid_token' detection
4 years ago
Mike Fährmann
e41e2be2f9
[booru] split '_prepare_post()'
4 years ago
Mike Fährmann
b233531aaa
[sankaku] use '/posts' endpoint for single posts
4 years ago
Mike Fährmann
459a0af4f8
[sankaku] add support for sankaku.app URLs ( closes #1193 )
4 years ago
Mike Fährmann
537742c0ee
[sankaku] normalize 'created_at' metadata ( closes #1190 )
4 years ago
Mike Fährmann
465015f75a
[sankaku] reimplement login support ( #1176 , #1182 )
4 years ago
Mike Fährmann
8d2e4e5f13
[booru] improve error handling
...
e.g. for posts without a valid 'file_url' (#1176 )
4 years ago
Mike Fährmann
b2c55f0a72
[sankaku] remove login support
...
The old login method for 'https://chan.sankakucomplex.com/user/login '
and the cookies it produces have no effect on the results from
'beta.sankakucomplex.com'.
4 years ago
Mike Fährmann
ecdea799dd
[sankaku] use 'beta.sankakucomplex.com' API endpoints
4 years ago
Mike Fährmann
1e3dd7330e
merge SharedConfigMixin functionality into Extractor
4 years ago
Mike Fährmann
844793847c
update extractor test results
4 years ago
Mike Fährmann
4409d00141
embed error messages in StopExtraction exceptions
5 years ago
Mike Fährmann
7a5e78741c
[booru] build directory path for each file ( #385 )
5 years ago
Mike Fährmann
40637556fa
[ngomik] fix extraction
5 years ago
Mike Fährmann
7a99e85943
[kissmanga] fix download URLs and file extensions
...
The current Blogspot image URLs hosted on Kissmanga end with an
"invalid" query parameter (/000.png&upx=...), which doesn't get
recognized by 'spliturl()' and 'parseurl()' as such and gets therefore
included in the 'extension' field from 'text.nameext_from_url()'.
5 years ago
Mike Fährmann
74c2415138
[sankakucomplex] move article extractor to its own module ( #258 )
5 years ago
Mike Fährmann
1e3e15c4f3
[sankaku] add article extractor ( #258 )
5 years ago
Mike Fährmann
efa805c5d7
[sankaku] update pagination end condition ( fixes #265 )
...
Pagination over popular listings (`date:...+order:popular") never
terminates, not even on the site itself, and at some point returns the
same results over and over again.
5 years ago
Mike Fährmann
0b4be57a10
[sankaku] fix error when no tags available ( closes #259 )
...
[ci skip]
5 years ago
Mike Fährmann
aa8e366b90
[luscious] fix tag extraction
5 years ago
Mike Fährmann
a2af2d2965
adjust cache maxage values
6 years ago
Mike Fährmann
4b1880fa5e
propagate 'match' to base extractor constructor
6 years ago
Mike Fährmann
6284731107
simplify extractor constants
...
- single strings for URL patterns
- tuples instead of lists for 'directory_fmt' and 'test'
- single-tuple tests where applicable
6 years ago
Mike Fährmann
4d656a81ca
replace SharedConfigExtractor class with a Mixin
6 years ago
Mike Fährmann
dd358b4564
improve cookie handling during logins
6 years ago
Mike Fährmann
2d2953a5bf
add 'text.parse_float()' + cleanup in text.py
6 years ago
Mike Fährmann
78b5f29a00
[sankaku] unescape tags
6 years ago
Mike Fährmann
2be4c9ffe3
[sankaku] small code improvements
6 years ago
Mike Fährmann
99137f1bee
[sankaku] send login info as formdata
...
Previously they were erroneously send as URL parameters.
6 years ago
Mike Fährmann
fa64c38d5b
[sankaku] fix pagination for user favorites ( #106 )
6 years ago
Mike Fährmann
b164231bca
[sankaku] increase default values for 'wait-min/-max'
6 years ago
Mike Fährmann
269dc2bbd5
[sankaku] add 'tags' option ( #94 )
6 years ago
Mike Fährmann
cc36f88586
rename safe_int to parse_int; move parse_* to text module
7 years ago
Mike Fährmann
5008e105ee
update archive IDs
...
... to behave in a more straightforward way when dealing with
bookmarks/favourites/etc.
specific IDs are now grouped by their owner, album-id, ... to
allow for duplicates when it would be expected.
7 years ago