Mike Fährmann
ac97aca99c
[realbooru] fix extraction
...
get file URLs from HTML pages
1 year ago
Mike Fährmann
cd931e1139
update extractor test results
2 years ago
Mike Fährmann
6423f990de
[realbooru] fix 'tags' extraction ( #2530 )
2 years ago
Mike Fährmann
ecad02cf3f
[realbooru] fix download URLs ( #2530 )
2 years ago
Mike Fährmann
b0cb4a1b9c
replace 'text.extract()' with 'text.extr()' where possible
2 years ago
Mike Fährmann
4fd3c893fa
[booru] adjust/match '_tags' and '_notes' code
2 years ago
Mike Fährmann
88954aa2e4
[gelbooru_v02] implement 'notes' extraction
...
same code as for 'moebooru' works here as well
2 years ago
Mike Fährmann
775895f44b
[booru] refactor 'tags' and 'notes' extraction
...
- move HTML request for post pages into its own function
- move gelbooru_v02.py notes extraction to gelbooru.py
since it only works there
- clean up some code
2 years ago
Mike Fährmann
67a2efb885
[rule34] implement 'pool' pagination ( #2853 )
2 years ago
Mike Fährmann
f225247670
[gelbooru] add support for `api_key` and `user_id` ( #2767 )
2 years ago
Mike Fährmann
c6a9bab019
update extractor test results
2 years ago
Mike Fährmann
ff5e10a86d
[hypnohub] move to gelbooru_v02 instances ( #2631 )
2 years ago
Mike Fährmann
d26da3b9e5
add pre-generated 'pattern' for supported BaseExtractor sites
2 years ago
Mike Fährmann
3e926bd465
[realbooru] fix extraction ( fixes #2530 )
2 years ago
Mike Fährmann
dee0d22561
update extractor test results
3 years ago
Mike Fährmann
199e7616a7
[rule34] use https://api.rule34.xxx for API requests
3 years ago
Mike Fährmann
93cef78450
[gelbooru] workaround pagination limits
...
Gelbooru only allows to retrieve the latest 20k posts for a tag search.
Add 'id:<N' to the search tags to work around that limitation, where N
is the ID of the last retrieved post.
http://gelbooru.me/index.php?page=forum&s=view&id=1467
3 years ago
Mike Fährmann
7bbb1f92d7
[gelbooru_v02] add 'favorite' extractor ( closes #1834 )
3 years ago
thatfuckingbird
dff03a6605
[booru] add an option to extract notes (only gelbooru for now) ( #1457 )
...
* [booru] add an option to extract notes (currently implemented only for gelbooru)
* appease linter
* [gelbooru] rename "text" to "body" in note extraction
* add a code comment about reusing return value of _extended_tags
3 years ago
thatfuckingbird
918b0441fb
[gelbooru] fix tag category extraction ( #1455 )
3 years ago
Mike Fährmann
3df527ee2c
update extractor test results
4 years ago
Mike Fährmann
59fd740b47
[tbib] add support for https://tbib.org/ ( #473 , closes #1082 )
4 years ago
Mike Fährmann
08d7934c6e
move extractors from booru.py into their own gelbooru_v02 module
4 years ago