Mike Fährmann
|
adc3aa0b77
|
[zerochan] fix metadata extraction
author, path, tags
|
10 months ago |
Mike Fährmann
|
a453335a9f
|
remove test results in extractor modules
and add generic example URLs
|
1 year ago |
Mike Fährmann
|
d97b8c2fba
|
consistent cookie-related names
- rename every cookie variable or method to 'cookies_*'
- simplify '.session.cookies' to just '.cookies'
- more consistent 'login()' structure
|
1 year ago |
enduser420
|
d52ed2bc5a
|
[zerochan] fix 'tags' extraction
|
1 year ago |
Mike Fährmann
|
ed2d715019
|
fix 'keywords' in extractor tests (#3491)
|
2 years ago |
Mike Fährmann
|
4063563cd7
|
[zerochan] update for layout v3
- remove cookie disabling v3
- fix and improve metadata extraction
|
2 years ago |
Mike Fährmann
|
b0cb4a1b9c
|
replace 'text.extract()' with 'text.extr()' where possible
|
2 years ago |
Mike Fährmann
|
3cb8327c60
|
[zerochan] add 'metadata' option (#2861)
|
2 years ago |
Mike Fährmann
|
21ff77fea0
|
[zerochan] extract more metadata for single posts
Neither HTML pages nor RSS feed entries have *all* metadata.
It might be necessary to do 1-2 extra HTTP requests to grab everything.
|
2 years ago |
Mike Fährmann
|
98af5a0409
|
[zerochan] implement login with username & password (#1434)
|
2 years ago |
Mike Fährmann
|
3a8addfe45
|
[zerochan] add 'tag' and 'image' extractors (#1434)
|
2 years ago |