Commit Graph

150 Commits (299bd2f1f51ca7758ec6060f46e4a856a41e64f9)

Author SHA1 Message Date
Mike Fährmann dad2875a3e
fix calculating retry sleep times (fixes #1990)
3 years ago
Mike Fährmann e69ee41f25
implement 'page-reverse' option (#1854)
3 years ago
Mike Fährmann c9e6693530
allow specifying a minimum/maximum for 'sleep-*' options (#1835)
3 years ago
Mike Fährmann 2ff2974353
[common] update default argument handling in Extractor.request()
3 years ago
Mike Fährmann d79bcb6236
allow extractors to register a 'finalize()' method
3 years ago
Mike Fährmann bb6a130942
automatically set required DDoS-GUARD cookies (#1779)
3 years ago
Mike Fährmann bd08ee2859
remove most 'yield Message.Version' statements
3 years ago
Mike Fährmann 9cb5ea5eda
update default User-Agent headers
3 years ago
Mike Fährmann 0179581340
add 'T' format string conversion (#1646)
3 years ago
Mike Fährmann 94faf8c85a
add type check before applying 'browser' option (fixes #1358)
4 years ago
Mike Fährmann 6cfc9613fe
update some code in Extractor constructor
4 years ago
Mike Fährmann 29ea54dc41
[patreon] use '"browser": "firefox"' by default (#1117)
4 years ago
Mike Fährmann cf5fa75d4c
add 'browser' option (#1117)
4 years ago
Mike Fährmann e1a12761d7
strip '/' from instance root URLs
4 years ago
Mike Fährmann d656892670
remove cloudflare.py
4 years ago
Mike Fährmann 88fae99811
remove 'generate_extractors()'
4 years ago
Mike Fährmann 745a114c61
[common] implement BaseExtractor class
4 years ago
Mike Fährmann 0d406c8daf
[common] restrict values used in 'generate_extractors()'
4 years ago
Mike Fährmann 8ca7f54750
rename '_request_…' variables
4 years ago
Mike Fährmann c57a918f4a
[e621] implement delay via '_request_interval_min'
4 years ago
Mike Fährmann 1e3dd7330e
merge SharedConfigMixin functionality into Extractor
4 years ago
Mike Fährmann 198c33ec36
also collect post processors from 'basecategory' entries
4 years ago
Mike Fährmann 1e313d5b84
implement 'sleep-request' option
4 years ago
Mike Fährmann 055c32e0f7
precompute extractor config paths
4 years ago
Mike Fährmann 231dd4c800
accumulate postprocessor objects (#994)
4 years ago
Mike Fährmann f6fd449b59
reduce wait time growth rate from exponential to linear
4 years ago
Mike Fährmann 2c9766b29f
fix UnboundLocalError in Extractor.request()
4 years ago
Mike Fährmann d6a271d2c7
add 'response' objects to 'HttpError's
4 years ago
Mike Fährmann 53cc498d9c
improve config lookup when there are multiple possible locations
4 years ago
Mike Fährmann 1ae1df0d27
update '--write-pages' (#737)
4 years ago
Mike Fährmann 15c3d29062
move dump_response() into a separate function (#737)
4 years ago
Mike Fährmann a363da4b43
include redirects and headers in --write-pages dumps (#737)
4 years ago
Mike Fährmann 3201fe3521
add global SENTINEL object
4 years ago
Mike Fährmann f8f95e68a7
improve '--write-pages' (#737)
4 years ago
Vrihub 4cc761c730
Implement --write-pages option (#736)
4 years ago
Mike Fährmann 5d7ca76885
retry Cloudflare challenges
4 years ago
Mike Fährmann d02f7c1118
improve Extractor.wait()
5 years ago
Mike Fährmann 2a4f227e08
warn about expired cookies
5 years ago
Mike Fährmann 56f1c96168
implement 'parent-directory' option (#551)
5 years ago
Mike Fährmann 2a9be48511
improve util.load/save_cookiestxt() and add tests
5 years ago
Mike Fährmann c1a6862863
implement functions to load/save cookies.txt files (closes #586)
5 years ago
Mike Fährmann bd5ce9855c
allow GalleryExtractors to set URL-independent extensions
5 years ago
Mike Fährmann 3811fd8a25
fix time formatting for Python 3.4 and 3.5
5 years ago
Mike Fährmann 569747a78d
implement extractor.wait()
5 years ago
Mike Fährmann ce54b8c04c
let extractors opt-out of cookie option usage
5 years ago
Mike Fährmann d3e44e899d
raise NotFoundErrors for 404 responses in GalleryExtractors
5 years ago
Mike Fährmann a4dd8b3dab
improve _check_cookies()
5 years ago
Mike Fährmann 15f9bb3d14
add option to disable pyOpenSSL usage (#508)
5 years ago
Mike Fährmann e17907ee2a
change default value of 'cookies-update' to 'true'
5 years ago
Mike Fährmann e2710702d4
fix Cloudflare bypss
5 years ago