Commit Graph

229 Commits (aa6d00613f33041dec657988fbc3ed33bb1d5967)

Author SHA1 Message Date
Mike Fährmann 6514828d4e
emit debug logging message when loading cookies from file
2 years ago
Mike Fährmann 9f06e79868
implement '"user-agent": "browser"' (#2636)
2 years ago
Mike Fährmann 86790da2d5
update Cloudflare IUAM detection
2 years ago
Mike Fährmann 8b1fe0bcf1
emit debug logging messages before calling time.sleep() (#2982)
2 years ago
Mike Fährmann 73a52a95b0
update Cloudflare IUAM detection
2 years ago
Mike Fährmann eb68d45544
add global 'warnings' option (#2762)
2 years ago
Mike Fährmann e4f48cc810
make it easier to disable default 'browser' settings
2 years ago
Mike Fährmann 92b75bcdce
limit path length for --write-pages output on Windows (#2733)
2 years ago
Mike Fährmann de20cadc68
add 'brotli' as optional dependency (#2716)
2 years ago
Mike Fährmann 3a5d5c3a91
update default User-Agent header to Firefox 102 ESR
2 years ago
Mike Fährmann 535cbcb185
cache extracted browser cookies
2 years ago
Mike Fährmann 6742f3bc1e
implement --cookies-from-browser (#1606)
2 years ago
Mike Fährmann c4b9f7bab8
update functions working with cookies.txt files
2 years ago
Mike Fährmann 3f02e483c6
[e621] fix applying request_interval_min (#2533)
2 years ago
Mike Fährmann 29db716a63
implement 'datetime_to_timestamp()'
3 years ago
Mike Fährmann 500a479026
fix a third(!) bug in _check_cookies() (#2372)
3 years ago
Mike Fährmann 47cf05c4ab
refactor proxy handling code (#2357)
3 years ago
Mike Fährmann bddcec49f1
implement 'text.root_from_url()'
3 years ago
Mike Fährmann f5b2b9333f
fix another bug in _check:cookies (#2160)
3 years ago
Mike Fährmann ed317bfcf1
warn about cookies expiring in less than 24 hours
3 years ago
Mike Fährmann b4f8e15a1f
allow BaseExtractors to use the domain pf the matched URL
3 years ago
Mike Fährmann f58364f6a8
update Firefox cipher list
3 years ago
Mike Fährmann 7e6981dda6
rename 'disabletls12' to 'tls12'
3 years ago
Mike Fährmann bb3e182562
overhaul session initialization
3 years ago
Robert Pendell 4c651f6252
[patreon] Disable TLS 1.2 by default (#2249)
3 years ago
Robert Pendell 392cf079f7
Add ability to disable TLS 1.2 (#2243)
3 years ago
Mike Fährmann de754590e0
add --source-address command-line option (closes #2206)
3 years ago
Mike Fährmann 6f2e0c9c3d
fix cookie checks for patreon, fanbox, fantia
3 years ago
Mike Fährmann ad30653b17
allow running a BaseExtractor for any URL
3 years ago
Mike Fährmann dad2875a3e
fix calculating retry sleep times (fixes #1990)
3 years ago
Mike Fährmann e69ee41f25
implement 'page-reverse' option (#1854)
3 years ago
Mike Fährmann c9e6693530
allow specifying a minimum/maximum for 'sleep-*' options (#1835)
3 years ago
Mike Fährmann 2ff2974353
[common] update default argument handling in Extractor.request()
3 years ago
Mike Fährmann d79bcb6236
allow extractors to register a 'finalize()' method
3 years ago
Mike Fährmann bb6a130942
automatically set required DDoS-GUARD cookies (#1779)
3 years ago
Mike Fährmann bd08ee2859
remove most 'yield Message.Version' statements
3 years ago
Mike Fährmann 9cb5ea5eda
update default User-Agent headers
3 years ago
Mike Fährmann 0179581340
add 'T' format string conversion (#1646)
3 years ago
Mike Fährmann 94faf8c85a
add type check before applying 'browser' option (fixes #1358)
4 years ago
Mike Fährmann 6cfc9613fe
update some code in Extractor constructor
4 years ago
Mike Fährmann 29ea54dc41
[patreon] use '"browser": "firefox"' by default (#1117)
4 years ago
Mike Fährmann cf5fa75d4c
add 'browser' option (#1117)
4 years ago
Mike Fährmann e1a12761d7
strip '/' from instance root URLs
4 years ago
Mike Fährmann d656892670
remove cloudflare.py
4 years ago
Mike Fährmann 88fae99811
remove 'generate_extractors()'
4 years ago
Mike Fährmann 745a114c61
[common] implement BaseExtractor class
4 years ago
Mike Fährmann 0d406c8daf
[common] restrict values used in 'generate_extractors()'
4 years ago
Mike Fährmann 8ca7f54750
rename '_request_…' variables
4 years ago
Mike Fährmann c57a918f4a
[e621] implement delay via '_request_interval_min'
4 years ago
Mike Fährmann 1e3dd7330e
merge SharedConfigMixin functionality into Extractor
4 years ago
Mike Fährmann 198c33ec36
also collect post processors from 'basecategory' entries
4 years ago
Mike Fährmann 1e313d5b84
implement 'sleep-request' option
4 years ago
Mike Fährmann 055c32e0f7
precompute extractor config paths
4 years ago
Mike Fährmann 231dd4c800
accumulate postprocessor objects (#994)
4 years ago
Mike Fährmann f6fd449b59
reduce wait time growth rate from exponential to linear
4 years ago
Mike Fährmann 2c9766b29f
fix UnboundLocalError in Extractor.request()
4 years ago
Mike Fährmann d6a271d2c7
add 'response' objects to 'HttpError's
4 years ago
Mike Fährmann 53cc498d9c
improve config lookup when there are multiple possible locations
4 years ago
Mike Fährmann 1ae1df0d27
update '--write-pages' (#737)
4 years ago
Mike Fährmann 15c3d29062
move dump_response() into a separate function (#737)
4 years ago
Mike Fährmann a363da4b43
include redirects and headers in --write-pages dumps (#737)
4 years ago
Mike Fährmann 3201fe3521
add global SENTINEL object
4 years ago
Mike Fährmann f8f95e68a7
improve '--write-pages' (#737)
4 years ago
Vrihub 4cc761c730
Implement --write-pages option (#736)
4 years ago
Mike Fährmann 5d7ca76885
retry Cloudflare challenges
4 years ago
Mike Fährmann d02f7c1118
improve Extractor.wait()
5 years ago
Mike Fährmann 2a4f227e08
warn about expired cookies
5 years ago
Mike Fährmann 56f1c96168
implement 'parent-directory' option (#551)
5 years ago
Mike Fährmann 2a9be48511
improve util.load/save_cookiestxt() and add tests
5 years ago
Mike Fährmann c1a6862863
implement functions to load/save cookies.txt files (closes #586)
5 years ago
Mike Fährmann bd5ce9855c
allow GalleryExtractors to set URL-independent extensions
5 years ago
Mike Fährmann 3811fd8a25
fix time formatting for Python 3.4 and 3.5
5 years ago
Mike Fährmann 569747a78d
implement extractor.wait()
5 years ago
Mike Fährmann ce54b8c04c
let extractors opt-out of cookie option usage
5 years ago
Mike Fährmann d3e44e899d
raise NotFoundErrors for 404 responses in GalleryExtractors
5 years ago
Mike Fährmann a4dd8b3dab
improve _check_cookies()
5 years ago
Mike Fährmann 15f9bb3d14
add option to disable pyOpenSSL usage (#508)
5 years ago
Mike Fährmann e17907ee2a
change default value of 'cookies-update' to 'true'
5 years ago
Mike Fährmann e2710702d4
fix Cloudflare bypss
5 years ago
Mike Fährmann ae09f87602
improve SharedConfigMixin config lookups
5 years ago
Mike Fährmann f5604492c3
update interface of config functions
5 years ago
Mike Fährmann d45fabb79d
match user profile handling on deviantart and newgrounds
5 years ago
Mike Fährmann 1a197d2195
store the original cookiejar as Extractor._cookiejar
5 years ago
Mike Fährmann de83ae4576
make 'method' argument of Extractor.request keyword-only
5 years ago
Mike Fährmann d44f790e81
adjust output for HTTP status related errors
5 years ago
Mike Fährmann 389d2d7e38
implement 'cookies-update' option (#445)
5 years ago
Mike Fährmann 1693d97bd3
update extractor class hierarchies
5 years ago
Mike Fährmann f4bc75e854
fix rate limit handling for OAuth APIs (#368)
5 years ago
Mike Fährmann 21991acc49
add 'ciphers' option; update default User-Agent
5 years ago
Mike Fährmann 84f4d3bc0b
replace urllib3's default cipher list with Firefox's (#342)
5 years ago
Mike Fährmann 09f37fde39
[reddit] move date-min/-max handling into Extractor class
5 years ago
Mike Fährmann 56c7a66a4a
detect Cloudflare CAPTCHAs and update cipher list
5 years ago
Mike Fährmann fdec59f8e2
replace extractor.request() 'expect' argument
5 years ago
Mike Fährmann 69205df68d
allow '-1' for infinite retries (#300)
5 years ago
Mike Fährmann f7b5c4c3e7
use values of 'retries' options correctly
5 years ago
Mike Fährmann 399e8e965a
also update urllib3's cipher list for versions >= 1.25
5 years ago
Mike Fährmann c02f12ce2f
avoid Cloudflare CAPTCHAs for OpenSSL < 1.1.1
5 years ago
Mike Fährmann 5fd94c6b83
import urllib3 from requests.packages
5 years ago
Mike Fährmann 35f343206c
update default SSL cipher list in urllib3 < 1.25
5 years ago
Mike Fährmann e25ebc4bff
don't disable certificate checks anymore
6 years ago