Commit Graph

221 Commits (fd734b92223a02c0c392e4eece6bf82ba0da1fc8)

Author SHA1 Message Date
Mike Fährmann 2403c405e3
Merge branch 'postprocessor'
6 years ago
Mike Fährmann baccf8a958
improve postprocessor handling
6 years ago
Mike Fährmann 7646bdbcfd
improve postprocessor initialization code
6 years ago
Mike Fährmann 821535b458
adjust PathFormat class
6 years ago
Mike Fährmann 2df1a15fb8
add '-s/--simulate' to run data extraction without download
6 years ago
Mike Fährmann 76c32d58e5
[postprocessor] initial code
6 years ago
Mike Fährmann 8bf3cdd82b
implement logging options
6 years ago
Mike Fährmann 9fb82e6b43
apply expand_path() to archive paths
7 years ago
Mike Fährmann f970a8f13c
fix adding keys to download archive when using skip=false
7 years ago
Mike Fährmann be3ea4425d
test archive-id creation and uniqueness
7 years ago
Mike Fährmann 3cec533c28
Merge branch 'archive'
7 years ago
Mike Fährmann 4d2fadfb6f
restore skip actions with download archive
7 years ago
Mike Fährmann 7f7c16ae37
add option to specify additional key-value pairs
7 years ago
Mike Fährmann 8c3b713362
rework DownloadJob.handle_url(); include archive functionality
7 years ago
Mike Fährmann db7f04dd97
emit log messages on download failure
7 years ago
Mike Fährmann 27fce6f600
fix UrlJob behavior
7 years ago
Mike Fährmann b837420291
fix minor urllist issues
7 years ago
Mike Fährmann 9d69401391
initial support for multiple URLs per image
7 years ago
Mike Fährmann 6174a5c4ef
[download] adjust filename extension on filetype mismatch
7 years ago
Mike Fährmann 1a70857a12
update extractor-unittest capabilities
7 years ago
Mike Fährmann 88bb0798fd
delay initialization of PathFormat objects
7 years ago
Mike Fährmann 9d73ed4772
fix issue with using 'skip()' when a filter is present
7 years ago
Mike Fährmann 291369eab2
various smaller changes/additions
7 years ago
Mike Fährmann 4fb6803fa6
add option to sleep before each download
7 years ago
Mike Fährmann 6c9da67581
apply selection options (filter, range) when using '-j'
7 years ago
Mike Fährmann 27c026543f
re-enable download unit tests
7 years ago
Mike Fährmann 2e982f56af
use 'Content-Length' to determine incomplete downloads (#29)
7 years ago
Mike Fährmann 2ef3c35c98
smaller textual changes
7 years ago
Mike Fährmann 0386503c80
fix (sub)category-transfer for DownloadJob instances (#41)
7 years ago
Mike Fährmann b319f4bab3
smaller code and text changes
7 years ago
Mike Fährmann 26a866e7d8
implement (sub)category-transfer between extractors (#41)
7 years ago
Mike Fährmann 9c138dfc1f
[common] detect empty HTTP response bodies
7 years ago
Mike Fährmann 0dedbe759c
enable '--chapter-filter'
7 years ago
Mike Fährmann 5704c709fa
apply filter before range
7 years ago
Mike Fährmann 9b21d3f13c
add '--filter' command-line option
7 years ago
Mike Fährmann 268cfa3cfe
filter duplicate URLs (#36)
7 years ago
Mike Fährmann 47bcf53ec1
implement support for additional unit test result types
7 years ago
Mike Fährmann ae2d61e5b3
handle format string exceptions separately
7 years ago
Mike Fährmann 3c9f190757
extend output of --list-keywords
7 years ago
Mike Fährmann cfa479fab5
update error message for unspecified exceptions
7 years ago
Mike Fährmann 915a0137de
improve 'extractor.request'
7 years ago
Mike Fährmann 58e95a7487
share extractor and downloader sessions
7 years ago
Mike Fährmann c921b4f32a
code cleanup and fixing tests
7 years ago
Mike Fährmann 25bcdc8aa9
add `--write-unsupported` option (#15)
7 years ago
Mike Fährmann 99b72130ee
[reddit] enable recursion (#15)
7 years ago
Mike Fährmann ae686c4c08
run queue items immediately
7 years ago
Mike Fährmann 30eef527d8
update output logic on error
7 years ago
Mike Fährmann e425243b1e
[reddit] some small fixes
7 years ago
Mike Fährmann a90c6acc9c
code cleanup + fixes
7 years ago
Mike Fährmann 4c88c0d496
rework the output format for --list-keywords
7 years ago
Mike Fährmann 13dc5d72bc
update some extractors to use https
8 years ago
Mike Fährmann 5af35ea150
add -v/--verbose option and reduce error verbosity
8 years ago
Mike Fährmann b43cd88101
add '-j/--dump-json' option
8 years ago
Mike Fährmann 841fd50242
move code into util.py
8 years ago
Mike Fährmann ed94d9b92d
fix/improve various things
8 years ago
Mike Fährmann 27ae152f57
use logging to report errors
8 years ago
Mike Fährmann 7a9d66fbce
implement basic way to tell extractors to skip ahead
8 years ago
Mike Fährmann 2fa575b273
restore exception-testing to its old form
8 years ago
Mike Fährmann 40be4933b8
fix exception based tests
8 years ago
Mike Fährmann 24f41e13b3
move some exception handling code
8 years ago
Mike Fährmann 6208d9dd79
implement '--images' and '--chapters' options
8 years ago
Mike Fährmann 2a32b12043
add '--items' option
8 years ago
Mike Fährmann 3bca866185
rework the '-g' cmdline option
8 years ago
Mike Fährmann 4f123b8513
code adjustments according to pep8
8 years ago
Mike Fährmann 29692c5784
get extension from Content-Type header if not provided
8 years ago
Mike Fährmann 1134339c1f
Merge branch 'category'
8 years ago
Mike Fährmann f32cf28758
enable long pathnames on windows (#4)
8 years ago
Mike Fährmann 581daebc4b
remove trailing spaces from path segments (#4)
8 years ago
Mike Fährmann a347d50ef5
add (sub)category keyword automatically
8 years ago
Mike Fährmann 406add217c
print urls recursively
8 years ago
Mike Fährmann 6f7f29d684
rename a few files
8 years ago