Commit Graph

1359 Commits (67cc0ac8734d3b74c8669d983442ebbde8dd6e9f)

Author SHA1 Message Date
Mike Fährmann 87853538b4
[yandere] add option to split tags by type (#92)
6 years ago
Mike Fährmann a699787d01
[deviantart] update URL patterns to new format
6 years ago
Mike Fährmann 9e3415886c
[senmanga] fix/update tests
6 years ago
Mike Fährmann 973cf98e88
fix download skip for files without extension
6 years ago
Mike Fährmann b8c97d2295
use 'extractor.request()' for more HTTP requests
6 years ago
Mike Fährmann cc15c6105c
release version 1.4.1
6 years ago
Mike Fährmann 150a6b9064
[xvideos] fix metadata extraction
6 years ago
Mike Fährmann 7a98cc9798
[smugmug] update tests
6 years ago
Mike Fährmann 4eb94aca17
[postprocessor:ugoira] pass '-f' if not present
6 years ago
Mike Fährmann 0c1c4557dd
[postprocessor:ugoira] add option for two-pass encoding
6 years ago
Mike Fährmann a9e276bc37
reset delete-flag
6 years ago
Mike Fährmann 91340d9d27
[pixiv] fix ugoira test
6 years ago
Mike Fährmann 709c5d466d
add '--zip' and '--ugoira-conv' command-line options
6 years ago
Mike Fährmann eb7a1f3b98
[pixiv] rework ugoira handling
6 years ago
Mike Fährmann 017188d268
improve extractor.request()
6 years ago
Mike Fährmann 613b692275
[postprocessor:ugoira] add a few options
6 years ago
Mike Fährmann a444755979
[postprocessor] add 'ugoira' to convert pixiv animations to webm
6 years ago
Mike Fährmann f10bd5cdbe
[4chan] unescape filenames
6 years ago
Mike Fährmann eec081dd3e
[postprocessor:zip] delete directory (#85)
6 years ago
Mike Fährmann 2d1a104739
[mangadex] unescape manga names and chapter titles
6 years ago
Mike Fährmann 3bcce77f6d
release version 1.4.0
6 years ago
Mike Fährmann 6ac403c5d3
add postprocessor config example
6 years ago
Mike Fährmann 2403c405e3
Merge branch 'postprocessor'
6 years ago
Mike Fährmann baccf8a958
improve postprocessor handling
6 years ago
Mike Fährmann 2628911ba0
[pp:exec] add 'async' option
6 years ago
Mike Fährmann 7646bdbcfd
improve postprocessor initialization code
6 years ago
Mike Fährmann 37d97ff02c
[pp:classify] use temppath
6 years ago
Mike Fährmann 97189e50cd
[pp:zip] use temppath; add options
6 years ago
Mike Fährmann 821535b458
adjust PathFormat class
6 years ago
Mike Fährmann a47c6136cd
[simplyhentai] avoid redirects for all-pages.json (#89)
6 years ago
Mike Fährmann ad14de19c6
[imgur] support "unmuted" URLs
6 years ago
Mike Fährmann 72e66f0aac
[simplyhentai] improve URL pattern
6 years ago
Mike Fährmann cdcc3427a0
[simplyhentai] add video extractor (#89)
6 years ago
Mike Fährmann f9a6a19658
[simplyhentai] add image extractor (#89)
6 years ago
Mike Fährmann ebf596b399
[pawoo] restore metadata fields + smaller improvements
6 years ago
Mike Fährmann f7e7306e5a
[komikcast] update URL pattern and unescape image URLs
6 years ago
Mike Fährmann 70f3617d88
[mangafox] fix URL extraction
6 years ago
Mike Fährmann a62bd81e9b
[pixiv] fix filter for 'type=all'
6 years ago
Mike Fährmann 12797e3b1f
update configuration.rst
6 years ago
Mike Fährmann 55b0913412
[simplyhentai] add gallery extractor (#89)
6 years ago
Mike Fährmann ae9a37a528
implement text.split_html()
6 years ago
Mike Fährmann b08d95ebe4
add an 'encoding' option for logging files (default 'utf-8')
6 years ago
Mike Fährmann 513d807632
explicitly open config files as utf-8
6 years ago
Mike Fährmann 2df1a15fb8
add '-s/--simulate' to run data extraction without download
6 years ago
Mike Fährmann 15cce22d82
[mangadex] fix parsing of unusual chapter strings
6 years ago
Mike Fährmann ecdc3475b8
[pixhost] support .to TLDs
6 years ago
Mike Fährmann f3d770d4e2
Merge branch '1.4-dev'
6 years ago
Mike Fährmann d0ae3ed52c
[postprocessor] add 'zip' to write files to a ZIP archive
6 years ago
Mike Fährmann ca4008e1c1
[postprocessor] add 'classify' to sort downloads by fileext
6 years ago
Mike Fährmann d378c0a323
[postprocessor] add 'exec' to execute user-defined processes
6 years ago
Mike Fährmann 76c32d58e5
[postprocessor] initial code
6 years ago
Mike Fährmann 1ff626db97
[pixiv] improve bookmark extraction
6 years ago
Mike Fährmann 0a1863fce3
[pixiv] respect more query parameters for user URLs
6 years ago
Mike Fährmann f43d446692
[mangahere] extract chapter titles
6 years ago
Mike Fährmann b8e53b8c6b
[pixiv] move query parsing out of constructor
6 years ago
Mike Fährmann 909d105ae6
[pixiv] add extractor for illusts from followed users
6 years ago
Mike Fährmann 7f899bd5d8
Merge branch 'master' into 1.4-dev
6 years ago
Mike Fährmann fe69d01083
[pixiv] add extractor for search results
6 years ago
Mike Fährmann 247f785af1
[pixiv] use App API
6 years ago
Mike Fährmann 92fc199b07
[reddit] allow arbitrary subdomains
6 years ago
Mike Fährmann 4cea886177
[imgur] allow longer album hashes
6 years ago
Mike Fährmann e1e23165a0
[pinterest] catch JSON decode errors
6 years ago
Mike Fährmann 789608c107
[imagebam] fix extraction for certain galleries
6 years ago
Mike Fährmann 7a58151566
fix util.parse_bytes invocations
6 years ago
Mike Fährmann 1c1e086d01
use common base class for OAuth1.0 based API interfaces
6 years ago
Mike Fährmann f3483a2b7c
[smugmug] add OAuth support
6 years ago
Mike Fährmann 6a31ada9e3
re-implement OAuth1.0 code
6 years ago
Mike Fährmann ec158776ed
[deviantart] add extractor for popular listings
6 years ago
Mike Fährmann 0e3883303f
[pixiv] implement AppAPI wrapper
6 years ago
Mike Fährmann e2157f594e
[mangadex] fix manga extraction (closes #84)
6 years ago
Mike Fährmann 69a5e6ddb3
Merge branch 'master' into 1.4-dev
6 years ago
Mike Fährmann 82c50fa609
release version 1.3.5
6 years ago
Mike Fährmann 3ce5296313
[smugmug] code cleanup
6 years ago
Mike Fährmann 42ed7667b8
[smugmug] support user- and general album URLs
6 years ago
Mike Fährmann 8bf3cdd82b
implement logging options
6 years ago
Mike Fährmann 2ea0d1da42
[smugmug] improve API code; use data expansions
6 years ago
Mike Fährmann 16e014baaa
[smugmug] added image and album extractor
6 years ago
Mike Fährmann d96b3474e5
[puremashiro] remove module
6 years ago
Mike Fährmann b44a296404
[gomanga] remove module
6 years ago
Mike Fährmann 95392554ee
use text.urljoin()
6 years ago
Mike Fährmann 2395d870dd
[pinterest] unquote board and user names, better errors
6 years ago
Mike Fährmann 8b79eaafea
[tumblr] log actual time of rate limit resets
6 years ago
Mike Fährmann 0f1e07f627
[pinterest] scrap OAuth implementation; code improvements
6 years ago
Mike Fährmann 55d4d23860
[pinterest] use Pinterest's "Web" API (#83)
7 years ago
Mike Fährmann 2721417dd8
Merge branch 'master' into 1.4-dev
7 years ago
Mike Fährmann c6d5154fc3
fix flake8 errors, ignore W504
7 years ago
Mike Fährmann 2d17a9e07f
improve extractor.request()
7 years ago
Mike Fährmann 80521ae1f6
[deviantart] improve API error handling
7 years ago
Mike Fährmann e54b43be08
[mangadex] add title info for chapter extractors
7 years ago
Mike Fährmann f471161920
Merge branch 'master' into 1.4-dev
7 years ago
Mike Fährmann a2020c736e
release version 1.3.4
7 years ago
Mike Fährmann eb37fbf0e8
[hentaifoundry] improve extractor
7 years ago
Mike Fährmann 80bead739d
[oauth] require custom client-* values for pinterest
7 years ago
Mike Fährmann cc36f88586
rename safe_int to parse_int; move parse_* to text module
7 years ago
Mike Fährmann ff643793bd
improve and document cloudflare bypass code
7 years ago
Mike Fährmann 10cc59f3b5
fix extractor names
7 years ago
Mike Fährmann b1325d4d2c
fix extractor docstrings
7 years ago
Mike Fährmann df7e18399e
[luscious] fix image order
7 years ago
Mike Fährmann d10579edb5
[pinterest] improve PinterestAPI code; remove OAuth mentions
7 years ago
Mike Fährmann 4bd182c107
[pinterest] implement `oauth:pinterest` (#83)
7 years ago
Mike Fährmann 9651f3fce0
[pinterest] improve error messages (#83)
7 years ago
Mike Fährmann dbe250f7e5
[pinterest] update access_token (#83)
7 years ago
Mike Fährmann dd49127408
[spectrumnexus] remove module
7 years ago
Mike Fährmann 5c487300ee
improve 'parse_query()' and add tests
7 years ago
Mike Fährmann 728c64a3fb
[tumblr] rename 'offset' to 'num and adjust formats
7 years ago
Mike Fährmann 4ffa94f634
remove 'shorten_path()' and 'shorten_filename()'
7 years ago
Mike Fährmann 27eab4e467
rewrite text tests and improve functions
7 years ago
Mike Fährmann e3f2bd4087
add tests for 'text.clean_xml()' and improve it
7 years ago
Mike Fährmann 6d8b191ea7
improve 'parse_query()' and add tests
7 years ago
Mike Fährmann 51ea699083
add 'abort()' as function to filter expressions
7 years ago
Mike Fährmann 6bd857a319
[tumblr] handle rate limits / 429 errors
7 years ago
Mike Fährmann 7073ab7707
[komikcast] update regex to only match manga pages
7 years ago
Mike Fährmann a1fa4b43b0
Revert "[tumblr] add option to sort photosets by upload order"
7 years ago
Mike Fährmann 48a83a89e9
[loveisover] remove module
7 years ago
Mike Fährmann 564e12ca8f
replace 'imgyt' with 'imxto'
7 years ago
Mike Fährmann 1b80fa82a9
[imgur] update URL pattern and tests
7 years ago
Mike Fährmann 4a26ae32df
[tumblr] add option to sort photosets by upload order
7 years ago
Mike Fährmann 6b72be8ee6
[tumblr] add 'hash' keyword
7 years ago
Mike Fährmann ffc0c67701
release version 1.3.3
7 years ago
Mike Fährmann d11fcf4804
smaller changes and fixes
7 years ago
Mike Fährmann f6c95dccf9
[cloudflare] fix bypass procedure
7 years ago
Mike Fährmann 759ba26fb0
[luscious] proper image order for picture albums
7 years ago
Mike Fährmann 68e9fbee16
[tumblr] check all 4 keys/secrets before using OAuth
7 years ago
Mike Fährmann 4810d446bb
remove the obsolete safeprint() and error() functions
7 years ago
Mike Fährmann 0381ae5318
replace error handlers for stdout and co.
7 years ago
Mike Fährmann f8168c693e
[tumblr] avoid calls to '/blog/.../info'
7 years ago
Mike Fährmann 64d7c85b55
[exhentai] improve metadata
7 years ago
Mike Fährmann 64b22e0fc1
[pawoo] update URL pattern
7 years ago
Mike Fährmann 7b562907c3
[nijie] add favorites extractor
7 years ago
Mike Fährmann 445db75955
[nijie] improve extraction and metadata
7 years ago
Mike Fährmann a112e3f2a0
[nijie] add doujin extractor
7 years ago
Mike Fährmann f39153b6e9
[nhentai] add extractor for search results
7 years ago
Mike Fährmann 52d41c41e7
[exhentai] add extractor for favorited galleries
7 years ago
Mike Fährmann 63cc2599c4
[exhentai] add extractor for search results
7 years ago
Mike Fährmann d1c91a1f2b
[mangadex] fix manga-page extraction
7 years ago
Mike Fährmann 299ae24996
[test] add a few downloader tests
7 years ago
Mike Fährmann dd314279fb
[test] add unit tests for extractor module functions
7 years ago
Mike Fährmann a993d0ea90
release version 1.3.2
7 years ago
Mike Fährmann e7525b1b0e
[artstation] add challenge extractor (#80)
7 years ago
Mike Fährmann 3f2dd6b6f8
avoid double path-separators
7 years ago
Mike Fährmann f5c6a2d7f5
[nhentai] use API to get gallery info
7 years ago
Mike Fährmann b2ba2b821d
[hitomi] fix image URLs and improve metadata
7 years ago
Mike Fährmann 3905474805
[booru] call update_page() with correct dict (closes #82)
7 years ago
Mike Fährmann 44c267e362
[artstation] add search extractor (#80)
7 years ago
Mike Fährmann 40ca562d7b
[artstation] add album extractor (#80)
7 years ago
Mike Fährmann 7121eeae8b
check supportedsites.rst in release script
7 years ago
Mike Fährmann c59f9b71f1
release version 1.3.1
7 years ago
Mike Fährmann f367d5c281
[deviantart] move delay-increase after expect_error check
7 years ago
Mike Fährmann 557cb94f81
[deviantart] use proper exponential backoff on API errors
7 years ago
Mike Fährmann 723cc66bb1
[artstation] add user-, image- and likes-extractors
7 years ago
Mike Fährmann b69cc94f0e
[util] implement bencode()
7 years ago
Mike Fährmann 4d74749496
[tests] rework filters for extractor tests
7 years ago
Mike Fährmann d6ef52897c
[imgchili] remove module
7 years ago
Mike Fährmann 7847ab1d5a
[imagehosts] remove even more dead sites
7 years ago
Mike Fährmann 5f37d40a3e
[komikcast] bypass cloudflare challenge
7 years ago
Mike Fährmann f9884e2338
[pixiv] update URL pattern
7 years ago
Mike Fährmann 85ed023c2e
[mangadex] remove the trailing ' - MangaDex' in a better way
7 years ago
Mike Fährmann 9fb82e6b43
apply expand_path() to archive paths
7 years ago
Mike Fährmann 32bbd12f08
update extractor tests
7 years ago
Mike Fährmann ca326bd275
[deviantart] fix folder and collection archive IDs
7 years ago
Mike Fährmann e32fe1cdf1
[pinterest] cast IDs to int
7 years ago
Mike Fährmann 179ecee965
[turboimagehost] fix extraction
7 years ago
Mike Fährmann 1400868f53
[mangadex] general improvements
7 years ago
Mike Fährmann 749fbbfa6c
[mangadex] add chapter- and manga-extractor
7 years ago
Mike Fährmann b58449fd88
release version 1.3.0
7 years ago
Mike Fährmann 6e38cf5aab
[mangareader] use 'https://'
7 years ago
Mike Fährmann 1d71123f91
[pixiv] update archive IDs and add metadata-fields
7 years ago
Mike Fährmann 858fdbdb22
[tumblr] improve 'inline' extraction
7 years ago
Mike Fährmann 1d54a8e07d
fix logging output during downloads
7 years ago
Mike Fährmann 5008e105ee
update archive IDs
7 years ago
Mike Fährmann 829ddf4ac1
[sankaku] general improvements
7 years ago
Jad 49463f76bb support multi-page URL (#79)
7 years ago
Mike Fährmann 19aefdfde3
[directlink] update test results
7 years ago
Mike Fährmann 74029c50bb
[directlink] unquote metadata fields
7 years ago
Mike Fährmann 2fad0b1f1b
add 'U' conversion for format strings to unquote their content
7 years ago
Mike Fährmann 8cdce21dcb
make archive keys user-configurable
7 years ago
Mike Fährmann 8f338347b6
[imagehosts] cleanup
7 years ago
Mike Fährmann edfd3d9fc9
[yeet] remove module
7 years ago
Mike Fährmann e1e0668ca8
add option to set default replacement field value
7 years ago
Mike Fährmann ac3da8115e
[util] don't add text: URLs to list of downloaded URLs
7 years ago
Mike Fährmann 8704d850bf
add explicit proxy support (#76)
7 years ago
Mike Fährmann 367b963d37
[pixiv] fix ugoira extraction ... again (#78)
7 years ago
Mike Fährmann b79f1f2ca7
[pixiv] fix ugoira extraction (closes #78)
7 years ago
Mike Fährmann 731ffd4986
improve text.filename_from_url() performance
7 years ago
Mike Fährmann d122203be1
[mangastream] fix extraction
7 years ago
Mike Fährmann 8809b32aed
release version 1.2.0
7 years ago
Mike Fährmann b50bdbf3d7
change config specifiers in input file format
7 years ago
Mike Fährmann f970a8f13c
fix adding keys to download archive when using skip=false
7 years ago
Mike Fährmann 179bcdd349
adjust archive-ids
7 years ago
Mike Fährmann be3ea4425d
test archive-id creation and uniqueness
7 years ago
Mike Fährmann 3cec533c28
Merge branch 'archive'
7 years ago
Mike Fährmann 20af86b2ea
add more extractor tests
7 years ago
Mike Fährmann b73b8b4f50
add OAuth unittests
7 years ago
Mike Fährmann 4d2fadfb6f
restore skip actions with download archive
7 years ago
Mike Fährmann 65773263fc
[util] implement OAuthSession.urlencode() (closes #75)
7 years ago
Mike Fährmann 7e0207bcf4
[imgur] strip trailing '?1' from 'ext'
7 years ago
Mike Fährmann cf147dfee9
[hentai2read] fix manga extraction
7 years ago
Mike Fährmann f5f2d29f56
[nijie] fix dojin extraction
7 years ago
Mike Fährmann 7f7c16ae37
add option to specify additional key-value pairs
7 years ago
Mike Fährmann d38bf2f54c
[tumblr] recognize /image/... URLs
7 years ago
Mike Fährmann 057668e17e
extend input-file format with per-URL config and comments
7 years ago
Mike Fährmann 5b3c34aa96
use generic chapter-extractor in more modules
7 years ago
Mike Fährmann 347baf7ac5
improve util.parse_range() performance
7 years ago
Mike Fährmann 7b5ba69951
[hentaihere] ensure consistent extraction results
7 years ago
Mike Fährmann 377b78b3c9
[hentai2read] fix manga name extraction
7 years ago
Mike Fährmann 54c36a8a34
[subapics] add chapter- and manga-extractor (#70)
7 years ago
Mike Fährmann 2dd3aeeeae
[komikcast] add chapter- and manga-extractor (#70)
7 years ago
Mike Fährmann 7a412f5c32
implement generic manga-chapter extractor
7 years ago
Mike Fährmann aa38eab2be
allow not-defined fields in format strings
7 years ago
Mike Fährmann 6a07e38366
implement extractor.add() and .add_module()
7 years ago
Mike Fährmann c0dd922c13
add '--download-archive' cmdline option
7 years ago
Mike Fährmann 8c3b713362
rework DownloadJob.handle_url(); include archive functionality
7 years ago
Mike Fährmann 34873dbd90
set 'archive_fmt' values
7 years ago
Mike Fährmann a34cebc253
[luscious] jump to first image if cover does not link to it
7 years ago
Mike Fährmann 84a52a9256
add DownloadArchive class
7 years ago
Mike Fährmann 915807dd77
log HTTP errors as warnings
7 years ago
Mike Fährmann db7f04dd97
emit log messages on download failure
7 years ago
Mike Fährmann d951f13e37
add config option for unsupported-URL file
7 years ago
Mike Fährmann 619387cbb1
update extractor unittest results
7 years ago
Mike Fährmann 364e335440
smaller adjustments and improvements
7 years ago
Mike Fährmann c9a9664a65
change --write-log behaviour
7 years ago
Mike Fährmann 97f4f15ec0
add option to write logging output to a file
7 years ago
Mike Fährmann f94e3706a8
use logging module for error messages during downloads
7 years ago
Mike Fährmann db91cf871c
document message identifiers
7 years ago
Mike Fährmann 0dd48d644f
update test results
7 years ago
Mike Fährmann 1e93955170
[batoto] remove module
7 years ago
Mike Fährmann 27fce6f600
fix UrlJob behavior
7 years ago
Mike Fährmann 76509a6d3c
[imgur] update test results
7 years ago
Mike Fährmann 9fccd7b783
[tumblr] provide fallback URLs (#64)
7 years ago
Mike Fährmann b837420291
fix minor urllist issues
7 years ago
Mike Fährmann 9d69401391
initial support for multiple URLs per image
7 years ago
Mike Fährmann 6174a5c4ef
[download] adjust filename extension on filetype mismatch
7 years ago
Mike Fährmann 91ed147cef
[oauth] use custom key/secret values during oauth:…
7 years ago
Mike Fährmann 421a9740a3
[tumblr] add 'tumblr:' to force Tumblr extractor (#71)
7 years ago
Mike Fährmann 40d35c87bc
[paheal] add tag- and post-extractors (closes #69)
7 years ago
Mike Fährmann cc0c2cca57
[reddit] add extractor for reddit-hosted images (closes #68)
7 years ago
Mike Fährmann f10ffc0839
update extractor blacklist to also allow classes
7 years ago
Mike Fährmann b6797032e3
release version 1.1.2
7 years ago
Mike Fährmann 35e09869d1
[mangapark] fix image URLs and use HTTPS
7 years ago
Mike Fährmann 9a049bdf51
[tumblr] add 'likes' extractor (#65)
7 years ago
Mike Fährmann 67d4462d26
[batoto] rudimentary Cloudflare bypass
7 years ago
Mike Fährmann 29d75fc3fa
[tumblr] add support for OAuth authentication (#65)
7 years ago
Mike Fährmann 4edb25346e
[slideshare] support mobile URLs (closes #67)
7 years ago
Mike Fährmann e420a28bbc
fix cookie tests
7 years ago
Mike Fährmann b33efc99a4
[idolcomplex] add support for idol.sankakucomplex.com
7 years ago
Mike Fährmann 75b2e84b6d
[tumblr] use s3.amazonaws.com for image URLs (#64)
7 years ago
Mike Fährmann 5b094328b5
[puremashiro] add chapter- and manga-extractor (closes #66)
7 years ago
Mike Fährmann 974e73bdbb
[booru] smaller code adjustments
7 years ago
Mike Fährmann 03b8a548cb
[tumblr] change `reblogs` default value to `true` (#61)
7 years ago
Mike Fährmann d235f68f59
[tumblr] add option to filter reblogged posts (#61)
7 years ago