Commit Graph

1061 Commits (4b441c162e4515d54a6aca543e548e129dae63dd)

Author SHA1 Message Date
Mike Fährmann ecdc3475b8
[pixhost] support .to TLDs
6 years ago
Mike Fährmann f3d770d4e2
Merge branch '1.4-dev'
6 years ago
Mike Fährmann 1ff626db97
[pixiv] improve bookmark extraction
6 years ago
Mike Fährmann 0a1863fce3
[pixiv] respect more query parameters for user URLs
6 years ago
Mike Fährmann f43d446692
[mangahere] extract chapter titles
6 years ago
Mike Fährmann b8e53b8c6b
[pixiv] move query parsing out of constructor
6 years ago
Mike Fährmann 909d105ae6
[pixiv] add extractor for illusts from followed users
6 years ago
Mike Fährmann 7f899bd5d8
Merge branch 'master' into 1.4-dev
6 years ago
Mike Fährmann fe69d01083
[pixiv] add extractor for search results
6 years ago
Mike Fährmann 247f785af1
[pixiv] use App API
6 years ago
Mike Fährmann 92fc199b07
[reddit] allow arbitrary subdomains
6 years ago
Mike Fährmann 4cea886177
[imgur] allow longer album hashes
6 years ago
Mike Fährmann e1e23165a0
[pinterest] catch JSON decode errors
6 years ago
Mike Fährmann 789608c107
[imagebam] fix extraction for certain galleries
6 years ago
Mike Fährmann 7a58151566
fix util.parse_bytes invocations
6 years ago
Mike Fährmann 1c1e086d01
use common base class for OAuth1.0 based API interfaces
6 years ago
Mike Fährmann f3483a2b7c
[smugmug] add OAuth support
6 years ago
Mike Fährmann 6a31ada9e3
re-implement OAuth1.0 code
6 years ago
Mike Fährmann ec158776ed
[deviantart] add extractor for popular listings
6 years ago
Mike Fährmann 0e3883303f
[pixiv] implement AppAPI wrapper
6 years ago
Mike Fährmann e2157f594e
[mangadex] fix manga extraction (closes #84)
6 years ago
Mike Fährmann 69a5e6ddb3
Merge branch 'master' into 1.4-dev
6 years ago
Mike Fährmann 3ce5296313
[smugmug] code cleanup
6 years ago
Mike Fährmann 42ed7667b8
[smugmug] support user- and general album URLs
6 years ago
Mike Fährmann 2ea0d1da42
[smugmug] improve API code; use data expansions
6 years ago
Mike Fährmann 16e014baaa
[smugmug] added image and album extractor
6 years ago
Mike Fährmann d96b3474e5
[puremashiro] remove module
6 years ago
Mike Fährmann b44a296404
[gomanga] remove module
6 years ago
Mike Fährmann 95392554ee
use text.urljoin()
6 years ago
Mike Fährmann 2395d870dd
[pinterest] unquote board and user names, better errors
6 years ago
Mike Fährmann 8b79eaafea
[tumblr] log actual time of rate limit resets
6 years ago
Mike Fährmann 0f1e07f627
[pinterest] scrap OAuth implementation; code improvements
6 years ago
Mike Fährmann 55d4d23860
[pinterest] use Pinterest's "Web" API (#83)
7 years ago
Mike Fährmann 2721417dd8
Merge branch 'master' into 1.4-dev
7 years ago
Mike Fährmann c6d5154fc3
fix flake8 errors, ignore W504
7 years ago
Mike Fährmann 2d17a9e07f
improve extractor.request()
7 years ago
Mike Fährmann 80521ae1f6
[deviantart] improve API error handling
7 years ago
Mike Fährmann e54b43be08
[mangadex] add title info for chapter extractors
7 years ago
Mike Fährmann f471161920
Merge branch 'master' into 1.4-dev
7 years ago
Mike Fährmann eb37fbf0e8
[hentaifoundry] improve extractor
7 years ago
Mike Fährmann 80bead739d
[oauth] require custom client-* values for pinterest
7 years ago
Mike Fährmann cc36f88586
rename safe_int to parse_int; move parse_* to text module
7 years ago
Mike Fährmann 10cc59f3b5
fix extractor names
7 years ago
Mike Fährmann b1325d4d2c
fix extractor docstrings
7 years ago
Mike Fährmann df7e18399e
[luscious] fix image order
7 years ago
Mike Fährmann d10579edb5
[pinterest] improve PinterestAPI code; remove OAuth mentions
7 years ago
Mike Fährmann 4bd182c107
[pinterest] implement `oauth:pinterest` (#83)
7 years ago
Mike Fährmann 9651f3fce0
[pinterest] improve error messages (#83)
7 years ago
Mike Fährmann dbe250f7e5
[pinterest] update access_token (#83)
7 years ago
Mike Fährmann dd49127408
[spectrumnexus] remove module
7 years ago
Mike Fährmann 5c487300ee
improve 'parse_query()' and add tests
7 years ago
Mike Fährmann 728c64a3fb
[tumblr] rename 'offset' to 'num and adjust formats
7 years ago
Mike Fährmann 6bd857a319
[tumblr] handle rate limits / 429 errors
7 years ago
Mike Fährmann 7073ab7707
[komikcast] update regex to only match manga pages
7 years ago
Mike Fährmann a1fa4b43b0
Revert "[tumblr] add option to sort photosets by upload order"
7 years ago
Mike Fährmann 48a83a89e9
[loveisover] remove module
7 years ago
Mike Fährmann 564e12ca8f
replace 'imgyt' with 'imxto'
7 years ago
Mike Fährmann 1b80fa82a9
[imgur] update URL pattern and tests
7 years ago
Mike Fährmann 4a26ae32df
[tumblr] add option to sort photosets by upload order
7 years ago
Mike Fährmann 6b72be8ee6
[tumblr] add 'hash' keyword
7 years ago
Mike Fährmann d11fcf4804
smaller changes and fixes
7 years ago
Mike Fährmann 759ba26fb0
[luscious] proper image order for picture albums
7 years ago
Mike Fährmann 68e9fbee16
[tumblr] check all 4 keys/secrets before using OAuth
7 years ago
Mike Fährmann f8168c693e
[tumblr] avoid calls to '/blog/.../info'
7 years ago
Mike Fährmann 64d7c85b55
[exhentai] improve metadata
7 years ago
Mike Fährmann 64b22e0fc1
[pawoo] update URL pattern
7 years ago
Mike Fährmann 7b562907c3
[nijie] add favorites extractor
7 years ago
Mike Fährmann 445db75955
[nijie] improve extraction and metadata
7 years ago
Mike Fährmann a112e3f2a0
[nijie] add doujin extractor
7 years ago
Mike Fährmann f39153b6e9
[nhentai] add extractor for search results
7 years ago
Mike Fährmann 52d41c41e7
[exhentai] add extractor for favorited galleries
7 years ago
Mike Fährmann 63cc2599c4
[exhentai] add extractor for search results
7 years ago
Mike Fährmann d1c91a1f2b
[mangadex] fix manga-page extraction
7 years ago
Mike Fährmann 299ae24996
[test] add a few downloader tests
7 years ago
Mike Fährmann dd314279fb
[test] add unit tests for extractor module functions
7 years ago
Mike Fährmann e7525b1b0e
[artstation] add challenge extractor (#80)
7 years ago
Mike Fährmann f5c6a2d7f5
[nhentai] use API to get gallery info
7 years ago
Mike Fährmann b2ba2b821d
[hitomi] fix image URLs and improve metadata
7 years ago
Mike Fährmann 3905474805
[booru] call update_page() with correct dict (closes #82)
7 years ago
Mike Fährmann 44c267e362
[artstation] add search extractor (#80)
7 years ago
Mike Fährmann 40ca562d7b
[artstation] add album extractor (#80)
7 years ago
Mike Fährmann f367d5c281
[deviantart] move delay-increase after expect_error check
7 years ago
Mike Fährmann 557cb94f81
[deviantart] use proper exponential backoff on API errors
7 years ago
Mike Fährmann 723cc66bb1
[artstation] add user-, image- and likes-extractors
7 years ago
Mike Fährmann 4d74749496
[tests] rework filters for extractor tests
7 years ago
Mike Fährmann d6ef52897c
[imgchili] remove module
7 years ago
Mike Fährmann 7847ab1d5a
[imagehosts] remove even more dead sites
7 years ago
Mike Fährmann 5f37d40a3e
[komikcast] bypass cloudflare challenge
7 years ago
Mike Fährmann f9884e2338
[pixiv] update URL pattern
7 years ago
Mike Fährmann 85ed023c2e
[mangadex] remove the trailing ' - MangaDex' in a better way
7 years ago
Mike Fährmann 32bbd12f08
update extractor tests
7 years ago
Mike Fährmann ca326bd275
[deviantart] fix folder and collection archive IDs
7 years ago
Mike Fährmann e32fe1cdf1
[pinterest] cast IDs to int
7 years ago
Mike Fährmann 179ecee965
[turboimagehost] fix extraction
7 years ago
Mike Fährmann 1400868f53
[mangadex] general improvements
7 years ago
Mike Fährmann 749fbbfa6c
[mangadex] add chapter- and manga-extractor
7 years ago
Mike Fährmann 6e38cf5aab
[mangareader] use 'https://'
7 years ago
Mike Fährmann 1d71123f91
[pixiv] update archive IDs and add metadata-fields
7 years ago
Mike Fährmann 858fdbdb22
[tumblr] improve 'inline' extraction
7 years ago
Mike Fährmann 5008e105ee
update archive IDs
7 years ago
Mike Fährmann 829ddf4ac1
[sankaku] general improvements
7 years ago
Jad 49463f76bb support multi-page URL (#79)
7 years ago
Mike Fährmann 19aefdfde3
[directlink] update test results
7 years ago
Mike Fährmann 74029c50bb
[directlink] unquote metadata fields
7 years ago
Mike Fährmann 8f338347b6
[imagehosts] cleanup
7 years ago
Mike Fährmann edfd3d9fc9
[yeet] remove module
7 years ago
Mike Fährmann 8704d850bf
add explicit proxy support (#76)
7 years ago
Mike Fährmann 367b963d37
[pixiv] fix ugoira extraction ... again (#78)
7 years ago
Mike Fährmann b79f1f2ca7
[pixiv] fix ugoira extraction (closes #78)
7 years ago
Mike Fährmann d122203be1
[mangastream] fix extraction
7 years ago
Mike Fährmann 179bcdd349
adjust archive-ids
7 years ago
Mike Fährmann 3cec533c28
Merge branch 'archive'
7 years ago
Mike Fährmann 20af86b2ea
add more extractor tests
7 years ago
Mike Fährmann 7e0207bcf4
[imgur] strip trailing '?1' from 'ext'
7 years ago
Mike Fährmann cf147dfee9
[hentai2read] fix manga extraction
7 years ago
Mike Fährmann f5f2d29f56
[nijie] fix dojin extraction
7 years ago
Mike Fährmann d38bf2f54c
[tumblr] recognize /image/... URLs
7 years ago
Mike Fährmann 5b3c34aa96
use generic chapter-extractor in more modules
7 years ago
Mike Fährmann 7b5ba69951
[hentaihere] ensure consistent extraction results
7 years ago
Mike Fährmann 377b78b3c9
[hentai2read] fix manga name extraction
7 years ago
Mike Fährmann 54c36a8a34
[subapics] add chapter- and manga-extractor (#70)
7 years ago
Mike Fährmann 2dd3aeeeae
[komikcast] add chapter- and manga-extractor (#70)
7 years ago
Mike Fährmann 7a412f5c32
implement generic manga-chapter extractor
7 years ago
Mike Fährmann 6a07e38366
implement extractor.add() and .add_module()
7 years ago
Mike Fährmann 34873dbd90
set 'archive_fmt' values
7 years ago
Mike Fährmann a34cebc253
[luscious] jump to first image if cover does not link to it
7 years ago
Mike Fährmann 84a52a9256
add DownloadArchive class
7 years ago
Mike Fährmann 619387cbb1
update extractor unittest results
7 years ago
Mike Fährmann db91cf871c
document message identifiers
7 years ago
Mike Fährmann 0dd48d644f
update test results
7 years ago
Mike Fährmann 1e93955170
[batoto] remove module
7 years ago
Mike Fährmann 76509a6d3c
[imgur] update test results
7 years ago
Mike Fährmann 9fccd7b783
[tumblr] provide fallback URLs (#64)
7 years ago
Mike Fährmann 9d69401391
initial support for multiple URLs per image
7 years ago
Mike Fährmann 91ed147cef
[oauth] use custom key/secret values during oauth:…
7 years ago
Mike Fährmann 421a9740a3
[tumblr] add 'tumblr:' to force Tumblr extractor (#71)
7 years ago
Mike Fährmann 40d35c87bc
[paheal] add tag- and post-extractors (closes #69)
7 years ago
Mike Fährmann cc0c2cca57
[reddit] add extractor for reddit-hosted images (closes #68)
7 years ago
Mike Fährmann f10ffc0839
update extractor blacklist to also allow classes
7 years ago
Mike Fährmann 35e09869d1
[mangapark] fix image URLs and use HTTPS
7 years ago
Mike Fährmann 9a049bdf51
[tumblr] add 'likes' extractor (#65)
7 years ago
Mike Fährmann 67d4462d26
[batoto] rudimentary Cloudflare bypass
7 years ago
Mike Fährmann 29d75fc3fa
[tumblr] add support for OAuth authentication (#65)
7 years ago
Mike Fährmann 4edb25346e
[slideshare] support mobile URLs (closes #67)
7 years ago
Mike Fährmann e420a28bbc
fix cookie tests
7 years ago
Mike Fährmann b33efc99a4
[idolcomplex] add support for idol.sankakucomplex.com
7 years ago
Mike Fährmann 75b2e84b6d
[tumblr] use s3.amazonaws.com for image URLs (#64)
7 years ago
Mike Fährmann 5b094328b5
[puremashiro] add chapter- and manga-extractor (closes #66)
7 years ago
Mike Fährmann 974e73bdbb
[booru] smaller code adjustments
7 years ago
Mike Fährmann 03b8a548cb
[tumblr] change `reblogs` default value to `true` (#61)
7 years ago
Mike Fährmann d235f68f59
[tumblr] add option to filter reblogged posts (#61)
7 years ago
Mike Fährmann a794fffc6d
[batoto] extend chapter-string regex (closes #60)
7 years ago
Mike Fährmann 1219ebb7f5
[danbooru] use alternate subdomains; support safebooru
7 years ago
Mike Fährmann 9e8a84ab6c
[booru] rewrite using Mixin classes (#59)
7 years ago
Mike Fährmann 0876541e43
[seiga] update tests
7 years ago
Mike Fährmann 88bb0798fd
delay initialization of PathFormat objects
7 years ago
Mike Fährmann c24e0e70a7
[pixiv] simplify main loop
7 years ago
Mike Fährmann c1e331edbb
[mangapark] replace manga test
7 years ago
Mike Fährmann 28cd78aae0
[kissmanga] extend chapter-string regex (closes #58)
7 years ago
Mike Fährmann a3e9b51bea
[imgbox] update test results
7 years ago
Mike Fährmann d0886f411e
[gelbooru] re-enable API use (closes #56)
7 years ago
Mike Fährmann 8102aae311
[mangahere] support ".cc" TLD and mobile URLs
7 years ago
Mike Fährmann 676602056c
[reddit] unescape output URLs
7 years ago
Mike Fährmann 2eedbaaaf9
[deviantart] use cache to store new refresh_tokens
7 years ago
Mike Fährmann fc7d165c97
[deviantart] add support for OAuth2 authentication
7 years ago
Mike Fährmann 91c2aed077
[nhentai] fix JSON extraction
7 years ago
Mike Fährmann 444008a14a
[khinsider] use urljoin() to complete page URLs
7 years ago
Mike Fährmann 263741d243
[luscious] update URL pattern (closes #55)
7 years ago
Mike Fährmann 0a9a07a6e1
[slideshare] improve metadata; flake8
7 years ago
Leonardo Taccari a8d2dde8b2 [slideshare] Add a new extractor for slideshare.net (#54)
7 years ago
Mike Fährmann 19a6ae57b2
[sankaku] add pool extractor
7 years ago
Mike Fährmann e52f0cc1ed
[sankaku] add post extractor
7 years ago
Mike Fährmann 595593a35e
[sankaku] rewrite
7 years ago
Mike Fährmann a3924d2072
[sankaku] fix swf extraction (closes #52)
7 years ago
Mike Fährmann 291369eab2
various smaller changes/additions
7 years ago
Mike Fährmann 300346ecdf
[mangazuki] remove extractors
7 years ago
Mike Fährmann d275b1d9a3
[khinsider] fix extraction
7 years ago
Mike Fährmann 6b8e3003df
[hentai2read] ensure consistent extraction results
7 years ago
Mike Fährmann a1980b16f3
[gelbooru] various improvements
7 years ago
Mike Fährmann 93482a1f88
implement 'util.advance()'
7 years ago
Mike Fährmann 038e3b3369
[kissmanga] handle "AreYouHuman" redirects (#51)
7 years ago
Mike Fährmann 2b9a783fc7
[khinsider] fix extraction
7 years ago
Mike Fährmann 214972bc9a
[gelbooru] use manual extraction
7 years ago
Mike Fährmann 55c64cad4b
[khinsider] fix filename extension and test-pattern
7 years ago
Mike Fährmann b14de6ffc2
[tumblr] small improvements
7 years ago
Mike Fährmann 9296a26eae
[tumblr] add warning messages
7 years ago
Mike Fährmann 65c1c53eb8
[khinsider] fix extraction
7 years ago
Mike Fährmann 12de658937
[tumblr] add options to control extraction behavior (#48)
7 years ago
Mike Fährmann 077f8c12be
[tumblr] original video URLs + continuous offset
7 years ago
Mike Fährmann 8eb12ebeae
[tumblr] support more post/media types (#48)
7 years ago
Mike Fährmann b8cdd42cab
[senmanga] fix extraction (again)
7 years ago
Mike Fährmann e6814aebe2
add 'extractor.*.user-agent' config option
7 years ago
Mike Fährmann 6913eeaa40
[powermanga] replace manga extractor unit test
7 years ago
Mike Fährmann 7e0d9257a7
[hbrowse] fix manga extraction
7 years ago
Mike Fährmann 3c576d10c0
[seiga] better metadata + 'skip()' support
7 years ago
Mike Fährmann f72318e593
[seiga] support more than 200 images
7 years ago
Mike Fährmann baf8094868
improve Extractor.request()'s retry behavior
7 years ago
Mike Fährmann 7e7b64162b
[batoto] handle error 10031
7 years ago
Mike Fährmann 92027f67f9
use consistent names for URL constants
7 years ago
Mike Fährmann 69cbc0619f
[mangastream] fix 'next-page' URLs (fixes #49)
7 years ago
Mike Fährmann 980fd3616d
[tumblr] use API v2 (#48)
7 years ago
Mike Fährmann d6bed9f36f
[tumblr] prevent premature exit to get all images (fixes #48)
7 years ago
Mike Fährmann 305da540c3
[mangahere] fix metadata extraction
7 years ago
Mike Fährmann 2d0cfb33e1
[xvideos] add user profile extractor (#45)
7 years ago
Mike Fährmann a393e6e538
[xvideos] add gallery extractor (#45)
7 years ago
Mike Fährmann 3a8a0c1f35
[imgbox] rewrite / fix extraction (closes #47)
7 years ago
Mike Fährmann 035ef655f1
[imagefap] update unit tests
7 years ago
Mike Fährmann 239d7afea7
[hosturimage] fix extraction of larger images
7 years ago
Mike Fährmann 158e60ee89
[3dbooru] enable download continuation
7 years ago
Mike Fährmann c4fcdf2691
Revert "[senmanga] fix extraction and download"
7 years ago
Mike Fährmann 81a7788b40
replace space characters in unit test URLs
7 years ago
Mike Fährmann bf82181359
[jaiminisbox] fix extraction
7 years ago
Mike Fährmann 16783e327f
[common] fix UnboundLocalError in Extractor.request()
7 years ago
Mike Fährmann 2ace5c7b3c
[senmanga] fix extraction and download
7 years ago
Mike Fährmann 4d8387f93b
[pixiv] support mobile URLs (https://touch.pixiv.net/)
7 years ago
Mike Fährmann ab2bf0b0dd
[deviantart] replace collection unittest
7 years ago
Mike Fährmann 289d6b65d2
[danbooru] extend and improve URL regex
7 years ago
Mike Fährmann 5fa42336a2
[sankaku] add warning for unauthenticated users
7 years ago
Mike Fährmann 6af921a952
[sankaku] rewrite/improve (fixes #44)
7 years ago
Mike Fährmann 9aecc67841
[common] explicitly handle HTTP status code 429
7 years ago
Mike Fährmann d68a24aa70
[kissmanga] fix extraction
7 years ago
Mike Fährmann 864a63ed33
fix typo
7 years ago
Mike Fährmann f3fbaa5c3e
[reddit] allow users to override the API User-Agent
7 years ago
Mike Fährmann 31ea6001e8
[dynastyscans] improve metadata and filename formats
7 years ago
Mike Fährmann 2ef3c35c98
smaller textual changes
7 years ago
Mike Fährmann 68a0a7579c
fix/improve some regular expressions
7 years ago
Mike Fährmann 393755ee94
[tumblr] update tests
7 years ago
Mike Fährmann 75d3a1f72f
[deviantart] always download original images
7 years ago
Mike Fährmann a1c8b21cfd
[senmanga] improve metadata
7 years ago
Mike Fährmann 994b2fc1e7
[deviantart] replace 'author[urlname]' keyword
7 years ago
Mike Fährmann 633b376f35
improve/adjust default filename formats for manga sites
7 years ago
Mike Fährmann 41adb99e9c
[pawoo] fix extraction
7 years ago
Mike Fährmann b319f4bab3
smaller code and text changes
7 years ago
Mike Fährmann ad4580800c
[pixiv] add support for more URL patterns
7 years ago
Mike Fährmann 82ea6c0cd3
adjust format strings with optional titles
7 years ago
Mike Fährmann 85a2b2ae59
[khinsider] fix extraction
7 years ago
Mike Fährmann 26a866e7d8
implement (sub)category-transfer between extractors (#41)
7 years ago
Mike Fährmann 1ab4c7986f
[mangahere] fix extraction
7 years ago
Mike Fährmann 8e14714c2b
[imgspice] fix extraction
7 years ago
Mike Fährmann 9c138dfc1f
[common] detect empty HTTP response bodies
7 years ago
Mike Fährmann c51616f8d8
[foolslide] fix minor chapter number
7 years ago
H R X N 77bf923c56 Update imgur.py to include 'title' of single image (#40)
7 years ago
Mike Fährmann a85f06d2d1
[foolslide] restructure; convert suitable values to int
7 years ago
Mike Fährmann deb2e803ba
simplify MangaExtractor class
7 years ago
Mike Fährmann 9fc1d0c901
implement and use 'util.safe_int()'
7 years ago
Mike Fährmann 8963da8fd8
[spectrumnexus] extract manga metadata
7 years ago
Mike Fährmann a3e40734d1
[mangareader] extract manga metadata
7 years ago
Mike Fährmann 9196005a4d
[mangazuki] extract manga metadata
7 years ago
Mike Fährmann 543ba245eb
[deviantart] update test results
7 years ago
Mike Fährmann b7a54a51d0
[mangapark] extract manga metadata + code improvements
7 years ago