Commit Graph

976 Commits (392a08165795b5579c5e8861a9b925520dad469b)

Author SHA1 Message Date
Mike Fährmann d6ef52897c
[imgchili] remove module
7 years ago
Mike Fährmann 7847ab1d5a
[imagehosts] remove even more dead sites
7 years ago
Mike Fährmann 5f37d40a3e
[komikcast] bypass cloudflare challenge
7 years ago
Mike Fährmann f9884e2338
[pixiv] update URL pattern
7 years ago
Mike Fährmann 85ed023c2e
[mangadex] remove the trailing ' - MangaDex' in a better way
7 years ago
Mike Fährmann 32bbd12f08
update extractor tests
7 years ago
Mike Fährmann ca326bd275
[deviantart] fix folder and collection archive IDs
7 years ago
Mike Fährmann e32fe1cdf1
[pinterest] cast IDs to int
7 years ago
Mike Fährmann 179ecee965
[turboimagehost] fix extraction
7 years ago
Mike Fährmann 1400868f53
[mangadex] general improvements
7 years ago
Mike Fährmann 749fbbfa6c
[mangadex] add chapter- and manga-extractor
7 years ago
Mike Fährmann 6e38cf5aab
[mangareader] use 'https://'
7 years ago
Mike Fährmann 1d71123f91
[pixiv] update archive IDs and add metadata-fields
7 years ago
Mike Fährmann 858fdbdb22
[tumblr] improve 'inline' extraction
7 years ago
Mike Fährmann 5008e105ee
update archive IDs
7 years ago
Mike Fährmann 829ddf4ac1
[sankaku] general improvements
7 years ago
Jad 49463f76bb support multi-page URL (#79)
7 years ago
Mike Fährmann 19aefdfde3
[directlink] update test results
7 years ago
Mike Fährmann 74029c50bb
[directlink] unquote metadata fields
7 years ago
Mike Fährmann 8f338347b6
[imagehosts] cleanup
7 years ago
Mike Fährmann edfd3d9fc9
[yeet] remove module
7 years ago
Mike Fährmann 8704d850bf
add explicit proxy support (#76)
7 years ago
Mike Fährmann 367b963d37
[pixiv] fix ugoira extraction ... again (#78)
7 years ago
Mike Fährmann b79f1f2ca7
[pixiv] fix ugoira extraction (closes #78)
7 years ago
Mike Fährmann d122203be1
[mangastream] fix extraction
7 years ago
Mike Fährmann 179bcdd349
adjust archive-ids
7 years ago
Mike Fährmann 3cec533c28
Merge branch 'archive'
7 years ago
Mike Fährmann 20af86b2ea
add more extractor tests
7 years ago
Mike Fährmann 7e0207bcf4
[imgur] strip trailing '?1' from 'ext'
7 years ago
Mike Fährmann cf147dfee9
[hentai2read] fix manga extraction
7 years ago
Mike Fährmann f5f2d29f56
[nijie] fix dojin extraction
7 years ago
Mike Fährmann d38bf2f54c
[tumblr] recognize /image/... URLs
7 years ago
Mike Fährmann 5b3c34aa96
use generic chapter-extractor in more modules
7 years ago
Mike Fährmann 7b5ba69951
[hentaihere] ensure consistent extraction results
7 years ago
Mike Fährmann 377b78b3c9
[hentai2read] fix manga name extraction
7 years ago
Mike Fährmann 54c36a8a34
[subapics] add chapter- and manga-extractor (#70)
7 years ago
Mike Fährmann 2dd3aeeeae
[komikcast] add chapter- and manga-extractor (#70)
7 years ago
Mike Fährmann 7a412f5c32
implement generic manga-chapter extractor
7 years ago
Mike Fährmann 6a07e38366
implement extractor.add() and .add_module()
7 years ago
Mike Fährmann 34873dbd90
set 'archive_fmt' values
7 years ago
Mike Fährmann a34cebc253
[luscious] jump to first image if cover does not link to it
7 years ago
Mike Fährmann 84a52a9256
add DownloadArchive class
7 years ago
Mike Fährmann 619387cbb1
update extractor unittest results
7 years ago
Mike Fährmann db91cf871c
document message identifiers
7 years ago
Mike Fährmann 0dd48d644f
update test results
7 years ago
Mike Fährmann 1e93955170
[batoto] remove module
7 years ago
Mike Fährmann 76509a6d3c
[imgur] update test results
7 years ago
Mike Fährmann 9fccd7b783
[tumblr] provide fallback URLs (#64)
7 years ago
Mike Fährmann 9d69401391
initial support for multiple URLs per image
7 years ago
Mike Fährmann 91ed147cef
[oauth] use custom key/secret values during oauth:…
7 years ago
Mike Fährmann 421a9740a3
[tumblr] add 'tumblr:' to force Tumblr extractor (#71)
7 years ago
Mike Fährmann 40d35c87bc
[paheal] add tag- and post-extractors (closes #69)
7 years ago
Mike Fährmann cc0c2cca57
[reddit] add extractor for reddit-hosted images (closes #68)
7 years ago
Mike Fährmann f10ffc0839
update extractor blacklist to also allow classes
7 years ago
Mike Fährmann 35e09869d1
[mangapark] fix image URLs and use HTTPS
7 years ago
Mike Fährmann 9a049bdf51
[tumblr] add 'likes' extractor (#65)
7 years ago
Mike Fährmann 67d4462d26
[batoto] rudimentary Cloudflare bypass
7 years ago
Mike Fährmann 29d75fc3fa
[tumblr] add support for OAuth authentication (#65)
7 years ago
Mike Fährmann 4edb25346e
[slideshare] support mobile URLs (closes #67)
7 years ago
Mike Fährmann e420a28bbc
fix cookie tests
7 years ago
Mike Fährmann b33efc99a4
[idolcomplex] add support for idol.sankakucomplex.com
7 years ago
Mike Fährmann 75b2e84b6d
[tumblr] use s3.amazonaws.com for image URLs (#64)
7 years ago
Mike Fährmann 5b094328b5
[puremashiro] add chapter- and manga-extractor (closes #66)
7 years ago
Mike Fährmann 974e73bdbb
[booru] smaller code adjustments
7 years ago
Mike Fährmann 03b8a548cb
[tumblr] change `reblogs` default value to `true` (#61)
7 years ago
Mike Fährmann d235f68f59
[tumblr] add option to filter reblogged posts (#61)
7 years ago
Mike Fährmann a794fffc6d
[batoto] extend chapter-string regex (closes #60)
7 years ago
Mike Fährmann 1219ebb7f5
[danbooru] use alternate subdomains; support safebooru
7 years ago
Mike Fährmann 9e8a84ab6c
[booru] rewrite using Mixin classes (#59)
7 years ago
Mike Fährmann 0876541e43
[seiga] update tests
7 years ago
Mike Fährmann 88bb0798fd
delay initialization of PathFormat objects
7 years ago
Mike Fährmann c24e0e70a7
[pixiv] simplify main loop
7 years ago
Mike Fährmann c1e331edbb
[mangapark] replace manga test
7 years ago
Mike Fährmann 28cd78aae0
[kissmanga] extend chapter-string regex (closes #58)
7 years ago
Mike Fährmann a3e9b51bea
[imgbox] update test results
7 years ago
Mike Fährmann d0886f411e
[gelbooru] re-enable API use (closes #56)
7 years ago
Mike Fährmann 8102aae311
[mangahere] support ".cc" TLD and mobile URLs
7 years ago
Mike Fährmann 676602056c
[reddit] unescape output URLs
7 years ago
Mike Fährmann 2eedbaaaf9
[deviantart] use cache to store new refresh_tokens
7 years ago
Mike Fährmann fc7d165c97
[deviantart] add support for OAuth2 authentication
7 years ago
Mike Fährmann 91c2aed077
[nhentai] fix JSON extraction
7 years ago
Mike Fährmann 444008a14a
[khinsider] use urljoin() to complete page URLs
7 years ago
Mike Fährmann 263741d243
[luscious] update URL pattern (closes #55)
7 years ago
Mike Fährmann 0a9a07a6e1
[slideshare] improve metadata; flake8
7 years ago
Leonardo Taccari a8d2dde8b2 [slideshare] Add a new extractor for slideshare.net (#54)
7 years ago
Mike Fährmann 19a6ae57b2
[sankaku] add pool extractor
7 years ago
Mike Fährmann e52f0cc1ed
[sankaku] add post extractor
7 years ago
Mike Fährmann 595593a35e
[sankaku] rewrite
7 years ago
Mike Fährmann a3924d2072
[sankaku] fix swf extraction (closes #52)
7 years ago
Mike Fährmann 291369eab2
various smaller changes/additions
7 years ago
Mike Fährmann 300346ecdf
[mangazuki] remove extractors
7 years ago
Mike Fährmann d275b1d9a3
[khinsider] fix extraction
7 years ago
Mike Fährmann 6b8e3003df
[hentai2read] ensure consistent extraction results
7 years ago
Mike Fährmann a1980b16f3
[gelbooru] various improvements
7 years ago
Mike Fährmann 93482a1f88
implement 'util.advance()'
7 years ago
Mike Fährmann 038e3b3369
[kissmanga] handle "AreYouHuman" redirects (#51)
7 years ago
Mike Fährmann 2b9a783fc7
[khinsider] fix extraction
7 years ago
Mike Fährmann 214972bc9a
[gelbooru] use manual extraction
7 years ago
Mike Fährmann 55c64cad4b
[khinsider] fix filename extension and test-pattern
7 years ago
Mike Fährmann b14de6ffc2
[tumblr] small improvements
7 years ago
Mike Fährmann 9296a26eae
[tumblr] add warning messages
7 years ago
Mike Fährmann 65c1c53eb8
[khinsider] fix extraction
7 years ago
Mike Fährmann 12de658937
[tumblr] add options to control extraction behavior (#48)
7 years ago
Mike Fährmann 077f8c12be
[tumblr] original video URLs + continuous offset
7 years ago
Mike Fährmann 8eb12ebeae
[tumblr] support more post/media types (#48)
7 years ago
Mike Fährmann b8cdd42cab
[senmanga] fix extraction (again)
7 years ago
Mike Fährmann e6814aebe2
add 'extractor.*.user-agent' config option
7 years ago
Mike Fährmann 6913eeaa40
[powermanga] replace manga extractor unit test
7 years ago
Mike Fährmann 7e0d9257a7
[hbrowse] fix manga extraction
7 years ago
Mike Fährmann 3c576d10c0
[seiga] better metadata + 'skip()' support
7 years ago
Mike Fährmann f72318e593
[seiga] support more than 200 images
7 years ago
Mike Fährmann baf8094868
improve Extractor.request()'s retry behavior
7 years ago
Mike Fährmann 7e7b64162b
[batoto] handle error 10031
7 years ago
Mike Fährmann 92027f67f9
use consistent names for URL constants
7 years ago
Mike Fährmann 69cbc0619f
[mangastream] fix 'next-page' URLs (fixes #49)
7 years ago
Mike Fährmann 980fd3616d
[tumblr] use API v2 (#48)
7 years ago
Mike Fährmann d6bed9f36f
[tumblr] prevent premature exit to get all images (fixes #48)
7 years ago
Mike Fährmann 305da540c3
[mangahere] fix metadata extraction
7 years ago
Mike Fährmann 2d0cfb33e1
[xvideos] add user profile extractor (#45)
7 years ago
Mike Fährmann a393e6e538
[xvideos] add gallery extractor (#45)
7 years ago
Mike Fährmann 3a8a0c1f35
[imgbox] rewrite / fix extraction (closes #47)
7 years ago
Mike Fährmann 035ef655f1
[imagefap] update unit tests
7 years ago
Mike Fährmann 239d7afea7
[hosturimage] fix extraction of larger images
7 years ago
Mike Fährmann 158e60ee89
[3dbooru] enable download continuation
7 years ago
Mike Fährmann c4fcdf2691
Revert "[senmanga] fix extraction and download"
7 years ago
Mike Fährmann 81a7788b40
replace space characters in unit test URLs
7 years ago
Mike Fährmann bf82181359
[jaiminisbox] fix extraction
7 years ago
Mike Fährmann 16783e327f
[common] fix UnboundLocalError in Extractor.request()
7 years ago
Mike Fährmann 2ace5c7b3c
[senmanga] fix extraction and download
7 years ago
Mike Fährmann 4d8387f93b
[pixiv] support mobile URLs (https://touch.pixiv.net/)
7 years ago
Mike Fährmann ab2bf0b0dd
[deviantart] replace collection unittest
7 years ago
Mike Fährmann 289d6b65d2
[danbooru] extend and improve URL regex
7 years ago
Mike Fährmann 5fa42336a2
[sankaku] add warning for unauthenticated users
7 years ago
Mike Fährmann 6af921a952
[sankaku] rewrite/improve (fixes #44)
7 years ago
Mike Fährmann 9aecc67841
[common] explicitly handle HTTP status code 429
7 years ago
Mike Fährmann d68a24aa70
[kissmanga] fix extraction
7 years ago
Mike Fährmann 864a63ed33
fix typo
7 years ago
Mike Fährmann f3fbaa5c3e
[reddit] allow users to override the API User-Agent
7 years ago
Mike Fährmann 31ea6001e8
[dynastyscans] improve metadata and filename formats
7 years ago
Mike Fährmann 2ef3c35c98
smaller textual changes
7 years ago
Mike Fährmann 68a0a7579c
fix/improve some regular expressions
7 years ago
Mike Fährmann 393755ee94
[tumblr] update tests
7 years ago
Mike Fährmann 75d3a1f72f
[deviantart] always download original images
7 years ago
Mike Fährmann a1c8b21cfd
[senmanga] improve metadata
7 years ago
Mike Fährmann 994b2fc1e7
[deviantart] replace 'author[urlname]' keyword
7 years ago
Mike Fährmann 633b376f35
improve/adjust default filename formats for manga sites
7 years ago
Mike Fährmann 41adb99e9c
[pawoo] fix extraction
7 years ago
Mike Fährmann b319f4bab3
smaller code and text changes
7 years ago
Mike Fährmann ad4580800c
[pixiv] add support for more URL patterns
7 years ago
Mike Fährmann 82ea6c0cd3
adjust format strings with optional titles
7 years ago
Mike Fährmann 85a2b2ae59
[khinsider] fix extraction
7 years ago
Mike Fährmann 26a866e7d8
implement (sub)category-transfer between extractors (#41)
7 years ago
Mike Fährmann 1ab4c7986f
[mangahere] fix extraction
7 years ago
Mike Fährmann 8e14714c2b
[imgspice] fix extraction
7 years ago
Mike Fährmann 9c138dfc1f
[common] detect empty HTTP response bodies
7 years ago
Mike Fährmann c51616f8d8
[foolslide] fix minor chapter number
7 years ago
H R X N 77bf923c56 Update imgur.py to include 'title' of single image (#40)
7 years ago
Mike Fährmann a85f06d2d1
[foolslide] restructure; convert suitable values to int
7 years ago
Mike Fährmann deb2e803ba
simplify MangaExtractor class
7 years ago
Mike Fährmann 9fc1d0c901
implement and use 'util.safe_int()'
7 years ago
Mike Fährmann 8963da8fd8
[spectrumnexus] extract manga metadata
7 years ago
Mike Fährmann a3e40734d1
[mangareader] extract manga metadata
7 years ago
Mike Fährmann 9196005a4d
[mangazuki] extract manga metadata
7 years ago
Mike Fährmann 543ba245eb
[deviantart] update test results
7 years ago
Mike Fährmann b7a54a51d0
[mangapark] extract manga metadata + code improvements
7 years ago
Mike Fährmann d39b8779af
[mangahere] extract manga metadata
7 years ago
Mike Fährmann c265cc074a
[hbrowse] fix syntax for Python3.3 and 3.4
7 years ago
Mike Fährmann a9e7145651
[hbrowse] extract hmanga metadata & general maintenance
7 years ago
Mike Fährmann 92c8a6cb01
[hentai2read] extract hmanga metadata
7 years ago
Mike Fährmann de174b40d6
[hentaihere] extract hmanga metadata
7 years ago
Mike Fährmann 04cc1ffe34
[kissmanga] extract manga metadata
7 years ago
Mike Fährmann 885bd4cbe2
[readcomiconline] extract comic metadata
7 years ago
Mike Fährmann cebf800a7f
[foolfuuka] add support for more sites (#18)
7 years ago
Mike Fährmann 84d4450410
[fallenangels] extract manga metadata
7 years ago
Mike Fährmann f32b1a0292
[imgyt] fix extraction
7 years ago
Mike Fährmann 4ad903b797
[warosu] fix extraction
7 years ago
Mike Fährmann b84f48dfa5
[batoto] extract manga metadata
7 years ago
Mike Fährmann 4ceb176c6b
[foolslide] extract manga metadata
7 years ago
Mike Fährmann 24e5f154a4
[deviantart] update test results
7 years ago
Mike Fährmann 0dedbe759c
enable '--chapter-filter'
7 years ago
Mike Fährmann 31cd5b1c1d
[luscious] detect high-load responses
7 years ago
Mike Fährmann 470bbe9d8c
fix smaller stuff
7 years ago
Mike Fährmann 6f30cf4c64
change keyword names to valid Python identifiers
7 years ago
Mike Fährmann 54c0715135
allow users to set their own API access_tokens/client_ids
7 years ago
Mike Fährmann 49c7e70c10
[acidimg] add image extractor
7 years ago
Mike Fährmann 9b21d3f13c
add '--filter' command-line option
7 years ago
Mike Fährmann 00420ff202
[booru] consistent order for "popular" results
7 years ago
Mike Fährmann 83cf1e1d6d
[sankaku] unescape image URLs
7 years ago
Mike Fährmann f98e3e8002
[luscious] fix tag extraction
7 years ago
Mike Fährmann 65997d835b
replace popular/ranking tests with older ones
7 years ago
Mike Fährmann be30fb2f98
add common config category for boorus and foolslide
7 years ago
Mike Fährmann c0755a4d5e
[exhentai] revert login-method to its old version (#37)
7 years ago
Mike Fährmann 3ee39ffd93
[exhentai] update login procedure (#37)
7 years ago
Mike Fährmann 88a386977e
[booru] add "popular" extractors for more sites
7 years ago
Mike Fährmann 07214f4007
[booru] place subcategories into base classes
7 years ago
Mike Fährmann 60a888a1e4
[foolfuuka] add common config category
7 years ago
Mike Fährmann 47bcf53ec1
implement support for additional unit test result types
7 years ago
Mike Fährmann 2d0dfe9d56
[exhenai] init headers before login and detect sadpanda
7 years ago
Mike Fährmann c7ec103e15
[batoto] fix extraction of chapter URLs
7 years ago
Mike Fährmann 18e6ed1c7e
[booru] add extractors for "Popular" images
7 years ago
Mike Fährmann f7cdfd4c25
add a simplified version of 'parse_qs'
7 years ago
Mike Fährmann 3b21e0703c
[deviantart] allow distinction between users and groups (#26)
7 years ago
Mike Fährmann e61a3a56d1
[hentai2read] fix and update keywords
7 years ago
Mike Fährmann c45770331a
use 'str.partition()'
7 years ago
Mike Fährmann 017a72f448
[pixiv] improve input validation
7 years ago
Mike Fährmann dcf42c5e89
[pixiv] add extractor for ranking lists
7 years ago
Mike Fährmann 4ea82ea556
[warosu] add thread extractor
7 years ago
Mike Fährmann 9aa95fba8c
[deviantart] adapt download URLs to use https
7 years ago
Mike Fährmann 02e89700fc
[foolfuuka] ensure sorted posts
7 years ago
Mike Fährmann 8bcf88bff7
[flickr] fix extraction
7 years ago
Mike Fährmann 004456d5d5
properly update the config-dictionary
7 years ago
Mike Fährmann cfa479fab5
update error message for unspecified exceptions
7 years ago
Mike Fährmann 7e936e9c06
[luscious] simplify and remove dead code
7 years ago
Mike Fährmann 0245a0ba5f
fix extraction and update test results
7 years ago
Mike Fährmann abd7c559cd
[yonkouprod] remove module
7 years ago
Mike Fährmann da7219ba74
[kisscomic] remove module
7 years ago
Mike Fährmann 852e7acd31
[twitter] ignore "Promoted Tweets"
7 years ago
Mike Fährmann 915a0137de
improve 'extractor.request'
7 years ago
rachmadani haryono dcd573806e chg: dev: fix error (#32)
7 years ago
Mike Fährmann c4713404c8
[directlink] improve URL pattern
7 years ago
Mike Fährmann d443822fdb
[luacious] get correct image URLs (fixes #33)
7 years ago
Mike Fährmann 6950708e52
[hentaicdn] use HTTPS
7 years ago
Mike Fährmann 4f1e6c109f
[deviantart] remove 'invalid escape sequence' warning
7 years ago
Mike Fährmann c864be479e
[directlink] update URL pattern & PEP 8
7 years ago
H R X N 45f9d64c23 Update directlink.py with additional file exts. (#30)
7 years ago
Mike Fährmann 4357966a70
[kissmanga] make URL pattern case-insensitive (fixes 28)
7 years ago
Mike Fährmann 7aa9fa796a
code cleanup and fixes
7 years ago
Mike Fährmann f08af03845
Merge branch 'cookies'
7 years ago
Mike Fährmann 55f048d02b
ignore case of cookiejar magic strings
7 years ago
Mike Fährmann f53bf1a323
[thebarchive] add thread extractor
7 years ago
Mike Fährmann b8cf434bb0
[rebeccablacktech] add thread extractor
7 years ago
Mike Fährmann 808f67ba7d
use 'cookiedomain' for cookies set by object-config-values
7 years ago
Mike Fährmann 390eeded4c
[mangazuki] support 'raws.…' subdomain
7 years ago
Mike Fährmann 4a60f6068a
[mangazuki] add manga extractor
7 years ago
Mike Fährmann 394241cd6f
[2chan] fix extraction
7 years ago
Mike Fährmann a13eb6010f
[fallenangels] fix extraction of chapter URLs
7 years ago
Mike Fährmann 1cb1d2e0a3
[mangazuki] add chapter extractor
7 years ago
Mike Fährmann 2f2e363c97
[imgur] use /a/<key>/all as album-url
7 years ago
Mike Fährmann 1cec03c9c6
[imgur] fix extraction of large albums
7 years ago
Mike Fährmann 0610ae5000
skip login if cookies are present
7 years ago
Mike Fährmann f105782435
[fireden] add thread extractor
7 years ago
Mike Fährmann c93f7d7496
[archiveofsins] add thread extractor
7 years ago
Mike Fährmann 96e13604da
[archivedmoe] add thread extractor
7 years ago
Mike Fährmann 30d3a5f9b2
support redirects on 4chan archives
7 years ago
Mike Fährmann 98464d1f1b
[loveisover] add thread extractor
7 years ago
Mike Fährmann 47692f28da
[2chan] add thread extractor
7 years ago
Mike Fährmann 3460dc8950
update gallery-dl.conf
7 years ago
Mike Fährmann 9be8f7e106
[deviantart] add "extractor.deviantart.flat" option
7 years ago
Mike Fährmann d075627fd9
[deviantart] support group galleries (#26)
7 years ago
Mike Fährmann b37a62501b
[pixiv] unquote tags
7 years ago