Commit Graph

1828 Commits (627d2141d3b3de7beb344c2658c576666a37066e)

Author SHA1 Message Date
Mike Fährmann db35c3b581
[directlink] separate filenames from paths
5 years ago
Mike Fährmann 41a3169c67
[foolfuuka] use '{extension}' in default filename format
5 years ago
Mike Fährmann e9aed62c91
[imgur] unescape image titles
5 years ago
Mike Fährmann 2c332edaad
[plurk] fix comment pagination
5 years ago
Mike Fährmann a3fa45bbb1
[behance] get images from 'media_collection' modules
5 years ago
Mike Fährmann 359c3bc1c5
[deviantart] revert to getting download URLs from OAuth API
5 years ago
Mike Fährmann 42b9633c7e
update test results
5 years ago
Mike Fährmann b28bd1c73e
[bobx] set generated session cookie (closes #482)
5 years ago
Mike Fährmann ae09f87602
improve SharedConfigMixin config lookups
5 years ago
Mike Fährmann f5604492c3
update interface of config functions
5 years ago
Mike Fährmann 4ca883c66f
[smugmug] replace test for custom URLs
5 years ago
Mike Fährmann d45fabb79d
match user profile handling on deviantart and newgrounds
5 years ago
Mike Fährmann ea80dadd09
[deviantart] restore archive keys
5 years ago
Mike Fährmann ea094692c8
[vsco] fix collection extraction (#480)
5 years ago
Mike Fährmann 490831f84a
[bobx] "fix" image download URLs
5 years ago
Mike Fährmann 978cb03f81
update misc test results
5 years ago
Mike Fährmann fca87974fe
[sexcom] fix video downloads by sending specific Referer headers
5 years ago
Mike Fährmann edc080468d
[instagram] make 'video_url' fields optional (fixes #479)
5 years ago
Mike Fährmann 9fdc5e74cb
[deviantart] ensure consistent username capitalization (#455)
5 years ago
Mike Fährmann b1f0609de5
[newgrounds] rewrite (#394)
5 years ago
Mike Fährmann 3ece3976ae
[newgrounds] implement login support (#394)
5 years ago
Mike Fährmann 3a07c06865
[newgrounds] update
5 years ago
Mike Fährmann 5513b66eb0
[vsco] fix user profile extraction
5 years ago
Mike Fährmann abfcb356fc
[flickr] support 3k, 4k, 5k, and 6k photo sizes (closes #472)
5 years ago
Mike Fährmann 521fcd2eb9
[imgbb] fix error in galleries without user info (closes #471)
5 years ago
Mike Fährmann 8061263d4c
[imgbb] improve pagination logic
5 years ago
Mike Fährmann da6789b2b0
disable unique archive id checks for some tests
5 years ago
Mike Fährmann b0197098e6
[imgur] get title from webpage if missing in API response
5 years ago
Mike Fährmann dd5d2b2eac
[deviantart] add user profile extractor (#377, #419)
5 years ago
Mike Fährmann a437e78620
[deviantart] minimize cookie usage during scraps extraction
5 years ago
Mike Fährmann 1a197d2195
store the original cookiejar as Extractor._cookiejar
5 years ago
Mike Fährmann de83ae4576
make 'method' argument of Extractor.request keyword-only
5 years ago
Mike Fährmann 4325695d74
[luscious] expand GraphQL queries
5 years ago
Mike Fährmann 94dbdbf506
[nijie] change default filename format
5 years ago
Mike Fährmann c18fadc221
[instagram] extract videos without youtube-dl (#391)
5 years ago
Mike Fährmann f15eedb634
[sexcom] set Referer header for file downloads (closes #464)
5 years ago
Mike Fährmann 2a3bd4e3c7
rename extractor classes starting with a digit
5 years ago
Mike Fährmann b3b9da6d74
[photobucket] replace test URL
5 years ago
Mike Fährmann 64786363be
[4chan] simplify
5 years ago
Mike Fährmann 557e2c018b
[8chan] remove module
5 years ago
Mike Fährmann e14782a948
[instagram] simplify graphql extraction for post pages
5 years ago
Mike Fährmann c01ff78467
[twitter] extend 'videos' option to force extraction with ytdl
5 years ago
Mike Fährmann f8ac67ce50
[hitomi] extend URL pattern + follow redirects
5 years ago
Mike Fährmann e877ca97c3
[naver] adjust directory names and metadata structure
5 years ago
Mike Fährmann 702f2fbd1f
[issuu] add publication and user extractors (#413)
5 years ago
Mike Fährmann 8361d874d7
[hitomi] fix extraction
5 years ago
Mike Fährmann 5fa6ff04dd
[instagram] extract '__additionalDataLoaded' (#391)
5 years ago
Mike Fährmann 87a87bff7e
[simplyhentai] fix image URLs
5 years ago
Mike Fährmann 4409d00141
embed error messages in StopExtraction exceptions
5 years ago
Mike Fährmann d44f790e81
adjust output for HTTP status related errors
5 years ago
Mike Fährmann 109718a5e3
[blogger] add blog and post extractors (closes #364)
5 years ago
Mike Fährmann 49a6b1b6c0
[twitter] extract video stream info without youtube-dl (#452)
5 years ago
Mike Fährmann 9f0dbf2a72
[twitter] raise proper exception for protected Tweets
5 years ago
Mike Fährmann 6e08ada4fe
[luscious] simplify some metadata entries
5 years ago
Mike Fährmann 9e3a8607ee
[deviantart] update usernames (#455)
5 years ago
Mike Fährmann 2eb38810c5
[twitter] fix image extraction when logged in (#452)
5 years ago
Mike Fährmann 8f38a35b91
[imgur] use API with "public" client_id (#446)
5 years ago
Mike Fährmann b23c822b23
[luscious] use GraphQL
5 years ago
Mike Fährmann ef17d94469
update test results
5 years ago
Mike Fährmann 2057c6ba29
[naver] add blog and post extractors (closes #447)
5 years ago
Mike Fährmann 389d2d7e38
implement 'cookies-update' option (#445)
5 years ago
Mike Fährmann fbc0a6a059
[nozomi] skip unavailable posts (#388)
5 years ago
Mike Fährmann ae98dbcbb3
[nozomi] implement searching for negated terms (#388)
5 years ago
Mike Fährmann 1c03a389df
[twitter] small improvements to search extractor
5 years ago
Mike Fährmann c3042978b8
[deviantart] match "/gallery/all" (closes #449)
5 years ago
Alice bcddcca6db Add search downloading to twitter.py (#448)
5 years ago
Mike Fährmann 1693d97bd3
update extractor class hierarchies
5 years ago
Mike Fährmann 7ebd984e8d
[imgur] print error message if no JSON data is found (#446)
5 years ago
Mike Fährmann 5882b00f2f
[imgur] implement login support (#446)
5 years ago
Mike Fährmann 91643ca54b
[nozomi] add search extractor (#388)
5 years ago
Mike Fährmann df2b3c6888
restore OAuth2 authentication error messages
5 years ago
Mike Fährmann 6779512fc7
[nozomi] add post and tag extractors (#388)
5 years ago
Mike Fährmann 6abe5f5bbb
[patreon] fix pagination (#444)
5 years ago
Mike Fährmann d4ffd6c952
[yaplog] improve metadata extraction (#443)
5 years ago
Mike Fährmann 15af2f8464
[hitomi] fallback to /reader/ page if main page returns 404
5 years ago
Mike Fährmann dc6ad81e2e
[yaplog] prevent crash on empty posts (#443)
5 years ago
Mike Fährmann 94eb7c6cad
[deviantart] fix sta.sh extraction (436)
5 years ago
Mike Fährmann 27b5b2497e
[deviantart] fix download URLs (#436)
5 years ago
Mike Fährmann 93aac8dfea
[yaplog] fix incomplete image URLs (#443)
5 years ago
Mike Fährmann a782b009b8
[yaplog] match blog names with '-' (#443)
5 years ago
Mike Fährmann cf5e716b9d
[hitomi] fix image URLs
5 years ago
Mike Fährmann 5a54efa025
[xhamster] unescape 'title' and 'description'
5 years ago
Mike Fährmann 1b9bf4fc6e
[behance] fix 'tags' extraction
5 years ago
Mike Fährmann bb97e87989
[komikcast] ignore banner image
5 years ago
Mike Fährmann 0ff90a3f7d
[gfycat] include title in default filenames (closes #434)
5 years ago
Mike Fährmann de4e2029d1
[nsfwalbum] update test album
5 years ago
Mike Fährmann 1faec285d1
[nijie] further improvements (closes #423)
5 years ago
Mike Fährmann 6d0a533d68
[reddit] respect 'comments:0' for single submissions (#429)
5 years ago
Mike Fährmann 803d8f814e
[oauth] update scope for reddit tokens (#428)
5 years ago
Mike Fährmann 46ba173ded
[reddit] fix documentation inconsistencies (closes #429)
5 years ago
Mike Fährmann 20eb6c401f
[nijie] improvements and fixes (#423)
5 years ago
Mike Fährmann d1ea08c67d
[weibo] fixes and improvements
5 years ago
Mike Fährmann 38d97f3da6
[deviantart] add debug message about API credentials (#424)
5 years ago
Mike Fährmann 80c2104fb5
[deviantart] fix 429 handling if 'fatal' is False (closes #424)
5 years ago
Mike Fährmann 913460240d
[reddit] fix 'extractor.blacklist()' arguments
5 years ago
Mike Fährmann 22bac14452
[pixiv] match '/artworks/' URLs
5 years ago
Mike Fährmann 66cac207ac
[twitter] match and use 'i/web' status URLs
5 years ago
Mike Fährmann 946f2751e2
[reddit] add 'user' extractor (closes #350)
5 years ago
Mike Fährmann c14abb9fb8
[reddit] improve URL parameter handling for subreddit links
5 years ago
Mike Fährmann ee8b654464
[instagram] implement 'highlights' option (closes #329)
5 years ago
Mike Fährmann f63c3097a9
[instagram] rework some code paths
5 years ago
Mike Fährmann 4330133114
[imgur] add 'favorite' extractor (closes #420)
5 years ago
Mike Fährmann ee5e20221f
[imgth] fix image URLs
5 years ago
Mike Fährmann b63b126808
[hentaicafe] extend URL pattern
5 years ago
Mike Fährmann d780f0357e
[imgur] add user extractor
5 years ago
Mike Fährmann 11ea689013
[simplyhentai] fix image and video URLs
5 years ago
Mike Fährmann 15632a1570
[tsumino] fix extraction
5 years ago
Mike Fährmann d92802fd37
[luscious] fix detection of unavailable galleries
5 years ago
Mike Fährmann f99da2b866
[imgbb] detect invalid album and user profile links
5 years ago
Mike Fährmann 01bc7adadc
[deviantart] improve journal detection (#419)
5 years ago
Mike Fährmann 6e12907de6
[deviantart] improve handling of private deviations (#414)
5 years ago
Mike Fährmann e7690ac694
[vsco] update URL pattern (closes #410)
5 years ago
Mike Fährmann 1848788970
update test results etc
5 years ago
Mike Fährmann d5fbb2d9de
[tumblr] ignore audio links from Spotify etc.
5 years ago
Mike Fährmann b1cddce865
Revert "[simplyhentai] fix extraction; remove image+video extractors"
5 years ago
Mike Fährmann d23660c04d
[hentaicafe] restore default 'request()' behavior
5 years ago
Mike Fährmann 9ae58a6b3e
[exhentai] update image limit checks
5 years ago
Mike Fährmann 6fe9a134bf
[lineblog] add blog and post extractors (closes #404)
5 years ago
Mike Fährmann 4e8a548a61
[livedoor] update metadata extraction
5 years ago
Mike Fährmann f9285f99e6
[pixiv] fix authentication
5 years ago
Mike Fährmann 6f3df3999a
[fuskator] add gallery and search extractor (closes #407)
5 years ago
Mike Fährmann bc0ca66c99
[twitter] small improvements
5 years ago
Mike Fährmann f02a768b5c
[danbooru] add 'ugoira' option (#406)
5 years ago
Mike Fährmann dedea3b4db
[deviantart] fix journal creation (#400)
5 years ago
Mike Fährmann c6c5cb1898
improve 'deviantart.quality' description
5 years ago
Mike Fährmann efb64ad031
[deviantart] generate filenames (#392, #400)
5 years ago
Mike Fährmann b2151f3928
[seiga] support mobile URLs (closes #401)
5 years ago
Mike Fährmann 20fd2d8450
[flickr] skip unavailable images/videos (fixes #398)
5 years ago
Mike Fährmann 5cc7be2536
[piczel] update and improve
5 years ago
Mike Fährmann 49f6d7176d
[deviantart] restore filenames (#392)
5 years ago
Mike Fährmann 63daa68d67
[deviantart] improvements (#392)
5 years ago
Mike Fährmann d1db5180ab
[simplyhentai] fix extraction; remove image+video extractors
5 years ago
Mike Fährmann 30d6e284b0
[deviantart] use NAPI for artworks and scraps (#392)
5 years ago
Mike Fährmann 7d6af936c5
[imgur] simplify gallery extraction
5 years ago
Mike Fährmann 51d10783fc
[patreon] include image info in API results (#383)
5 years ago
Mike Fährmann 7a5e78741c
[booru] build directory path for each file (#385)
5 years ago
Mike Fährmann b1728f512d
[patreon] support multi image posts and post URLs (#383)
5 years ago
Mike Fährmann c50d60a53d
[reactor] fix image URLs
5 years ago
Mike Fährmann 32447d0d24
[pixiv] simplify default filename format
5 years ago
Mike Fährmann 829b1ccf04
[imgur] distinguish album and gallery URLs (#380)
5 years ago
Mike Fährmann 23251356cb
require 'extension' data for each URL (#382)
5 years ago
Mike Fährmann a67413d64f
[xhamster] use input URL domain
5 years ago
Mike Fährmann 423f68f585
[deviantart] fix scraps extraction (closes #376)
5 years ago
Mike Fährmann 3bf20ffb70
[instagram] add support for story highlights
5 years ago
Mike Fährmann a732e9c430
[instagram] update query hashes and headers
5 years ago
Mike Fährmann 2ccf6a9e35
[instagram] make extractor tests happy (#373)
5 years ago
Leonardo Taccari bc5eaf7746 [instagram] Add support for IGTV (#373)
5 years ago
Mike Fährmann eb7da159e2
[imagebam] update URL test results
5 years ago
Mike Fährmann 189acbeac9
[imgbb] add extractor for individual images (closes #363)
5 years ago
Mike Fährmann ad3ac02fbc
[pixiv] update metadata entries (#366)
5 years ago
Mike Fährmann 1ff4c4ec03
[adultempire] consistent artist order
5 years ago
Leonardo Taccari 2df050e627 [instagram] Add support for stories (#371)
5 years ago
Mike Fährmann f4bc75e854
fix rate limit handling for OAuth APIs (#368)
5 years ago
Mike Fährmann 3957d27d79
[deviantart] add 'quality' option (#369)
5 years ago
Mike Fährmann 64b2935d8e
[pixiv] provide 'filename' and change default filename format
5 years ago
Mike Fährmann fa60109e97
[exhentai] don't use e-hentai.org for exhentai URLs
5 years ago
Mike Fährmann 4a0c98bfc9
miscellaneous fixes and adjustments
5 years ago
Mike Fährmann 2c839f3760
[imgbb] add user extractor + login support (#361)
5 years ago
Mike Fährmann 2153206093
[imgbb] add album extractor (#361)
5 years ago
Mike Fährmann beb4fab2e6
[exhentai] improve limit and error handling (#360)
5 years ago
Mike Fährmann 81b35ed3cb
[exhentai] catch more error states (#356, #360)
5 years ago
Mike Fährmann 6ce22f606b
[exhentai] update login procedure and tests
5 years ago
Mike Fährmann dc73d02d87
[exhentai] always use e-hentai.org as domain + set nw cookie
5 years ago
Mike Fährmann 40637556fa
[ngomik] fix extraction
5 years ago
Mike Fährmann 3969f9cbbd
[behance] fix collection extraction
5 years ago
Mike Fährmann 17a3426845
[gelbooru] enable all content when not using API
5 years ago
Mike Fährmann 279db2c5b2
[vsco] add collection & image extractor + video support (#331)
5 years ago
Mike Fährmann d9d44ad953
[tsumino] update test results
5 years ago
Mike Fährmann 60cf40380a
[vsco] add user extractor (#331)
5 years ago
Mike Fährmann 3fe5ccdfa6
[adultempire] add gallery extractor (closes #340)
5 years ago
Mike Fährmann 5d968412ca
[deviantart] case-insensitive folder name matching (fixes #343)
5 years ago
Mike Fährmann a3c736fedc
[500px] fix extraction
5 years ago
Mike Fährmann 1133b7fcbd
[smugmug] update unit tests
5 years ago
Mike Fährmann 21991acc49
add 'ciphers' option; update default User-Agent
5 years ago
Mike Fährmann 84f4d3bc0b
replace urllib3's default cipher list with Firefox's (#342)
5 years ago
Mike Fährmann feb98cf196
[twitter] improve 'content' formatting; add option (#338)
5 years ago
Mike Fährmann 8d1ae9b715
[tumblr] enable date-min/-max/-format options (#337)
5 years ago
Mike Fährmann 09f37fde39
[reddit] move date-min/-max handling into Extractor class
5 years ago
Mike Fährmann 0151e250f5
[twitter] extract 'content' metadata (closes #333)
5 years ago
Mike Fährmann 56c7a66a4a
detect Cloudflare CAPTCHAs and update cipher list
5 years ago
Mike Fährmann a7b42b37a2
[35photo] fix extraction
5 years ago
Mike Fährmann 04b8d0894a
[newgrounds] improve metadata extraction
5 years ago
Mike Fährmann 12da6bd0c9
[simplyhentai] fix/improve extraction
5 years ago
Mike Fährmann fdec59f8e2
replace extractor.request() 'expect' argument
5 years ago
Mike Fährmann 2ff73873f0
[erolord] add gallery extractor (closes #326)
5 years ago
Mike Fährmann b4da8c5a97
[sexcom] add extractor for related pins (#325)
5 years ago
Mike Fährmann 69997e92db
[sexcom] skip unavailable pins (#325)
5 years ago
Mike Fährmann bc6b0cfddc
[shopify] skip consecutive duplicate products
5 years ago
Mike Fährmann b89f0d8d3c
update extractor result tests
5 years ago
Mike Fährmann 69205df68d
allow '-1' for infinite retries (#300)
5 years ago
Mike Fährmann f7b5c4c3e7
use values of 'retries' options correctly
5 years ago
Mike Fährmann 40da44b17f
Merge branch 'v1.9.0'
5 years ago
Mike Fährmann 7a99e85943
[kissmanga] fix download URLs and file extensions
5 years ago
Mike Fährmann 055102431f
[hitomi] handle Game CG galleries with scenes (fixes #321)
5 years ago
Mike Fährmann a9c89085fb
[instagram] implement login support (#195)
5 years ago
Mike Fährmann 7856e5e7dc
]deviantart] "fix" scraps extraction
5 years ago
Mike Fährmann 082cb24acd
[pururin] fix extraction
5 years ago
Mike Fährmann 98554cbab8
[mangoxo] fix login
5 years ago
Mike Fährmann 108963d138
[imagefap] include Referer headers
5 years ago
Mike Fährmann e314621366
[nsfwalbum] fix default directory_fmt (#287)
5 years ago
Mike Fährmann 18a1f8c6cd
[vanillarock] add post and tag extractors (closes #254)
5 years ago
Mike Fährmann f0c5093812
[nsfwalbum] add album extractor (closes #287)
5 years ago
Mike Fährmann 61e413d85d
[hentaifoundry] stop disabling IPv6 addresses
5 years ago
Mike Fährmann 76ae9957c2
[deviantart] force legacy version for single deviations
5 years ago
Mike Fährmann 520c8ba106
[hentaicafe] extract 'tags' and 'artist' metadata (closes #238)
5 years ago
Mike Fährmann b51baa9a4b
[hitomi] fix empty language detection; parse datetime
5 years ago
Mike Fährmann 258e8b2060
[deviantart] small code improvements
5 years ago
Mike Fährmann a77340c647
[keenspot] fix extraction for "TwoKinds"
5 years ago
Mike Fährmann 03e6876fbe
[instagram] provide 'description' metadata (#310)
5 years ago
Mike Fährmann ec3e8601f1
[slickpic] add user extractor (#249)
5 years ago
Mike Fährmann 97ef416218
[8muses] support multi-page listings (#305)
5 years ago
Mike Fährmann f5961ac968
[deviantart] download deviations with no 'content' field
5 years ago
Mike Fährmann 4e07f99e3e
[mangoxo] change token message level to debug
5 years ago
Mike Fährmann d997c10320
[8muses] add album extractor (#305)
5 years ago
Mike Fährmann e05a96db5e
[deviantart] rename 'stash' to 'extra' (#302)
5 years ago
Mike Fährmann 2184e3a86b
[slickpic] add album extractor (#249)
5 years ago
Mike Fährmann c23bf263fe
[deviantart] rename 'external' to 'stash' (#302)
5 years ago
Mike Fährmann c73c2cda50
[pornhub] add gallery & user extractor (#282)
5 years ago
Mike Fährmann 7c6cb908f9
[xhamster] update test results
5 years ago
Mike Fährmann 2fb85178da
[deviantart] add 'external' option (#302)
5 years ago
Mike Fährmann f85e42cffc
[deviantart] fix --range for deviation & stash extractor
5 years ago
Mike Fährmann 40c7eb3424
[livedoor] improve extraction (fixes #301)
5 years ago
Mike Fährmann 62335b9015
[paheal] adjust test results
5 years ago
Mike Fährmann aa1ca4ed35
[shopify] skip deleted products (#175)
5 years ago
Mike Fährmann 096009367b
[xhamster] add gallery & user extractor (#281)
5 years ago
Mike Fährmann 208202b962
[tumblr] improve error handling (#297)
5 years ago
Mike Fährmann c08c340178
[directlink] make pattern case insensitive (fixes #296)
5 years ago
Mike Fährmann 95b4a53b9c
[keenspot] improve pagination (#223)
5 years ago
Mike Fährmann 731c7cbd5b
[keenspot] support all comics and "random" access (#223)
5 years ago
Mike Fährmann 6a34f4b0c1
skip tests on read timeouts; print list of skipped tests
5 years ago
Mike Fährmann 1c36e65e9b
[exhentai] choose site version depending on input URL (#278)
5 years ago
Mike Fährmann 6da3e21237
[downloader:ytdl] provide 'filename' metadata (closes #291)
5 years ago
Mike Fährmann d33f5a7423
[wallhaven] rewrite
5 years ago
Mike Fährmann 5499934ae2
[ngomik] fix extraction
5 years ago
Mike Fährmann f1893b2b5b
[deviantart] add 'folders' option (#276)
5 years ago
Mike Fährmann c849574def
[keenspot] add comic extractor (#223)
5 years ago
Mike Fährmann 8bd5a19515
[hentainexus] add '_extractor' data
5 years ago
Mike Fährmann 2a085a5e96
[sankakucomplex] fix 'date' values (#258)
5 years ago
Mike Fährmann bcd1801aa8
[sankakucomplex] add 'tag' extractor (#258)
5 years ago
Mike Fährmann 74c2415138
[sankakucomplex] move article extractor to its own module (#258)
5 years ago
Mike Fährmann 4465a3ea68
[kissmanga][readcomiconline] add 'captcha' option (#279)
5 years ago
Mike Fährmann 1e3e15c4f3
[sankaku] add article extractor (#258)
5 years ago
Mike Fährmann 48233f00c0
[readcomiconline] detect 'AreYouHuman' redirects (#279)
5 years ago
Mike Fährmann 1cde38110d
[livedoor] return 'date' as datetime object
5 years ago
Mike Fährmann e88824e1a7
[livedoor] fix adjustments for https:// URLs
5 years ago
Mike Fährmann b3e4664715
[hentainexus] fix extraction
5 years ago
Mike Fährmann 399e8e965a
also update urllib3's cipher list for versions >= 1.25
5 years ago
Mike Fährmann f837ea98cb
[deviantart] don't call 'extend()' on folders (fixes #271)
5 years ago
Mike Fährmann bb32a2d490
[patreon] use file extensions from original filenames (#268)
5 years ago
Mike Fährmann efa805c5d7
[sankaku] update pagination end condition (fixes #265)
5 years ago
Mike Fährmann a4ba34c835
[booru] prevent crash when no tags are present (#259)
5 years ago
Mike Fährmann ca3bad1779
[patreon] small fixes and adjustments (#226)
5 years ago
Leonardo Taccari fb09dd962a [instagram] Fix extraction after `rhx_gis' field removal
5 years ago
Mike Fährmann 7a14aaed7d
[luscious] fix extraction
5 years ago
Mike Fährmann e82cadac61
[patreon] add extractors (#226)
5 years ago
Mike Fährmann 4891f4a328
[hentainexus] add search extractor (#256)
5 years ago
Mike Fährmann c02f12ce2f
avoid Cloudflare CAPTCHAs for OpenSSL < 1.1.1
5 years ago
Mike Fährmann 0b4be57a10
[sankaku] fix error when no tags available (closes #259)
5 years ago
Mike Fährmann 9890bfdf23
[flickr] improve code and metadata
5 years ago
Mike Fährmann aa8e366b90
[luscious] fix tag extraction
5 years ago
Mike Fährmann ba8eb1ffec
[hentainexus] add gallery extractor (#256)
5 years ago
Mike Fährmann b1db194c14
[reactor] update and improve
5 years ago
Mike Fährmann b0e85a42e3
apply workaround from 4736912 in parse_datetime() itself
5 years ago
Mike Fährmann 8de5866fd2
[twitter] replace unit test URLs
5 years ago
Mike Fährmann 74c7304c6b
[newgrounds] extract 'date', 'favorites', and 'score'
5 years ago
Mike Fährmann 4736912d4e
[pixiv] work around strptime limitations in Python < 3.7
5 years ago
Mike Fährmann 1f7fa9dc8e
[exhentai] update data extraction code
5 years ago
Mike Fährmann 80fdb11508
[pixiv] add 'date' metadata field (closes #248)
5 years ago
Mike Fährmann 049e9fd6ce
[twitter] fix pagination end condition
5 years ago
Mike Fährmann 51e0e92429
[deviantart] fix GIF downloads (#242)
5 years ago
Leonardo Taccari f347d2d152 [instagram] Fix for missing `edge_media_to_comment' field and add `date' metadata (#250)
5 years ago
Mike Fährmann 5fd94c6b83
import urllib3 from requests.packages
5 years ago
Mike Fährmann 35f343206c
update default SSL cipher list in urllib3 < 1.25
5 years ago
Mike Fährmann fc5e4f2b21
[hitomi] simplify data extraction code
5 years ago
Mike Fährmann 2756cc8dde
[hitomi] set Referer header (fixes #239)
5 years ago
Mike Fährmann dcc1592dbf
[twitter] add fallback URLs (#237)
5 years ago
Mike Fährmann 1c665fd4bd
[mangoxo] fix login
5 years ago
Mike Fährmann add7e693d0
[tumblr] provide parsed 'date' metadata (#232)
5 years ago
Mike Fährmann 9544683d56
[deviantart] provide 'date' metadata (#232)
5 years ago
Mike Fährmann 0d7e8be987
[dynastyscans] simplify image extractor
5 years ago
Mike Fährmann 9aa0bb5afe
[dynastyscans] encode "[]" in search queries
5 years ago
Mike Fährmann fe849382d8
[komikcast] improve extraction
5 years ago
Mike Fährmann 0318c610dc
[sexcom] add extractor for search results (#147)
5 years ago
Mike Fährmann a247c94c34
[sexcom] add pin and board extractors (#147)
5 years ago
Mike Fährmann 6264a46212
use 'utcfromtimestamp()'
5 years ago
Mike Fährmann d84e7c6861
[twitter] extract 'date' metadata (#224)
5 years ago
Mike Fährmann f2cf1c1d73
use 'text.extract_from()' in a few places
5 years ago
Mike Fährmann e25ebc4bff
don't disable certificate checks anymore
6 years ago
Mike Fährmann 70be494161
[plurk] add a 'comments' options (#212)
6 years ago
Mike Fährmann 0b2ff406f6
[plurk] add timeline- and post-extractors (#212)
6 years ago
Mike Fährmann d6ddb74cde
update test results
6 years ago
Mike Fährmann 87b0929bec
Revert "[flickr] restore image quality"
6 years ago
Mike Fährmann e7cd5510d5
[pixnet] add extractors (closes #177)
6 years ago
Mike Fährmann 155e1faeaf
[imagebam] support galleries with >100 images (fixes #219)
6 years ago
Mike Fährmann 9587aea98f
[deviantart] don't rewrite URLs for newer deviations
6 years ago
Mike Fährmann f2220938cb
[mangoxo] improve channel extraction (#184)
6 years ago
Mike Fährmann d9b94a585d
[mangoxo] add login support (#184)
6 years ago
Mike Fährmann 49a6522c38
ensure consistent headers and params ordering
6 years ago
Mike Fährmann e730fc9045
[twitter] add login support (#214)
6 years ago
Mike Fährmann 2c32dc76cb
[yaplog] update metadata structure (#190)
6 years ago
Mike Fährmann 35919a9bb8
[livedoor] add blog- and post-extractors (#190)
6 years ago
Mike Fährmann 3f513f1056
[flickr] restore image quality
6 years ago
Mike Fährmann 060859cc68
fix URL patterns
6 years ago
Mike Fährmann 13526f3624
[yaplog] fix archive_id and posts with more than 24 images
6 years ago
Mike Fährmann 2ff043edfa
[yaplog] add user- and post-extractors (#190)
6 years ago
Mike Fährmann 790f15a56f
[photobucket] use HTTPS
6 years ago
Mike Fährmann 6da665f32e
[mangoxo] add album- and channel-extractors (closes #184)
6 years ago
Mike Fährmann 21e80d60ff
[wikiart] docstring fixes
6 years ago
Mike Fährmann c70b21248d
[wikiart] add extractors (#179)
6 years ago
Mike Fährmann 0f02e85961
[reactor] use "/full/" URLs (closes #210)
6 years ago
Mike Fährmann 17c11393f5
[weibo] allow user-ids in status URLs
6 years ago
Mike Fährmann ec88ff1562
[flickr] relax unit test results
6 years ago
Mike Fährmann 00d604cafb
[luscious] fix SearchExtractor URL-pattern
6 years ago
Mike Fährmann 1384ebf907
[luscious] fix metadata extraction
6 years ago
Mike Fährmann 5398bfbd69
[exhentai] fix search and favorite extraction
6 years ago
Leonardo Taccari 790b1336a6 [instagram] Add support for hashtags
6 years ago
Mike Fährmann a9bdd0f153
[instagram] fix syntax for Python 3.4
6 years ago
Mike Fährmann eacebf41e4
fix typo in README
6 years ago
Leonardo Taccari 1e38f65996 [instagram] Add support for GraphSidecar media types (#201)
6 years ago
Mike Fährmann 6ba67b0537
[hypnohub] add extractors (closes #196)
6 years ago
Mike Fährmann fe27154a10
[komikcast] fix extraction
6 years ago
Mike Fährmann 5ec55ec4fc
[deviantart] improve URLs for non-downloadable deviations
6 years ago
Mike Fährmann c7a6b0ed90
[deviantart] add 'metadata' option (#189)
6 years ago
Mike Fährmann 8d96a8ce4c
[500px] add user-, gallery-, and image-extractors (#185)
6 years ago
Mike Fährmann d0f88c35be
[komikcast] fix extraction
6 years ago
Mike Fährmann 6277a739e4
[35photo] add user-, genre-, and image-extractors (#162)
6 years ago
Mike Fährmann fb14f80d62
[tumblr] fix avatar URLs for non-OAuth1.0 calls (closes #193)
6 years ago
Mike Fährmann 973a720a7a
[weibo] fix unit test URL patterns
6 years ago
Mike Fährmann a2af2d2965
adjust cache maxage values
6 years ago
Mike Fährmann f612284d24
cache cfclearance cookies
6 years ago
Mike Fährmann 591a07f20c
small code changes and cleanups
6 years ago
Mike Fährmann 6f57d44ec2
[seaotterscans] remove extractor
6 years ago
Mike Fährmann 6dae6bee37
automatically detect and bypass cloudflare challenge pages
6 years ago
Mike Fährmann 25aaf55514
[smugmug] improve format selection (closes #183)
6 years ago
Mike Fährmann 7c1cb923a4
[myportfolio] replace unit test
6 years ago
Mike Fährmann fffbfd3dce
[imgspice] fix extraction
6 years ago
Mike Fährmann 4ca4631bad
simplify auto-disabling certificate verification
6 years ago
Mike Fährmann 09d872a2b1
generalize extractor creation code
6 years ago
Mike Fährmann 8dc6be246b
[shopify] add custom retry logic for 430 status codes (#175)
6 years ago
Mike Fährmann 0887fb61f4
[komikcast] update test results
6 years ago
Mike Fährmann 976ccb267f
[myportfolio] combine gallery and user extractors
6 years ago
Mike Fährmann efd104e45e
[instagram] reject more non-user URLs (#180)
6 years ago
HRXN 56e0e92e0d [shopify] cosmetic changes in shopify.py (#181)
6 years ago
Mike Fährmann 9c0e2f294b
[shopify] add generic collection and product extractors (#175)
6 years ago
Mike Fährmann 26c4365baa
adjust metadata types for GalleryExtractors
6 years ago
Mike Fährmann 13e0f2a78f
[deviantart] add 'scraps' extractor (closes #168)
6 years ago
Mike Fährmann 3ea11f5d5e
[nhentai] rewrite
6 years ago
Mike Fährmann 3595cd582f
use GalleryExtractor as common base class
6 years ago
Mike Fährmann a138d5873d
[hentaifoundry] improve/fix extraction
6 years ago
Mike Fährmann 280531c8ff
[pururin] add gallery extractor (closes #174)
6 years ago
Mike Fährmann 3159dd79d5
[seiga] use HTTPS
6 years ago
Mike Fährmann f6734142ee
[komikcast] remove 'width' and 'height' info
6 years ago
Mike Fährmann d0059cab79
[tumblr] check for null URLs (closes #165)
6 years ago
Mike Fährmann e687a6095e
[luscious] raise exception if album is not available
6 years ago
Mike Fährmann 22d3a2fcc8
[artstation] add extractor for artwork listings (#80)
6 years ago
Mike Fährmann 937a802b49
[dynastyscans] add extractors for images and image searches
6 years ago
Mike Fährmann b09a8184ca
move TestJob into test module; test _extractor values
6 years ago
Mike Fährmann 19860655a3
[weibo] add 'user' and 'status' extractors
6 years ago
Mike Fährmann f8782c05f2
[paheal] rename "tags" to "search_tags"
6 years ago
Mike Fährmann c7b8421333
[deviantart] don't match 'www' as a potential username
6 years ago
Mike Fährmann 5530871b5a
change results of text.nameext_from_url()
6 years ago
Mike Fährmann 32edf4fc7b
add '_extractor' info to manga extractor results
6 years ago
Mike Fährmann 89ee8cd7e4
filter "private" kwdict entries
6 years ago
Mike Fährmann 61741d7333
provide type information for Queue messages
6 years ago
Mike Fährmann 2e516a1e3e
store the full original URL in Extractor.url
6 years ago
Mike Fährmann 580baef72c
change Chapter and MangaExtractor classes
6 years ago
Mike Fährmann 4b1880fa5e
propagate 'match' to base extractor constructor
6 years ago
Mike Fährmann ade86da7a1
[tsumino] replace test
6 years ago
Mike Fährmann 1f3422c28b
[mangahere] fix extraction
6 years ago
Mike Fährmann 84ae72b8d8
[ngomik] fix extraction
6 years ago
Mike Fährmann 02d733d219
[simplyhentai] fix and improve tag extraction
6 years ago
Mike Fährmann 3a0b4af744
[seiga] recognize /thumb/ URLs
6 years ago
Mike Fährmann 8fc6fbfa34
[artstation] recognize shortened project URLs
6 years ago
Mike Fährmann 9a9cd32461
implement alternative constructor for extractors
6 years ago
Mike Fährmann abbd45d0f4
update handling of extractor URL patterns
6 years ago
Mike Fährmann 6284731107
simplify extractor constants
6 years ago
Mike Fährmann 34bab080ae
rewrite URL patterns to use only 1 per extractor
6 years ago
Mike Fährmann 0e46db6f45
rename some base classes
6 years ago
Mike Fährmann 793b24e513
[imagehosts] fix and improve various extractors
6 years ago
Mike Fährmann bc0951d974
allow for simplified test data structures
6 years ago
Mike Fährmann 050bc1aa4a
[reactor] simplify tests
6 years ago
Mike Fährmann 2f3a021d72
[hentaicafe] restore functionality
6 years ago
Mike Fährmann 347398f692
fix various tests
6 years ago
Mike Fährmann 00dc37ccbf
replace AsynchronousMixin Extractor with a Mixin
6 years ago
Mike Fährmann 4d656a81ca
replace SharedConfigExtractor class with a Mixin
6 years ago
Mike Fährmann ccb95d0ba4
[mastodon] changes/improvements based on foolfuuka/-slide
6 years ago
Mike Fährmann 12ff750111
[foolfuuka] smaller code changes and updates
6 years ago
Mike Fährmann e1bf3b225e
[foolslide] dynamically generate extractor classes
6 years ago
Mike Fährmann 58a9eede38
[foolfuuka] dynamically generate extractor classes
6 years ago
Mike Fährmann 22d7a783d5
update extraction result tests
6 years ago
Mike Fährmann 197d0e99a4
[tsumino] more useful error message (#161)
6 years ago
Mike Fährmann d36ec51e5a
[tsumino] add extractor for search results (#161)
6 years ago
Mike Fährmann 1c1367ec5b
[behance] fix empty docstring
6 years ago
Mike Fährmann 45e529ab91
[behance] fix extraction
6 years ago
Mike Fährmann bfbbac4495
[tsumino] add login capabilities (#161)
6 years ago
Mike Fährmann dd358b4564
improve cookie handling during logins
6 years ago
Mike Fährmann 6126615698
update URLs for supportedsites.rst
6 years ago
Mike Fährmann 80a75a1ecf
[tsumino] add gallery extractor (#161)
6 years ago
Mike Fährmann 2d2953a5bf
add 'text.parse_float()' + cleanup in text.py
6 years ago
Mike Fährmann 0c32dc5858
[hentaifox] add extractor for search results (#160)
6 years ago