Commit Graph

1261 Commits (c73c2cda50c20c51f8117b04ee76d3724ec9dff8)

Author SHA1 Message Date
Mike Fährmann 80fdb11508
[pixiv] add 'date' metadata field (closes #248)
5 years ago
Mike Fährmann 049e9fd6ce
[twitter] fix pagination end condition
5 years ago
Mike Fährmann 51e0e92429
[deviantart] fix GIF downloads (#242)
5 years ago
Leonardo Taccari f347d2d152 [instagram] Fix for missing `edge_media_to_comment' field and add `date' metadata (#250)
5 years ago
Mike Fährmann 5fd94c6b83
import urllib3 from requests.packages
5 years ago
Mike Fährmann 35f343206c
update default SSL cipher list in urllib3 < 1.25
5 years ago
Mike Fährmann fc5e4f2b21
[hitomi] simplify data extraction code
5 years ago
Mike Fährmann 2756cc8dde
[hitomi] set Referer header (fixes #239)
5 years ago
Mike Fährmann dcc1592dbf
[twitter] add fallback URLs (#237)
5 years ago
Mike Fährmann 1c665fd4bd
[mangoxo] fix login
5 years ago
Mike Fährmann add7e693d0
[tumblr] provide parsed 'date' metadata (#232)
5 years ago
Mike Fährmann 9544683d56
[deviantart] provide 'date' metadata (#232)
5 years ago
Mike Fährmann 0d7e8be987
[dynastyscans] simplify image extractor
5 years ago
Mike Fährmann 9aa0bb5afe
[dynastyscans] encode "[]" in search queries
5 years ago
Mike Fährmann fe849382d8
[komikcast] improve extraction
5 years ago
Mike Fährmann 0318c610dc
[sexcom] add extractor for search results (#147)
5 years ago
Mike Fährmann a247c94c34
[sexcom] add pin and board extractors (#147)
5 years ago
Mike Fährmann 6264a46212
use 'utcfromtimestamp()'
5 years ago
Mike Fährmann d84e7c6861
[twitter] extract 'date' metadata (#224)
5 years ago
Mike Fährmann f2cf1c1d73
use 'text.extract_from()' in a few places
5 years ago
Mike Fährmann e25ebc4bff
don't disable certificate checks anymore
6 years ago
Mike Fährmann 70be494161
[plurk] add a 'comments' options (#212)
6 years ago
Mike Fährmann 0b2ff406f6
[plurk] add timeline- and post-extractors (#212)
6 years ago
Mike Fährmann d6ddb74cde
update test results
6 years ago
Mike Fährmann 87b0929bec
Revert "[flickr] restore image quality"
6 years ago
Mike Fährmann e7cd5510d5
[pixnet] add extractors (closes #177)
6 years ago
Mike Fährmann 155e1faeaf
[imagebam] support galleries with >100 images (fixes #219)
6 years ago
Mike Fährmann 9587aea98f
[deviantart] don't rewrite URLs for newer deviations
6 years ago
Mike Fährmann f2220938cb
[mangoxo] improve channel extraction (#184)
6 years ago
Mike Fährmann d9b94a585d
[mangoxo] add login support (#184)
6 years ago
Mike Fährmann 49a6522c38
ensure consistent headers and params ordering
6 years ago
Mike Fährmann e730fc9045
[twitter] add login support (#214)
6 years ago
Mike Fährmann 2c32dc76cb
[yaplog] update metadata structure (#190)
6 years ago
Mike Fährmann 35919a9bb8
[livedoor] add blog- and post-extractors (#190)
6 years ago
Mike Fährmann 3f513f1056
[flickr] restore image quality
6 years ago
Mike Fährmann 060859cc68
fix URL patterns
6 years ago
Mike Fährmann 13526f3624
[yaplog] fix archive_id and posts with more than 24 images
6 years ago
Mike Fährmann 2ff043edfa
[yaplog] add user- and post-extractors (#190)
6 years ago
Mike Fährmann 790f15a56f
[photobucket] use HTTPS
6 years ago
Mike Fährmann 6da665f32e
[mangoxo] add album- and channel-extractors (closes #184)
6 years ago
Mike Fährmann 21e80d60ff
[wikiart] docstring fixes
6 years ago
Mike Fährmann c70b21248d
[wikiart] add extractors (#179)
6 years ago
Mike Fährmann 0f02e85961
[reactor] use "/full/" URLs (closes #210)
6 years ago
Mike Fährmann 17c11393f5
[weibo] allow user-ids in status URLs
6 years ago
Mike Fährmann ec88ff1562
[flickr] relax unit test results
6 years ago
Mike Fährmann 00d604cafb
[luscious] fix SearchExtractor URL-pattern
6 years ago
Mike Fährmann 1384ebf907
[luscious] fix metadata extraction
6 years ago
Mike Fährmann 5398bfbd69
[exhentai] fix search and favorite extraction
6 years ago
Leonardo Taccari 790b1336a6 [instagram] Add support for hashtags
6 years ago
Mike Fährmann a9bdd0f153
[instagram] fix syntax for Python 3.4
6 years ago
Mike Fährmann eacebf41e4
fix typo in README
6 years ago
Leonardo Taccari 1e38f65996 [instagram] Add support for GraphSidecar media types (#201)
6 years ago
Mike Fährmann 6ba67b0537
[hypnohub] add extractors (closes #196)
6 years ago
Mike Fährmann fe27154a10
[komikcast] fix extraction
6 years ago
Mike Fährmann 5ec55ec4fc
[deviantart] improve URLs for non-downloadable deviations
6 years ago
Mike Fährmann c7a6b0ed90
[deviantart] add 'metadata' option (#189)
6 years ago
Mike Fährmann 8d96a8ce4c
[500px] add user-, gallery-, and image-extractors (#185)
6 years ago
Mike Fährmann d0f88c35be
[komikcast] fix extraction
6 years ago
Mike Fährmann 6277a739e4
[35photo] add user-, genre-, and image-extractors (#162)
6 years ago
Mike Fährmann fb14f80d62
[tumblr] fix avatar URLs for non-OAuth1.0 calls (closes #193)
6 years ago
Mike Fährmann 973a720a7a
[weibo] fix unit test URL patterns
6 years ago
Mike Fährmann a2af2d2965
adjust cache maxage values
6 years ago
Mike Fährmann f612284d24
cache cfclearance cookies
6 years ago
Mike Fährmann 591a07f20c
small code changes and cleanups
6 years ago
Mike Fährmann 6f57d44ec2
[seaotterscans] remove extractor
6 years ago
Mike Fährmann 6dae6bee37
automatically detect and bypass cloudflare challenge pages
6 years ago
Mike Fährmann 25aaf55514
[smugmug] improve format selection (closes #183)
6 years ago
Mike Fährmann 7c1cb923a4
[myportfolio] replace unit test
6 years ago
Mike Fährmann fffbfd3dce
[imgspice] fix extraction
6 years ago
Mike Fährmann 4ca4631bad
simplify auto-disabling certificate verification
6 years ago
Mike Fährmann 09d872a2b1
generalize extractor creation code
6 years ago
Mike Fährmann 8dc6be246b
[shopify] add custom retry logic for 430 status codes (#175)
6 years ago
Mike Fährmann 0887fb61f4
[komikcast] update test results
6 years ago
Mike Fährmann 976ccb267f
[myportfolio] combine gallery and user extractors
6 years ago
Mike Fährmann efd104e45e
[instagram] reject more non-user URLs (#180)
6 years ago
HRXN 56e0e92e0d [shopify] cosmetic changes in shopify.py (#181)
6 years ago
Mike Fährmann 9c0e2f294b
[shopify] add generic collection and product extractors (#175)
6 years ago
Mike Fährmann 26c4365baa
adjust metadata types for GalleryExtractors
6 years ago
Mike Fährmann 13e0f2a78f
[deviantart] add 'scraps' extractor (closes #168)
6 years ago
Mike Fährmann 3ea11f5d5e
[nhentai] rewrite
6 years ago
Mike Fährmann 3595cd582f
use GalleryExtractor as common base class
6 years ago
Mike Fährmann a138d5873d
[hentaifoundry] improve/fix extraction
6 years ago
Mike Fährmann 280531c8ff
[pururin] add gallery extractor (closes #174)
6 years ago
Mike Fährmann 3159dd79d5
[seiga] use HTTPS
6 years ago
Mike Fährmann f6734142ee
[komikcast] remove 'width' and 'height' info
6 years ago
Mike Fährmann d0059cab79
[tumblr] check for null URLs (closes #165)
6 years ago
Mike Fährmann e687a6095e
[luscious] raise exception if album is not available
6 years ago
Mike Fährmann 22d3a2fcc8
[artstation] add extractor for artwork listings (#80)
6 years ago
Mike Fährmann 937a802b49
[dynastyscans] add extractors for images and image searches
6 years ago
Mike Fährmann b09a8184ca
move TestJob into test module; test _extractor values
6 years ago
Mike Fährmann 19860655a3
[weibo] add 'user' and 'status' extractors
6 years ago
Mike Fährmann f8782c05f2
[paheal] rename "tags" to "search_tags"
6 years ago
Mike Fährmann c7b8421333
[deviantart] don't match 'www' as a potential username
6 years ago
Mike Fährmann 5530871b5a
change results of text.nameext_from_url()
6 years ago
Mike Fährmann 32edf4fc7b
add '_extractor' info to manga extractor results
6 years ago
Mike Fährmann 89ee8cd7e4
filter "private" kwdict entries
6 years ago
Mike Fährmann 61741d7333
provide type information for Queue messages
6 years ago
Mike Fährmann 2e516a1e3e
store the full original URL in Extractor.url
6 years ago
Mike Fährmann 580baef72c
change Chapter and MangaExtractor classes
6 years ago
Mike Fährmann 4b1880fa5e
propagate 'match' to base extractor constructor
6 years ago