Commit Graph

2102 Commits (f6c5edb76be125da0bd837a3f79affed8481b015)

Author SHA1 Message Date
Mike Fährmann ea80dadd09
[deviantart] restore archive keys
5 years ago
Mike Fährmann 3fc1e12949
[postprocessor:metadata] filter private entries
5 years ago
Mike Fährmann ea094692c8
[vsco] fix collection extraction (#480)
5 years ago
Mike Fährmann 490831f84a
[bobx] "fix" image download URLs
5 years ago
Mike Fährmann 978cb03f81
update misc test results
5 years ago
Mike Fährmann fca87974fe
[sexcom] fix video downloads by sending specific Referer headers
5 years ago
Mike Fährmann bbbeff4c41
[downloader.http] implement file-specific HTTP headers
5 years ago
Mike Fährmann edc080468d
[instagram] make 'video_url' fields optional (fixes #479)
5 years ago
Mike Fährmann 9fdc5e74cb
[deviantart] ensure consistent username capitalization (#455)
5 years ago
Mike Fährmann b1f0609de5
[newgrounds] rewrite (#394)
5 years ago
Mike Fährmann 3ece3976ae
[newgrounds] implement login support (#394)
5 years ago
Mike Fährmann 3a07c06865
[newgrounds] update
5 years ago
Mike Fährmann 5513b66eb0
[vsco] fix user profile extraction
5 years ago
Mike Fährmann abfcb356fc
[flickr] support 3k, 4k, 5k, and 6k photo sizes (closes #472)
5 years ago
Mike Fährmann 521fcd2eb9
[imgbb] fix error in galleries without user info (closes #471)
5 years ago
Mike Fährmann 8061263d4c
[imgbb] improve pagination logic
5 years ago
Mike Fährmann da6789b2b0
disable unique archive id checks for some tests
5 years ago
Mike Fährmann 67e54ed8ea
release version 1.11.1
5 years ago
Mike Fährmann ce98a86c0e
fix data file inclusion in source distributions
5 years ago
Mike Fährmann 6c86fbfe2a
release version 1.11.0
5 years ago
Mike Fährmann 94a94f3b86
miscellaneous stuff
5 years ago
Mike Fährmann b0197098e6
[imgur] get title from webpage if missing in API response
5 years ago
Mike Fährmann dd5d2b2eac
[deviantart] add user profile extractor (#377, #419)
5 years ago
Mike Fährmann a437e78620
[deviantart] minimize cookie usage during scraps extraction
5 years ago
Mike Fährmann 1a197d2195
store the original cookiejar as Extractor._cookiejar
5 years ago
Mike Fährmann de83ae4576
make 'method' argument of Extractor.request keyword-only
5 years ago
Mike Fährmann a5be08a830
[downloader:ytdl] forward proxy settings
5 years ago
Mike Fährmann 4325695d74
[luscious] expand GraphQL queries
5 years ago
Mike Fährmann 94dbdbf506
[nijie] change default filename format
5 years ago
Mike Fährmann 9e88e7a344
[postprocessor:exec] improve (#421, #413)
5 years ago
Mike Fährmann c18fadc221
[instagram] extract videos without youtube-dl (#391)
5 years ago
Mike Fährmann f15eedb634
[sexcom] set Referer header for file downloads (closes #464)
5 years ago
Mike Fährmann 2a3bd4e3c7
rename extractor classes starting with a digit
5 years ago
Mike Fährmann b3b9da6d74
[photobucket] replace test URL
5 years ago
Mike Fährmann 64786363be
[4chan] simplify
5 years ago
Mike Fährmann 557e2c018b
[8chan] remove module
5 years ago
Mike Fährmann e14782a948
[instagram] simplify graphql extraction for post pages
5 years ago
Mike Fährmann c01ff78467
[twitter] extend 'videos' option to force extraction with ytdl
5 years ago
Mike Fährmann f8ac67ce50
[hitomi] extend URL pattern + follow redirects
5 years ago
Mike Fährmann e877ca97c3
[naver] adjust directory names and metadata structure
5 years ago
Mike Fährmann 702f2fbd1f
[issuu] add publication and user extractors (#413)
5 years ago
Mike Fährmann 8361d874d7
[hitomi] fix extraction
5 years ago
Mike Fährmann 5fa6ff04dd
[instagram] extract '__additionalDataLoaded' (#391)
5 years ago
Mike Fährmann 5af291ba5c
include failed downloads and child extractors in exit status
5 years ago
Mike Fährmann 322c2e7ed4
renaming variables
5 years ago
Mike Fährmann 87a87bff7e
[simplyhentai] fix image URLs
5 years ago
Mike Fährmann 4409d00141
embed error messages in StopExtraction exceptions
5 years ago
Mike Fährmann d5e3910270
adjust 'util.raises()'
5 years ago
Mike Fährmann d44f790e81
adjust output for HTTP status related errors
5 years ago
Mike Fährmann 03e0cec715
return with non-zero exit status on error
5 years ago
Mike Fährmann c887493a80
overhaul exception stuff
5 years ago
Mike Fährmann 109718a5e3
[blogger] add blog and post extractors (closes #364)
5 years ago
Mike Fährmann 244d396b0b
add '--ugoira-conv-lossless' command-line option (#432)
5 years ago
Mike Fährmann 49a6b1b6c0
[twitter] extract video stream info without youtube-dl (#452)
5 years ago
Mike Fährmann 9f0dbf2a72
[twitter] raise proper exception for protected Tweets
5 years ago
Mike Fährmann 083e14ad9a
[downloader:ytdl] add data from '_ytdl_extra' to info_dicts
5 years ago
Mike Fährmann 6e08ada4fe
[luscious] simplify some metadata entries
5 years ago
Mike Fährmann 9e3a8607ee
[deviantart] update usernames (#455)
5 years ago
Mike Fährmann 2eb38810c5
[twitter] fix image extraction when logged in (#452)
5 years ago
Mike Fährmann 8f38a35b91
[imgur] use API with "public" client_id (#446)
5 years ago
Mike Fährmann b23c822b23
[luscious] use GraphQL
5 years ago
Mike Fährmann ef17d94469
update test results
5 years ago
Mike Fährmann 2057c6ba29
[naver] add blog and post extractors (closes #447)
5 years ago
Mike Fährmann 389d2d7e38
implement 'cookies-update' option (#445)
5 years ago
Mike Fährmann fbc0a6a059
[nozomi] skip unavailable posts (#388)
5 years ago
Mike Fährmann ae98dbcbb3
[nozomi] implement searching for negated terms (#388)
5 years ago
Mike Fährmann 1c03a389df
[twitter] small improvements to search extractor
5 years ago
Mike Fährmann c3042978b8
[deviantart] match "/gallery/all" (closes #449)
5 years ago
Alice bcddcca6db Add search downloading to twitter.py (#448)
5 years ago
Mike Fährmann 1693d97bd3
update extractor class hierarchies
5 years ago
Mike Fährmann 7ebd984e8d
[imgur] print error message if no JSON data is found (#446)
5 years ago
Mike Fährmann 5882b00f2f
[imgur] implement login support (#446)
5 years ago
Mike Fährmann 91643ca54b
[nozomi] add search extractor (#388)
5 years ago
Mike Fährmann df2b3c6888
restore OAuth2 authentication error messages
5 years ago
Mike Fährmann 6779512fc7
[nozomi] add post and tag extractors (#388)
5 years ago
Mike Fährmann 6abe5f5bbb
[patreon] fix pagination (#444)
5 years ago
Mike Fährmann ff1e4a86aa
release version 1.10.6
5 years ago
Mike Fährmann d4ffd6c952
[yaplog] improve metadata extraction (#443)
5 years ago
Mike Fährmann 15af2f8464
[hitomi] fallback to /reader/ page if main page returns 404
5 years ago
Mike Fährmann 8af59a4bba
fix & update docs
5 years ago
Mike Fährmann dc6ad81e2e
[yaplog] prevent crash on empty posts (#443)
5 years ago
Mike Fährmann 94eb7c6cad
[deviantart] fix sta.sh extraction (436)
5 years ago
Mike Fährmann 1032cfa34b
[downloader:http] extend mimetype map with archive formats
5 years ago
Mike Fährmann 27b5b2497e
[deviantart] fix download URLs (#436)
5 years ago
Mike Fährmann 93aac8dfea
[yaplog] fix incomplete image URLs (#443)
5 years ago
Mike Fährmann a782b009b8
[yaplog] match blog names with '-' (#443)
5 years ago
Mike Fährmann cf5e716b9d
[hitomi] fix image URLs
5 years ago
Mike Fährmann ad81c07204
[postprocessor] match logger names of downloader modules
5 years ago
Mike Fährmann 03bc8adfc7
[postprocessor:exec] run after file moved to target location
5 years ago
Mike Fährmann 35958bebd4
[postprocessor:exec] fix filename quoting on Windows (#421)
5 years ago
Mike Fährmann b06c372e4d
[postprocessor:exec] improve; add command-line option (#421)
5 years ago
Mike Fährmann 5a54efa025
[xhamster] unescape 'title' and 'description'
5 years ago
Mike Fährmann 1b9bf4fc6e
[behance] fix 'tags' extraction
5 years ago
Mike Fährmann bb97e87989
[komikcast] ignore banner image
5 years ago
Mike Fährmann 0ff90a3f7d
[gfycat] include title in default filenames (closes #434)
5 years ago
Mike Fährmann fabdc3b0c6
release version 1.10.5
5 years ago
Mike Fährmann de4e2029d1
[nsfwalbum] update test album
5 years ago
Mike Fährmann 1faec285d1
[nijie] further improvements (closes #423)
5 years ago
Mike Fährmann 6d0a533d68
[reddit] respect 'comments:0' for single submissions (#429)
5 years ago
Mike Fährmann 803d8f814e
[oauth] update scope for reddit tokens (#428)
5 years ago
Mike Fährmann 46ba173ded
[reddit] fix documentation inconsistencies (closes #429)
5 years ago
Mike Fährmann 20eb6c401f
[nijie] improvements and fixes (#423)
5 years ago
Mike Fährmann d1ea08c67d
[weibo] fixes and improvements
5 years ago
Mike Fährmann 38d97f3da6
[deviantart] add debug message about API credentials (#424)
5 years ago
Mike Fährmann 80c2104fb5
[deviantart] fix 429 handling if 'fatal' is False (closes #424)
5 years ago
Mike Fährmann 913460240d
[reddit] fix 'extractor.blacklist()' arguments
5 years ago
Mike Fährmann 22bac14452
[pixiv] match '/artworks/' URLs
5 years ago
Mike Fährmann 66cac207ac
[twitter] match and use 'i/web' status URLs
5 years ago
Mike Fährmann 946f2751e2
[reddit] add 'user' extractor (closes #350)
5 years ago
Mike Fährmann c14abb9fb8
[reddit] improve URL parameter handling for subreddit links
5 years ago
Mike Fährmann ee8b654464
[instagram] implement 'highlights' option (closes #329)
5 years ago
Mike Fährmann f63c3097a9
[instagram] rework some code paths
5 years ago
Mike Fährmann 4330133114
[imgur] add 'favorite' extractor (closes #420)
5 years ago
Mike Fährmann ee5e20221f
[imgth] fix image URLs
5 years ago
Mike Fährmann b63b126808
[hentaicafe] extend URL pattern
5 years ago
Mike Fährmann d780f0357e
[imgur] add user extractor
5 years ago
Mike Fährmann 11ea689013
[simplyhentai] fix image and video URLs
5 years ago
Mike Fährmann 15632a1570
[tsumino] fix extraction
5 years ago
Mike Fährmann d92802fd37
[luscious] fix detection of unavailable galleries
5 years ago
Mike Fährmann f99da2b866
[imgbb] detect invalid album and user profile links
5 years ago
Mike Fährmann 01bc7adadc
[deviantart] improve journal detection (#419)
5 years ago
Mike Fährmann 776e9e073f
close archive on job completion (#417)
5 years ago
Mike Fährmann 5ac9732adc
call 'sys.exit()' on Ctrl+c
5 years ago
Mike Fährmann 9178b54eae
handle errors when opening download archive file (#417)
5 years ago
Mike Fährmann 6e12907de6
[deviantart] improve handling of private deviations (#414)
5 years ago
Mike Fährmann 4203931d79
release version 1.10.4
5 years ago
Mike Fährmann e7690ac694
[vsco] update URL pattern (closes #410)
5 years ago
Mike Fährmann 1848788970
update test results etc
5 years ago
Mike Fährmann d5fbb2d9de
[tumblr] ignore audio links from Spotify etc.
5 years ago
Mike Fährmann b1cddce865
Revert "[simplyhentai] fix extraction; remove image+video extractors"
5 years ago
Mike Fährmann d23660c04d
[hentaicafe] restore default 'request()' behavior
5 years ago
Mike Fährmann 9ae58a6b3e
[exhentai] update image limit checks
5 years ago
Mike Fährmann 6fe9a134bf
[lineblog] add blog and post extractors (closes #404)
5 years ago
Mike Fährmann 4e8a548a61
[livedoor] update metadata extraction
5 years ago
Mike Fährmann f9285f99e6
[pixiv] fix authentication
5 years ago
Mike Fährmann 6f3df3999a
[fuskator] add gallery and search extractor (closes #407)
5 years ago
Mike Fährmann bc0ca66c99
[twitter] small improvements
5 years ago
Mike Fährmann 682105b8ee
prevent crash when loading unavailable downloader (#405)
5 years ago
Mike Fährmann 5fcebb69c2
[postprocessor:ugoira] improve error messages (#406)
5 years ago
Mike Fährmann f02a768b5c
[danbooru] add 'ugoira' option (#406)
5 years ago
Mike Fährmann 9646ccb320
release version 1.10.3
5 years ago
Mike Fährmann dedea3b4db
[deviantart] fix journal creation (#400)
5 years ago
Mike Fährmann c6c5cb1898
improve 'deviantart.quality' description
5 years ago
Mike Fährmann 8eaae58045
[downloader:http] change log message level to 'debug'
5 years ago
Mike Fährmann efb64ad031
[deviantart] generate filenames (#392, #400)
5 years ago
Mike Fährmann 0ce98169b8
improve path generation
5 years ago
Mike Fährmann b2151f3928
[seiga] support mobile URLs (closes #401)
5 years ago
Mike Fährmann 20fd2d8450
[flickr] skip unavailable images/videos (fixes #398)
5 years ago
Mike Fährmann 60c8e090da
[postprocessor:zip] fix archive names (closes #397)
5 years ago
Mike Fährmann 7c09545f70
[downloader:ytdl] add 'outtmpl' option (#395)
5 years ago
Mike Fährmann 5cc7be2536
[piczel] update and improve
5 years ago
Mike Fährmann 0c1c7abb4d
release version 1.10.2
5 years ago
Mike Fährmann 49f6d7176d
[deviantart] restore filenames (#392)
5 years ago
Mike Fährmann 63daa68d67
[deviantart] improvements (#392)
5 years ago
Mike Fährmann d1db5180ab
[simplyhentai] fix extraction; remove image+video extractors
5 years ago
Mike Fährmann 30d6e284b0
[deviantart] use NAPI for artworks and scraps (#392)
5 years ago
Mike Fährmann 7d6af936c5
[imgur] simplify gallery extraction
5 years ago
Mike Fährmann 3284c62f22
ensure PathFormat.directory ends with a path separator
5 years ago
Mike Fährmann ebabc5caf1
[downloader:http] treat 416 without downloaded data as error
5 years ago
Mike Fährmann 2495b99347
[postprocessor:classify] improve path generation (fixes #138)
5 years ago
Mike Fährmann e77a656437
optimize directory path generation
5 years ago
Mike Fährmann 51d10783fc
[patreon] include image info in API results (#383)
5 years ago
Mike Fährmann 7a5e78741c
[booru] build directory path for each file (#385)
5 years ago
Mike Fährmann b1728f512d
[patreon] support multi image posts and post URLs (#383)
5 years ago
Mike Fährmann 454bf1ebf9
preserve enumeration index after 'set_extension()' (#306)
5 years ago
Mike Fährmann f5039b897f
replace DownloadArchive.check() with __contains__()
5 years ago
Mike Fährmann 5a210991b6
Remove control characters from filesystem paths
5 years ago
Mike Fährmann c50d60a53d
[reactor] fix image URLs
5 years ago
Mike Fährmann 32447d0d24
[pixiv] simplify default filename format
5 years ago
Mike Fährmann 5f8621b29d
improve output of active post processor modules
5 years ago
Mike Fährmann 2cbbc3dec4
add a 'whitelist' to '--ugoira-conv' (#382)
5 years ago
Mike Fährmann 829b1ccf04
[imgur] distinguish album and gallery URLs (#380)
5 years ago
Mike Fährmann 23251356cb
require 'extension' data for each URL (#382)
5 years ago
Mike Fährmann a67413d64f
[xhamster] use input URL domain
5 years ago
Mike Fährmann 0bb873757a
update PathFormat class
5 years ago
Mike Fährmann 423f68f585
[deviantart] fix scraps extraction (closes #376)
5 years ago
Mike Fährmann 3bf20ffb70
[instagram] add support for story highlights
5 years ago
Mike Fährmann a732e9c430
[instagram] update query hashes and headers
5 years ago
Mike Fährmann 2ccf6a9e35
[instagram] make extractor tests happy (#373)
5 years ago
Mike Fährmann 8dc42bb178
implement 'enumerate' for 'extractor.skip' (#306)
5 years ago
Leonardo Taccari bc5eaf7746 [instagram] Add support for IGTV (#373)
5 years ago
Mike Fährmann b7fb93e2b2
[downloader:http] add 'adjust-extensions' option
5 years ago
Mike Fährmann eb7da159e2
[imagebam] update URL test results
5 years ago
Mike Fährmann 189acbeac9
[imgbb] add extractor for individual images (closes #363)
5 years ago
Mike Fährmann ad3ac02fbc
[pixiv] update metadata entries (#366)
5 years ago
Mike Fährmann 1ff4c4ec03
[adultempire] consistent artist order
5 years ago
Leonardo Taccari 2df050e627 [instagram] Add support for stories (#371)
5 years ago
Mike Fährmann f4bc75e854
fix rate limit handling for OAuth APIs (#368)
5 years ago
Mike Fährmann 3957d27d79
[deviantart] add 'quality' option (#369)
5 years ago
Mike Fährmann 64b2935d8e
[pixiv] provide 'filename' and change default filename format
5 years ago
Mike Fährmann 2f33bac030
release version 1.10.1
5 years ago
Mike Fährmann fa60109e97
[exhentai] don't use e-hentai.org for exhentai URLs
5 years ago
Mike Fährmann dfe552421b
release version 1.10.0
5 years ago
Mike Fährmann 0609afd1e4
update default cache directory ... again
5 years ago
Mike Fährmann 4a0c98bfc9
miscellaneous fixes and adjustments
5 years ago
Mike Fährmann 2c839f3760
[imgbb] add user extractor + login support (#361)
5 years ago
Mike Fährmann a8b60b2bd9
change default cache directory for unix systems
5 years ago
Mike Fährmann 4b6edfbfd2
restrict permissions without importing 'pathlib'
5 years ago
Leonardo Taccari afce1ee1eb
Avoid possible sensitive information disclosure via cache.file
5 years ago
Mike Fährmann 2153206093
[imgbb] add album extractor (#361)
5 years ago
Mike Fährmann beb4fab2e6
[exhentai] improve limit and error handling (#360)
5 years ago
Mike Fährmann 81b35ed3cb
[exhentai] catch more error states (#356, #360)
5 years ago
Mike Fährmann a90280f4e7
[postprocessor:zip] add 'mode' option (#355)
5 years ago
Mike Fährmann 6ce22f606b
[exhentai] update login procedure and tests
5 years ago
Mike Fährmann dc73d02d87
[exhentai] always use e-hentai.org as domain + set nw cookie
5 years ago
Mike Fährmann 40637556fa
[ngomik] fix extraction
5 years ago
Mike Fährmann 3969f9cbbd
[behance] fix collection extraction
5 years ago
Mike Fährmann 20f7b07312
ensure postproc finalize() is called during C-c or crash (#355)
5 years ago
Mike Fährmann 17a3426845
[gelbooru] enable all content when not using API
5 years ago
Mike Fährmann 279db2c5b2
[vsco] add collection & image extractor + video support (#331)
5 years ago
Mike Fährmann 547ea71463
[downloader.ytdl] add 'forward-cookies' option (#352)
5 years ago
Mike Fährmann d9d44ad953
[tsumino] update test results
5 years ago
Mike Fährmann b1bea8aaeb
add 'restrict-filenames' option (#348)
5 years ago
Mike Fährmann 60cf40380a
[vsco] add user extractor (#331)
5 years ago
Mike Fährmann 3fe5ccdfa6
[adultempire] add gallery extractor (closes #340)
5 years ago
Mike Fährmann b3851e01d9
release version 1.9.0
5 years ago
Mike Fährmann 5d968412ca
[deviantart] case-insensitive folder name matching (fixes #343)
5 years ago
Mike Fährmann a3c736fedc
[500px] fix extraction
5 years ago
Mike Fährmann 1133b7fcbd
[smugmug] update unit tests
5 years ago
Mike Fährmann 21991acc49
add 'ciphers' option; update default User-Agent
5 years ago
Mike Fährmann 84f4d3bc0b
replace urllib3's default cipher list with Firefox's (#342)
5 years ago
Mike Fährmann feb98cf196
[twitter] improve 'content' formatting; add option (#338)
5 years ago
Mike Fährmann 1740086d8a
add 'repl' and 'sep' arguments to text.replace_html()
5 years ago
Mike Fährmann 8d1ae9b715
[tumblr] enable date-min/-max/-format options (#337)
5 years ago
Mike Fährmann 09f37fde39
[reddit] move date-min/-max handling into Extractor class
5 years ago
Mike Fährmann 7b77ecc35a
fix paths for files without extension (#220)
5 years ago
Mike Fährmann c41ff9441e
improve find() for downloaders and postprocessors
5 years ago
Mike Fährmann 0151e250f5
[twitter] extract 'content' metadata (closes #333)
5 years ago
Mike Fährmann 16c582aaf9
implement 'mtime' post-processor (#332)
5 years ago
Mike Fährmann 62097284fe
add 'download' option (#220)
5 years ago
Mike Fährmann fe7805de7c
improve attribute access in DownloadJob.handle_url()
5 years ago
Mike Fährmann 56c7a66a4a
detect Cloudflare CAPTCHAs and update cipher list
5 years ago
Mike Fährmann a7b42b37a2
[35photo] fix extraction
5 years ago
Mike Fährmann 04b8d0894a
[newgrounds] improve metadata extraction
5 years ago
Mike Fährmann 12da6bd0c9
[simplyhentai] fix/improve extraction
5 years ago
Mike Fährmann fdec59f8e2
replace extractor.request() 'expect' argument
5 years ago
Mike Fährmann 2ff73873f0
[erolord] add gallery extractor (closes #326)
5 years ago
Mike Fährmann b4da8c5a97
[sexcom] add extractor for related pins (#325)
5 years ago
Mike Fährmann 69997e92db
[sexcom] skip unavailable pins (#325)
5 years ago
Mike Fährmann 8966930c5c
[downloader:http] try to import SSL exception class from OpenSSL
5 years ago
Mike Fährmann bc6b0cfddc
[shopify] skip consecutive duplicate products
5 years ago
Mike Fährmann b89f0d8d3c
update extractor result tests
5 years ago
Mike Fährmann 69205df68d
allow '-1' for infinite retries (#300)
5 years ago
Mike Fährmann f7b5c4c3e7
use values of 'retries' options correctly
5 years ago
Mike Fährmann 6393b47db2
add '-A/--abort'; deprecate '--abort-on-skip'
5 years ago
Mike Fährmann f2000a69aa
implement 'image-unique' and 'chapter-unique' options (#303)
5 years ago
Mike Fährmann 40da44b17f
Merge branch 'v1.9.0'
5 years ago
Mike Fährmann 9a216a6c6c
release version 1.8.7
5 years ago
Mike Fährmann 7a99e85943
[kissmanga] fix download URLs and file extensions
5 years ago
Mike Fährmann 055102431f
[hitomi] handle Game CG galleries with scenes (fixes #321)
5 years ago