Commit Graph

1875 Commits (055c32e0f77302f8f7263366463d115a7df0eaf0)

Author SHA1 Message Date
Mike Fährmann fa2952ac55
[furaffinity] add 'following' extractor (#515)
4 years ago
Mike Fährmann 9b194520db
[newgrounds] add 'following' extractor (closes #684)
4 years ago
Mike Fährmann 6386ee54e1
[deviantart] add extractor info to 'following' results
4 years ago
Mike Fährmann d5273f9b0c
[hiperdex] update domain to hiperdex.net
4 years ago
Mike Fährmann 08674a91f3
[patreon] fix hash extraction from download URLs (closes #693)
4 years ago
Mike Fährmann a6286bb551
[hiperdex] add 'artist' extractor (#606)
5 years ago
Mike Fährmann 291033720a
[hiperdex] fix manga extraction
5 years ago
Mike Fährmann dfc0557807
[vsco] fix collection extraction
5 years ago
Mike Fährmann fd438f0d78
update extractor test results
5 years ago
Mike Fährmann bae1e8ed12
[deviantart] fix JPEG quality replacement pattern
5 years ago
Mike Fährmann cf4cef3d63
[aryion] adjust 'date' to UTC time
5 years ago
Mike Fährmann 6c531be294
[aryion] fix malformed 'last-modified' headers (#390)
5 years ago
Mike Fährmann dc65f7d8dc
[aryion] use generic download URLs (#390)
5 years ago
Mike Fährmann 96b78bcf04
[aryion] include path in default directory format (#390)
5 years ago
Mike Fährmann 6143050980
[aryion] add gallery and post extractors (#390, #673)
5 years ago
Mike Fährmann 9e7dfc0cfc
[myportfolio] fix extraction of galleries without title
5 years ago
Mike Fährmann 88fca0a172
[mastodon] update OAuth credentials for pawoo.net (#665)
5 years ago
Mike Fährmann 4ae8a25567
[mastodon] use 'combine_dict()' to combine extractor info dicts
5 years ago
Mike Fährmann 220c06b86e
[mastodon] handle rate limits
5 years ago
Mike Fährmann d02f7c1118
improve Extractor.wait()
5 years ago
Mike Fährmann 5d7404ab58
[oauth] use the new name for 'DeviantartAPI' (fixes #670)
5 years ago
Mike Fährmann 762c758af4
[hiperdex] fix extraction
5 years ago
Mike Fährmann f9a590f92b
[deviantart] apply HTTP request limits in more places
5 years ago
Mike Fährmann 2587296deb
[mastodon] add access tokens for mastodon.social and baraag.net
5 years ago
Mike Fährmann ff7c0b7eff
[deviantart] handle "Request blocked" errors (#655)
5 years ago
Mike Fährmann c874684f05
[deviantart] retrieve *all* download URLs through OAuth API
5 years ago
Mike Fährmann 5c27b25a8f
[deviantart] improve sta.sh extraction
5 years ago
Mike Fährmann e2fc4eaa6f
[deviantart] detect stash folders (fixes #659)
5 years ago
Mike Fährmann c034159701
[piczel] fix extraction for single images
5 years ago
Mike Fährmann 699036ea0c
[weibo] accept status URLs with non-numeric IDs (#664)
5 years ago
Mike Fährmann fe96f99e4b
[hentainexus] reduce line length (flake8) & update test
5 years ago
墨焓 6f81cac8fa
Add metadata to hentainexus: circle, event, title_conventional. (#661)
5 years ago
Mike Fährmann 6f911aeb1c
[deviantart] add error message for cloudFront blocks (#655)
5 years ago
Mike Fährmann 7499d71d02
[simplyhentai] ignore certificate errors in video test
5 years ago
Mike Fährmann 4203dc0bdc
[mangapark] fix metadata extraction
5 years ago
Mike Fährmann 1b82d36ab2
[deviantart] handle decode errors for extended_fetch results (#655)
5 years ago
Mike Fährmann 09f2271528
[35photo] add 'tag' extractor
5 years ago
Mike Fährmann 77fda8190c
[35photo] simplify/remove tests for the 'genre' extractor
5 years ago
Mike Fährmann fb846c9ee5
[instagram] reduce line lengths and make flake8 happy
5 years ago
Mike Fährmann ad2efa8509
[e621] derive from Danbooru extractors (#651)
5 years ago
Mike Fährmann 9b39e1cd7e
[e621] fix bug in API rate limiting (#651)
5 years ago
Mike Fährmann b607d0ad7f
[twitter] fix typo in 'x-twitter-auth-type' header (#625)
5 years ago
Mike Fährmann 2c3b9e1450
[nozomi] support multiple images per post (#646)
5 years ago
Mike Fährmann c606d0c854
[instagram] update pattern for user profile URLs
5 years ago
Mike Fährmann 2530db3f4d
[mangadex] transform 'date' timestamps to datetime objects
5 years ago
Mike Fährmann ae2a33243b
[newgrounds] catch general Exceptions
5 years ago
Mike Fährmann 32e36d8f02
[sexcom] replace tests
5 years ago
Mike Fährmann 33b42dc847
[nozomi] sort search results (fixes #646)
5 years ago
Mike Fährmann eaa60a438b
[piczel] fix extraction
5 years ago
Mike Fährmann 5bcc7184c9
[danbooru][e621] increase page limits
5 years ago
Mike Fährmann 90d15e3682
[instagram] use 'itertools.chain()'
5 years ago
Leonardo Taccari 160328d21c
[instagram] Add support for user's saved medias (#644)
5 years ago
Mike Fährmann d3482ace7f
[furaffinity] extract more metadata
5 years ago
Mike Fährmann fdd2dd5136
[kabeuchi] add 'user' extractor (closes #561)
5 years ago
Mike Fährmann 59edcdc822
[hitomi] restore metadata fields from before f33b13a
5 years ago
Mike Fährmann 2d5703c493
[twitter] use a simpler data structure to store cookies in cache
5 years ago
Mike Fährmann 87d4f83597
[newgrounds] make post extraction nonfatal
5 years ago
Mike Fährmann 823fbeaae6
[newgrounds] add 'favorite' extractor (#394)
5 years ago
Mike Fährmann a45fbc38ea
[pixiv] implement 'avatar' option (#595, #623)
5 years ago
Mike Fährmann a63a376ad2
[mangoxo] fix login
5 years ago
Mike Fährmann ebc70e87ce
[e621] update to new interface / API endpoints (closes #635)
5 years ago
Mike Fährmann d1cf7ccdb3
[instagram] add 'post_shortcode' metadata field (#525)
5 years ago
Mike Fährmann 32df8d06fe
[twitter] add 'bookmark' extractor (closes #625)
5 years ago
Mike Fährmann 3fb41c34c8
[bcy] reduce requests to '/item/detail/<id>' (#613)
5 years ago
Mike Fährmann f33b13aacf
[hitomi] simplify metadata extraction
5 years ago
Mike Fährmann ce5e2a58fe
[imgbb] update test results
5 years ago
Mike Fährmann f117e32910
[danbooru] restore 'popular' functionality
5 years ago
Mike Fährmann 39b48d665b
[hiperdex] use proper name for 'chapter_minor'
5 years ago
Mike Fährmann 8fbbaa54ff
[bcy] fix partial image URLs (#613)
5 years ago
Mike Fährmann 86c00f9e66
[danbooru] move extractor logic from booru.py
5 years ago
Mike Fährmann 1d4a369ea2
update extractor test results
5 years ago
Mike Fährmann 7625912b31
[piczel] improve and update
5 years ago
Mike Fährmann 913b8333cc
write DeviantArt refresh-tokens to cache (#616)
5 years ago
Mike Fährmann 2a4f227e08
warn about expired cookies
5 years ago
Mike Fährmann 4e361b3008
add tests for specific datetime values
5 years ago
Mike Fährmann 80ecb99089
[hitomi] fix extraction
5 years ago
Mike Fährmann 247c9e1416
[vsco] update gallery URL pattern
5 years ago
Mike Fährmann 19ae6f3fc4
update test results
5 years ago
Mike Fährmann cc5079c844
[hiperdex] add chapter and manga extractors (closes #606)
5 years ago
Mike Fährmann 64bdec8430
[deviantart] check availability of intermediary URLs (fixes #609)
5 years ago
Mike Fährmann 5607dd3646
[hitomi] follow multiple redirects
5 years ago
Mike Fährmann 765b2a0527
[hentaihand] add extractors (closes #605)
5 years ago
Mike Fährmann d94215d119
[tumblr] replace '-' with ' ' in tag searches (fixes #611)
5 years ago
Mike Fährmann e6cd49e78b
update extractor test results
5 years ago
Mike Fährmann 5d9437b398
[vsco] skip "invalid" entities
5 years ago
Mike Fährmann 650f2b6d58
[furaffinity] accept sfw.furaffinity.net URLs (closes #608)
5 years ago
Mike Fährmann 74e684e828
[twitter] change default value for 'videos' to 'true'
5 years ago
Mike Fährmann c7cf9dd111
[furaffinity] support classic layout (#284)
5 years ago
Mike Fährmann 138135c190
[furaffinity] add extractors (#284)
5 years ago
Mike Fährmann b9c574bd1d
[patreon] log skipped files (#590)
5 years ago
Mike Fährmann 80ea9104b8
[8kun] adjust URL pattern
5 years ago
Mike Fährmann ce26070231
[pixiv] reduce calls to '/user/detail'
5 years ago
Mike Fährmann da0d5f6092
[oauth] add 'port' option (#604)
5 years ago
Mike Fährmann 719b63d0ca
[bcy] add user and post extractors (#592)
5 years ago
Mike Fährmann 6426e3efc7
[khinsider] fix and improve metadata extraction
5 years ago
Mike Fährmann b7eb6cecbb
[pixiv] handle tags at the end of new bookmark URLs
5 years ago
Mike Fährmann 109f6c8685
[patreon] filter duplicate files per post (#590)
5 years ago
Mike Fährmann b38cf59711
[sexcom] fix image URLs & parse 'date' fields
5 years ago
Mike Fährmann 1f4c9c5f9d
[8kun] add thread and board extractors (closes #582)
5 years ago
Mike Fährmann facc5daa6d
[twitter] force old login page layout (fixes #584, fixes #598)
5 years ago
Mike Fährmann d1de7dc296
[hitomi] implement workaround for "broken" redirects
5 years ago
Mike Fährmann 40fe062851
[pixiv] fix user id for bookmarks API calls (closes #596)
5 years ago
Mike Fährmann 91aaaf1a9e
[pixiv] add 'rating' metadata field (#595)
5 years ago
Mike Fährmann dff33b260c
[reddit] add 'videos' option
5 years ago
Mike Fährmann 2ad43618cc
[piczel] fix extraction
5 years ago
Mike Fährmann cf7a67d67f
[yaplog] remove module
5 years ago
Mike Fährmann e0dd073ce0
[twitter] replace embedded tweet test
5 years ago
Mike Fährmann ec36df4851
[deviantart] fix video extraction from 'extended_fetch' results
5 years ago
Mike Fährmann 48be2266ed
[deviantart] better error message for 'extended_fetch' (#585)
5 years ago
Mike Fährmann 71851a6241
[pixiv] update URLs of followed users to the new format
5 years ago
Mike Fährmann d086f30b42
[reddit] restore archive keys for i.redd.it images
5 years ago
Mike Fährmann 56f1c96168
implement 'parent-directory' option (#551)
5 years ago
Mike Fährmann ae07f92f7e
[reddit] rewrite extractor logic (closes #551)
5 years ago
Mike Fährmann 2852691d78
[paheal] replace test URL
5 years ago
Mike Fährmann 2a9be48511
improve util.load/save_cookiestxt() and add tests
5 years ago
Mike Fährmann e35c2ea1a6
[weibo] use youtube-dl to download from m3u8 manifests
5 years ago
Mike Fährmann 6703b8a86b
[blogger] implement video extraction (closes #587)
5 years ago
Mike Fährmann c1a6862863
implement functions to load/save cookies.txt files (closes #586)
5 years ago
Mike Fährmann 25d5ec4ff3
[twitter] add option to extract TwitPic embeds (#579)
5 years ago
Mike Fährmann 32d7195d08
[pinterest] improve detection of invalid pin.it links
5 years ago
Mike Fährmann 174117f827
allow multiple hashes for content tests
5 years ago
Alice f498a9057f [twitter] Fix stop before real end (#573)
5 years ago
Mike Fährmann 8bb32ee188
[hitomi] fix image URLs
5 years ago
Mike Fährmann bd5ce9855c
allow GalleryExtractors to set URL-independent extensions
5 years ago
Mike Fährmann af42c75152
[mangadex] revert domain to 'mangadex.org'
5 years ago
Mike Fährmann e89413da22
update test results
5 years ago
Mike Fährmann 33a6e0ac6e
[hentaifoundry] extract more metadata (closes #565)
5 years ago
Mike Fährmann 5cac79c3d9
[erolord] remove extractor
5 years ago
Mike Fährmann b9cbf932b4
[pixiv] update URL patterns (fixes #568)
5 years ago
Mike Fährmann 988cc2ec23
[mangadex] change domain to mangadex.cc (closes #559)
5 years ago
Mike Fährmann f8e137d6b4
[deviantart] show warning about private deviations only once
5 years ago
Mike Fährmann 939fec8ecd
[deviantart] match new search/popular URLs (closes #538)
5 years ago
Mike Fährmann 09cc88b715
[deviantart] match '/favourites/all' URLs (closes #555)
5 years ago
Mike Fährmann 3811fd8a25
fix time formatting for Python 3.4 and 3.5
5 years ago
Mike Fährmann 43ab9572b4
[twitter] handle API rate limits (#526)
5 years ago
Mike Fährmann 569747a78d
implement extractor.wait()
5 years ago
Mike Fährmann 5532e9c158
[twitter] handle quoted tweets (#526)
5 years ago
Mike Fährmann 0b4cb8e57a
[mangahere] send 'isAdult' cookie (fixes #556)
5 years ago
Mike Fährmann 1afb91363c
[imagefap] generalize URL patterns and add tests (#552)
5 years ago
Xope Totec f701e9f33a Handle beta.imagefap.com URLs (#552)
5 years ago
Mike Fährmann ce54b8c04c
let extractors opt-out of cookie option usage
5 years ago
Mike Fährmann 5ad92fc196
[newgrounds] fix tags metadata extraction
5 years ago
Mike Fährmann 82f7f4172a
update test results
5 years ago
Mike Fährmann 1f2a69f3c5
add '_extractor' information to redirect results
5 years ago
Mike Fährmann a27f43dad1
[pixiv] wait and retry after rate limit error (closes #535)
5 years ago
Mike Fährmann 6b373cb7e2
[exhentai] restrict default directory name length (#545)
5 years ago
Mike Fährmann b347bf68c7
[deviantart] add extractor for followed users (#515)
5 years ago
Mike Fährmann c0f391a4e2
[pixiv] support listing followed users (#515)
5 years ago
Mike Fährmann 896896a490
[twitter] fix URLs forwarded to youtube-dl (closes #540)
5 years ago
Mike Fährmann 1e2713b895
[artstation] fix search result pagination (closes #537)
5 years ago
Mike Fährmann bf3df3d0b0
[directlink] send Referer headers (closes #536)
5 years ago
Mike Fährmann 9be7ff600e
[imagetwist] replace test image
5 years ago
Mike Fährmann 66905b1664
[foolslide] add fallback for chapter data extraction
5 years ago
Mike Fährmann 48e42e73fb
[reddit] change default value for 'comments' to '0'
5 years ago
Mike Fährmann 9c0928457a
[reddit] fix errors with 't1_…' submissions
5 years ago
Mike Fährmann bf658fd84b
[vsco] implement 'videos' option
5 years ago
Mike Fährmann 95c90722ee
[instagram] implement 'videos' option (closes #521)
5 years ago
Mike Fährmann d0920e84e9
update test results
5 years ago
Mike Fährmann 8c11e81c9f
Merge commit '63e6993716db8d8bedfb7b0d445c7161493046b6'
5 years ago
Mike Fährmann 63e6993716
merge 'bypost' functionality into metadata postprocessor
5 years ago
Mike Fährmann 31a29835ff
[realbooru] simplify extractors and update tests (#514)
5 years ago
The Oddball 9a4ce20b8e [realbooru] Add Realbooru extractor (#514)
5 years ago
Mike Fährmann 72b8fbfbad
[instagram] make post-page extraction nonfatal
5 years ago
Mike Fährmann 922b8a9595
[weibo] raise NotFoundError for unavailable/deleted statuses
5 years ago
Mike Fährmann 0cd157300e
[patreon] fix regex pattern for posts
5 years ago
Mike Fährmann fe19e233f3
[xvideos] improve
5 years ago
Mike Fährmann d3e44e899d
raise NotFoundErrors for 404 responses in GalleryExtractors
5 years ago
Mike Fährmann a4dd8b3dab
improve _check_cookies()
5 years ago
Mike Fährmann 76e60d10a6
[patreon] raise proper exception if creator/post doesn't exist
5 years ago
Mike Fährmann 9e63804347
[patreon] make retrieving user info nonfatal (#508)
5 years ago
Mike Fährmann 964dc57286
[vsco] improve image resolutions
5 years ago
Mike Fährmann 0629fe8fa4
[vsco] fix user profile extraction … again
5 years ago
Mike Fährmann ab17ea9632
[deviantart] only print warning if 'original' is enabled
5 years ago
Mike Fährmann 2188db6284
[gelbooru] fix non-API tag extraction
5 years ago
Mike Fährmann c4702ec9b6
simplify some logging calls
5 years ago
Gio c0b9ad678d Separate metadata from handle_url into handle_metadata, commenting
5 years ago
Mike Fährmann c9ef1b21c3
[patreon] get partial user info without /api/user/<id> (#507)
5 years ago
Mike Fährmann 0ab9bb1721
[4chan] add extractor for entire boards (closes #510)
5 years ago
Gio cfc70a97ab Added an additional channel for downloading the metadata of an entire post or gallery.
5 years ago
Mike Fährmann 15f9bb3d14
add option to disable pyOpenSSL usage (#508)
5 years ago
Mike Fährmann c8e99e3b3b
[deviantart] fix crash on missing "token" field (#505)
5 years ago
Mike Fährmann 6ed2c7823c
[deviantart] disable original downloads if no cookies set
5 years ago
Mike Fährmann 50deab5265
[deviantart] fix URL generation from /extended_fetch results
5 years ago
Mike Fährmann 1f209da4c0
[pixiv] match new search URLs (closes #507)
5 years ago
Mike Fährmann e17907ee2a
change default value of 'cookies-update' to 'true'
5 years ago
Mike Fährmann 07dafad26d
[twitter] attempt to fix infinite loops (#499)
5 years ago
Mike Fährmann 71acbdabf4
[2chan] fix metadata extraction
5 years ago
Mike Fährmann c0a1241648
[livedoor] force https:// for image URLs
5 years ago
Mike Fährmann 6e23c0da09
[imgur] add extractor for subreddit links (closes #500)
5 years ago
Mike Fährmann 372ffe95ee
[oauth] adjust Flickr redirect URI (fixes #503)
5 years ago
Mike Fährmann 004812258d
[hentaifox] fix extraction
5 years ago
Mike Fährmann e2710702d4
fix Cloudflare bypss
5 years ago
Mike Fährmann 8759403f37
[plurk] add delay between comment requests
5 years ago
Mike Fährmann a28552fd19
update test results
5 years ago
Mike Fährmann dcaa3d01bd
[imagefap] adapt to new image URL format
5 years ago
Mike Fährmann e62c209ca0
[nijie] fix 'date' parsing
5 years ago
Mike Fährmann 3bba763ab9
[twitter] improve
5 years ago
Mike Fährmann db35c3b581
[directlink] separate filenames from paths
5 years ago
Mike Fährmann 41a3169c67
[foolfuuka] use '{extension}' in default filename format
5 years ago
Mike Fährmann e9aed62c91
[imgur] unescape image titles
5 years ago
Mike Fährmann 2c332edaad
[plurk] fix comment pagination
5 years ago
Mike Fährmann a3fa45bbb1
[behance] get images from 'media_collection' modules
5 years ago
Mike Fährmann 359c3bc1c5
[deviantart] revert to getting download URLs from OAuth API
5 years ago
Mike Fährmann 42b9633c7e
update test results
5 years ago
Mike Fährmann b28bd1c73e
[bobx] set generated session cookie (closes #482)
5 years ago
Mike Fährmann ae09f87602
improve SharedConfigMixin config lookups
5 years ago
Mike Fährmann f5604492c3
update interface of config functions
5 years ago
Mike Fährmann 4ca883c66f
[smugmug] replace test for custom URLs
5 years ago
Mike Fährmann d45fabb79d
match user profile handling on deviantart and newgrounds
5 years ago
Mike Fährmann ea80dadd09
[deviantart] restore archive keys
5 years ago
Mike Fährmann ea094692c8
[vsco] fix collection extraction (#480)
5 years ago
Mike Fährmann 490831f84a
[bobx] "fix" image download URLs
5 years ago
Mike Fährmann 978cb03f81
update misc test results
5 years ago
Mike Fährmann fca87974fe
[sexcom] fix video downloads by sending specific Referer headers
5 years ago
Mike Fährmann edc080468d
[instagram] make 'video_url' fields optional (fixes #479)
5 years ago
Mike Fährmann 9fdc5e74cb
[deviantart] ensure consistent username capitalization (#455)
5 years ago
Mike Fährmann b1f0609de5
[newgrounds] rewrite (#394)
5 years ago
Mike Fährmann 3ece3976ae
[newgrounds] implement login support (#394)
5 years ago
Mike Fährmann 3a07c06865
[newgrounds] update
5 years ago
Mike Fährmann 5513b66eb0
[vsco] fix user profile extraction
5 years ago
Mike Fährmann abfcb356fc
[flickr] support 3k, 4k, 5k, and 6k photo sizes (closes #472)
5 years ago
Mike Fährmann 521fcd2eb9
[imgbb] fix error in galleries without user info (closes #471)
5 years ago
Mike Fährmann 8061263d4c
[imgbb] improve pagination logic
5 years ago
Mike Fährmann da6789b2b0
disable unique archive id checks for some tests
5 years ago
Mike Fährmann b0197098e6
[imgur] get title from webpage if missing in API response
5 years ago
Mike Fährmann dd5d2b2eac
[deviantart] add user profile extractor (#377, #419)
5 years ago
Mike Fährmann a437e78620
[deviantart] minimize cookie usage during scraps extraction
5 years ago
Mike Fährmann 1a197d2195
store the original cookiejar as Extractor._cookiejar
5 years ago
Mike Fährmann de83ae4576
make 'method' argument of Extractor.request keyword-only
5 years ago
Mike Fährmann 4325695d74
[luscious] expand GraphQL queries
5 years ago
Mike Fährmann 94dbdbf506
[nijie] change default filename format
5 years ago
Mike Fährmann c18fadc221
[instagram] extract videos without youtube-dl (#391)
5 years ago
Mike Fährmann f15eedb634
[sexcom] set Referer header for file downloads (closes #464)
5 years ago
Mike Fährmann 2a3bd4e3c7
rename extractor classes starting with a digit
5 years ago
Mike Fährmann b3b9da6d74
[photobucket] replace test URL
5 years ago
Mike Fährmann 64786363be
[4chan] simplify
5 years ago
Mike Fährmann 557e2c018b
[8chan] remove module
5 years ago
Mike Fährmann e14782a948
[instagram] simplify graphql extraction for post pages
5 years ago
Mike Fährmann c01ff78467
[twitter] extend 'videos' option to force extraction with ytdl
5 years ago
Mike Fährmann f8ac67ce50
[hitomi] extend URL pattern + follow redirects
5 years ago
Mike Fährmann e877ca97c3
[naver] adjust directory names and metadata structure
5 years ago
Mike Fährmann 702f2fbd1f
[issuu] add publication and user extractors (#413)
5 years ago
Mike Fährmann 8361d874d7
[hitomi] fix extraction
5 years ago
Mike Fährmann 5fa6ff04dd
[instagram] extract '__additionalDataLoaded' (#391)
5 years ago
Mike Fährmann 87a87bff7e
[simplyhentai] fix image URLs
5 years ago
Mike Fährmann 4409d00141
embed error messages in StopExtraction exceptions
5 years ago
Mike Fährmann d44f790e81
adjust output for HTTP status related errors
5 years ago
Mike Fährmann 109718a5e3
[blogger] add blog and post extractors (closes #364)
5 years ago
Mike Fährmann 49a6b1b6c0
[twitter] extract video stream info without youtube-dl (#452)
5 years ago
Mike Fährmann 9f0dbf2a72
[twitter] raise proper exception for protected Tweets
5 years ago