gallery-dl

Commit Graph

Author	SHA1	Message	Date
Mike Fährmann	bfbbac4495	[tsumino] add login capabilities (#161 )	6 years ago
Mike Fährmann	dd358b4564	improve cookie handling during logins	6 years ago
Mike Fährmann	6126615698	update URLs for supportedsites.rst	6 years ago
Mike Fährmann	80a75a1ecf	[tsumino] add gallery extractor (#161 )	6 years ago
Mike Fährmann	2d2953a5bf	add 'text.parse_float()' + cleanup in text.py	6 years ago
Mike Fährmann	0c32dc5858	[hentaifox] add extractor for search results (#160 )	6 years ago
Mike Fährmann	580947bfce	[hentaifox] rename Chapter- to GalleryExtractor (#160 )	6 years ago
Mike Fährmann	8095f5f81a	[mangapark] fix manga title extraction	6 years ago
Mike Fährmann	0156189468	[hentaifox] add chapter extractor (#160 )	6 years ago
Mike Fährmann	e4171d6baf	[luscious] add login capabilities (closes #159 )	6 years ago
Mike Fährmann	4f49fdf065	[mastodon] various improvements and fixes (#144 ) - allow instances to specify their own 'category' - improve config lookup: - first look into extractor.<category>.* - and afterwards look into extractor.mastodon.<instance>.* - add a default entry for pawoo.net in a way that actually works - add an 'instance' keyword and turn 'tags' into a usable list	6 years ago
Mike Fährmann	3f608a84b7	[photobucket] don't crash if JSON data is missing	6 years ago
Mike Fährmann	134487ffb0	[exhentai] stop extraction if image limit is exceeded (#141 ) can be turned off with the `exhentai.limits' option	6 years ago
Mike Fährmann	e868fb4393	[exhentai] improve gallery extraction - match image page URLs and extract galleries from that point onward - add a few more metadata entries: 'parent', 'visible', 'cost'	6 years ago
Mike Fährmann	a50e9faf0e	[newgrounds] recognize direct links	6 years ago
Mike Fährmann	9fba48fbd7	[postprocessor:metadata] add '--write-tags' flag (#135 )	6 years ago
Mike Fährmann	c5559fa07d	[photobucket] improve subalbum extraction (#117 ) The former implementation would produce a complete list of all subalbums for each (sub)album extraction. This would for example result in a level 2 subalbum getting "extracted" twice: once through the root-album (level 0) and once through its parent album on level 1. In the current implementation only the next level of subalbums are returned, which themselves will handle their next level in a recursive fashion.	6 years ago
Mike Fährmann	ecad69100a	[photobucket] add 'image' extractor (#117 )	6 years ago
Mike Fährmann	b50b30f1c9	[photobucket] download subalbums (#117 )	6 years ago
Mike Fährmann	d19bac71be	[photobucket] add 'album' extractor (#117 )	6 years ago
Mike Fährmann	78b5f29a00	[sankaku] unescape tags	6 years ago
Mike Fährmann	277b52101a	add 'category-transfer' option [ci skip]	6 years ago
Mike Fährmann	9b8ac12eed	[behance] enable 'categorytransfer' for collections (#157 )	6 years ago
Mike Fährmann	217a0687ef	[behance] add 'collection' extractor (closes #157 )	6 years ago
Mike Fährmann	b8fed34548	add generalized extractors for Mastodon instances (#144 ) Extractors for Mastodon instances can now be dynamically generated, based on the instance names in the 'extractor.mastodon.*' config path. Example: { "extractor": { "mastodon": { "pawoo.net": { ... }, "mastodon.xyz": { ... }, "tabletop.social": { ... }, ... } } } Each entry requires an 'access-token' value, which can be generated with 'gallery-dl oauth:mastodon:<instance URL>'. An 'access-token' (as well as a 'client-id' and 'client-secret') for pawoo.net is always available, but can be overwritten as necessary.	6 years ago
Mike Fährmann	4b441c162e	release version 1.6.3	6 years ago
Mike Fährmann	66460337f1	[mangapark] fix extraction	6 years ago
Mike Fährmann	8aba2bdebf	[postprocessor:metadata] add 'tags' and 'custom' modes (#135 )	6 years ago
Mike Fährmann	79c01ec7ae	implement J<separator>/ format option J joins list elements by calling <separator>.join(list): Example: {f:J - /} -> "a - b - c" (if "f" is ["a", "b", "c"])	6 years ago
Mike Fährmann	2ffc105887	[exhentai] extract tag metadata	6 years ago
Mike Fährmann	0fb98d1d79	[hbrowse] extract tag metadata	6 years ago
Mike Fährmann	9bbbadd93a	[hbrowse] use HTTPS	6 years ago
Mike Fährmann	2fbf072723	[newgrounds] ensure consistent tag order ... plus some code restructuring	6 years ago
Mike Fährmann	d7a4739cf6	[hbrowse] print error message if site is down ... instead of crashing with a meaningless exception	6 years ago
Mike Fährmann	98c6520384	[pinterest] update root URL of API calls	6 years ago
Mike Fährmann	751e535948	[nhentai] fix extraction (closes #156 ) Use JSON embedded in webpage since API endpoints have been disabled	6 years ago
Mike Fährmann	5f38ac9609	[postprocessor:exec] add a better error message (#155 )	6 years ago
Mike Fährmann	89df37a173	[artstation] use a separate dict for each asset (#154 ) Using the same base-dict for each asset of a project causes unwanted side effects like re-using image filename extensions for videos, resulting in errors with the youtube-dl downloader.	6 years ago
Mike Fährmann	344bbaa71a	remove useless line A remnant from when `filter` and `range` were global and only available as command line options.	6 years ago
Mike Fährmann	1734a6c879	[reactor] detect "circular" redirects (#148 )	6 years ago
Mike Fährmann	e53cdfd6a8	update build_supportedsites.py	6 years ago
Mike Fährmann	1e4d351ad3	[danbooru] add authentication support (closes #151 ) ... via HTTP Basic Auth with username and "password". The password value in this case is not the account password itself, but the"api_key" found in your user profile.	6 years ago
Mike Fährmann	06cbf5f9c4	implement 'chapter-reverse' option (#149 ) Setting it to `true` will start with the latest chapter instead of the first one.	6 years ago
Mike Fährmann	e95b24f056	[reactor] add wait-min & -max options (#148 )	6 years ago
Mike Fährmann	8e01cf0ef8	[reactor] generalize extractors (#148 ) - support *.reactor.cc domains - combine joyreactor and pornreactor modules	6 years ago
Mike Fährmann	38500ad697	[postprocessor:metadata] first implementation (#135 )	6 years ago
Mike Fährmann	1737d7f576	[joyreactor] fix and improve pagination (#148 )	6 years ago
Mike Fährmann	8753627ef4	[joyreactor] improve error handling for faulty JSON (#148 ) - remove all ASCII escape codes, not just \n and \r - ignore faulty posts instead of letting the exception propagate	6 years ago
Mike Fährmann	a36f52a730	[joyreactor] add extractor for search results (#148 )	6 years ago
Mike Fährmann	a303efb597	[mangadex] handle manga pages without chapters	6 years ago
Mike Fährmann	0afa913de4	[tumblr] add tests for hidden and private blogs (#145 ) Hidden / dashboard-only blogs are pretty straightforward and "only" require a valid 'access-token' and 'access-token-secret' for the given 'api-key' and 'api-secret', so that signed OAuth1.0 requests are possible. Private / password protected blogs on the other hand are a bit cumbersome. In addition to a valid 'access-token' and 'access-token-secret', they also require the account belonging to those tokens to be a member of the blog itself. Knowing the password and entering it in the website isn't enough to access a blog through the API. Following a private blog is also impossible, so that option can't work either.	6 years ago
Mike Fährmann	67cc0ac873	release version 1.6.2	6 years ago
Mike Fährmann	fa7fa2f8ff	[deviantart1 update tests]	6 years ago
Mike Fährmann	b7b5456a32	[kissmanga] use HTTPS	6 years ago
Mike Fährmann	259123732f	[readcomiconline] improve comic-page parsing	6 years ago
Mike Fährmann	0328a04a65	[cloudflare] don't output the whole challenge page thanks to the embedded animated gifs this is just a bit too much	6 years ago
Mike Fährmann	4ab0960083	[reddit] add metadata to extracted URLs	6 years ago
Mike Fährmann	2f4f60de33	[tumblr] add tests for each post type	6 years ago
Mike Fährmann	98314aa04c	[mangapark] detect non-existent chapters	6 years ago
Mike Fährmann	6c71e9cf5d	[deviantart] add separate 'sta.sh' extractor (#113 ) - supports multiple stashed deviations per page - explicitly mentions sta.sh support on supportedsites.rst	6 years ago
Mike Fährmann	f9ace0f4a3	[mangapark] fix manga extraction ... again	6 years ago
Mike Fährmann	28f9539551	[tumblr] change default values for post types and inline media	6 years ago
Mike Fährmann	5be95034ba	[tumblr] add option to download avatars (#137 )	6 years ago
Mike Fährmann	7471933d5f	use extractor.request for all other API calls - deviantart - pawoo - pixiv - reddit	6 years ago
Mike Fährmann	995844c915	[instagram] relax test pattern even more	6 years ago
Mike Fährmann	2e5f82e59e	[tumblr] don't follow 'external' Tumblr URLs (#139 )	6 years ago
Mike Fährmann	c5d4f558c9	allow missing field access keys in format strings (#136 )	6 years ago
Mike Fährmann	0c9762f00e	[mangapark] fix extraction	6 years ago
Mike Fährmann	c9ef5ed364	[luscious] ensure URLs have a scheme	6 years ago
Mike Fährmann	851ee9f89f	[sensescans] replace tests the old ones got removed	6 years ago
Mike Fährmann	c14d44e1bc	[downloader:common] retry downloads on SSL errors (#130 )	6 years ago
Mike Fährmann	0be7ee3106	[hitomi] fix image subdomains (closes #142 ) galleries with an ID ending in 1 need some special treatment	6 years ago
Mike Fährmann	fe96835d25	[kissmanga] add fallback for chapter-string parsing (#20 )	6 years ago
Mike Fährmann	4d73cc785d	update test results	6 years ago
Mike Fährmann	049a9575c4	[tumblr] fix inline extraction #2 Using only the "comment" field isn't enough ... [ci skip]	6 years ago
Mike Fährmann	f6bf66f72c	[pixiv] create directory for each "work" item (#136 )	6 years ago
Mike Fährmann	79f6755c60	[postprocessor:classify] handle missing "extension" (#138 )	6 years ago
Mike Fährmann	b7a9f6cc49	[tumblr] improve inline extraction (#137 )	6 years ago
Mike Fährmann	010da8372a	[instagram] relax test pattern	6 years ago
Mike Fährmann	1c6b9ba322	[readcomiconline] use HTTPS	6 years ago
Leonardo Taccari	2655a2ea02	Add support for instagram.com user profiles and pages (#134 ) * [instagram] Add extractor for instagram.com user profiles and pages The extractor scrapes `instagram.com/<user>' timelines and `instagram.com/p/<shortcode>' by mimicking the behaviour of a web browser and extracting the sharedData JSON of the single pages. Please note that this mean that for user timelines we also do an extra request to the `instagram.com/p/<shortcode>' page but this permit to have consistent (and all) information about the media fetched. The MD5 logic used for X-Instagram-GIS was documented in <https://stackoverflow.com/questions/49786980/> * [instagram] Test for keywords, not url for GraphImage and GraphSidecar URLs returned by instagram seems not stable so avoid testing for them and instead test for keyword returned. * [instagram] Improve test of InstagramProfilepageExtractor Also check the count of media returned. * [instagram] Several cleanup and improvements - Change description, subcategories to generate a better description in docs/supportedsite.rst - Remove not needed InstagramExtractor.__init__() - Use text.parse_int() instead of directly using int() (the former is more robust) - Use self.request().json() instead of using json.loads() the self.request().text() - Add `pattern:' to check the URLs where we do not have a stable URLs. It seems that only the subdomain is not stable. Thanks to @mikf!	6 years ago
HRXN	e80ee77d71	tumblr.py: update regex for video (#133 ) There seems to be another sub-domain for videos, apparently.. Not just `vt(.media).tumblr` `vtt(media).tumblr` But also `ve(.media).tumblr`	6 years ago
Mike Fährmann	9a98b6769d	use extractor.request for API calls (#130 ) ... at least for OAuth1.0 based APIs (flickr, smugmug, tumblr)	6 years ago
Mike Fährmann	0225d90078	add exception name and traceback for OSErrors	6 years ago
Mike Fährmann	ad2cefda6b	[tumblr] in case of exception use filename as 'hash' (#129 ) While a filename might not be a real 'hash', or comparable to what tumbler usually provides, it is still better than an empty string. At least as long as "alternatives" in format strings aren't implemented.	6 years ago
Mike Fährmann	95636418ad	[tumblr] catch exception for 'hash' extraction (fixes #129 )	6 years ago
Mike Fährmann	40e30694f3	[pinterest] fix pin.it redirects	6 years ago
Mike Fährmann	770200888e	[gfycat] use public API endpoint	6 years ago
Mike Fährmann	b1e22e8354	release version 1.6.1	6 years ago
Mike Fährmann	5d6e219fb2	[joyreactor] update tests	6 years ago
Mike Fährmann	c59f56fe7e	[gfycat] fix extraction /cajax/get/<id> doesn't work anymore	6 years ago
Mike Fährmann	ba56827f36	[newgrounds] add user-, video-, image-extractors (#119 )	6 years ago
Mike Fährmann	15890930ea	[mangafox] fix extraction use mobile version since desktop version is obfuscated	6 years ago
Mike Fährmann	a4263fb253	[luscious] add extractor for search results (closes #127 )	6 years ago
Mike Fährmann	fb53b5dd55	fix control+c during -j and range tests	6 years ago
Mike Fährmann	a0ae156edc	[pornreactor] add tag-, user-, post-extractors (#114 )	6 years ago
Mike Fährmann	bacbc2e7bd	[joyreactor] try to prevent JsonDecodeErrors (#114 )	6 years ago
Mike Fährmann	503d42a1c2	[joyreactor] add tag-, user-, post-extractors (#114 )	6 years ago
Mike Fährmann	59bb434ba5	[flickr] add ability to download all albums of a user for example with 'https://www.flickr.com/photos/shona_s/albums'	6 years ago
Mike Fährmann	13cb270326	set target directory before postprocessor init (fixes #126 )	6 years ago

1 2 3 4 5 ...

1460 Commits (77551bf01b2a8af15e8f4fbfb5d4c5f3ac3a5ec4)