gallery-dl

Commit Graph

Author	SHA1	Message	Date
Leonardo Taccari	1e38f65996	[instagram] Add support for GraphSidecar media types (#201 ) * [instagram] Add support for GraphSidecar media types Refactor _extract_postpage() to always return a list of medias. Fetch common keywords and gracefully handle GraphSidecar media type by extracting each single media and adding `sidecar_media_id' and `sidecar_shortcode' keywords to indicate the parent of sidecar childrens. While here join the copyright comment lines in a single one. Closes #178. * [instagram] Use `yield from' instead of `for ... yield' (thanks @mikf)! * [instagram] Adjust filename for GraphSidecar medias Add a possible leading `media_id' of the sidecar for GraphSidecar media. Thanks to @mikf for the suggestion! * [instagram] Add extra metadata for youtube-dl in GraphSidecar childrens GraphSidecar children ytdl: URLs when consumed by youtube-dl redirects to the URL of their parent. In GraphSidecar-s with multiple GraphVideo-s this leads to downloading the same video multiple times. Add a `_ytdl_index' field to indicate the index of the youtube-dl playlist corresponding the children of the sidecar. This will be used by the `ytdl' downloader.	6 years ago
Mike Fährmann	6ba67b0537	[hypnohub] add extractors (closes #196 )	6 years ago
Mike Fährmann	fe27154a10	[komikcast] fix extraction ... again	6 years ago
Mike Fährmann	5ec55ec4fc	[deviantart] improve URLs for non-downloadable deviations	6 years ago
Mike Fährmann	c7a6b0ed90	[deviantart] add 'metadata' option (#189 )	6 years ago
Mike Fährmann	8d96a8ce4c	[500px] add user-, gallery-, and image-extractors (#185 )	6 years ago
Mike Fährmann	d0f88c35be	[komikcast] fix extraction	6 years ago
Mike Fährmann	6277a739e4	[35photo] add user-, genre-, and image-extractors (#162 )	6 years ago
Mike Fährmann	fb14f80d62	[tumblr] fix avatar URLs for non-OAuth1.0 calls (closes #193 )	6 years ago
Mike Fährmann	973a720a7a	[weibo] fix unit test URL patterns	6 years ago
Mike Fährmann	a2af2d2965	adjust cache maxage values	6 years ago
Mike Fährmann	f612284d24	cache cfclearance cookies	6 years ago
Mike Fährmann	591a07f20c	small code changes and cleanups	6 years ago
Mike Fährmann	6f57d44ec2	[seaotterscans] remove extractor http://seaotterscans.com/ now redirects to their MangaDex profile	6 years ago
Mike Fährmann	6dae6bee37	automatically detect and bypass cloudflare challenge pages TODO: cache and re-apply cfclearance cookies	6 years ago
Mike Fährmann	25aaf55514	[smugmug] improve format selection (closes #183 ) - use original image if available - support video formats - remove user info for ImageExtractor (it is no longer possible to get image owner information for a single image)	6 years ago
Mike Fährmann	7c1cb923a4	[myportfolio] replace unit test the old gallery got removed	6 years ago
Mike Fährmann	fffbfd3dce	[imgspice] fix extraction	6 years ago
Mike Fährmann	4ca4631bad	simplify auto-disabling certificate verification if no certificate bundle is found	6 years ago
Mike Fährmann	09d872a2b1	generalize extractor creation code	6 years ago
Mike Fährmann	8dc6be246b	[shopify] add custom retry logic for 430 status codes (#175 )	6 years ago
Mike Fährmann	0887fb61f4	[komikcast] update test results	6 years ago
Mike Fährmann	976ccb267f	[myportfolio] combine gallery and user extractors An URL alone isn't good enough to distinguish between a gallery or a gallery-listing, so the new extractor decides what to do based on the page's content.	6 years ago
Mike Fährmann	efd104e45e	[instagram] reject more non-user URLs (#180 )	6 years ago
HRXN	56e0e92e0d	[shopify] cosmetic changes in shopify.py (#181 ) Glanced over the commits, randomly spotted some minor things.	6 years ago
Mike Fährmann	9c0e2f294b	[shopify] add generic collection and product extractors (#175 ) with fashionnova.com as a default domain	6 years ago
Mike Fährmann	26c4365baa	adjust metadata types for GalleryExtractors	6 years ago
Mike Fährmann	13e0f2a78f	[deviantart] add 'scraps' extractor (closes #168 )	6 years ago
Mike Fährmann	3ea11f5d5e	[nhentai] rewrite - use GalleryExtractor as base class - extract a lot more metadata (artist, tags, etc.)	6 years ago
Mike Fährmann	3595cd582f	use GalleryExtractor as common base class	6 years ago
Mike Fährmann	a138d5873d	[hentaifoundry] improve/fix extraction - Sometimes an ad interfered when trying to get a download URL - Resolving "www.hentai-foundry.com" yields an invalid(?) IPv6 address (2607:5300:60:ca9e:feed:dead:beef:1) and urllib3 only tries to connect to the IPv4 variant after a rather long wait time	6 years ago
Mike Fährmann	280531c8ff	[pururin] add gallery extractor (closes #174 )	6 years ago
Mike Fährmann	3159dd79d5	[seiga] use HTTPS	6 years ago
Mike Fährmann	f6734142ee	[komikcast] remove 'width' and 'height' info	6 years ago
Mike Fährmann	d0059cab79	[tumblr] check for null URLs (closes #165 )	6 years ago
Mike Fährmann	e687a6095e	[luscious] raise exception if album is not available	6 years ago
Mike Fährmann	22d3a2fcc8	[artstation] add extractor for artwork listings (#80 ) like https://www.artstation.com/artwork?sorting=latest or https://www.artstation.com/artwork?sorting=picks	6 years ago
Mike Fährmann	937a802b49	[dynastyscans] add extractors for images and image searches (closes #163)	6 years ago
Mike Fährmann	b09a8184ca	move TestJob into test module; test _extractor values	6 years ago
Mike Fährmann	19860655a3	[weibo] add 'user' and 'status' extractors	6 years ago
Mike Fährmann	f8782c05f2	[paheal] rename "tags" to "search_tags" to better match field names of other booru extractors	6 years ago
Mike Fährmann	c7b8421333	[deviantart] don't match 'www' as a potential username	6 years ago
Mike Fährmann	5530871b5a	change results of text.nameext_from_url() Instead of getting a complete 'filename' from an URL and splitting that into 'name' and 'extension', the new approach gets rid of the complete version and renames 'name' to 'filename'. (Using anything other than {extension} for a filename extension doesn't really work anyway) Example: "https://example.org/path/filename.ext" before: - filename : filename.ext - name : filename - extension: ext now: - filename : filename - extension: ext	6 years ago
Mike Fährmann	32edf4fc7b	add '_extractor' info to manga extractor results	6 years ago
Mike Fährmann	89ee8cd7e4	filter "private" kwdict entries	6 years ago
Mike Fährmann	61741d7333	provide type information for Queue messages Child extractors are now directly constructed with Extractor.from_url() if the extractor class is known beforehand, instead of using extractor.find() and searching through all possible extractor classes.	6 years ago
Mike Fährmann	2e516a1e3e	store the full original URL in Extractor.url	6 years ago
Mike Fährmann	580baef72c	change Chapter and MangaExtractor classes - unify and simplify constructors - rename get_metadata and get_images to just metadata() and images() - rename self.url to chapter_url and manga_url	6 years ago
Mike Fährmann	4b1880fa5e	propagate 'match' to base extractor constructor	6 years ago
Mike Fährmann	ade86da7a1	[tsumino] replace test	6 years ago

1 2 3 4 5 ...

1160 Commits (e47a24afc7ff61b834a27eb851abeaa1d7334cfb)