Mike Fährmann
2007cb2f59
[tests] check extractor category values
8 months ago
Mike Fährmann
93b4120e77
[gelbooru] support 'all' and empty tag ( #5076 )
8 months ago
Mike Fährmann
a416d4c3d5
[sankaku] support post URLs with alphanumeric IDs ( #5073 )
8 months ago
Mike Fährmann
ea553a1d55
[wikimedia] generalize ( #1443 )
...
- support mediawiki.org
- support mariowiki.com (#3660 )
- combine code into a single extractor
(use prefix as subcategory)
- handle non-wiki instances
- unescape titles
8 months ago
Mike Fährmann
c3c1635ef3
[wikimedia] update
...
- rewrite using BaseExtractor
- support most Wiki* domains
- update docs/supportedsites
- add tests
8 months ago
Mike Fährmann
3d68eda4ab
[kemonoparty] add 'revision_hash' metadata ( #4706 , #4727 , #5013 )
...
A SHA1 hexdigest of other relevant metadata fields like
title, content, file and attachment URLs.
This value does NOT reflect which revisions are listed on the website.
Neither does 'edited' or any other metadata field (combinations).
8 months ago
Mike Fährmann
799a8206ad
merge #5061 : [webtoons] extract more metadata
...
- author_name
- comic_name
- episode_name
- username
8 months ago
Mike Fährmann
8ffa0cd3c8
[webtoons] small optimization
...
don't extract the entire 'author_area' and
avoid creating a second 'text.extract_from()' object
8 months ago
Mike Fährmann
68196589c4
[2ch] update
...
- simplify extractor code
- more metadata
- add tests
8 months ago
Mike Fährmann
69726fc82c
[tests] skip tests requiring auth when non is provided
8 months ago
blankie
bb446b1598
[webtoons] extract more metadata
8 months ago
Mike Fährmann
355b909f46
merge #5041 : [steamgriddb] add support ( #5033 )
8 months ago
Mike Fährmann
71e2c3e5a2
merge #5037 : [hatenablog] add support ( #5036 )
8 months ago
Mike Fährmann
b97af09e03
[tests] include URL in failure report
8 months ago
Mike Fährmann
58e0665fbc
[tests] load config from external file
8 months ago
Mike Fährmann
2dcfb012ea
[patreon] download 'm3u8' manifests with ytdl
8 months ago
Mike Fährmann
2191e29e14
[nijie] fix image URL for single image posts ( #5049 )
8 months ago
Mike Fährmann
39904c9e4e
[deviantart:avatar] add 'formats' option ( #4995 )
8 months ago
Mike Fährmann
887ade30a5
[batoto] support more mirror domains ( #5042 )
8 months ago
blankie
2ccb7d3bd3
[steamgriddb] add support
8 months ago
blankie
2cfe788f93
[hatenablog] fix extractor naming errors
9 months ago
blankie
61f3b2f820
[hatenablog] add support
9 months ago
Mike Fährmann
657ed93a22
[batoto] improve v2 manga URL pattern
...
and add tests
9 months ago
Mike Fährmann
33f228756a
[mangadex] add 'list' extractor ( #5025 )
...
supports listing manga and chapters from list feed
9 months ago
Mike Fährmann
c25bdbae91
[komikcast] fix 'manga' extractor ( #5027 )
9 months ago
Mike Fährmann
8e1a2b5446
[komikcast] update domain to 'komikcast.lol' ( #5027 )
9 months ago
Mike Fährmann
a441249ea2
merge #4979 : [batoto] add 'chapter' and 'manga' extractors ( #1434 , #2111 )
9 months ago
Mike Fährmann
b11c352d66
[bato] rename to 'batoto'
...
to use the same category name as the previous bato.to site
9 months ago
Mike Fährmann
3aa24c3744
[bato] simplify and update
9 months ago
Mike Fährmann
11150a7d72
[nudecollect] remove module
9 months ago
Mike Fährmann
c158927c38
merge #5016 : [zzup] add 'gallery' extractor ( #4517 , #4604 , #4659 , #4863 )
9 months ago
Mike Fährmann
217fa7f8a1
include 'test/results' in flake8 checks
9 months ago
Mike Fährmann
e61f016465
[szurubooru] support 'snootbooru.com' ( #5023 )
9 months ago
Mike Fährmann
b4bcf40278
[weibo] fix AttributeError in 'user' extractor ( #5022 )
...
yet another bug caused by a383eca7
9 months ago
Mike Fährmann
0ab0a10d2d
[jpgfish] update domain
9 months ago
enduser420
0f30136109
[zzup] add 'gallery' extractor
9 months ago
Mike Fährmann
7eaf648f2e
[fanbox] add 'metadata' option ( #4921 )
...
extracts 'plan' and extended 'user' metadata
9 months ago
Mike Fährmann
4f3671458e
[deviantart] add 'avatar' and 'background' extractors ( #4995 )
9 months ago
Mike Fährmann
63f649cd92
[idolcomplex] fix extraction & update URL patterns ( #5002 )
9 months ago
Mike Fährmann
7aa1c9671b
[tests] fix 'invalid escape sequence' warnings
9 months ago
Mike Fährmann
b6903a4c90
[nijie] add 'count' metadata field
...
https://github.com/mikf/gallery-dl/issues/146#issuecomment-1812849102
9 months ago
Mike Fährmann
b93b351db9
merge #4962 : [poringa] add support ( #4675 )
9 months ago
Mike Fährmann
9f21c839ad
[poringa] improvements and fixes
...
- add 'num' and 'count' metadata fields
- prevent crash for "private" posts
- prevent crash when there's no 'main-info'
- update tests
9 months ago
Mike Fährmann
caceb14fc2
[tests] fail when a results file contains syntax errors
...
or is otherwise not importable
9 months ago
Mike Fährmann
085411f3f1
[rule34] recognize URLs with 'www' subdomain ( #4984 )
9 months ago
Antonio
e348da7a06
[poringa] add support
9 months ago
bug-assassin
74c225f94e
[bato] add support
9 months ago
Mike Fährmann
f9544194c0
[paheal] restore 'extension' metadata ( #4976 )
9 months ago
Mike Fährmann
77d46e6f0c
[lynxchan] update 'bbw-chan' domain ( #4970 )
9 months ago
Mike Fährmann
108c978073
merge #4919 : [postmill] add support ( #4917 )
9 months ago
Mike Fährmann
2a60645095
[deviantart] set 'is_original' for intermediary URLs to 'false'
9 months ago
Mike Fährmann
01bb75f6cb
merge #4945 : {shimmie2[ support 'rule34hentai.net' ( #861 , #4789 )
9 months ago
Mike Fährmann
79e4606893
[rule34hentai] cleanup
...
- fix using 'self._posts_rule34hentai'
- fix 'file_url' for posts
- update docs/supportedsites
- add tests
9 months ago
Mike Fährmann
627ed794a2
[danbooru] provide 'tags' as list ( #4942 )
...
keep the old 'tag_string' values around, similar to sankaku
a lot of repeat code ...
would be a lot less bad if "".split(" ") returned an empty list
9 months ago
Mike Fährmann
99aa923322
[inkbunny] improve '/submissionsviewall.php' patterns ( #4934 )
...
allow 'mode=…' to be in any position
don't require it to be somewhere in the middle
9 months ago
Mike Fährmann
3f9c113d78
[mastodon] Support non-numeric status IDs ( #4936 )
9 months ago
Mike Fährmann
2852404e49
[inkbunny] add 'unread' extractor ( #4934 )
9 months ago
Mike Fährmann
a37b7759bc
[myhentaigallery] recognize '/g/' URLs ( #4920 )
9 months ago
blankie
fbe14a2745
[postmill] add support
9 months ago
Mike Fährmann
bf74eb5c46
merge #4886 : [urlgalleries] add 'gallery' extractor ( #919 , #1184 , #2905 )
10 months ago
Mike Fährmann
ade93c5397
[urlgalleries] add tests
10 months ago
Mike Fährmann
4eb3590103
[nijie] fix image URLs of multi-image posts ( #4876 )
10 months ago
Mike Fährmann
c83fbe6c2d
merge #4855 : [nitter] fix video extraction ( #4853 )
10 months ago
Mike Fährmann
1137d72d48
[tests] skip test_init for BaseExtractor classes without instances
10 months ago
Mike Fährmann
625e94fa7d
update extractor test results
...
still not everything, but good enough for now
10 months ago
enduser420
1e9bacd169
[nitter] fix video extraction
10 months ago
Mike Fährmann
95c1dfb089
[tests] swap assertEqual argument order
...
before this, it would show test failures as
+ test value
- extracted value
when it should be the other way round
10 months ago
Mike Fährmann
bdb3ce7217
[foolslide] remove 'powermanga.org'
10 months ago
Mike Fährmann
f9dac43be9
[warosu] fix file URLs
10 months ago
Mike Fährmann
645b4627ef
[sankaku] update URL patterns
10 months ago
Mike Fährmann
119755a5a3
[tests] implement skipping/failing tests when pressing ctrl+c
10 months ago
Mike Fährmann
1ae43d8123
merge #4841 : [fapello] support '.su' TLD ( #4840 )
10 months ago
Mike Fährmann
e1404827a6
[pixeldrain] add 'file' and 'album' extractors ( #4839 )
10 months ago
enduser420
2402162e8a
[fapello] support '.su' TLD
10 months ago
Mike Fährmann
725c8dd55a
[tmohentai] 'categories' -> 'genres'
...
quite likely that the site meant 'genres' by "Genders"
10 months ago
Mike Fährmann
ce7c4cb544
merge #4832 : [tmohentai] add 'gallery' extractor ( #4808 )
10 months ago
Mike Fährmann
c4a201ed42
[tmohentai] simplify + tests
10 months ago
Mike Fährmann
e17a48fe56
[blogger] inherit from BaseExtractor
...
- support www.micmicidol.club (#4759 )
10 months ago
Mike Fährmann
0fa85360a0
merge #4812 : [erome] add 'count' metadata field
10 months ago
Mike Fährmann
a43cf78bb7
[erome] tests
10 months ago
Mike Fährmann
07cb584231
[behance] add 'modules' option ( #4799 )
10 months ago
Mike Fährmann
ea78f67860
[downloader:http] skip files not passing filesize-min/-max ( #4821 )
...
instead of failing the download
10 months ago
Mike Fährmann
3f591d5a4e
[mastodon] update test results
10 months ago
Mike Fährmann
6402f2950f
[pp:metadata] ignore non-string tag values ( #4764 )
11 months ago
Mike Fährmann
007c433677
[patreon] support 'id:<campaign_id>' in place of a user name
...
https://patreon.com/id:12345
… and remove 'campaign-id' config option
11 months ago
Mike Fährmann
43a3d93467
merge #4755 : [twitter] recognize fixupx.com URLs
11 months ago
Mike Fährmann
cdf77e326f
[twitter] add test for fixupx.com
11 months ago
Mike Fährmann
fc8f86bf24
[hitomi] recognize 'imageset' gallery URLs ( #4756 )
11 months ago
Mike Fährmann
72b18d701f
represent util.NONE as 'null' in JSON output
...
was '"None"' before
11 months ago
Mike Fährmann
68e72a836c
[exhentai] fix extraction ( #4730 )
...
- update to new API response layout
- use proper API server URL
- fix 'filesize' metadata
11 months ago
Mike Fährmann
fd8f58ad76
[behance] unescape embed URLs ( #4742 )
11 months ago
Mike Fährmann
c9a2be36d4
[sankaku] support '/posts/' tag search URLs ( #4740 )
11 months ago
Mike Fährmann
218295a4c6
[twitter] fix avatars without 'date' information ( #4696 )
11 months ago
Mike Fährmann
d0effcae20
[kemonoparty] add 'revision_index' metadata field ( #4727 )
11 months ago
Mike Fährmann
3bbaa875f1
[kemonoparty] fix parsing of non-standard 'dates' ( #4676 )
11 months ago
Mike Fährmann
a09df34bcf
merge #4714 : [4archive] add 'thread' and 'board' extractors
...
(#1262 , #2418 , #4400 , #4710 )
11 months ago
enduser420
acb713b95a
[4archive] update
11 months ago
Mike Fährmann
6766877524
merge #4693 : [reddit] support Reddit Mobile share links
11 months ago
Mike Fährmann
1042278bec
[misskey] support 'misskey.design' ( #4713 )
11 months ago
enduser420
c0714d5585
[4archive] add 'thread' and 'board' extractors
11 months ago