Mike Fährmann
7d874e2497
[bluesky] improve API error messages
7 months ago
Mike Fährmann
d921d860f1
automatically create directory path for logging files ( #5249 )
7 months ago
Mike Fährmann
24106d9994
exclude scripts/pyprint.py from linting for Python<3.8
7 months ago
Mike Fährmann
495c9ee126
[bluesky] add 'reposts' option ( #4438 , #5248 )
7 months ago
Mike Fährmann
c8b591303f
[paheal] cleanup
7 months ago
Mike Fährmann
ba062712ad
[tests] '__main__' -> "__main__"
7 months ago
Mike Fährmann
2501adeda0
move 'pprint()' into its own module
...
to reuse its code in create_test_data.py later
rename to 'pyprint' since 'pprint' is already used by stdlib module
7 months ago
Mike Fährmann
8a11b72253
remove extractor/test.py ( #4504 )
7 months ago
Mike Fährmann
fde9e25c9f
[tests:kemonoparty] '.party' -> '.su'
7 months ago
Mike Fährmann
311a21bfb2
[bluesky] fix '/follows' not spawning child extractors ( #5246 )
7 months ago
Mike Fährmann
d3dca68225
[xvideos] fix galleries with more than 500 images ( #5244 )
7 months ago
Mike Fährmann
13443f40a3
[xvideos] support '/channels/' URLs ( #5244 )
7 months ago
Mike Fährmann
c60ebc6519
[deviantart] improve fetching extended metadata ( #5175 )
...
use multiple metadata API calls per chunk of deviations if necessary
7 months ago
Mike Fährmann
cc6b9e4c18
[zerochan] use API by default ( #3669 )
...
add 'pagination' option
7 months ago
Mike Fährmann
efccd3d3d1
merge #5097 : update Dockerfile
...
- remove a layer and reduce image size
- update pip and apk
7 months ago
Mike Fährmann
a2b55d5dde
[skeb] retry 429 responses containing a 'request_key' cookie ( #5210 )
7 months ago
Mike Fährmann
e51ee6b132
fix HttpError.status value
...
'response' with error status code evaluates to False
7 months ago
Mike Fährmann
b4c46de4b8
merge #5224 : [artstation] update URL patterns to recognize usernames with dashes
7 months ago
blankie
962f55cc68
[artstation] fix handling usernames with dashes
7 months ago
Mike Fährmann
fe7e2281ac
[nijie] increase default delay between requests ( #5221 )
...
1-2s is not enough
7 months ago
Mike Fährmann
a34312e3ac
[instagram] make accessing 'like_count' non-fatal ( #5218 )
7 months ago
Mike Fährmann
741fd00cec
[deviantart] extend 'metadata' option ( #5175 )
...
alloe fetching extended metadata in addition to the usual
'description', 'tags', etc by setting 'metadata' to a list of
'camera', 'stats', 'submission', 'collection', and 'gallery'
for example "metadata": "stats,submission"
7 months ago
Mike Fährmann
fc46177578
release version 1.26.8
7 months ago
Mike Fährmann
8a63801311
[vsco] add 'spaces' extractor ( #5202 )
...
for spaces listed on a user page
7 months ago
Mike Fährmann
ccb413df71
[wikimedia] support 'pidgi.net' and 'bulbapedia.bulbagarden.net' ( #5205 , #5206 )
7 months ago
Mike Fährmann
7033cc14e9
[vsco] add 'space' extractor ( #5202 )
7 months ago
Mike Fährmann
770aec922d
[fapachi] ignore empty entries
7 months ago
Mike Fährmann
c9efccc959
[tests] update extractor results
7 months ago
Mike Fährmann
c413834dfc
[bluesky] extend tests
7 months ago
Mike Fährmann
ee7c054855
[bluesky] add 'search' extractor ( #4438 )
...
Both https://bsky.app/search?q=QUERY and https://bsky.app/search/QUERY
are recognized as search URLs, where QUERY gets forwarded unmodified as
'q' parameter for app.bsky.feed.searchPosts .
User searches are not supported yet.
7 months ago
Mike Fährmann
91e5c4fdfe
[bluesky] add 'avatar' and 'background' extractors ( #4438 )
7 months ago
Mike Fährmann
24c1317e0d
[batoto] fix crash when manga/chapter contains a '-' ( #5200 )
7 months ago
Mike Fährmann
0abd9723af
[bluesky] add 'metadata' option ( #4438 )
...
allow extracting 'user' metadata and
make 'facets' extraction optional
7 months ago
Mike Fährmann
7e036ea290
[bluesky] add 'depth' option ( #4438 )
...
and reduce default depth and parentHeight values
7 months ago
Mike Fährmann
42335ea880
[zerochan] fix skipping every other post
7 months ago
Mike Fährmann
c97b92cc35
[fanbox] add 'home' and 'supporting' extractors ( #5138 )
7 months ago
Mike Fährmann
04e4ffc64c
[deviantart] combine 'png' option with 'quality' ( #4846 )
...
"quality": "png" to download PNGs instead og JPEGs
7 months ago
Mike Fährmann
9cc4ec2c58
[deviantart] add 'png' option ( #4846 )
7 months ago
Mike Fährmann
966c8608e6
[deviantart] move image content extraction into separate function
7 months ago
Mike Fährmann
61a50da086
merge #5195 : [pornpics] support multiple 'channel' values
...
i.e. change 'channel' from string to list
use '{channel[0]}' to get the old behavior
7 months ago
Mike Fährmann
1d1ffe3317
[pornpics] update 'channel' extraction & add test
...
change 'channel' to a list, since extracting both 'channel' and
'channels' does not really work with text.extract_from()
7 months ago
cc1234
32472d7d6c
Add support for multi channels
7 months ago
Mike Fährmann
139ff3f6ab
[kemonoparty] add 'posts' extractor ( #5194 )
7 months ago
Mike Fährmann
814ad9321e
[deviantart] skip locked/blurred posts ( #4567 , #5193 )
7 months ago
Mike Fährmann
f7f8ef8684
[twitter] support communities ( #4913 )
7 months ago
Mike Fährmann
8f27f43d4d
[tests] implement explicitly disabling auth
7 months ago
Mike Fährmann
cae77e85f8
[twitter] update query hashes
...
... as well as 'variables' and 'features' values
also remove unused legacy API code
7 months ago
Mike Fährmann
06cb518d97
[bunkr] fix extraction ( #5088 , #5151 , #5153 )
...
- remove legacy code
- map legacy domains to bunkr.sk
- use input URL domain for newer domains
- update tests (some files got slightly modified or deleted)
7 months ago
Mike Fährmann
dcc6e3f65c
merge #5134 : [bunkr] add new bunkr domains ( #5130 )
7 months ago
Mike Fährmann
4641937ca3
[imagetwist] add 'gallery' extractor ( #5190 )
7 months ago