Mike Fährmann
a453335a9f
remove test results in extractor modules
...
and add generic example URLs
1 year ago
Mike Fährmann
1d2b5d0c60
update test comment positions
...
always put them above the test they're referring to
1 year ago
Mike Fährmann
ba842981af
[imagevenue] fix extraction ( #4473 )
1 year ago
Mike Fährmann
d12a5e440a
update docs/supportedsites
1 year ago
Mike Fährmann
7eab101144
[acidimg] fix extraction
...
swap ' and " again (2e309a13
)
and add a fallback in case this happens yet another time
1 year ago
Mike Fährmann
62fce6a75f
[imagehosts] adjust variable names ( #4358 )
...
prefix them with underscores to prevent a clash
with the new 'self.cookies' from d97b8c2f
1 year ago
Mike Fährmann
2d9e3093ca
merge #4134 : [postimage] add gallery support, update image extractor
1 year ago
chio0hai
2e309a13a7
[acidimg] fix extractor
1 year ago
chio0hai
92178b369c
[postimage] add gallery support, update image extractor to download
...
original image instead of main image
1 year ago
Mike Fährmann
5fb7107f2b
[imxto] fix 'gallery' extraction
...
support both single and double quotes
1 year ago
Mike Fährmann
5b635f2317
[imxto] add 'gallery' extractor ( #1289 )
1 year ago
Mike Fährmann
706ec70e89
[imagetwist] simplify pattern and add tests
2 years ago
Prinz23
29f0830b53
[imagetwist] add phun.imagetwist.com and imagehaha.com alias to imagetwist extractor
2 years ago
Mike Fährmann
5f57a27ba6
[imagetwist] fix extraction
2 years ago
enduser420
54844944ab
[pixhost] add 'gallery' support ( #3353 )
2 years ago
enduser420
93ea8ca8e3
[imxto] extract additional metadata ( #3175 )
2 years ago
Mike Fährmann
b0cb4a1b9c
replace 'text.extract()' with 'text.extr()' where possible
2 years ago
Mike Fährmann
7b5dad075d
[fappic] fix extraction
2 years ago
Mike Fährmann
e0bdacd932
[fappic] add 'image' extractor ( closes #1898 )
3 years ago
Mike Fährmann
bd08ee2859
remove most 'yield Message.Version' statements
...
only leave them in oauth.py as noop results
3 years ago
Mike Fährmann
64240c8d42
[imagevenue] fix extraction
...
(closes #1677 )
3 years ago
Mike Fährmann
204523611c
[imgclick] use 'http://' for image URLs
...
The TLS certificate for main.imgclick.net is invalid.
4 years ago
Mike Fährmann
ca6b0fc2ac
[imagehosts] cleanup
4 years ago
Mike Fährmann
95a66bdad6
[imgclick] add 'image' extractor ( closes #1307 )
...
basically reverts b0e8daf415
4 years ago
Mike Fährmann
e5e591b848
[vipr] simplify and add test ( #1258 )
4 years ago
v-delta
e707e060cb
[vipr] add image extractor ( #1258 )
...
* [vipr] add image extractor
Adds support for images hosted on https://vipr.im
* Fix codestyle issues
4 years ago
Mike Fährmann
1e3dd7330e
merge SharedConfigMixin functionality into Extractor
4 years ago
Mike Fährmann
968d3e8465
remove '&' from URL patterns
...
'/?&#' -> '/?#' and '?&#' -> '?#'
According to https://www.ietf.org/rfc/rfc3986.txt , URLs are
"organized hierarchically" by using "the slash ("/"), question
mark ("?"), and number sign ("#") characters to delimit components"
4 years ago
Mike Fährmann
9be7ff600e
[imagetwist] replace test image
...
the old one expired, it seems
5 years ago
Mike Fährmann
d0920e84e9
update test results
5 years ago
Mike Fährmann
a2af2d2965
adjust cache maxage values
6 years ago
Mike Fährmann
fffbfd3dce
[imgspice] fix extraction
6 years ago
Mike Fährmann
5530871b5a
change results of text.nameext_from_url()
...
Instead of getting a complete 'filename' from an URL and splitting that
into 'name' and 'extension', the new approach gets rid of the complete
version and renames 'name' to 'filename'. (Using anything other than
{extension} for a filename extension doesn't really work anyway)
Example: "https://example.org/path/filename.ext "
before:
- filename : filename.ext
- name : filename
- extension: ext
now:
- filename : filename
- extension: ext
6 years ago
Mike Fährmann
2e516a1e3e
store the full original URL in Extractor.url
6 years ago
Mike Fährmann
4b1880fa5e
propagate 'match' to base extractor constructor
6 years ago
Mike Fährmann
abbd45d0f4
update handling of extractor URL patterns
...
When loading extractor classes during 'extractor.find(…)', their
'pattern' attribute will be replaced with a compiled version of itself.
6 years ago
Mike Fährmann
6284731107
simplify extractor constants
...
- single strings for URL patterns
- tuples instead of lists for 'directory_fmt' and 'test'
- single-tuple tests where applicable
6 years ago
Mike Fährmann
34bab080ae
rewrite URL patterns to use only 1 per extractor
6 years ago
Mike Fährmann
793b24e513
[imagehosts] fix and improve various extractors
6 years ago
Mike Fährmann
6126615698
update URLs for supportedsites.rst
6 years ago
Mike Fährmann
e53cdfd6a8
update build_supportedsites.py
6 years ago
Mike Fährmann
fd8ed35591
[turboimagehost] fix extraction
6 years ago
Mike Fährmann
f3793660ef
update tests
6 years ago
Mike Fährmann
ecdc3475b8
[pixhost] support .to TLDs
6 years ago
Mike Fährmann
95392554ee
use text.urljoin()
6 years ago
Mike Fährmann
564e12ca8f
replace 'imgyt' with 'imxto'
...
https://img.yt/ wasn't available for a couple of days, but has now
re-emerged as https://imx.to/ with a new web-interface.
Links to older images still work (see tests).
7 years ago
Mike Fährmann
7847ab1d5a
[imagehosts] remove even more dead sites
...
All removed sites either
- reject all incoming connections or
- display a message from their domain registrar
7 years ago
Mike Fährmann
179ecee965
[turboimagehost] fix extraction
7 years ago
Mike Fährmann
8f338347b6
[imagehosts] cleanup
...
removed
- chronos.to - unable to resolve hostname
- coreimg.net - same
- imgmaid.net - same
- hosturimage.com - everything returns 404
- imageontime.org - redirects to some shady site
- imgupload.yt - cloudflare error 522, host down
- img4ever.net - read timeout
7 years ago
Mike Fährmann
34873dbd90
set 'archive_fmt' values
...
These are going to be used to create an unique id for each image.
7 years ago