Mike Fährmann
c5e3971b18
[newgrounds] extract image embeds ( closes #1033 )
4 years ago
Mike Fährmann
f9c1684af7
[newgrounds] restore original video URLs ( #1042 )
4 years ago
Mike Fährmann
5b927c15df
[newgrounds] fix video extraction ( closes #1042 )
4 years ago
Mike Fährmann
e17d4f44f6
[newgrounds] fix favorites extraction
4 years ago
Mike Fährmann
6294e2c540
add 'text.ensure_http_scheme()'
4 years ago
Mike Fährmann
c56a751dae
[newgrounds] fix URLs produced by 'followng' extractors ( #684 )
4 years ago
Mike Fährmann
9b194520db
[newgrounds] add 'following' extractor ( closes #684 )
4 years ago
Mike Fährmann
ae2a33243b
[newgrounds] catch general Exceptions
5 years ago
Mike Fährmann
87d4f83597
[newgrounds] make post extraction nonfatal
5 years ago
Mike Fährmann
823fbeaae6
[newgrounds] add 'favorite' extractor ( #394 )
5 years ago
Mike Fährmann
4e361b3008
add tests for specific datetime values
5 years ago
Mike Fährmann
5ad92fc196
[newgrounds] fix tags metadata extraction
5 years ago
Mike Fährmann
42b9633c7e
update test results
5 years ago
Mike Fährmann
d45fabb79d
match user profile handling on deviantart and newgrounds
5 years ago
Mike Fährmann
b1f0609de5
[newgrounds] rewrite ( #394 )
...
- restructure extractor hierarchy
- extract more metadata
- extract videos without youtube-dl
- be more resilient to errors
TODO:
- favorites
- games, but that might be near impossible for non-flash titles
5 years ago
Mike Fährmann
3ece3976ae
[newgrounds] implement login support ( #394 )
5 years ago
Mike Fährmann
3a07c06865
[newgrounds] update
...
- create directory per post
- rename variables and methods
5 years ago
Mike Fährmann
a732e9c430
[instagram] update query hashes and headers
5 years ago
Mike Fährmann
1133b7fcbd
[smugmug] update unit tests
...
The account used for tests before has been deleted.
5 years ago
Mike Fährmann
04b8d0894a
[newgrounds] improve metadata extraction
5 years ago
Mike Fährmann
b89f0d8d3c
update extractor result tests
5 years ago
Mike Fährmann
74c7304c6b
[newgrounds] extract 'date', 'favorites', and 'score'
5 years ago
Mike Fährmann
5530871b5a
change results of text.nameext_from_url()
...
Instead of getting a complete 'filename' from an URL and splitting that
into 'name' and 'extension', the new approach gets rid of the complete
version and renames 'name' to 'filename'. (Using anything other than
{extension} for a filename extension doesn't really work anyway)
Example: "https://example.org/path/filename.ext "
before:
- filename : filename.ext
- name : filename
- extension: ext
now:
- filename : filename
- extension: ext
6 years ago
Mike Fährmann
4b1880fa5e
propagate 'match' to base extractor constructor
6 years ago
Mike Fährmann
6284731107
simplify extractor constants
...
- single strings for URL patterns
- tuples instead of lists for 'directory_fmt' and 'test'
- single-tuple tests where applicable
6 years ago
Mike Fährmann
a50e9faf0e
[newgrounds] recognize direct links
6 years ago
Mike Fährmann
66460337f1
[mangapark] fix extraction
6 years ago
Mike Fährmann
2fbf072723
[newgrounds] ensure consistent tag order
...
... plus some code restructuring
6 years ago
Mike Fährmann
98314aa04c
[mangapark] detect non-existent chapters
6 years ago
Mike Fährmann
ba56827f36
[newgrounds] add user-, video-, image-extractors ( #119 )
6 years ago