Mike Fährmann
6996f5c118
[mangahere] fix and improve chapter extraction
6 years ago
Mike Fährmann
f3d770d4e2
Merge branch '1.4-dev'
6 years ago
Mike Fährmann
f43d446692
[mangahere] extract chapter titles
6 years ago
Mike Fährmann
95392554ee
use text.urljoin()
6 years ago
Mike Fährmann
cc36f88586
rename safe_int to parse_int; move parse_* to text module
7 years ago
Mike Fährmann
d11fcf4804
smaller changes and fixes
...
- fix the cloudflare challenge result if the last decimal places
are zero (JS`s toFixed() removes trailing zeroes)
- fix downloading of kissmanga chapter-pages hosted on blogspot
(accessing blogspot with "kissmanga.com" as referrer yields a 401)
- disable certificate validation for 'mangahere' tests
- update flickr test result
7 years ago
Mike Fährmann
5b3c34aa96
use generic chapter-extractor in more modules
7 years ago
Mike Fährmann
8102aae311
[mangahere] support ".cc" TLD and mobile URLs
7 years ago
Mike Fährmann
305da540c3
[mangahere] fix metadata extraction
7 years ago
Mike Fährmann
633b376f35
improve/adjust default filename formats for manga sites
7 years ago
Mike Fährmann
1ab4c7986f
[mangahere] fix extraction
...
would switch to HTTPS, but there seem to be certificate issues
7 years ago
Mike Fährmann
9fc1d0c901
implement and use 'util.safe_int()'
...
same as Python's 'int()', except it doesn't raise any exceptions and
accepts a default value
7 years ago
Mike Fährmann
d39b8779af
[mangahere] extract manga metadata
7 years ago
Mike Fährmann
6f30cf4c64
change keyword names to valid Python identifiers
...
This commit mostly replaces all minus-signs ('-') in keyword names with
underscores ('_') to allow them to be used in filter-expressions. For
example 'gallery-id' got renamed to 'gallery_id'.
(It is theoretically possible to access any variable, regardless of its
name, with 'locals()["NAME"]', but that seems a bit too convoluted if
just 'NAME' could be enough)
7 years ago
Mike Fährmann
7aa9fa796a
code cleanup and fixes
7 years ago
Mike Fährmann
f226417420
simplify code by using a MangaExtractor base class
7 years ago
Mike Fährmann
94e10f249a
code adjustments according to pep8 nr2
8 years ago
Mike Fährmann
4c55275305
update tests
8 years ago
Mike Fährmann
56d810c896
update keyword hashes for tests
8 years ago
Mike Fährmann
19c2d4ff6f
remove explicit (sub)category keywords
8 years ago
Mike Fährmann
d7e168799d
consistent extractor naming scheme + docstrings
8 years ago
Mike Fährmann
2faa7393b1
[mangahere] adjust for image domain
8 years ago
Mike Fährmann
0736fe29e2
[mangahere] fix parsing
9 years ago
Mike Fährmann
ba99506c72
more extractor test-cases
9 years ago
Mike Fährmann
f7c47a6018
add subcategories to extractors
9 years ago
Mike Fährmann
1497da07de
remove unused format-strings
9 years ago
Mike Fährmann
f48712c9c9
docstrings
9 years ago
Mike Fährmann
914062d172
use text.extract_iter where applicable
9 years ago
Mike Fährmann
332d9e393b
[mangahere] support sub-chapters (e.g. ch4.5)
9 years ago
Mike Fährmann
88739a3564
[mangahere] add manga-extractor
9 years ago
Mike Fährmann
d1673d912a
[mangahere] add chapter-extractor
9 years ago