Commit Graph

55 Commits (7d5415c2017440c05046eb53c79f0c9e292235a6)

Author SHA1 Message Date
Mike Fährmann 5227bb6b1d
[text] catch general Exceptions
5 months ago
Mike Fährmann 76581c13f7
handle URLs without '/' after their TLD (#5252)
7 months ago
Mike Fährmann 05255f5be0
add 'default' argument to 'text.extr()'
2 years ago
Mike Fährmann eb33e6cf2d
add 'text.extr()'
2 years ago
Mike Fährmann 67bad04dda
[formatter] add 'g' conversion to sluGify a string (#2410)
2 years ago
Mike Fährmann bddcec49f1
implement 'text.root_from_url()'
3 years ago
Mike Fährmann bc0e853d30
combine KeyError & IndexError to common base class LookupError
3 years ago
Mike Fährmann bc868e7bb8
consider apparently long extensions as part of the filename
3 years ago
Mike Fährmann 387fe415d5
unescape items in text.split_html()
4 years ago
Mike Fährmann 78fd63b8f0
remove 'text.clean_xml()'
4 years ago
Mike Fährmann 8553b218d9
replace calls to 'os.path.splitext()' with 'str.rpartition()'
4 years ago
Mike Fährmann a09f42f6b3
improve filename_from_url() performance
4 years ago
Mike Fährmann 37d71f6e09
strip microseconds in text.parse_datetime()
4 years ago
Mike Fährmann 6294e2c540
add 'text.ensure_http_scheme()'
4 years ago
Mike Fährmann a0f4c295c0
add optional 'utcoffset' argument to 'parse_datetime()'
5 years ago
Mike Fährmann f6c5edb76b
pre-compile regex pattern for remove_html() and split_html()
5 years ago
Mike Fährmann b1bea8aaeb
add 'restrict-filenames' option (#348)
5 years ago
Mike Fährmann 1740086d8a
add 'repl' and 'sep' arguments to text.replace_html()
5 years ago
Mike Fährmann b171befa87
implement 'parse_unicode_escapes()'
5 years ago
Mike Fährmann 2b1999476e
implement 'text.rextract()'
5 years ago
Mike Fährmann 2316e0ed3d
fix strptime workaround from b0e85a4
5 years ago
Mike Fährmann b0e85a42e3
apply workaround from 4736912 in parse_datetime() itself
5 years ago
Mike Fährmann d09864b581
implement text.parse_datetime()
5 years ago
Mike Fährmann 6264a46212
use 'utcfromtimestamp()'
5 years ago
Mike Fährmann d670de0344
implement 'text.parse_timestamp()'
5 years ago
Mike Fährmann 21a7e395a7
implement convenience wrapper for text.extract functionality
6 years ago
Mike Fährmann 8f249f1d54
improve text.extract_iter() performance
6 years ago
Mike Fährmann 5530871b5a
change results of text.nameext_from_url()
6 years ago
Mike Fährmann e1d3e9a926
add 'ext_from_url' to text.py
6 years ago
Mike Fährmann 2d2953a5bf
add 'text.parse_float()' + cleanup in text.py
6 years ago
Mike Fährmann ae9a37a528
implement text.split_html()
6 years ago
Mike Fährmann cc36f88586
rename safe_int to parse_int; move parse_* to text module
7 years ago
Mike Fährmann 4ffa94f634
remove 'shorten_path()' and 'shorten_filename()'
7 years ago
Mike Fährmann 27eab4e467
rewrite text tests and improve functions
7 years ago
Mike Fährmann e3f2bd4087
add tests for 'text.clean_xml()' and improve it
7 years ago
Mike Fährmann 6d8b191ea7
improve 'parse_query()' and add tests
7 years ago
Mike Fährmann 731ffd4986
improve text.filename_from_url() performance
7 years ago
Mike Fährmann f7cdfd4c25
add a simplified version of 'parse_qs'
7 years ago
Mike Fährmann e5f79ae839
[deviantart] add support for all media types
7 years ago
Mike Fährmann ed94d9b92d
fix/improve various things
8 years ago
Mike Fährmann 619c74159a
[seiga] fix file extension and xml parsing
8 years ago
Mike Fährmann 4f123b8513
code adjustments according to pep8
8 years ago
Mike Fährmann 8780abcc77
fix a small spelling error
8 years ago
Mike Fährmann 00074a71d7
several changes to make travis build work
8 years ago
Mike Fährmann 91c446805b
replace platform.system() with os.name
8 years ago
Mike Fährmann 8a49a28d13
replace deprecated 'unescape' method
9 years ago
Mike Fährmann 99b4fbb081
implement text.extract_iter
9 years ago
Mike Fährmann 7fd284a705
always provide lowercase fileextensions
9 years ago
Mike Fährmann ca523b9f64
add helper method to text module
9 years ago
Mike Fährmann d0bebd9ce3
allow adding values to existing dict
9 years ago