Icedream
2508971be1
parsers/youtube: Fix #11 by using a full reset in the template.
2016-07-06 04:16:26 +02:00
Icedream
2608df9727
parsers/youtube: Add NSFW marker for age-restricted videos.
2016-07-05 16:18:14 +02:00
Icedream
b61927108b
parsers/youtube: Fix information for channels.
2016-07-05 16:13:03 +02:00
Icedream
f97c872b2e
parsers/wikipedia: Fix HTTP(S) URL filter logic.
2016-07-03 19:03:32 +02:00
Icedream
6de3faa8e0
parsers/wikipedia: Only accept HTTP(S) links.
2016-07-03 18:59:13 +02:00
Icedream
7a131adfb8
parsers/wikipedia: Fix handling of www.wikipedia.org and wikipedia.org links.
2016-07-03 18:57:04 +02:00
Icedream
d6a32315f6
parsers/web: Remove extra logging.
2016-06-20 02:45:29 +02:00
Icedream
5c5f5ef478
parsers/web: Compare URLs by their string representations instead.
...
The Path can be different in that the original URL is missing the "/" at the beginning, however the resulting URL may very well contain a "/" at the beginning. In the resulting string representation this doesn't make any difference.
2016-06-20 02:44:51 +02:00
Icedream
dc5597c054
parsers/web: Remove hash reference when parsing URL.
...
Fixes #8 .
2016-06-20 02:43:30 +02:00
Icedream
280da493fb
parsers/web: Add test functions.
2016-06-20 02:30:27 +02:00
Icedream
2163bfc99f
parsers/web: Remove extra logging lines.
2016-06-20 02:30:10 +02:00
Icedream
ae1dce4bce
New-line fix caused extra spaces between each character.
2016-06-19 23:34:57 +02:00
Icedream
8696313f8e
Move "(no title)" text to its own constant noTitleStr.
...
Main purpose is for making integration testing easier later.
2016-06-19 23:32:27 +02:00
Icedream
ec899f0ddf
Replace new-line characters in HTML title with space.
...
Targets #2 .
2016-06-19 23:31:57 +02:00
Icedream
6775fe5100
parsers/web: Limit HTML parsing to first 8 kB and use Content-Length header.
...
Targets #2 .
2016-06-19 23:09:22 +02:00
Icedream
4a60f6142a
Initial commit.
...
Source code not properly documented yet and barely any testing but this is the code as running on the server at the moment.
2016-06-11 16:08:13 +02:00