Commit Graph

17 Commits (b37a15ac1f29a898699180ffd90a796798a6ba63)

Author SHA1 Message Date
Icedream b37a15ac1f
Introduce "--images" flag to enable image link parsing and disable image link parsing by default. 2016-07-26 20:36:36 +02:00
Icedream 2508971be1 parsers/youtube: Fix #11 by using a full reset in the template. 2016-07-06 04:16:26 +02:00
Icedream 2608df9727 parsers/youtube: Add NSFW marker for age-restricted videos. 2016-07-05 16:18:14 +02:00
Icedream b61927108b parsers/youtube: Fix information for channels. 2016-07-05 16:13:03 +02:00
Icedream f97c872b2e parsers/wikipedia: Fix HTTP(S) URL filter logic. 2016-07-03 19:03:32 +02:00
Icedream 6de3faa8e0 parsers/wikipedia: Only accept HTTP(S) links. 2016-07-03 18:59:13 +02:00
Icedream 7a131adfb8 parsers/wikipedia: Fix handling of www.wikipedia.org and wikipedia.org links. 2016-07-03 18:57:04 +02:00
Icedream d6a32315f6 parsers/web: Remove extra logging. 2016-06-20 02:45:29 +02:00
Icedream 5c5f5ef478 parsers/web: Compare URLs by their string representations instead.
The Path can be different in that the original URL is missing the "/" at the beginning, however the resulting URL may very well contain a "/" at the beginning. In the resulting string representation this doesn't make any difference.
2016-06-20 02:44:51 +02:00
Icedream dc5597c054 parsers/web: Remove hash reference when parsing URL.
Fixes #8.
2016-06-20 02:43:30 +02:00
Icedream 280da493fb parsers/web: Add test functions. 2016-06-20 02:30:27 +02:00
Icedream 2163bfc99f parsers/web: Remove extra logging lines. 2016-06-20 02:30:10 +02:00
Icedream ae1dce4bce New-line fix caused extra spaces between each character. 2016-06-19 23:34:57 +02:00
Icedream 8696313f8e Move "(no title)" text to its own constant noTitleStr.
Main purpose is for making integration testing easier later.
2016-06-19 23:32:27 +02:00
Icedream ec899f0ddf Replace new-line characters in HTML title with space.
Targets #2.
2016-06-19 23:31:57 +02:00
Icedream 6775fe5100 parsers/web: Limit HTML parsing to first 8 kB and use Content-Length header.
Targets #2.
2016-06-19 23:09:22 +02:00
Icedream 4a60f6142a Initial commit.
Source code not properly documented yet and barely any testing but this is the code as running on the server at the moment.
2016-06-11 16:08:13 +02:00