rss2maildir.git
16 years ago* unittest for mixture of different types of lists
Brett Parker [Mon, 7 Jan 2008 01:00:44 +0000 (01:00 +0000)]
* unittest for mixture of different types of lists

16 years ago* Serious reworking of HTML2Text to handle nested lists reasonably
Brett Parker [Sun, 6 Jan 2008 22:39:04 +0000 (22:39 +0000)]
* Serious reworking of HTML2Text to handle nested lists reasonably
* Adding more unittests for the nested lists

16 years ago* Small improvements to the HTML2Text code
Brett Parker [Sun, 6 Jan 2008 11:43:44 +0000 (11:43 +0000)]
* Small improvements to the HTML2Text code
* Reorganize unittests for parsing to make it easier to add more tests later

16 years ago* serious reworking of the HTML2Text parser
Brett Parker [Sat, 5 Jan 2008 21:06:27 +0000 (21:06 +0000)]
* serious reworking of the HTML2Text parser

16 years ago* fix README to have a more complete config example
Brett Parker [Sat, 5 Jan 2008 17:00:57 +0000 (17:00 +0000)]
* fix README to have a more complete config example
* stop text width from being hardcoded

16 years ago* add missing source files for unit tests
Brett Parker [Sat, 5 Jan 2008 15:49:44 +0000 (15:49 +0000)]
* add missing source files for unit tests
* small fix to paragraph handling

16 years ago* add (first draft of) full test suite runner
Brett Parker [Sat, 5 Jan 2008 13:00:48 +0000 (13:00 +0000)]
* add (first draft of) full test suite runner
* add test for well formed paragraph handling
* update UnorderedListTests to have better test naming scheme
* add suite function to UnorderedListTests

16 years agoUpdate li handling a bit, and make the expected test results be what we'd
Brett Parker [Sat, 5 Jan 2008 10:06:32 +0000 (10:06 +0000)]
Update li handling a bit, and make the expected test results be what we'd
actually want (previous version might have been a bit of a work around)

16 years ago* Move some of the list handling above the paragraph handling so that it
Brett Parker [Mon, 31 Dec 2007 03:08:57 +0000 (03:08 +0000)]
* Move some of the list handling above the paragraph handling so that it
  doesn't get confused (bless it!)
* Make expected output match actual output, unittest now passes

16 years ago* Add unit test for some of the badly formed lists that we get after the
Brett Parker [Mon, 31 Dec 2007 02:56:28 +0000 (02:56 +0000)]
* Add unit test for some of the badly formed lists that we get after the
  feedparser "sanitizer" has a word with the HTML (currently fails)

16 years ago* Add unittest for unordered list
Brett Parker [Mon, 24 Dec 2007 11:38:12 +0000 (11:38 +0000)]
* Add unittest for unordered list
* make sure that the string that we use for plain text always ends in a new
  line character

16 years agoReformat code ready for adding test suite
Brett Parker [Mon, 24 Dec 2007 08:15:34 +0000 (08:15 +0000)]
Reformat code ready for adding test suite

16 years ago* Update TODO list with further escaping needs
Brett Parker [Sat, 22 Dec 2007 22:08:55 +0000 (22:08 +0000)]
* Update TODO list with further escaping needs

16 years ago* Add item url to html parts
Brett Parker [Sat, 22 Dec 2007 20:27:32 +0000 (20:27 +0000)]
* Add item url to html parts

16 years ago* Add item url to bottom of text only part
Brett Parker [Sat, 22 Dec 2007 19:33:11 +0000 (19:33 +0000)]
* Add item url to bottom of text only part

16 years agoUpdate TODO list
Brett Parker [Sat, 22 Dec 2007 18:33:09 +0000 (18:33 +0000)]
Update TODO list

16 years ago* multiple posts with the same link but different guid support - still
Brett Parker [Sat, 22 Dec 2007 18:32:52 +0000 (18:32 +0000)]
* multiple posts with the same link but different guid support - still
  threaded, but don't keep delivering the same messages everytime until they
  leave the feed

16 years ago* Add redirect support
Brett Parker [Sat, 22 Dec 2007 01:02:37 +0000 (01:02 +0000)]
* Add redirect support
* Try to get a URL 3 times (redirects are included in the count...)
* Refactor connection creation in to it's own function to lower duplication of
  code

16 years agoUpdate TODO list
Brett Parker [Sat, 22 Dec 2007 00:09:45 +0000 (00:09 +0000)]
Update TODO list

16 years ago* fix typo for a particular entity
Brett Parker [Fri, 21 Dec 2007 22:05:23 +0000 (22:05 +0000)]
* fix typo for a particular entity

16 years ago* Add a prelimanary todo list
Brett Parker [Fri, 21 Dec 2007 21:29:38 +0000 (21:29 +0000)]
* Add a prelimanary todo list

16 years ago* be slightly more forgiving on connection resets
Brett Parker [Fri, 21 Dec 2007 21:29:17 +0000 (21:29 +0000)]
* be slightly more forgiving on connection resets
* if there's no date in the feed, use todays date/time

16 years agoOnly download feeds that have changed (or that don't give us enough data to
Brett Parker [Fri, 21 Dec 2007 20:52:11 +0000 (20:52 +0000)]
Only download feeds that have changed (or that don't give us enough data to
workout if they've changed without downloading it all anyways)

16 years agoRefactor <br /> handling code so that there's no duplication
Brett Parker [Fri, 21 Dec 2007 19:14:29 +0000 (19:14 +0000)]
Refactor <br /> handling code so that there's no duplication

16 years ago* updated posts are now "threaded" - adds a References header with the previous
Brett Parker [Fri, 21 Dec 2007 18:51:40 +0000 (18:51 +0000)]
* updated posts are now "threaded" - adds a References header with the previous
  message-id in it, then adds the previous message id to the current message-id
  so that further updates can reference that properly

16 years agoFurther reformatting to < 80 chars per line
Brett Parker [Fri, 21 Dec 2007 16:03:40 +0000 (16:03 +0000)]
Further reformatting to < 80 chars per line

16 years ago* improve handling of unicode data
Brett Parker [Fri, 21 Dec 2007 15:40:51 +0000 (15:40 +0000)]
* improve handling of unicode data

16 years ago* tidy code to be mostly < 80 chars per line
Brett Parker [Fri, 21 Dec 2007 15:14:22 +0000 (15:14 +0000)]
* tidy code to be mostly < 80 chars per line
* add unordered list support
* tidy paragraph handling code to work better

16 years agoRemove references to mailbox module (doesn't let you write to maildir, which is
Brett Parker [Fri, 21 Dec 2007 13:29:17 +0000 (13:29 +0000)]
Remove references to mailbox module (doesn't let you write to maildir, which is
what we want, until python 2.5)

16 years agobetter utf-8 handling (though, we currently don't take in to account what
Brett Parker [Fri, 21 Dec 2007 13:26:13 +0000 (13:26 +0000)]
better utf-8 handling (though, we currently don't take in to account what
encoding we should be handling, so this could be "interesting" at best)

16 years ago* improved entity handling
Brett Parker [Fri, 21 Dec 2007 00:31:37 +0000 (00:31 +0000)]
* improved entity handling

16 years ago* fix blockquote support
Brett Parker [Thu, 20 Dec 2007 23:55:32 +0000 (23:55 +0000)]
* fix blockquote support
* improve headings support
* add pre support

16 years agoFix documentation up a bit
Brett Parker [Thu, 20 Dec 2007 23:10:04 +0000 (23:10 +0000)]
Fix documentation up a bit

16 years ago* blockquote support - indents a blockquote with a "> "
Brett Parker [Thu, 20 Dec 2007 23:05:54 +0000 (23:05 +0000)]
* blockquote support - indents a blockquote with a "> "

16 years ago* make db key actually unique for feed url + link url
Brett Parker [Thu, 20 Dec 2007 22:13:08 +0000 (22:13 +0000)]
* make db key actually unique for feed url + link url

16 years ago* add support for &nbsp;
Brett Parker [Thu, 20 Dec 2007 22:03:01 +0000 (22:03 +0000)]
* add support for &nbsp;
* add text wrapping for paragraphs (this is going to need more work, really)

16 years agoAdd licence information
Brett Parker [Thu, 20 Dec 2007 21:16:31 +0000 (21:16 +0000)]
Add licence information

16 years agoAdd basic HTML -> plain text parser
Brett Parker [Thu, 20 Dec 2007 21:03:09 +0000 (21:03 +0000)]
Add basic HTML -> plain text parser

16 years agoUpdate example file with planet alug and planet debian
Brett Parker [Thu, 20 Dec 2007 19:30:17 +0000 (19:30 +0000)]
Update example file with planet alug and planet debian

16 years ago* make mail messages multipart/alternative messages with a text/plain and
Brett Parker [Thu, 20 Dec 2007 19:29:54 +0000 (19:29 +0000)]
* make mail messages multipart/alternative messages with a text/plain and
  text/html part
* create a seen database that logs wether or not we've seen an item before by
  using the url of the item as the key, then check the md5sum to see if we need
  to see that item anyways

16 years agoAdd a Message-ID header and set the type to the type of the content in the rss
Brett Parker [Thu, 20 Dec 2007 14:14:51 +0000 (14:14 +0000)]
Add a Message-ID header and set the type to the type of the content in the rss
feed.

16 years agoMuch better filename creation for the tmp file
Brett Parker [Thu, 20 Dec 2007 01:14:02 +0000 (01:14 +0000)]
Much better filename creation for the tmp file

16 years ago* Parsing of the RSS feed using feedparser
Brett Parker [Wed, 19 Dec 2007 20:09:30 +0000 (20:09 +0000)]
* Parsing of the RSS feed using feedparser
* Creation of files for the maildir

16 years agoCreate Maildirs and Maildir root if possible for the feeds
Brett Parker [Wed, 19 Dec 2007 14:40:35 +0000 (14:40 +0000)]
Create Maildirs and Maildir root if possible for the feeds

16 years agoStarting point of rss2maildir:
Brett Parker [Wed, 19 Dec 2007 10:58:03 +0000 (10:58 +0000)]
Starting point of rss2maildir:
    * Config parser
    * Options parser