Dan Olsen’s Weblog

My insights and comments on technical issues and anything else I feel like saying.

ROSE Status: Sept. 8, 2004

The Rose project is nearing completion in it’s first stage. The project has been making slow progress as of lately due to problems between the program, which is written in Python, and the Berkeley XML database.

At the moment there is a piece that currently updates the RSS feeds in the database. This part of the project is not having any problems at the current moment. The part that is having the problems is the part that takes an OPML file and checks for the feeds in the file to see if they are in the database. If the feed is not in the database then it is retrieved over the internet and inserted into the database. It seems to work on a majority of feeds but crashes on a few feeds. I have been working with the folks at Sleepy Cat to see if we can find what the problem might be.

To be a little more specific as to what the problem is I will explain that below.

When the program reads in the RSS feeds I check them against the database, if the feed is not in the database then the program will retrieve them as stated above. However, on some feeds the program just quits with the message “Aborted”. I have narrowed the problem down to the putDocument call on the XML database. I have then taken the XML that is supposed to be put in the database and put it into a small program that implements the same procedure as in the main program. The feeds that crash the program work fine so it is not a problem with the structure of the XML. The problem lies somewhere within the rest of the program. I am looking into the possibility of the problem occurring due to the loop that goes through the array of feeds.

I am continuing to debug the process and find why certain feeds crash the program. I am also looking into other implementations that I may be able to use to get around the problem. I will post more of my implementation ideas I sort through the problem.

|