Classifier4J 0.3 "I now have Classifier4J and nntp//rss working together to do Bayesian classification of RSS feeds."
RDF extraction from HTML "Currently it extracts a set of blurbs of RDF and a list of URI's it believes contain RDF. Currently the only documents parsed are HTML.
There are a great many ways to include RDF in HTML files. I located one summary of possible methods. I am also aware that Creative Commons advocates storing the RDF that describes the license for a page in a comment tag."
It's official - we are not stupid! Is another commentry on Mark Butler's paper.