Monday, July 05, 2004
A Simpler Time
In each of three sets of horizontal lines of random lengths, the demo sorted the collection by size, from shortest to longest, by actually moving them up and down in the browser. The audience had never seen anything but static images in a browser before this: The lines were moving, as if being sorted by unseen hands!
Suddenly, everyone in the room was rethinking the potential of the Internet. Far from the crash-and-burn scenario Gosling had first envisioned, his demo had jolted a very influential audience off their seats, and they were delivering enthusiastic applause. And within this technology-entertainment crowd, word would spread quickly."
That draw dropping demo still runs, too.
| 0 comments | Link me |
Sunday, July 04, 2004
Java Rules
The manual says: "The mandarax inference engine uses backward reasoning, and the reference implementation uses an object oriented version of backward reasoning similar to the algorithm used in Prolog. On the other hand, most commercial rule systems such as ILOG and popular open source solutions like CLIPS and JESS use forward reasoning, in particular an algorithm called RETE. This algorithm keeps the derivation structure in memory and propagates changes in the rule and fact base."
A description of the RETE algorithm is here.
| 0 comments | Link me |
Desktop Metadata
Comments link to: WinFS is not filesystem, Spotlight, rdf semweb winfs, Haystack, Questions about Longhorn, Pike and libferris.
| 0 comments | Link me |
Friday, July 02, 2004
Blast from the Past
Many wasted hours spent playing this the first time...
| 0 comments | Link me |
Thursday, July 01, 2004
Updated Kowari Site
| 0 comments | Link me |
Word! The DOPE Project
"Current per-formance problems stem mostly from query procedures between the Sesame system and the Collexis-SOAP interface. We plan to address these problems by expanding DOPE with other data sources and thesauri."
Download and project page here.
| 0 comments | Link me |
FOAF next for Feedster?
| 0 comments | Link me |
RDF Query Languages
| 0 comments | Link me |
RDF - Just don't mention the Semantic Web
"First introduced in 1998, the W3C's Resource Description Framework is a computer-processible way of describing things. And that's about as simple as it gets. Despite being mired for some time in controversy over an awkward XML expression, the current view and consensus over RDF is in terms of its data model. The data model is simple and expressive, and is the best starting point for understanding RDF."
| 0 comments | Link me |
Tuesday, June 29, 2004
Jena Ported to .NET
| 1 comments | Link me |
Spotlight - Metadata Done Right?
The list of file formats are fairly standard but the cool stuff is probably the API layers. Part of Spotlight. See also, Does Spotlight == BFS?.
The example screen shots show rendering Slashdot's RSS feed and they also claim support for Adobe Photoshop. So that seems to suggest a limited support support of RDF - RSS 1.0 and perhaps XMP. Back in 2003 I found mention of metadata in OS X; it's a shame it's taken this long ;-).
And the new 30" monitors save you money. Buy 10, buy 100...
| 0 comments | Link me |
Monday, June 28, 2004
Radar Networks Decloaks
"Of course for this to work lots of new infrastructure is required. There are also numerous issues that arise as soon as one digs deeper into the subject -- issues related to data synchronization, privacy and security, ontology integration, policies and many other subjects. My company, Radar Networks' software platform (to be released as LGPL in 2005) will provide an open-source framework that solves many of these problems and will enable the open community to collaborate to solve any that remain or are found in the future."
Danny beat me to it.
| 0 comments | Link me |
Accessing vs Naming Models
Our existing scheme for naming models takes the form:
protocol://hostname/servername#modelname
Which usually becomes:
rmi://www.kowari.org/server1#model1
This is pretty brittle given the above use case. We've known for quite some time that our model naming scheme confused two distinctive properties:
* How to access the model, and
* The name of the model.
By ignoring this issue we could get around to adding views, security and the iTQL FROM and IN clause.
However, we now have the need to uniquely identify the model as well as continuing to support the old naming scheme. The currently proposed idea is to first map the URL of a model to a URN so that:
<rmi://fully.qualified.hostname/server1#model1>
Becomes:
<urn:fully-qualified-hostname:server1:model1>
This also allows us some added flexibility such that you can now have names for RSS feeds and external models that can be described using:
<urn:slashdot-org:news-rss> <access> <http://slashdot.org/news.rss>
<urn:my-com:server1:hr> <access> <rmi://localhost/server1>
The second entry refers to a local copy, which can be changed to:
<urn:my-com:server1:hr> <access> <rmi://my.com/server1#hr>
Which will access the live version of the database. It will break some of the current functionality in iTQL. This is where it creates a model on the fly if it happens to use an external model. For example:
select $s $p $o
from <http://slashdot.org/rss.xml>
where $s $p $o ;
This downloads the RDF and puts it in a model. Subsequent calls will be done against the local model. Also, iTQL doesn't implicitly have a default host so you must give it another way of giving the connection.
The simplest way is to create the mapping when creating a model:
create <urn:slashdot-org:rss-feed> <http://slashdot.org/rss.xml>
In the future, resolvers will have configuration settings so you could make statements such as:
<urn:slashdot-org:rss-feed> <updateOnlyIfChanged> <every 5 minutes>
Once created, it can then be accessed in the FROM:
select $s $p $o
from <urn:my-com:server1:hr> and <urn:slashdot-org:rss-feed> <rmi://my.com/server1>
where $s $p $o ;
This will do a set operation over the two listed models as defined in the Kowari/TKS server at "my.com".
However, this can get verbose so we'll probably borrow something from RDQL and the like and add a USING clause:
select $s $p $o
from <urn:my-com:hr> and <urn:pi-com:cal>
where $s $p $o
using <rmi://my.com/server1> ;
Or alternatively, we could set the server before doing a query using:
set server <rmi://localhost/server1> ;
To turn it from a local query to a distributed query (a TKS only feature):
set server <rmi://my.com/server1> for <urn:my-com:hr> ;
set server <rmi://pi.com/server1> for <urn:pi-com:cal> ;
| 0 comments | Link me |
Sunday, June 27, 2004
Illegal iPod
Prelude to a Fake Complaint "Apple's iPod music player seemed particularly vulnerable to attack. Any major record label could bring a strong lawsuit against Apple for "intentionally inducing" infringement under this new law with the iPod, both because it's plausible to argue that having an iPod enhances the lure of using P2P to download music (gotta fill all that space!) and because all the major record labels still believe that private sharing of songs from your CDs with friends is copyright infringement. We still disagree with the labels on these points, but the reality is that no court has yet convinced them that their legal theories are flawed. We also threw in Toshiba for making the iPod's hard drive and CNET for showing people how to move the iPod's music files."
| 0 comments | Link me |
Saturday, June 26, 2004
Jena Jumble
* Introduction to Jena "RDF is increasingly recognized as an excellent choice for representing and processing semi-structured data. In this article, Web Developer Philip McCarthy shows you how to use the Jena Semantic Web Toolkit to exploit RDF data models in your Java applications."
* BRQL "This document describes the Bristol RDF Query Language, or BRQL (pronounced 'burkol'), designed to meet the requirements and design objectives described in the W3C RDF Data Access Working Group (DAWG) document RDF Data Access Use Cases and Requirements (and its working version). This query language is based on RDQL Member Submission."
| 0 comments | Link me |
Friday, June 25, 2004
Kowari Bloggers
This seems like the ideal time to mention that Andrae is joing the Kowari blogging craze with Etymon.
| 0 comments | Link me |
Kowari on XML.com
Updated documentation is going to be released to coincide with 1.0.4. Hopefully, this should make it a little more accessible to most.
"According to Tom Adams, "Our triplestore is really a quad store, the 4th tuple being the group/model that a triple belongs to.""
Apparently, I made this quote but it doesn't matter.
Paul said, "a better name might be "platypus"" - that was already taken by a previous project.
Thursday, June 24, 2004
Semantic Clarity
Triple-based Computing "[Johanson & Fox, 2004] expect ubiquitous computing as the “killer app” for tuplespace-based computing because of the model’s portability, extensibility, flexibility, and ability to deal with heterogeneous environments...Since applications are decoupled in reference, time, and space many issues in protocol and process alignment disappear because they are provided by the underlying middleware that implements the tuplespace."
Semantic Web bringing clarity to the Universal Server concept "I would also like to conclude that what we know today, as the monolithic "point of presence" on the web called a "Web Site" (which infers browsing and page serving), is naturally going to morph into a different kind of "point of presence" that is capable of delivering the following from a single process:
1. Serve up Semantic Data from existing data sources
2. Provide execution endpoints for Web Services
3. Provide an instigation point for events that trigger Service Orchestratio
This is what Virtuoso is all about, and why it is described as a "Universal Server"; a server instance that speaks many protocols, delivering a plethora of functionality (Database, Web Services Platform, Orchestration Engine, and more)."
| 0 comments | Link me |
Wednesday, June 23, 2004
D2RQ
* access information in a non-RDF database using the Jena model API.
* query a non-RDF database using RDQL or find(spo).
* do RDFS and OWL inferencing over the content of a non-RDF database using the Jena ontology API.
* publish the content of a non-RDF database on the Semantic Web using the Joseki RDF server."
D2R is already integrated into TKS 2.0 and hopefully we'll get D2RQ in there. Good to see it using RDF for configuration.
| 0 comments | Link me |
Tuesday, June 22, 2004
News Services go RDF
Photographers and news services around the world use IPTC metadata schemas to embed identifying characteristics such as captions, credits, location and time and dates within digital photos and scans. This information can be read by content management systems, used by virtually every newspaper publisher, to review and identify photos they want to publish to accompany a story. Adobe and IPTC also announced plans to develop future implementations that will be accessible via the Adobe(R) Creative Suite, and related point products including Photoshop(R) CS, through a customizable metadata user interface. This is intended to streamline the data capture process and allow relevant IPTC metadata to be preserved as the file is utilized across news and derivative workflows. "
| 0 comments | Link me |
Eve of Sleepycat
"Who's building applications in Java? Does it span the spectrum?
I think so. People can use a B-tree anywhere a fast, relatively constant time lookup is needed, regardless of the size of the data. This happens all over the place.
We're actually in the process of building an embeddable, pure-Java LDAP server called Eve. She's a beaut, introducing triggers and stored procedures to the world of LDAP. "
" What do you think of Berkeley DB Java Edition's performance capabilities?
It uses NIO [new input/output], which should have a considerable effect on performance due to the way memory is accessed. The NIO packages are new APIs that allow Java to have the power of C where IO is concerned. These APIs are new in the 1.4 JDK and make Java-based servers much more effective. Furthermore, a binding API makes mapping objects to records very intuitive while avoiding the overheads in serialization.
Other implementations often leave this up to users, who often use serialization and wind up paying for it with a massive performance hit. "
There was also a recent comparison between C++ and Java showing Java is faster in all but three tests.
| 0 comments | Link me |
The Next Web - Web Services
| 0 comments | Link me |
Scaling Taxonomies
VCC uses rules to define how documents should be classified. Once taxonomies have been set up, VCC automatically classifies new documents as spiders discover them. The customer can control automatic classification by defining how well VCC thinks a document matches a category. For instance, if VCC’s confidence level for a candidate category exceeds 70 percent, then automatically publish it into that category; if not, route the decision to an assigned knowledge worker. Verity calls this “automated classification with manual oversight.”
Verity worked with DuPont to develop VCC. Whitney said the company uses VCC to manage a 25,000 node taxonomy. Internal users include “everyone from a bench chemist to a knowledge engineer—whoever.” He said VCC provides DuPont with “frictionless review between highly specialized knowledge workers, many with PhDs, and the knowledge engineering staff.”"
| 0 comments | Link me |
Monday, June 21, 2004
Semantic Web for 5th Graders
It says: "Twenty-two lessons show students how to build a vocabulary...(KWL, SQ3R, semantic web making, outlining)...". Of course, it doesn't look like it's *the* Semantic Web.
Searching for Java by itself reveals some interesting results. Searching for Java in the Computers category reveals no hits; the category function seems to be broken.
| 0 comments | Link me |
Thursday, June 17, 2004
Global Scalability
There are many interesting responses in the thread including one from Frederick Kintanar: "...I do think the Web makes a big difference between earlier knowledge representation efforts and Semantic Web initiative. A big one is global scalability, where the key element is URI's (and their already deployed global acceptance). Hypertext was already a relatively mature technology in the research community, when Tim Berners-Lee hit upon what is needed to make it scalable: global identification..."
And no this isn't scaling a single agreed upon ontology across the Web.
| 0 comments | Link me |
Spicy Searching
"Microsoft, though, isn't standing still. It is working on its own distributed search plan with Longhorn and a new release of its SQL Server database, code-named Yukon, and plans to build its own Internet search service. BEA Systems and others are working on similar technology."
"Information Integrator is a software layer than can pull data from different software--Oracle databases, Microsoft Excel, IBM's own DB2 and Lotus databases--with a single query. IBM and other companies are touting this "federated" database approach, in which searches tap into spread-out data sources, as a potentially cheaper alternative to shipping and storing large amounts of information in a single database."
| 0 comments | Link me |
Ruined by Developers
"WinFS, advertised as a way to make searching work by making the file system be a relational database, ignores the fact that the real way to make searching work is by making searching work. Don't make me type metadata for all my files that I can search using a query language. Just do me a favor and search the damned hard drive, quickly, for the string I typed, using full-text indexes and other technologies that were boring in 1973."
"RSS became fragmented with several different versions, inaccurate specs and lots of political fighting, and the attempt to clean everything up by creating yet another format called Atom has resulted in several different versions of RSS plus one version of Atom, inaccurate specs and lots of political fighting. When you try to unify two opposing forces by creating a third alternative, you just end up with three opposing forces. You haven't unified anything and you haven't really fixed anything."
| 0 comments | Link me |
Sunday, June 13, 2004
Loom and Drools
An article theServerSide.com introduces "...the JSR-94 Rules Engine API and an Open Source product called Drools, the forerunner implementation of this up-and-coming technology...it can scale to incorporate and execute hundreds of thousands of rules in a manner which is an order of magnitude more efficient then the next best algorithm." Drools Homepage.
| 0 comments | Link me |
WSDL2OWL-S 1.1 Released
| 0 comments | Link me |
Friday, June 11, 2004
TKS the RDF Database
He referenced one of his non-RDF projects that uses explicit database tables to make certain data associations. This approach is neither scalable nor flexible. The unique nature of RDF, however, can provide a flexible mechanism that would allow far greater associative capabilities, thereby increasing the ability to query and make inferences on topic matters not explicitly hard-wired into tables.
Daconta added that good strides have been made in commercial products supporting Semantic Web technologies, including RDF databases (such as Tucana Knowledge Server), ontology editors and inference engines (such as Network Inference's Cerebra Server), and data source integration engines (such as Unicorn System)."
The author is Ken Fromm from Loomia who are working on "a semantic technology software company building an RDF/FoaF-based identity and relationship stack".
| 0 comments | Link me |
Behind the Wall
The other previously missing ingredient is that real organizations have at least something approximating an implicit ontology in their database schema, standard operating procedures, official vocabularies, etc. It is at least arguable that the technologies that have emerged from the Semantic Web efforts allow all this diverse stuff to be pulled together in a useful way -- ontology editors, inferencing engines, semantic metadata repositories, etc. I'm seeing real success stories in my day job, and a coherent story is starting to be told by a number of vendors, analysts, etc.
Champion here makes a similar point to the one I argued in an article last fall ("Commercializing the Semantic Web"), namely, that there exist today several startups and fledgling ventures that are selling Semantic Web technologies to corporate clients, including Network Inference, Tucana Technologies, and others."
"Honestly, I don't know whether to laugh, because with WinFS Microsoft seems to be buying into the Semantic Web idea, or cry, because with WinFS Microsoft seems to be embracing-and-extending the Semantic Web idea. Oh well -- outside of the realm of unenforced US antitrust legislation, Microsoft is like gravity. Eventually, you just learn to work around it."
| 0 comments | Link me |
Thursday, June 10, 2004
Semantic Web Tutorial
| 0 comments | Link me |
WinFS and RDF to Square Off?
Where do you get Longhorn today? RDF and schemas for RDF are available now. Taking the both bottom up and top down schemas and deployed locally and globally. Last year I asked, Should the RDF model be integrated into the File System?, the answer still seems to be 'yes'.
Danny has a good summary of part 1 of this discussion.
| 0 comments | Link me |
Wednesday, June 09, 2004
Can't get enough of the Semantic Web?
| 0 comments | Link me |
Tuesday, June 08, 2004
Cynical Development
| 0 comments | Link me |
Monday, June 07, 2004
Sesame Speed
| 0 comments | Link me |
iTQL evaluated
| 0 comments | Link me |
Sunday, June 06, 2004
Datalog and Inferencing
Two other related papers: Bubo - Implementing OWL in rule-based systems and Description Logic Programs: Combining Logic Programs with Description Logic.
This maps well to 3VL and the recursive predicate functions trans and walk.
| 0 comments | Link me |
Not Relational Enough
In, XML, the New Database Heresy "One of the major benefits of using XML in relational databases is that it is a lot easier to deal with fluid schemas or data with sparse entries with XML. When the shape of the data tends to change or is not fixed the relational model is simply not designed to deal with this. Constantly changing your database schema is simply not feasible and there is no easy way to provide the extensibility of XML where one can say "after the X element, any element from any namespace can appear". How would one describe the capacity to store “any data” in a traditional relational database without resorting to an opaque blob?
I do tend to agree that some people are going overboard and trying to model their data hierarchically instead of relationally which experience has thought us is a bad idea."
Edd Dumbill wrote in, Ron Bourret on XML and databases "My guess is that everything will pick up on this front in a year or two, with companies moving towards what I consider the holy grail of XML support in relational databases: native storage behind a first-class XML data type, XQuery support with extensions for (a) including relational data or SQL queries and (b) updates, SQL/XML support with extensions for embedded XQuery queries, and support for JSR 225 (see below)."
One of the aspects we've found in implementing TKS/Kowari was that it pays to stick closely to the relational model. Everything comes down to tuples and I think that the problems with putting RDF on top of SQL database is that they aren't relational or "relational enough" (whatever that means).
Actually, the DAWG is asking for comments, maybe ensuring that the queries are expressed in relational algebra. That should prevent things like NULL getting in there.
| 0 comments | Link me |
Labels: kowari, relational model, sql, xml
Why FOAF? Why XFN?
Social software snippets "So Jonas poses a good question, why was FOAF invented? Perhaps an attempt to justify RDF? Is this an example of a solution looking for a problem? A clever acronym looking for a reason to exist?
Speaking of which, the biggest irony I see about FOAF is its name, which stands for "Friend of a Friend", and yet the technology has nothing to do with "friends". Like Jonas said, it's simply vCard recast in RDF, except for the "knows" relationship, which itself is quite meaningless (quite ironic for a Semantic Web effort), as it provides no more meaning than a plain hyperlink. Why work so hard for so little?
FOAF could be saved however, simply by adding an XFN module, thus enabling FOAF to finally fulfill its namesake, and represent friendship rather than just claiming to."
| 0 comments | Link me |