Wednesday, October 02, 2002

The Whole, General Mish Mash

"Latent semantic indexing adds an important step to the document indexing process. In addition to recording which keywords a document contains, the method examines the document collection as a whole, to see which other documents contain some of those same words. LSI considers documents that have many words in common to be semantically close, and ones with few words in common to be semantically distant."

A very clear explanation for extracting concepts and seeing which documents are related to others.

http://javelina.cet.middlebury.edu/lsa/out/cover_page.htm

No comments: