| http://www.w3.org/ns/prov#value | - dnet), by observing the structure of hyperlinks originating from and terminating in the documents, and by using statistical word distribution metrics such as term frequency and inverse document frequency (TF.IDF) to provide information indicative of the similarity between two documents. [0036] Known techniques for establishing a similarity measure between two documents are given in Dumais et al.,
|