| http://www.w3.org/ns/prov#value | - The above discussion is based on one embodiment of the data structures storing information of duplicate documents, e.g., CFT, UFT and PRT. As described above, the document address space of the Internet may be partitioned into N segments (FIG. 1), where N is an integer greater than one, and the web crawler system 200 processes one segment per epoch.
|