"FIG. 4 is a flow chart of an exemplary method for determining a final set of near duplicate documents from an initial set of near duplicate documents in a manner consistent with the present invention." . . . .