. "For example, if the document is a Microsoft Word file, then the textual content of the file is extracted and the specific MS Word formatting is removed." . . .