The capture information processing component 165 is configured to detect the presence of rendered documents, extract text regions from documents, and analyze the document information to recognize document and text features, such as absolute and relative layout information, paragraph, line and word shadows or profiles, glyph-related features, and character encodings.