In such a case, the method of the present invention causes a computer to combine a processing of a document having a plurality of pages with a processing of voice generated with reference to the document, the method including the steps of causing the computer to determine, among subtitles obtained by recognizing the voice, a specific subtitle obtained by recognizing voice generated with reference