The system then determined whether speech recognition (2.6) or audio modification (2.7) is required accordingly the system performs speech recognition (2.8) or audio modification (2.9) and sends the modified audio recognized words including keywords to the output depending upon the output mode (2.15) and changes the word including the keyword to audio (2.10).