"As to sound information such as speech, BGM, etc, too, the video object can be similarly defined as the data structure which is caused to correspond on a 1:1 basis to an arbitrary partial period of the sound information space having the time axis." . . . .