An image processing method comprising the steps of: producing an image of a subject including a prescribed part as a low-resolution image, from an input image outputted from a single video camera; locating the prescribed part using the low-resolution image; and extracting the prescribed part as a higher-resolution image from the input image outputted from the camera.