An audio and image-based on-demand content annotation framework for augmenting the video viewing experience on mobile devices