Dense captioning of Boston Dynamics Atlas Robot from vision caption Watch Video
Preview(s):
Gallery
Play Video: (Note: The default playback of the video is HD VERSION. If your browser is buffering the video slowly, please play the REGULAR MP4 VERSION or Open The Video below for better experience. Thank you!)
Description: This is the result of running the Densecap captioning system implemented at Stanford Vision Lab on the video of the Atlas humanoid robot designed by Boston Dynamics. (all links at bottom)nnDensecap captions salient regions of images using a recurrent neural network and a fully convolutional localization network (FCLN). The FCLN processes an image, proposing regions of interest and conditioning a recurrent neural network which generates the associated captions. The whole system is trained end-to-
Play Video: (Note: The default playback of the video is HD VERSION. If your browser is buffering the video slowly, please play the REGULAR MP4 VERSION or Open The Video below for better experience. Thank you!)