KeypointNet

#artificialintelligence 

This is frame-by-frame prediction with no temporal constraints. This is a frame-by-frame keypoint prediction on each animation frame. No temporal information is used. We show how the network is able to utilize the same keypoints across object instances and consistently predict keypoints across viewing angles, even when parts are occluded such as the back legs. Your browser does not support the video tag.