Joint Level Generation and Translation Using Gameplay Videos