Joint Automatic Speech Recognition And Structure Learning For Better Speech Understanding