LOIS: Looking Out of Instance Semantics for Visual Question Answering