Understanding user interfaces with screen parsing