Dependency Graph-to-String Statistical Machine Translation
Li, Liangyou, Way, Andy, Liu, Qun
–arXiv.org Artificial Intelligence
We present graph-based translation models which translate source graphs into target strings. Source graphs are constructed from dependency trees with extra links so that non-syntactic phrases are connected. Inspired by phrase-based models, we first introduce a translation model which segments a graph into a sequence of disjoint subgraphs and generates a translation by combining subgraph translations left-to-right using beam search. However, similar to phrase-based models, this model is weak at phrase reordering. Therefore, we further introduce a model based on a synchronous node replacement grammar which learns recursive translation rules. We provide two implementations of the model with different restrictions so that source graphs can be parsed efficiently. Experiments on Chinese--English and German--English show that our graph-based models are significantly better than corresponding sequence- and tree-based baselines.
arXiv.org Artificial Intelligence
Mar-20-2021
- Country:
- South America > Brazil (0.04)
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America
- United States
- Texas (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Ohio > Franklin County
- Columbus (0.04)
- Maryland > Prince George's County
- College Park (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- New York
- Richmond County > New York City (0.04)
- Queens County > New York City (0.04)
- New York County > New York City (0.04)
- Kings County > New York City (0.04)
- Bronx County > New York City (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Washington > King County
- Seattle (0.04)
- Colorado
- Denver County > Denver (0.04)
- Boulder County > Boulder (0.04)
- Pennsylvania
- Philadelphia County > Philadelphia (0.04)
- Allegheny County > Pittsburgh (0.04)
- Massachusetts
- Suffolk County > Boston (0.04)
- Middlesex County > Cambridge (0.04)
- California
- Los Angeles County > Los Angeles (0.14)
- Santa Cruz County > Santa Cruz (0.04)
- San Diego County > San Diego (0.04)
- Canada
- United States
- Europe
- Czechia > Prague (0.04)
- Germany > Berlin (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Belgium > Flanders
- Flemish Brabant > Leuven (0.04)
- Italy > Trentino-Alto Adige/Südtirol
- Trentino Province > Trento (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Switzerland > Geneva
- Geneva (0.04)
- Asia
- North Korea (0.04)
- South Korea (0.04)
- Thailand > Phuket
- Phuket (0.04)
- Middle East > Qatar
- Japan > Hokkaidō
- Hokkaidō Prefecture > Sapporo (0.04)
- India > Maharashtra
- Mumbai (0.04)
- China
- Africa
- South Africa (0.05)
- Middle East > Egypt
- Giza Governorate > Giza (0.04)
- Genre:
- Research Report > New Finding (0.46)
- Industry:
- Government (0.46)
- Technology: