A Unified Generative Approach to Product Attribute-Value Identification
Shinzato, Keiji, Yoshinaga, Naoki, Xia, Yandi, Chen, Wei-Te
–arXiv.org Artificial Intelligence
Product attribute-value identification (PAVI) has been studied to link products on e-commerce sites with their attribute values (e.g., ) using product text as clues. Technical demands from real-world e-commerce platforms require PAVI methods to handle unseen values, multi-attribute values, and canonicalized values, which are only partly addressed in existing extraction- and classification-based approaches. Motivated by this, we explore a generative approach to the PAVI task. We finetune a pre-trained generative model, T5, to decode a set of attribute-value pairs as a target sequence from the given product text. Since the attribute value pairs are unordered set elements, how to linearize them will matter; we, thus, explore methods of composing an attribute-value pair and ordering the pairs for the task. Experimental results confirm that our generation-based approach outperforms the existing extraction and classification-based methods on large-scale real-world datasets meant for those methods.
arXiv.org Artificial Intelligence
Jun-8-2023
- Country:
- Asia
- China (0.04)
- India > Telangana
- Hyderabad (0.04)
- Japan > Honshū
- Chūbu > Aichi Prefecture
- Nagoya (0.04)
- Kantō > Tokyo Metropolis Prefecture
- Tokyo (0.04)
- Tōhoku (0.04)
- Chūbu > Aichi Prefecture
- Macao (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Europe
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy (0.05)
- United Kingdom > Scotland
- City of Edinburgh > Edinburgh (0.04)
- Ireland > Leinster
- North America
- Canada > Quebec (0.04)
- Puerto Rico > San Juan
- San Juan (0.04)
- United States
- California
- San Diego County > San Diego (0.04)
- San Francisco County > San Francisco (0.14)
- Colorado > Boulder County
- Boulder (0.04)
- Illinois > Cook County
- Chicago (0.04)
- New Mexico (0.04)
- New York > New York County
- New York City (0.04)
- Washington > King County
- Seattle (0.04)
- California
- Asia
- Genre:
- Research Report > New Finding (0.48)
- Industry:
- Information Technology (1.00)
- Technology: