IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design Patents
–Neural Information Processing Systems
In this paper, we introduce IMPACT (Integrated Multimodal Patent Analysis and Creation Dataset for Design Patents), a large-scale multimodal patent dataset with detailed captions for design patent figures. Our dataset includes half a million design patents comprising 3.61 million figures along with captions from patents granted by the United States Patent and Trademark Office (USPTO) over a 16-year period from 2007 to 2022. We incorporate the metadata of each patent application with elaborate captions that are coherent with multiple viewpoints of designs. Even though patents themselves contain a variety of design figures, titles, and descriptions of viewpoints, we find that they lack detailed descriptions that are necessary to perform multimodal tasks such as classification and retrieval. IMPACT closes this gap thereby providing researchers with necessary ingredients to instantiate a variety of multimodal tasks.
Neural Information Processing Systems
May-27-2025, 19:47:55 GMT
- Country:
- North America > United States (0.98)
- Industry:
- Technology:
- Information Technology > Artificial Intelligence > Vision (0.62)