The Harvard USPTO Patent Dataset: A Large-Scale, Well-Structured, and Multi-Purpose Corpus of Patent Applications
–Neural Information Processing Systems
With more than 4.5 million patent documents, HUPD is two to three times larger than comparable corpora.
Neural Information Processing Systems
Feb-16-2026, 16:33:50 GMT
- Country:
- Asia > Japan (0.04)
- South America > Chile
- North America > United States
- Michigan (0.04)
- Nevada (0.04)
- Alaska (0.04)
- Texas > Travis County
- Austin (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- California > Santa Clara County
- Palo Alto (0.04)
- Europe
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- Italy > Tuscany
- Florence (0.04)
- France > Grand Est
- Bas-Rhin > Strasbourg (0.04)
- United Kingdom > England
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Technology:
- Information Technology
- Information Management (1.00)
- Data Science (1.00)
- Communications (1.00)
- Artificial Intelligence
- Natural Language (1.00)
- Machine Learning
- Neural Networks > Deep Learning (1.00)
- Statistical Learning (0.67)
- Information Technology