The Harvard USPTO Patent Dataset: A Large-Scale, Well-Structured, and Multi-Purpose Corpus of Patent Applications
–Neural Information Processing Systems
With more than 4.5 million patent documents, HUPD is two to three times larger than comparable corpora.
Neural Information Processing Systems
Feb-16-2026, 16:33:50 GMT
- Country:
- Asia > Japan (0.04)
- Europe
- France > Grand Est
- Bas-Rhin > Strasbourg (0.04)
- Italy > Tuscany
- Florence (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- France > Grand Est
- North America > United States
- Alaska (0.04)
- California > Santa Clara County
- Palo Alto (0.04)
- Michigan (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Nevada (0.04)
- Texas > Travis County
- Austin (0.04)
- South America > Chile
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Neural Networks > Deep Learning (1.00)
- Statistical Learning (0.67)
- Natural Language (1.00)
- Machine Learning
- Communications (1.00)
- Data Science (1.00)
- Information Management (1.00)
- Artificial Intelligence
- Information Technology