Feature Hashing for Scalable Machine Learning – Inside Machine learning
Feature hashing is a powerful technique for handling sparse, high-dimensional features in machine learning. It is fast, simple, memory-efficient, and well suited to online learning scenarios. While an approximation, it has surprisingly low accuracy tradeoffs in many machine learning problems. In this post, I will cover the basics of feature hashing and how to use it for flexible, scalable feature encoding and engineering. I'll also mention feature hashing in the context of Apache Spark's MLlib machine learning library.
Mar-28-2017, 00:40:29 GMT
- AI-Alerts:
- 2017 > 2017-04 > AAAI AI-Alert for Apr 4, 2017 (1.00)
- Industry:
- Education (0.57)
- Information Technology > Services (0.34)
- Technology: