Applying Machine Learning to Text Mining with Amazon S3 and RapidMiner


By some estimates, 80% of an organization's data is unstructured content. This content includes web pages, call center transcripts, surveys, feedback forms, legal documents, forums, social media, and blog articles. Therefore, organizations must analyze not just transactional information but also textual content to gain insight and boost performance. A powerful way to analyze this textual content is by using text mining. Text mining typically applies machine learning techniques such as clustering, classification, association rules and predictive modeling.