Goto

Collaborating Authors

Chinese Natural Language Processing in Practice

#artificialintelligence

Text mining is one of the prospering areas in data science that allows data scientist to work with textual contents – however, some common practices around text mining, such as stopwords and stemming, are not applicable to Chinese texts due to the difference in language structures. On the other hand, a study from InternetWorld Stats showed that Chinese Language Internet users accounted for 23.2% of the World Internet users (as of December 31, 2013), which is the second largest group of users (native English users if the largest group at 28.6%). No doubt that the business world has a strong demand on text-mining skills for Chinese texts. It is important to provide knowledge and necessary tools to extend data scientist text-mining capacity to include Chinese text contents.


Chinese Natural Language Processing in Practice

#artificialintelligence

Text mining is one of the prospering areas in data science that allows data scientist to work with textual contents – however, some common practices around text mining, such as stopwords and stemming, are not applicable to Chinese texts due to the difference in language structures. On the other hand, a study from InternetWorld Stats showed that Chinese Language Internet users accounted for 23.2% of the World Internet users (as of December 31, 2013), which is the second largest group of users (native English users if the largest group at 28.6%). No doubt that the business world has a strong demand on text-mining skills for Chinese texts. It is important to provide knowledge and necessary tools to extend data scientist text-mining capacity to include Chinese text contents.


r/datascience - Applying basic NLP / text-mining to texts with girlfriend -- results

#artificialintelligence

As part of the repository of data science tools I have been building over the past few weeks, I wanted to include some basic NLP / text-mining techniques in R. In thinking of a use case, I decided to kill two birds with one stone: teach myself a new skill, and build a cool gift for my SO! The result (link included below) is pretty basic, but hopefully you'll find it helpful! If you have ever wanted to work with text message data, are intrigued by NLP / text-mining in general, or you're the foremost expert on all things NLP (which I DEFINITELY am not), I hope there's something fun and insightful for all of you here. I look forward to your constructive criticism, and hope you enjoy this and/or any of other material I've assembled:)


KDD 2006 Conference - Panel

AITopics Original Links

It should be relevant to data mining and knowledge discovery and be based on analysis of large volumes of data, preferably publicly available data. It should be sufficiently important and difficult so that its solution will advance the field and benefit the society at large. It should be interesting and exciting to attract researchers, public and press attention, and funding. This requires a simple and concise problem statement -- one or two sentences. The required domain knowledge should be relatively accessible.


Seth Redmore on Twitter

#artificialintelligence

Modern technology marketer wielding messaging, data and systems as weapons to drive business.