Machine Learning Best Algorithms: Gradient Boosting Machines (GBM)
We'll have a main talk (30 mins) and 3 excellent lightning talks about the machine learning algorithm that usually achieves the best accuracy on structured/tabular data (e.g. in industry/business applications or in Kaggle competitions): Abstract: With all the hype about deep learning and "AI", it is not well publicized that for structured/tabular data widely encountered in business applications it is actually another machine learning algorithm, the gradient boosting machine (GBM) that most often achieves the highest accuracy in supervised learning tasks. In this talk we'll review some of the main GBM implementations available as R and Python packages such as xgboost, h2o, lightgbm etc, we'll discuss some of their main features and characteristics, and we'll see how tuning GBMs and creating ensembles of the best models can achieve the best prediction accuracy for many business problems. Bio: Szilard studied Physics in the 90s and obtained a PhD by using statistical methods to analyze the risk of financial portfolios. He worked in finance, then more than a decade ago moved to become the Chief Scientist of a tech company in Santa Monica doing everything data (analysis, modeling, data visualization, machine learning, data infrastructure etc). He is the founder/organizer of several meetups in the Los Angeles area (R, data science etc) and the data science community website datascience.la.
Jun-22-2018, 02:16:26 GMT
- Country:
- Europe (0.06)
- North America > United States
- California > Los Angeles County
- Santa Monica (0.27)
- Los Angeles (0.26)
- California > Los Angeles County
- Genre:
- Contests & Prizes (0.38)
- Industry:
- Information Technology (1.00)
- Media
- Television (0.34)
- Film (0.34)
- Technology: