How Feature Engineering Can Help You Do Well in a Kaggle Competition – Part 2
In the first part of this series, I introduced the Outbrain Click Prediction machine learning competition. That post described some preliminary and important data science tasks like exploratory data analysis and feature engineering performed for the competition, using a Spark cluster deployed on Google Dataproc. In this post, I describe the competition evaluation, the design of my cross-validation strategy and my baseline models using statistics and trees ensembles. In that competition, Kagglers were required to rank recommended ads by decreasing predicted likelihood of being clicked. Sponsored search advertising, contextual advertising, display advertising and real-time bidding auctions have all relied heavily on the ability of learned models to predict ad click–through rates (CTRs) accurately, quickly and reliably.
Jun-27-2017, 23:15:06 GMT
- Genre:
- Contests & Prizes (0.40)
- Industry:
- Marketing (1.00)
- Information Technology > Services (0.54)
- Technology: