Goto

Collaborating Authors

 spark cluster


Fine-grained large-scale content recommendations for MSX sellers

Singh, Manpreet, Pasricha, Ravdeep, Kondapalli, Ravi Prasad, R, Kiran, Singh, Nitish, Agarwalla, Akshita, R, Manoj, Prabhakar, Manish, Boué, Laurent

arXiv.org Artificial Intelligence

One of the most critical tasks of Microsoft sellers is to meticulously track and nurture potential business opportunities through proactive engagement and tailored solutions. Recommender systems play a central role to help sellers achieve their goals. In this paper, we present a content recommendation model which surfaces various types of content (technical documentation, comparison with competitor products, customer success stories etc.) that sellers can share with their customers or use for their own self-learning. The model operates at the opportunity level which is the lowest possible granularity and the most relevant one for sellers. It is based on semantic matching between metadata from the contents and carefully selected attributes of the opportunities. Considering the volume of seller-managed opportunities in organizations such as Microsoft, we show how to perform efficient semantic matching over a very large number of opportunity-content combinations. The main challenge is to ensure that the top-5 relevant contents for each opportunity are recommended out of a total of $\approx 40,000$ published contents. We achieve this target through an extensive comparison of different model architectures and feature selection. Finally, we further examine the quality of the recommendations in a quantitative manner using a combination of human domain experts as well as by using the recently proposed "LLM as a judge" framework.


Machine Learning Project - Loan Approval Prediction - Projects Based Learning

#artificialintelligence

Welcome to this project on predict whether a customer is eligible for Home loan or not in Apache Spark Machine Learning using Databricks platform community edition server which allows you to execute your spark code, free of cost on their server just by registering through email id. In this project, we explore Apache Spark and Machine Learning on the Databricks platform. I am a firm believer that the best way to learn is by doing. That's why I haven't included any purely theoretical lectures in this tutorial: you will learn everything on the way and be able to put it into practice straight away. Seeing the way each feature works will help you learn Apache Spark machine learning thoroughly by heart.


YouTube Spam Comment Prediction - Projects Based Learning

#artificialintelligence

Process Comma-separated values file (ie file with .csv Convert String data to Numeric format so we can process the data in Apache Spark ML Library. Welcome to this project on creating prediction model to Identify spam comment in Apache Spark Machine Learning using Databricks platform community edition server which allows you to execute your spark code, free of cost on their server just by registering through email id. In this project we explore Apache Spark and Machine Learning on the Databricks platform. I am a firm believer that the best way to learn is by doing.


Predicting Possible Loan Default Using Machine Learning - Projects Based Learning

#artificialintelligence

Welcome to this project on Loan Prediction Based on Customer Behavior in Apache Spark Machine Learning using Databricks platform community edition server which allows you to execute your spark code, free of cost on their server just by registering through email id. In this project, we explore Apache Spark and Machine Learning on the Databricks platform. I am a firm believer that the best way to learn is by doing. That's why I haven't included any purely theoretical lectures in this tutorial: you will learn everything on the way and be able to put it into practice straight away. Seeing the way each feature works will help you learn Apache Spark machine learning thoroughly by heart.


Drug Classification Part 1 - Projects Based Learning

#artificialintelligence

Since as a beginner in machine learning it would be a great opportunity to try some techniques to predict the outcome of the drugs that might be accurate for the patient. The main problem here is not just the feature sets and target sets but also the approach that is taken in solving these types of problems as a beginner. Welcome to this project on Drug Classification in Apache Spark Machine Learning using Databricks platform community edition server which allows you to execute your spark code, free of cost on their server just by registering through email id. In this project, we explore Apache Spark and Machine Learning on the Databricks platform. I am a firm believer that the best way to learn is by doing.


Predict Ads Click - Practice Data Analysis and Logistic Regression Prediction - Projects Based Learning

#artificialintelligence

In this project we will be working with a data set, indicating whether or not a particular internet user clicked on an Advertisement. We will try to create a model that will predict whether or not they will click on an ad based off the features of that user. Welcome to this project on predict Ads Click in Apache Spark Machine Learning using Databricks platform community edition server which allows you to execute your spark code, free of cost on their server just by registering through email id. In this project, we explore Apache Spark and Machine Learning on the Databricks platform. I am a firm believer that the best way to learn is by doing.


Predicting the age of abalone from physical measurements Part 1 - Projects Based Learning

#artificialintelligence

Abalone is a common name for any of a group of small to very large sea snails, marine gastropod molluscs in the family Haliotidae. Other common names are ear shells, sea ears, and muttonfish or muttonshells in Australia, ormer in the UK, perlemoen in South Africa, and paua in New Zealand. The age of abalone is determined by cutting the shell through the cone, staining it, and counting the number of rings through a microscope a boring and time consuming task. Other measurements, which are easier to obtain, are used to predict the age. Given is the attribute name, attribute type, the measurement unit and a brief description.


Prediction task is to determine whether a person makes over 50K a year Part 1 - Projects Based Learning

#artificialintelligence

Prediction task is to determine whether a person makes over 50K a year.(Income Convert String data to Numeric format so we can process the data in Apache Spark ML Library. Welcome to this project on predict whether a person makes over 50K a year in Apache Spark Machine Learning using Databricks platform community edition server which allows you to execute your spark code, free of cost on their server just by registering through email id. In this project we explore Apache Spark and Machine Learning on the Databricks platform. I am a firm believer that the best way to learn is by doing.


Machine Learning Project – Predict Forest Cover Part 1 - Projects Based Learning

#artificialintelligence

In this project, we'll predict Forest Cover supported various attributes (cartographic variables) of the Forest. Hence, this is often a classification problem. Given is the attribute name, attribute type, the measurement unit, and a brief description. The forest cover type is the classification problem. Welcome to this project on predict Forest Cover in Apache Spark Machine Learning using Databricks platform community edition server which allows you to execute your spark code, free of cost on their server just by registering through email id.


Mobile Price Classification - Projects Based Learning

#artificialintelligence

Bob has started his own mobile company. He wants to give a tough fight to big companies like Apple, Samsung etc. He does not know how to estimate the price of mobiles his company creates. In this competitive mobile phone market, you cannot simply assume things. To solve this problem he collects sales data of mobile phones of various companies.