AITopics | high cardinality

Collaborating Authors

high cardinality

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Handling Large-scale Cardinality in building recommendation systems

Kurra, Dhruva Dixith, Ling, Bo, Zh, Chun, Ashrafzadeh, Seyedshahin

arXiv.org Artificial IntelligenceJan-17-2024

Effective recommendation systems rely on capturing user preferences, often requiring incorporating numerous features such as universally unique identifiers (UUIDs) of entities. However, the exceptionally high cardinality of UUIDs poses a significant challenge in terms of model degradation and increased model size due to sparsity. This paper presents two innovative techniques to address the challenge of high cardinality in recommendation systems. Specifically, we propose a bag-of-words approach, combined with layer sharing, to substantially decrease the model size while improving performance. Our techniques were evaluated through offline and online experiments on Uber use cases, resulting in promising results demonstrating our approach's effectiveness in optimizing recommendation systems and enhancing their overall performance.

building recommendation system, handling large-scale cardinality, recommendation system, (12 more...)

arXiv.org Artificial Intelligence

2401.09572

Country:

North America > United States > New York > New York County > New York City (0.05)
North America > United States > California > San Francisco County > San Francisco (0.04)

Genre: Research Report > Promising Solution (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

Add feedback

Edelweiss improves cross-sell using machine learning on Amazon SageMaker

#artificialintelligenceJun-2-2021, 23:04:16 GMT

This post is co-written by Nikunj Agarwal, lead data scientist at Edelweiss Tokio Life Insurance. Edelweiss Tokio Life Insurance Company Ltd is a leading life insurance company in India. Its broad spectrum of offerings includes life insurance, health insurance, retirement policies, wealth enhancement schemes, education funding, and more. How are you being recommended a credit card based on your savings account behavior? How about a life insurance product when you buy car insurance, or a side dish when you order a main course on your food ordering app?

customer, hyperparameter, sagemaker, (15 more...)

#artificialintelligence

Country: Asia > India (0.25)

Industry: Banking & Finance > Insurance (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

High number of unique values and tree based models

#artificialintelligenceMar-12-2021, 14:35:50 GMT

Having columns of data with high cardinality can adversely affect the performance of your models. The idea of this article stemmed from my personal experience of employing tree based solutions in various projects. In this article I will attempt to show the effects of this on a couple of datasets using the simple decision tree. Cardinality can be defined as the uniqueness of data in the machine learning context. Examples of fields with a high number of unique values include cities, countries, medical diagnosis codes, movie categories on Netflix, flavours of ice cream, etc.

high cardinality, high number, unique value and tree, (5 more...)

#artificialintelligence

Industry: Information Technology (0.41)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Decision Tree Learning (0.51)

Add feedback