AITopics | production model

Collaborating Authors

production model

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A Meta-Learning Perspective on Cold-Start Recommendations for Items

Manasi Vartak, Arvind Thiagarajan, Conrado Miranda, Jeshua Bratman, Hugo Larochelle

Neural Information Processing SystemsNov-21-2025, 08:56:57 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, machine learning, recommendation, (18 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Industry: Information Technology > Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)

Add feedback

From Hard Refusals to Safe-Completions: Toward Output-Centric Safety Training

Yuan, Yuan, Sriskandarajah, Tina, Brakman, Anna-Luisa, Helyar, Alec, Beutel, Alex, Vallone, Andrea, Jain, Saachi

arXiv.org Artificial IntelligenceAug-14-2025

Large Language Models used in ChatGPT have traditionally been trained to learn a refusal boundary: depending on the user's intent, the model is taught to either fully comply or outright refuse. While this is a strong mitigation for explicitly malicious prompts, focusing safety training on refusals can lead to brittleness for prompts with obscured user intent. Binary refusal boundaries are especially ill-suited for dual-use cases (such as biology or cybersecurity), where a user request can be answered safely at a high level, but in some cases can lead to malicious uplift if sufficiently detailed or actionable. As an alternative, we propose safe-completions: a safety-training approach that centers on the safety of the assistant's output, rather than a binary classification of the user's intent. Safe-completions seek to maximize helpfulness within the safety policy's constraints. We incorporated this approach into GPT-5 and find that across both production comparisons and internally controlled experiments, safe-completion training improves safety (especially on dual-use prompts), reduces the severity of residual safety failures, and substantially increases model helpfulness.

helpfulness, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2508.09224

Genre: Research Report > Experimental Study (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Government (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

DV365: Extremely Long User History Modeling at Instagram

Lyu, Wenhan, Tyagi, Devashish, Yang, Yihang, Li, Ziwei, Somani, Ajay, Shanmugasundaram, Karthikeyan, Andrejevic, Nikola, Adeputra, Ferdi, Zeng, Curtis, Singh, Arun K., Ransan, Maxime, Jain, Sagar

arXiv.org Artificial IntelligenceJun-3-2025

Long user history is highly valuable signal for recommendation systems, but effectively incorporating it often comes with high cost in terms of data center power consumption and GPU. In this work, we chose offline embedding over end-to-end sequence length optimization methods to enable extremely long user sequence modeling as a cost-effective solution, and propose a new user embedding learning strategy, multi-slicing and summarization, that generates highly generalizable user representation of user's long-term stable interest. History length we encoded in this embedding is up to 70,000 and on average 40,000. This embedding, named as DV365, is proven highly incremental on top of advanced attentive user sequence models deployed in Instagram. Produced by a single upstream foundational model, it is launched in 15 different models across Instagram and Threads with significant impact, and has been production battle-proven for >1 year since our first launch.

artificial intelligence, machine learning, social media, (18 more...)

arXiv.org Artificial Intelligence

2506.0045

Country: North America > United States > New York > New York County > New York City (0.15)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Services (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.48)

Add feedback

360Brew: A Decoder-only Foundation Model for Personalized Ranking and Recommendation

Firooz, Hamed, Sanjabi, Maziar, Englhardt, Adrian, Gupta, Aman, Levine, Ben, Olgiati, Dre, Polatkan, Gungor, Melnychuk, Iuliia, Ramgopal, Karthik, Talanine, Kirill, Srinivasan, Kutta, Simon, Luke, Sivasubramoniapillai, Natesh, Ayan, Necip Fazil, Song, Qingquan, Sriram, Samira, Ghosh, Souvik, Song, Tao, Dharamsi, Tejas, Kothapalli, Vignesh, Zhai, Xiaoling, Xu, Ya, Wang, Yu, Dai, Yun

arXiv.org Artificial IntelligenceFeb-7-2025

Ranking and recommendation systems are the foundation for numerous online experiences, ranging from search results to personalized content delivery. These systems have evolved into complex, multilayered architectures that leverage vast datasets and often incorporate thousands of predictive models. The maintenance and enhancement of these models is a labor intensive process that requires extensive feature engineering. This approach not only exacerbates technical debt but also hampers innovation in extending these systems to emerging problem domains. In this report, we present our research to address these challenges by utilizing a large foundation model with a textual interface for ranking and recommendation tasks. We illustrate several key advantages of our approach: (1) a single model can manage multiple predictive tasks involved in ranking and recommendation, (2) decoder models with textual interface due to their comprehension of reasoning capabilities, can generalize to new recommendation surfaces and out-of-domain problems, and (3) by employing natural language interfaces for task definitions and verbalizing member behaviors and their social connections, we eliminate the need for feature engineering and the maintenance of complex directed acyclic graphs of model dependencies. We introduce our research pre-production model, 360Brew V1.0, a 150B parameter, decoder-only model that has been trained and fine-tuned on LinkedIn's data and tasks. This model is capable of solving over 30 predictive tasks across various segments of the LinkedIn platform, achieving performance levels comparable to or exceeding those of current production systems based on offline metrics, without task-specific fine-tuning. Notably, each of these tasks is conventionally addressed by dedicated models that have been developed and maintained over multiple years by teams of a similar or larger size than our own.

arxiv preprint arxiv, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2501.1645

Country:

North America > United States > Texas (0.04)
North America > United States > New York (0.04)
North America > United States > California > Santa Clara County > Sunnyvale (0.04)

Genre: Research Report (0.64)

Industry: Information Technology > Services (0.89)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

A Meta-Learning Perspective on Cold-Start Recommendations for Items

Manasi Vartak, Arvind Thiagarajan, Conrado Miranda, Jeshua Bratman, Hugo Larochelle

Neural Information Processing SystemsOct-3-2024, 10:11:04 GMT

Neural Information Processing Systems http://nips.cc/

architecture, item history, recommendation, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
Oceania > Australia > New South Wales > Sydney (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Los Angeles County > Long Beach (0.04)

Industry: Information Technology > Services (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.70)

Add feedback

Counterfactual Evaluation of Ads Ranking Models through Domain Adaptation

Radwan, Mohamed A., Bhattacharjee, Himaghna, Lanners, Quinn, Zhang, Jiasheng, Karakulak, Serkan, Nassif, Houssam, Bayir, Murat Ali

arXiv.org Artificial IntelligenceSep-29-2024

We propose a domain-adapted reward model that works alongside an Offline A/B testing system for evaluating ranking models. This approach effectively measures reward for ranking model changes in large-scale Ads recommender systems, where model-free methods like IPS are not feasible. Our experiments demonstrate that the proposed technique outperforms both the vanilla IPS method and approaches using non-generalized reward models.

artificial intelligence, machine learning, reward model, (14 more...)

arXiv.org Artificial Intelligence

2409.19824

Country:

Europe > Italy > Apulia > Bari (0.06)
North America > United States > New York > New York County > New York City (0.05)
Oceania > Australia > Victoria > Melbourne (0.04)
Asia > Middle East > Israel (0.04)

Genre: Research Report (0.84)

Industry: Marketing (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (0.38)

Add feedback

Collaborative-Enhanced Prediction of Spending on Newly Downloaded Mobile Games under Consumption Uncertainty

Sun, Peijie, Wang, Yifan, Zhang, Min, Wu, Chuhan, Fang, Yan, Zhu, Hong, Fang, Yuan, Wang, Meng

arXiv.org Artificial IntelligenceApr-12-2024

With the surge in mobile gaming, accurately predicting user spending on newly downloaded games has become paramount for maximizing revenue. However, the inherently unpredictable nature of user behavior poses significant challenges in this endeavor. To address this, we propose a robust model training and evaluation framework aimed at standardizing spending data to mitigate label variance and extremes, ensuring stability in the modeling process. Within this framework, we introduce a collaborative-enhanced model designed to predict user game spending without relying on user IDs, thus ensuring user privacy and enabling seamless online training. Our model adopts a unique approach by separately representing user preferences and game features before merging them as input to the spending prediction module. Through rigorous experimentation, our approach demonstrates notable improvements over production models, achieving a remarkable \textbf{17.11}\% enhancement on offline data and an impressive \textbf{50.65}\% boost in an online A/B test. In summary, our contributions underscore the importance of stable model training frameworks and the efficacy of collaborative-enhanced models in predicting user spending behavior in mobile gaming.

collaborative signal, proceedings, production model, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3589335.3648297

2404.08301

Country:

Asia > Singapore > Central Region > Singapore (0.05)
Asia > China > Beijing > Beijing (0.05)
Asia > China > Guangdong Province > Shenzhen (0.04)
(3 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Information Technology (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.34)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Rankitect: Ranking Architecture Search Battling World-class Engineers at Meta Scale

Wen, Wei, Liu, Kuang-Hung, Fedorov, Igor, Zhang, Xin, Yin, Hang, Chu, Weiwei, Hassani, Kaveh, Sun, Mengying, Liu, Jiang, Wang, Xu, Jiang, Lin, Chen, Yuxin, Zhang, Buyun, Liu, Xi, Cheng, Dehua, Chen, Zhengxing, Zhao, Guang, Han, Fangqiu, Yang, Jiyan, Hao, Yuchen, Xiong, Liang, Chen, Wen-Yen

arXiv.org Artificial IntelligenceNov-13-2023

Neural Architecture Search (NAS) has demonstrated its efficacy in computer vision and potential for ranking systems. However, prior work focused on academic problems, which are evaluated at small scale under well-controlled fixed baselines. In industry system, such as ranking system in Meta, it is unclear whether NAS algorithms from the literature can outperform production baselines because of: (1) scale - Meta ranking systems serve billions of users, (2) strong baselines - the baselines are production models optimized by hundreds to thousands of world-class engineers for years since the rise of deep learning, (3) dynamic baselines - engineers may have established new and stronger baselines during NAS search, and (4) efficiency - the search pipeline must yield results quickly in alignment with the productionization life cycle. In this paper, we present Rankitect, a NAS software framework for ranking systems at Meta. Rankitect seeks to build brand new architectures by composing low level building blocks from scratch. Rankitect implements and improves state-of-the-art (SOTA) NAS methods for comprehensive and fair comparison under the same search space, including sampling-based NAS, one-shot NAS, and Differentiable NAS (DNAS). We evaluate Rankitect by comparing to multiple production ranking models at Meta. We find that Rankitect can discover new models from scratch achieving competitive tradeoff between Normalized Entropy loss and FLOPs. When utilizing search space designed by engineers, Rankitect can generate better models than engineers, achieving positive offline evaluation and online A/B test at Meta scale.

dimension, rankitect, supernet, (11 more...)

arXiv.org Artificial Intelligence

2311.0843

Country:

Asia > Singapore (0.05)
North America > United States > Virginia > Arlington County > Arlington (0.04)
North America > United States > New York > New York County > New York City (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

MLOps and ModelOps: What's the Difference and Why it Matters

#artificialintelligenceOct-28-2021, 21:30:33 GMT

These two terms are often used interchangeably. However, there are key distinctions between the functionality and features each provide, and the AI value and scalability at your organization depend on them. Did you know approximately half of the AI models that are developed never actually make it into production? If you want to understand why and prevent the waste of data scientist time and other resources from happening at your organization, then it is important to understand the difference between MLOps and ModelOps. They aren't the same, but the terms are often used interchangeably.

mlop and modelop, modelop, modelop platform, (8 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

MODELOPS VS MLOPS: HERE IS WHAT YOU NEED TO KNOW

#artificialintelligenceOct-5-2021, 19:48:46 GMT

One area marked by confusion today is understanding the differences between ModelOps vs. MLOps. ModelOps is the missing link for today's approach, connecting together existing data management solutions and model training tools to the value delivered via business applications. By incorporating ModelOps into your AI pipeline, you'll move past last-mile challenges with operationalizing AI and begin to see the return on your investments in the form of reduced costs, increased revenues, and better risk management. Recently, ModelOps has emerged as the critical link to addressing last-mile delivery challenges for AI deployments. ModelOps is a superset of MLOps, which refers to the processes involved to operationalize and manage AI models in use in production systems.

business application, modelop, modelop platform, (12 more...)

#artificialintelligence

Industry:

Information Technology > Software (0.39)
Information Technology > Security & Privacy (0.36)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.58)

Add feedback