AITopics | model number

Collaborating Authors

model number

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

My A/C unit came with a cruddy manual. Claude made a better one

PCWorldJun-9-2026, 15:30:00 GMT

PCWorld reports how Claude AI transformed a confusing air conditioner manual into a comprehensive 12-page guide with visuals and maintenance tips. The process involved uploading the generic manual and model number to Claude's Cowork feature, which generated accurate operating procedures and quick-start guides. This demonstrates AI's potential to make complex product documentation more user-friendly and accessible for consumers struggling with manufacturer manuals. Um, what does button do? Our new air conditioner had just arrived, a necessity for a sure-to-be-sizzling New York summer, and already I was scratching my head.

claude, large language model, natural language, (11 more...)

PCWorld

Country: North America > United States > New York (0.25)

Industry:

Information Technology > Security & Privacy (1.00)
Leisure & Entertainment > Games > Computer Games (0.53)

Technology:

Information Technology > Artificial Intelligence > Robots (0.53)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)

Add feedback

Infinite forecast combinations based on Dirichlet process

Ren, Yinuo, Li, Feng, Kang, Yanfei, Wang, Jue

arXiv.org Machine LearningNov-24-2023

Forecast combination integrates information from various sources by consolidating multiple forecast results from the target time series. Instead of the need to select a single optimal forecasting model, this paper introduces a deep learning ensemble forecasting model based on the Dirichlet process. Initially, the learning rate is sampled with three basis distributions as hyperparameters to convert the infinite mixture into a finite one. All checkpoints are collected to establish a deep learning sub-model pool, and weight adjustment and diversity strategies are developed during the combination process. The main advantage of this method is its ability to generate the required base learners through a single training process, utilizing the decaying strategy to tackle the challenge posed by the stochastic nature of gradient descent in determining the optimal learning rate. To ensure the method's generalizability and competitiveness, this paper conducts an empirical analysis using the weekly dataset from the M4 competition and explores sensitivity to the number of models to be combined. The results demonstrate that the ensemble model proposed offers substantial improvements in prediction accuracy and stability compared to a single benchmark model.

artificial intelligence, dirichlet process, machine learning, (17 more...)

arXiv.org Machine Learning

2311.12379

Country:

Asia > China > Beijing > Beijing (0.05)
Asia > Middle East > Jordan (0.04)

Genre:

Research Report > New Finding (0.48)
Research Report > Experimental Study (0.47)

Technology:

Information Technology > Modeling & Simulation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Analysis of frequent trading effects of various machine learning models

Chen, Jiahao, Li, Xiaofei

arXiv.org Artificial IntelligenceSep-14-2023

In recent years, high-frequency trading has emerged as a crucial strategy in stock trading. This study aims to develop an advanced high-frequency trading algorithm and compare the performance of three different mathematical models: the combination of the cross-entropy loss function and the quasi-Newton algorithm, the FCNN model, and the vector machine. The proposed algorithm employs neural network predictions to generate trading signals and execute buy and sell operations based on specific conditions. By harnessing the power of neural networks, the algorithm enhances the accuracy and reliability of the trading strategy. To assess the effectiveness of the algorithm, the study evaluates the performance of the three mathematical models. The combination of the cross-entropy loss function and the quasi-Newton algorithm is a widely utilized logistic regression approach. The FCNN model, on the other hand, is a deep learning algorithm that can extract and classify features from stock data. Meanwhile, the vector machine is a supervised learning algorithm recognized for achieving improved classification results by mapping data into high-dimensional spaces. By comparing the performance of these three models, the study aims to determine the most effective approach for high-frequency trading. This research makes a valuable contribution by introducing a novel methodology for high-frequency trading, thereby providing investors with a more accurate and reliable stock trading strategy.

algorithm, fcnn model, transaction, (14 more...)

arXiv.org Artificial Intelligence

2311.10719

Country:

Asia > China > Hubei Province (0.04)
North America > Trinidad and Tobago > Trinidad > Arima > Arima (0.04)
Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Banking & Finance > Trading (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Revisiting Design Choices in Model-Based Offline Reinforcement Learning

Lu, Cong, Ball, Philip J., Parker-Holder, Jack, Osborne, Michael A., Roberts, Stephen J.

arXiv.org Artificial IntelligenceOct-8-2021

Offline reinforcement learning enables agents to leverage large pre-collected datasets of environment transitions to learn control policies, circumventing the need for potentially expensive or unsafe online data collection. Significant progress has been made recently in offline model-based reinforcement learning, approaches which leverage a learned dynamics model. This typically involves constructing a probabilistic model, and using the model uncertainty to penalize rewards where there is insufficient data, solving for a pessimistic MDP that lower bounds the true MDP. Existing methods, however, exhibit a breakdown between theory and practice, whereby pessimistic return ought to be bounded by the total variation distance of the model from the true dynamics, but is instead implemented through a penalty based on estimated model uncertainty. This has spawned a variety of uncertainty heuristics, with little to no comparison between differing approaches. In this paper, we compare these heuristics, and design novel protocols to investigate their interaction with other hyperparameters, such as the number of models, or imaginary rollout horizon. Using these insights, we show that selecting these key hyperparameters using Bayesian Optimization produces superior configurations that are vastly different to those currently used in existing hand-tuned state-of-the-art methods, and result in drastically stronger performance.

ensemble std, ensemble var, max aleatoric max pairwise diff, (9 more...)

arXiv.org Artificial Intelligence

2110.04135

Country:

North America > United States (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > France > Hauts-de-France > Nord > Lille (0.04)

Genre:

Research Report > New Finding (0.46)
Research Report > Promising Solution (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Interpretable Methods for Identifying Product Variants

West, Rebecca, Jadda, Khalifeh Al, Ahsan, Unaiza, Qu, Huiming, Cui, Xiquan

arXiv.org Artificial IntelligenceApr-12-2021

For e-commerce companies with large product selections, the organization and grouping of products in meaningful ways is important for creating great customer shopping experiences and cultivating an authoritative brand image. One important way of grouping products is to identify a family of product variants, where the variants are mostly the same with slight and yet distinct differences (e.g. color or pack size). In this paper, we introduce a novel approach to identifying product variants. It combines both constrained clustering and tailored NLP techniques (e.g. extraction of product family name from unstructured product title and identification of products with similar model numbers) to achieve superior performance compared with an existing baseline using a vanilla classification approach. In addition, we design the algorithm to meet certain business criteria, including meeting high accuracy requirements on a wide range of categories (e.g. appliances, decor, tools, and building materials, etc.) as well as prioritizing the interpretability of the model to make it accessible and understandable to all business partners.

constraint, product variant, variant, (15 more...)

arXiv.org Artificial Intelligence

2104.05504

Country:

Asia > Taiwan > Taiwan Province > Taipei (0.06)
North America > United States > Georgia > Fulton County > Atlanta (0.05)
North America > United States > New York > New York County > New York City (0.04)

Genre:

Research Report (0.84)
Overview (0.66)

Industry:

Materials > Construction Materials (0.54)
Information Technology > Services > e-Commerce Services (0.35)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)

Add feedback

Accelerating Multi-Model Inference by Merging DNNs of Different Weights

Jeong, Joo Seong, Kim, Soojeong, Yu, Gyeong-In, Lee, Yunseong, Chun, Byung-Gon

arXiv.org Machine LearningSep-28-2020

Standardized DNN models that have been proved to perform well on machine learning tasks are widely used and often adopted as-is to solve downstream tasks, forming the transfer learning paradigm. However, when serving multiple instances of such DNN models from a cluster of GPU servers, existing techniques to improve GPU utilization such as batching are inapplicable because models often do not share weights due to fine-tuning. We propose NetFuse, a technique of merging multiple DNN models that share the same architecture but have different weights and different inputs. NetFuse is made possible by replacing operations with more general counterparts that allow a set of weights to be associated with only a certain set of inputs. Experiments on ResNet-50, ResNeXt-50, BERT, and XLNet show that NetFuse can speed up DNN inference time up to 3.6x on a NVIDIA V100 GPU, and up to 3.0x on a TITAN Xp GPU when merging 32 model instances, while only using up a small additional amount of GPU memory.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

2009.13062

Country: Asia > South Korea > Seoul > Seoul (0.05)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Approximate Model Counting by Partial Knowledge Compilation

Lai, Yong

arXiv.org Artificial IntelligenceMay-18-2018

Model counting is the problem of computing the number of satisfying assignments of a given propositional formula. Although exact model counters can be naturally furnished by most of the knowledge compilation (KC) methods, in practice, they fail to generate the compiled results for the exact counting of models for certain formulas due to the explosion in sizes. Decision-DNNF is an important KC language that captures most of the practical compilers. We propose a generalized Decision-DNNF (referred to as partial Decision-DNNF) via introducing a class of new leaf vertices (called unknown vertices), and then propose an algorithm called PartialKC to generate randomly partial Decision-DNNF formulas from the given formulas. An unbiased estimate of the model number can be computed via a randomly partial Decision-DNNF formula. Each calling of PartialKC consists of multiple callings of MicroKC, while each of the latter callings is a process of importance sampling equipped with KC technologies. The experimental results show that PartialKC is more accurate than both SampleSearch and SearchTreeSampler, PartialKC scales better than SearchTreeSampler, and the KC technologies can obviously accelerate sampling.

artificial intelligence, decision-dnnf formula, formula, (16 more...)

arXiv.org Artificial Intelligence

1805.0718

Country: Asia > China (0.14)

Genre: Research Report > New Finding (0.48)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.93)

Add feedback

Will Apple's new iPhone SE have a notch?

Daily Mail - Science & techMay-1-2018, 16:20:16 GMT

Apple's hugely popular iPhone SE is overdue and overhaul - and it could see the end of the headphone jack. The latest renders claiming to show the next generation phone reveal the jack has gone - but that a new'notch' has appeared. The notch, first seen in the iPhone X, would give the phone FaceID capabilities. The latest renders claiming to show the next generation phone reveal the headphone jack and the home button have gone - but that a new'notch' has appeared However, the latest leaks also reveal the home button and headphone jack are gone, bringing the iPhone SE into line with the rest of Apple's line. The images were posted by @onleaks, although even he admitted they could be fake, tweeting'Now that u aware I can't confirm if this one is partially or completely accurate or even exists but despite of that decided to share it for discussion purposes only.'

apple, artificial intelligence, headphone jack, (15 more...)

Daily Mail - Science & tech

Industry: Information Technology > Security & Privacy (0.35)

Technology:

Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Vision > Face Recognition (0.54)

Add feedback

Singlecue Gen 2 review: This gesture-recognition device nearly provoked us into making some rude gestures

PCWorldDec-9-2016, 12:30:05 GMT

I have an Amazon Echo and a Logitech Harmony Remote in my bedroom, but my goal is to eliminate as many remotes as possible so I can control the TV, cable box, Amazon Fire TV, and other gadgets in my house as quickly and efficiently as possible. I want to be able to do things like dim the lights, turn on the TV, and tune to my favorite program in a single step, without needing to reach for a switch or fumble with multiple remotes. It's against that backdrop--and a desire to simplify my life--that I anxiously broke out Singlecue gesture-control device from the box and plugged it in. I was hoping that Singlecue's promise to enable me to control my home's smart devices with a wave of my hand would further my mission to eliminate remotes altogether. At first blush, Singlecue is a compelling device.

artificial intelligence, singlecue, singlecue gen 2, (15 more...)

PCWorld

Industry: Information Technology > Smart Houses & Appliances (0.50)

Technology: Information Technology > Artificial Intelligence > Vision > Gesture Recognition (0.71)

Add feedback

Constructing Reference Sets from Unstructured, Ungrammatical Text

Michelson, M., Knoblock, C. A.

Journal of Artificial Intelligence ResearchMay-28-2010

Vast amounts of text on the Web are unstructured and ungrammatical, such as classified ads, auction listings, forum postings, etc. We call such text posts. Despite their inconsistent structure and lack of grammar, posts are full of useful information. This paper presents work on semi-automatically building tables of relational information, called reference sets, by analyzing such posts directly. Reference sets can be applied to a number of tasks such as ontology maintenance and information extraction. Our reference-set construction method starts with just a small amount of background knowledge, and constructs tuples representing the entities in the posts to form a reference set. We also describe an extension to this approach for the special case where even this small amount of background knowledge is impossible to discover and use. To evaluate the utility of the machine-constructed reference sets, we compare them to manually constructed reference sets in the context of reference-set-based information extraction. Our results show the reference sets constructed by our method outperform manually constructed reference sets. We also compare the reference-set-based extraction approach using the machine-constructed reference set to supervised extraction approaches using generic features. These results demonstrate that using machine-constructed reference sets outperforms the supervised methods, even though the supervised methods require training data.

entity tree, extraction, seed-based method, (14 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.2937

AI Access Foundation

10652

Journal of Artificial Intelligence Research

Country:

North America > United States > California > San Francisco County > San Francisco (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
North America > United States > California > Los Angeles County > El Segundo (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Automobiles & Trucks > Manufacturer (1.00)
Transportation > Passenger (0.93)
Transportation > Ground > Road (0.93)
Government > Regional Government > North America Government > United States Government (0.67)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
(3 more...)

Add feedback