AITopics | supermodel

Collaborating Authors

supermodel

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Segregated Graphs and Marginals of Chain Graph Models

Ilya Shpitser

Neural Information Processing SystemsOct-2-2025, 11:38:16 GMT

Neural Information Processing Systems http://nips.cc/

artificial intelligence, graph, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report > New Finding (0.68)
Research Report > Strength High (0.46)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Vaccines (0.47)
Health & Medicine > Therapeutic Area > Immunology (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.31)

Add feedback

David Gandy: 'Britain produces some of the greatest models. We want to keep it that way'

BBC NewsSep-19-2025, 23:06:38 GMT

David Gandy: 'Britain produces some of the greatest models. We want to keep it that way' The Essex-born supermodel is sitting in his light-filled kitchen, sipping a glass of water and reflecting on his almost 25-year career. At 45, Gandy's striking dark brown hair, sharp cheekbones and piercing blue eyes have been at the centre of some of fashion's most iconic campaigns of the last two decades, and he is one of the few male models to become a household name. I always say that I was inspired by the female supermodels, Gandy says, name-checking Cindy Crawford, Kate Moss and Naomi Campbell. You don't even need to say the surnames.

britain produce, gandy, greatest model, (15 more...)

BBC News

Country:

South America (0.15)
North America > United States (0.15)
North America > Central America (0.15)
(14 more...)

Industry: Government > Regional Government > Europe Government > United Kingdom Government (0.49)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Segregated Graphs and Marginals of Chain Graph Models

Neural Information Processing SystemsJan-13-2025, 20:16:56 GMT

Bayesian networks are a popular representation of asymmetric (for example causal) relationships between random variables. Markov random fields (MRFs) are a complementary model of symmetric relationships used in computer vision, spatial modeling, and social and gene expression networks. A chain graph model under the Lauritzen-Wermuth-Frydenberg interpretation (hereafter a chain graph model) generalizes both Bayesian networks and MRFs, and can represent asymmetric and symmetric relationships together.As in other graphical models, the set of marginals from distributions in a chain graph model induced by the presence of hidden variables forms a complex model. One recent approach to the study of marginal graphical models is to consider a well-behaved supermodel. Such a supermodel of marginals of Bayesian networks, defined only by conditional independences, and termed the ordinary Markov model, was studied at length in (Evans and Richardson, 2014).In this paper, we show that special mixed graphs which we call segregated graphs can be associated, via a Markov property, with supermodels of a marginal of chain graphs defined only by conditional independences.

chain graph model, segregated graph and marginal, supermodel, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.70)

Add feedback

A Unified Approach to Routing and Cascading for LLMs

Dekoninck, Jasper, Baader, Maximilian, Vechev, Martin

arXiv.org Artificial IntelligenceOct-14-2024

The widespread applicability of large language models (LLMs) has increased the availability of many fine-tuned models of various sizes targeting specific tasks. Given a set of such specialized models, to maximize overall performance, it is important to figure out the optimal strategy for selecting the right model for a given user query. An effective strategy could drastically increase overall performance and even offer improvements over a single large monolithic model. Existing approaches typically fall into two categories: routing, where a single model is selected for each query, and cascading, which runs a sequence of increasingly larger models until a satisfactory answer is obtained. However, both have notable limitations: routing commits to an initial model without flexibility, while cascading requires executing every model in sequence, which can be inefficient. Additionally, the conditions under which these strategies are provably optimal remain unclear. In this work, we derive optimal strategies for both routing and cascading. Building on this analysis, we propose a novel approach called cascade routing, which combines the adaptability of routing with the cost-efficiency of cascading. Our experiments demonstrate that cascade routing consistently outperforms both routing and cascading across a variety of settings, improving both output quality and lowering computational cost, thus offering a unified and efficient solution to the model selection problem.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2410.10347

Country:

Europe > Austria > Vienna (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(8 more...)

Genre:

Research Report > New Finding (0.68)
Research Report > Promising Solution (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Segregated Graphs and Marginals of Chain Graph Models

Neural Information Processing SystemsMar-13-2024, 01:59:25 GMT

artificial intelligence, bayesian inference, machine learning, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > California > Los Angeles County > Los Angeles (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report > New Finding (0.68)
Research Report > Strength High (0.46)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Vaccines (0.47)
Health & Medicine > Therapeutic Area > Immunology (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.51)

Add feedback

CheX-Nomaly: Segmenting Lung Abnormalities from Chest Radiographs using Machine Learning

Singh, Sanskriti

arXiv.org Artificial IntelligenceNov-3-2023

The global challenge in chest radiograph X-ray (CXR) abnormalities often being misdiagnosed is primarily associated with perceptual errors, where healthcare providers struggle to accurately identify the location of abnormalities, rather than misclassification errors. We currently address this problem through disease-specific segmentation models. Unfortunately, these models cannot be released in the field due to their lack of generalizability across all thoracic diseases. A binary model tends to perform poorly when it encounters a disease that isn't represented in the dataset. We present CheX-nomaly: a binary localization U-net model that leverages transfer learning techniques with the incorporation of an innovative contrastive learning approach. Trained on the VinDr-CXR dataset, which encompasses 14 distinct diseases in addition to 'no finding' cases, my model achieves generalizability across these 14 diseases and others it has not seen before. We show that we can significantly improve the generalizability of an abnormality localization model by incorporating a contrastive learning method and dissociating the bounding boxes with its disease class. We also introduce a new loss technique to apply to enhance the U-nets performance on bounding box segmentation. By introducing CheX-nomaly, we offer a promising solution to enhance the precision of chest disease diagnosis, with a specific focus on reducing the significant number of perceptual errors in healthcare.

abnormality, dataset, supermodel, (16 more...)

arXiv.org Artificial Intelligence

2311.01777

Country:

North America > United States (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > Vietnam > Hanoi > Hanoi (0.04)
Asia > Bangladesh (0.04)

Genre: Research Report > Promising Solution (0.48)

Industry:

Health & Medicine > Nuclear Medicine (1.00)
Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.32)

Add feedback

SpeedLimit: Neural Architecture Search for Quantized Transformer Models

Chai, Yuji, Bailey, Luke, Jin, Yunho, Karle, Matthew, Ko, Glenn G., Brooks, David, Wei, Gu-Yeon, Kung, H. T.

arXiv.org Artificial IntelligenceOct-13-2023

While research in the field of transformer models has primarily focused on enhancing performance metrics such as accuracy and perplexity, practical applications in industry often necessitate a rigorous consideration of inference latency constraints. Addressing this challenge, we introduce SpeedLimit, a novel Neural Architecture Search (NAS) technique that optimizes accuracy whilst adhering to an upper-bound latency constraint. Our method incorporates 8-bit integer quantization in the search process to outperform the current state-of-the-art technique. Our results underline the feasibility and efficacy of seeking an optimal balance between performance and latency, providing new avenues for deploying state-of-the-art transformer models in latency-sensitive environments.

architecture, latency, speedlimit, (11 more...)

arXiv.org Artificial Intelligence

2209.12127

Country: Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre:

Research Report > New Finding (0.48)
Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Add feedback

On Testability and Goodness of Fit Tests in Missing Data Models

Nabi, Razieh, Bhattacharya, Rohit

arXiv.org Artificial IntelligenceJun-10-2023

Significant progress has been made in developing identification and estimation techniques for missing data problems where modeling assumptions can be described via a directed acyclic graph. The validity of results using such techniques rely on the assumptions encoded by the graph holding true; however, verification of these assumptions has not received sufficient attention in prior work. In this paper, we provide new insights on the testable implications of three broad classes of missing data graphical models, and design goodness-of-fit tests for them. The classes of models explored are: sequential missing-at-random and missing-not-at-random models which can be used for modeling longitudinal studies with dropout/censoring, and a no self-censoring model which can be applied to cross-sectional studies and surveys.

artificial intelligence, data quality, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2203.00132

Country:

North America > United States > Massachusetts (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry: Health & Medicine (0.94)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Quality (0.84)

Add feedback

Conformal Prediction with Partially Labeled Data

Javanmardi, Alireza, Sale, Yusuf, Hofman, Paul, Hüllermeier, Eyke

arXiv.org Artificial IntelligenceJun-1-2023

While the predictions produced by conformal prediction are set-valued, the data used for training and calibration is supposed to be precise. In the setting of superset learning or learning from partial labels, a variant of weakly supervised learning, it is exactly the other way around: training data is possibly imprecise (set-valued), but the model induced from this data yields precise predictions. In this paper, we combine the two settings by making conformal prediction amenable to set-valued training data. We propose a generalization of the conformal prediction procedure that can be applied to set-valued training and calibration data. We prove the validity of the proposed method and present experimental studies in which it compares favorably to natural baselines.

artificial intelligence, machine learning, prediction, (15 more...)

arXiv.org Artificial Intelligence

2306.01191

Country: Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre:

Research Report > New Finding (0.48)
Research Report > Experimental Study (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.66)

Add feedback

Small wonders: stunning exhibition celebrates artistry of model buildings

The GuardianDec-9-2022, 18:12:55 GMT

When the eerily accurate AI image generator Dall-E 2 was released for public experimentation by OpenAI this summer, most people immediately used it to create whimsical scenes such as "samurai dolphin painted in the style of Rembrandt" or "Bruce Willis angrily devouring a cheeseburger on the moon". True, if you looked too closely at Bruce's left ear you might have noticed it wasn't there – but the freaky glitches were, though somewhat unsettling, part of the fun, not to mention a calming reminder that AI cannot entirely trick us that its images are real – yet. But more than one panicked architect also typed in, "Four-storey family home in forest in the style of Mies van der Rohe" or "Japanese-Scandi lounge area in office building lobby", and let out a tiny scream when the results resembled the renders of projects that architects otherwise spend long hours churning out. If an AI could knock out a decent interior in seconds, did it promise to be a fabulous time-saver – or would it put everyone out of a job? Not only does it celebrate the painstaking construction of physical structures, complete with tiny people and fake trees like a model railway set, which clearly took ages to make and no AI could come close to replicating – yet, but these models are also animatronic: they move, open, chirp, whirr, creak and close like Victorian clockwork figurines or the childlike works of Rodney Peppe.

model building, small wonder, stunning exhibition celebrate artistry, (9 more...)

The Guardian

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.93)

Add feedback