AITopics | halvorsen

Collaborating Authors

halvorsen

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Multi-Task Learning for Visually Grounded Reasoning in Gastrointestinal VQA

Safwan, Itbaan, Shaikh, Muhammad Annas, Haaris, Muhammad, Khan, Ramail, Tahir, Muhammad Atif

arXiv.org Artificial IntelligenceNov-7-2025

We present a multi-task framework for the MediaEval Medico 2025 challenge, leveraging a LoRA-tuned Florence-2 model for simultaneous visual question answering (VQA), explanation generation, and visual grounding. The proposed system integrates three curated datasets: (1) Kvasir-VQA-x1 for question-answer learning, (2) a synthetically enriched explanation dataset offering structured medical reasoning, and (3) text-to-region pairs linking visual features with segmentation masks. This multi-task setup enables the model to jointly learn visual grounding, reasoning, and interpretation, producing responses that are both accurate and interpretable. Extensive evaluation demonstrates that our approach substantially improves over single-task baselines in both answer accuracy and visual localization, highlighting the effectiveness of grounded multi-task learning for medical VQA applications.

explanation, machine learning, question answering, (14 more...)

arXiv.org Artificial Intelligence

2511.04384

Country:

North America > United States (0.14)
Asia > Pakistan (0.14)

Genre: Research Report (0.42)

Industry: Health & Medicine > Diagnostic Medicine (0.69)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.37)
Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (0.35)

Add feedback

DPERC: Direct Parameter Estimation for Mixed Data

Vo, Tuan L., Do, Quan Huu, Dang, Uyen, Nguyen, Thu, Halvorsen, Pål, Riegler, Michael A., Nguyen, Binh T.

arXiv.org Machine LearningJan-17-2025

The covariance matrix is a foundation in numerous statistical and machine-learning applications such as Principle Component Analysis, Correlation Heatmap, etc. However, missing values within datasets present a formidable obstacle to accurately estimating this matrix. While imputation methods offer one avenue for addressing this challenge, they often entail a trade-off between computational efficiency and estimation accuracy. Consequently, attention has shifted towards direct parameter estimation, given its precision and reduced computational burden. In this paper, we propose Direct Parameter Estimation for Randomly Missing Data with Categorical Features (DPERC), an efficient approach for direct parameter estimation tailored to mixed data that contains missing values within continuous features. Our method is motivated by leveraging information from categorical features, which can significantly enhance covariance matrix estimation for continuous features. Our approach effectively harnesses the information embedded within mixed data structures. Through comprehensive evaluations of diverse datasets, we demonstrate the competitive performance of DPERC compared to various contemporary techniques. In addition, we also show by experiments that DPERC is a valuable tool for visualizing the correlation heatmap.

artificial intelligence, covariance matrix, machine learning, (17 more...)

arXiv.org Machine Learning

2501.1054

Country:

Asia > Vietnam (0.17)
Europe > Norway (0.15)

Genre: Research Report (0.50)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Polyp and Surgical Instrument Segmentation with Double Encoder-Decoder Networks

Galdran, Adrian

arXiv.org Artificial IntelligenceJun-6-2024

This paper describes a solution for the MedAI competition, in which participants were required to segment both polyps and surgical instruments from endoscopic images. Our approach relies on a double encoder-decoder neural network which we have previously applied for polyp segmentation, but with a series of enhancements: a more powerful encoder architecture, an improved optimization procedure, and the post-processing of segmentations based on tempered model ensembling. Experimental results show that our method produces segmentations that show a good agreement with manual delineations provided by medical experts.

instrument segmentation, johansen, segmentation, (12 more...)

arXiv.org Artificial Intelligence

doi: 10.5617/nmi.9107

2406.03901

Country: Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.05)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Health Care Technology (0.75)
Health & Medicine > Diagnostic Medicine > Imaging (0.72)
Health & Medicine > Surgery (0.65)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Combining datasets to increase the number of samples and improve model fitting

Nguyen, Thu, Khadka, Rabindra, Phan, Nhan, Yazidi, Anis, Halvorsen, Pål, Riegler, Michael A.

arXiv.org Artificial IntelligenceMay-16-2023

For many use cases, combining information from different datasets can be of interest to improve a machine learning model's performance, especially when the number of samples from at least one of the datasets is small. However, a potential challenge in such cases is that the features from these datasets are not identical, even though there are some commonly shared features among the datasets. To tackle this challenge, we propose a novel framework called Combine datasets based on Imputation (ComImp). In addition, we propose a variant of ComImp that uses Principle Component Analysis (PCA), PCA-ComImp in order to reduce dimension before combining datasets. This is useful when the datasets have a large number of features that are not shared between them. Furthermore, our framework can also be utilized for data preprocessing by imputing missing data, i.e., filling in the missing entries while combining different datasets. To illustrate the power of the proposed methods and their potential usages, we conduct experiments for various tasks: regression, classification, and for different data types: tabular data, time series data, when the datasets to be combined have missing data. We also investigate how the devised methods can be used with transfer learning to provide even further model training improvement. Our results indicate that the proposed methods are somewhat similar to transfer learning in that the merge can significantly improve the accuracy of a prediction model on smaller datasets. In addition, the methods can boost performance by a significant margin when combining small datasets together and can provide extra improvement when being used with transfer learning.

artificial intelligence, dataset, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2210.05165

Country:

Europe > Norway > Eastern Norway > Oslo (0.05)
Asia > Vietnam > Hồ Chí Minh City > Hồ Chí Minh City (0.04)
Europe > Norway > Western Norway > Vestland > Bergen (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.49)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.77)

Add feedback

Mask-conditioned latent diffusion for generating gastrointestinal polyp images

Macháček, Roman, Mozaffari, Leila, Sepasdar, Zahra, Parasa, Sravanthi, Halvorsen, Pål, Riegler, Michael A., Thambawita, Vajira

arXiv.org Artificial IntelligenceApr-11-2023

In order to take advantage of AI solutions in endoscopy diagnostics, we must overcome the issue of limited annotations. These limitations are caused by the high privacy concerns in the medical field and the requirement of getting aid from experts for the time-consuming and costly medical data annotation process. In computer vision, image synthesis has made a significant contribution in recent years as a result of the progress of generative adversarial networks (GANs) and diffusion probabilistic models (DPM). Novel DPMs have outperformed GANs in text, image, and video generation tasks. Therefore, this study proposes a conditional DPM framework to generate synthetic GI polyp images conditioned on given generated segmentation masks. Our experimental results show that our system can generate an unlimited number of high-fidelity synthetic polyp images with the corresponding ground truth masks of polyps. To test the usefulness of the generated data, we trained binary image segmentation models to study the effect of using synthetic data. Results show that the best micro-imagewise IOU of 0.7751 was achieved from DeepLabv3+ when the training data consists of both real data and synthetic data. However, the results reflect that achieving good segmentation performance with synthetic data heavily depends on model architectures.

artificial intelligence, diffusion model, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2304.05233

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Norway > Eastern Norway > Oslo (0.05)
Oceania > Australia > Victoria > Melbourne (0.04)
(5 more...)

Genre: Research Report > New Finding (0.68)

Industry:

Health & Medicine > Therapeutic Area > Gastroenterology (1.00)
Health & Medicine > Diagnostic Medicine (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

A contrastive learning approach for individual re-identification in a wild fish population

Olsen, Ørjan Langøy, Sørdalen, Tonje Knutsen, Goodwin, Morten, Malde, Ketil, Knausgård, Kristian Muri, Halvorsen, Kim Tallaksen

arXiv.org Artificial IntelligenceJan-2-2023

In both terrestrial and marine ecology, physical tagging is a frequently used method to study population dynamics and behavior. However, such tagging techniques are increasingly being replaced by individual re-identification using image analysis. This paper introduces a contrastive learning-based model for identifying individuals. The model uses the first parts of the Inception v3 network, supported by a projection head, and we use contrastive learning to find similar or dissimilar image pairs from a collection of uniform photographs. We apply this technique for corkwing wrasse, Symphodus melops, an ecologically and commercially important fish species. Photos are taken during repeated catches of the same individuals from a wild population, where the intervals between individual sightings might range from a few days to several years. Our model achieves a one-shot accuracy of 0.35, a 5-shot accuracy of 0.56, and a 100-shot accuracy of 0.88, on our dataset.

accuracy, artificial intelligence, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2301.00596

Country:

Europe > Norway > Western Norway > Vestland > Bergen (0.04)
Europe > Norway > Northern Norway > Troms > Tromsø (0.04)

Genre: Research Report (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Pyramid-Focus-Augmentation: Medical Image Segmentation with Step-Wise Focus

Thambawita, Vajira, Hicks, Steven, Halvorsen, Pål, Riegler, Michael A.

arXiv.org Artificial IntelligenceDec-14-2020

Segmentation of findings in the gastrointestinal tract is a challenging but also an important task which is an important building stone for sufficient automatic decision support systems. In this work, we present our solution for the Medico 2020 task, which focused on the problem of colon polyp segmentation. We present our simple but efficient idea of using an augmentation method that uses grids in a pyramid-like manner (large to small) for segmentation. Our results show that the proposed methods work as indented and can also lead to comparable results when competing with other methods.

grid size, johansen, segmentation, (12 more...)

arXiv.org Artificial Intelligence

2012.0743

Country: Europe > Norway > Eastern Norway > Oslo (0.04)

Genre: Research Report > New Finding (0.54)

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (0.51)
Health & Medicine > Therapeutic Area > Gastroenterology (0.36)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.30)

Add feedback

Medico

#artificialintelligenceJul-30-2020, 09:14:45 GMT

The "Medico automatic polyp segmentation task" aims to develop computer-aided diagnosis systems for automatic polyp segmentation to detect all types of polyps (for example, irregular polyp, smaller or flat polyps) with high efficiency and accuracy. The main goal of the challenge is to benchmark semantic segmentation algorithms on a publicly available dataset, emphasizing robustness, speed, and generalization. Participants will get access to a dataset consisting of 1,000 segmented polyp images from the gastrointestinal tract and a separate testing dataset. The challenge consists of two mandatory tasks, each focused on a different requirement for efficient polyp detection. We hope that this task encourages multimedia researchers to apply their vast knowledge to the medical field and make an impact that may affect real lives.

artificial intelligence, dataset, polyp, (17 more...)

#artificialintelligence

Country: Europe > Norway (0.05)

Industry:

Health & Medicine > Therapeutic Area > Gastroenterology (0.97)
Health & Medicine > Therapeutic Area > Oncology (0.80)

Technology: Information Technology > Artificial Intelligence > Vision (0.36)

Add feedback

An Extensive Study on Cross-Dataset Bias and Evaluation Metrics Interpretation for Machine Learning applied to Gastrointestinal Tract Abnormality Classification

Thambawita, Vajira, Jha, Debesh, Hammer, Hugo Lewi, Johansen, Håvard D., Johansen, Dag, Halvorsen, Pål, Riegler, Michael A.

arXiv.org Machine LearningMay-8-2020

Precise and efficient automated identification of Gastrointestinal (GI) tract diseases can help doctors treat more patients and improve the rate of disease detection and identification. Currently, automatic analysis of diseases in the GI tract is a hot topic in both computer science and medical-related journals. Nevertheless, the evaluation of such an automatic analysis is often incomplete or simply wrong. Algorithms are often only tested on small and biased datasets, and cross-dataset evaluations are rarely performed. A clear understanding of evaluation metrics and machine learning models with cross datasets is crucial to bring research in the field to a new quality level. Towards this goal, we present comprehensive evaluations of five distinct machine learning models using Global Features and Deep Neural Networks that can classify 16 different key types of GI tract conditions, including pathological findings, anatomical landmarks, polyp removal conditions, and normal findings from images captured by common GI tract examination instruments. In our evaluation, we introduce performance hexagons using six performance metrics such as recall, precision, specificity, accuracy, F1-score, and Matthews Correlation Coefficient to demonstrate how to determine the real capabilities of models rather than evaluating them shallowly. Furthermore, we perform cross-dataset evaluations using different datasets for training and testing. With these cross-dataset evaluations, we demonstrate the challenge of actually building a generalizable model that could be used across different hospitals. Our experiments clearly show that more sophisticated performance metrics and evaluation methods need to be applied to get reliable models rather than depending on evaluations of the splits of the same dataset, i.e., the performance metrics should always be interpreted together rather than relying on a single metric.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Machine Learning

2005.03912

Country:

Europe > Norway > Eastern Norway > Oslo (0.05)
Europe > Norway > Northern Norway > Troms > Tromsø (0.04)
Africa > Cameroon > Far North Region > Maroua (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Oncology (1.00)
Health & Medicine > Therapeutic Area > Gastroenterology (1.00)
Health & Medicine > Diagnostic Medicine (0.95)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.93)

Add feedback

The Medico-Task 2018: Disease Detection in the Gastrointestinal Tract using Global Features and Deep Learning

Thambawita, Vajira, Jha, Debesh, Riegler, Michael, Halvorsen, Pål, Hammer, Hugo Lewi, Johansen, Håvard D., Johansen, Dag

arXiv.org Machine LearningOct-31-2018

We have proposed a system based on global features and deep neural networks. The best approach combines two neural networks, and the reproducible experimental results signify the efficiency of the proposed model with an accuracy rate of 95.80%, a precision of 95.87%, and an F1-score of 95.80%.

artificial intelligence, machine learning, proceedings, (15 more...)

arXiv.org Machine Learning

1810.13278

Country: Europe > Norway (0.30)

Genre: Research Report (0.50)

Industry: Health & Medicine > Therapeutic Area > Gastroenterology (0.84)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback