AITopics | subimage

Collaborating Authors

subimage

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Domain-decomposed image classification algorithms using linear discriminant analysis and convolutional neural networks

Klawonn, Axel, Lanser, Martin, Weber, Janine

arXiv.org Artificial IntelligenceOct-30-2024

In many modern computer application problems, the classification of image data plays an important role. Among many different supervised machine learning models, convolutional neural networks (CNNs) and linear discriminant analysis (LDA) as well as sophisticated variants thereof are popular techniques. In this work, two different domain decomposed CNN models are experimentally compared for different image classification problems. Both models are loosely inspired by domain decomposition methods and in addition, combined with a transfer learning strategy. The resulting models show improved classification accuracies compared to the corresponding, composed global CNN model without transfer learning and besides, also help to speed up the training process. Moreover, a novel decomposed LDA strategy is proposed which also relies on a localization approach and which is combined with a small neural network model. In comparison with a global LDA applied to the entire input data, the presented decomposed LDA approach shows increased classification accuracies for the considered test problems.

classification accuracy, cnn, neural network, (12 more...)

arXiv.org Artificial Intelligence

2410.23359

Country:

Europe > Germany > North Rhine-Westphalia > Cologne Region > Cologne (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.64)

Industry: Health & Medicine > Therapeutic Area (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Discriminant Analysis (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.56)

Add feedback

Model Parallel Training and Transfer Learning for Convolutional Neural Networks by Domain Decomposition

Klawonn, Axel, Lanser, Martin, Weber, Janine

arXiv.org Artificial IntelligenceAug-26-2024

Deep convolutional neural networks (CNNs) have been shown to be very successful in a wide range of image processing applications. However, due to their increasing number of model parameters and an increasing availability of large amounts of training data, parallelization strategies to efficiently train complex CNNs are necessary. In previous work by the authors, a novel model parallel CNN architecture was proposed which is loosely inspired by domain decomposition. In particular, the novel network architecture is based on a decomposition of the input data into smaller subimages. For each of these subimages, local CNNs with a proportionally smaller number of parameters are trained in parallel and the resulting local classifications are then aggregated in a second step by a dense feedforward neural network (DNN). In the present work, we compare the resulting CNN-DNN architecture to less costly alternatives to combine the local classifications into a final, global decision. Additionally, we investigate the performance of the CNN-DNN trained as one coherent model as well as using a transfer learning strategy, where the parameters of the pre-trained local CNNs are used as initial values for a subsequently trained global coherent CNN-DNN model.

architecture, cnn-dnn model, local cnn, (13 more...)

arXiv.org Artificial Intelligence

2408.14442

Country: Europe > Germany > North Rhine-Westphalia > Cologne Region > Cologne (0.05)

Genre: Research Report (0.70)

Industry: Health & Medicine (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

DDU-Net: A Domain Decomposition-based CNN for High-Resolution Image Segmentation on Multiple GPUs

Verburg, Corné, Heinlein, Alexander, Cyr, Eric C.

arXiv.org Artificial IntelligenceJul-31-2024

The segmentation of ultra-high resolution images poses challenges such as loss of spatial information or computational inefficiency. In this work, a novel approach that combines encoder-decoder architectures with domain decomposition strategies to address these challenges is proposed. Specifically, a domain decomposition-based U-Net (DDU-Net) architecture is introduced, which partitions input images into non-overlapping patches that can be processed independently on separate devices. A communication network is added to facilitate inter-patch information exchange to enhance the understanding of spatial context. Experimental validation is performed on a synthetic dataset that is designed to measure the effectiveness of the communication network. Then, the performance is tested on the DeepGlobe land cover classification dataset as a real-world benchmark data set. The results demonstrate that the approach, which includes inter-patch communication for images divided into $16\times16$ non-overlapping subimages, achieves a $2-3\,\%$ higher intersection over union (IoU) score compared to the same network without inter-patch communication. The performance of the network which includes communication is equivalent to that of a baseline U-Net trained on the full image, showing that our model provides an effective solution for segmenting ultra-high-resolution images while preserving spatial context. The code is available at https://github.com/corne00/HiRes-Seg-CNN.

ddu-net, feature map, subimage, (14 more...)

arXiv.org Artificial Intelligence

2407.21266

Country:

Europe > Netherlands > South Holland > Delft (0.04)
North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
North America > Mexico > Mexico City > Mexico City (0.04)
(3 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Energy (1.00)
Government > Regional Government > North America Government > United States Government (0.93)
Health & Medicine > Diagnostic Medicine (0.68)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Communications > Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.74)

Add feedback

ComCLIP: Training-Free Compositional Image and Text Matching

Jiang, Kenan, He, Xuehai, Xu, Ruize, Wang, Xin Eric

arXiv.org Artificial IntelligenceNov-13-2023

Contrastive Language-Image Pretraining (CLIP) has demonstrated great zero-shot performance for matching images and text. However, it is still challenging to adapt vision-lanaguage pretrained models like CLIP to compositional image and text matching -- a more challenging image and text matching task requiring the model understanding of compositional word concepts and visual components. Towards better compositional generalization in zero-shot image and text matching, in this paper, we study the problem from a causal perspective: the erroneous semantics of individual entities are essentially confounders that cause the matching failure. Therefore, we propose a novel \textbf{\textit{training-free}} compositional CLIP model (ComCLIP). ComCLIP disentangles input images into subjects, objects, and action sub-images and composes CLIP's vision encoder and text encoder to perform evolving matching over compositional text embedding and sub-image embeddings. In this way, ComCLIP can mitigate spurious correlations introduced by the pretrained CLIP models and dynamically evaluate the importance of each component. Experiments on four compositional image-text matching datasets: SVO, ComVG, Winoground, and VL-checklist, and two general image-text retrieval datasets: Flick30K, and MSCOCO demonstrate the effectiveness of our plug-and-play method, which boosts the \textbf{\textit{zero-shot}} inference ability of CLIP, SLIP, and BLIP2 even without further training or fine-tuning. Our codes can be found at https://github.com/eric-ai-lab/ComCLIP.

comclip, dataset, predicate, (12 more...)

arXiv.org Artificial Intelligence

2211.13854

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > China (0.04)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Better Understanding Differences in Attribution Methods via Systematic Evaluations

Rao, Sukrut, Böhle, Moritz, Schiele, Bernt

arXiv.org Artificial IntelligenceMar-21-2023

Deep neural networks are very successful on many vision tasks, but hard to interpret due to their black box nature. To overcome this, various post-hoc attribution methods have been proposed to identify image regions most influential to the models' decisions. Evaluating such methods is challenging since no ground truth attributions exist. We thus propose three novel evaluation schemes to more reliably measure the faithfulness of those methods, to make comparisons between them more fair, and to make visual inspection more systematic. To address faithfulness, we propose a novel evaluation setting (DiFull) in which we carefully control which parts of the input can influence the output in order to distinguish possible from impossible attributions. To address fairness, we note that different methods are applied at different layers, which skews any comparison, and so evaluate all methods on the same layers (ML-Att) and discuss how this impacts their performance on quantitative metrics. For more systematic visualizations, we propose a scheme (AggAtt) to qualitatively evaluate the methods on complete datasets. We use these evaluation schemes to study strengths and shortcomings of some widely used attribution methods over a wide range of models. Finally, we propose a post-processing smoothing step that significantly improves the performance of some attribution methods, and discuss its applicability.

artificial intelligence, attribution, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2303.11884

Country:

Europe > Germany > Saarland > Saarbrücken (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)
(5 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Publishers use AI to catch bad scientists doctoring data

#artificialintelligenceSep-15-2022, 10:12:03 GMT

Analysis Shady scientists trying to publish bad research may want to think twice as academic publishers are increasingly using AI software to automatically spot signs of data tampering. Duplications of images, where the same picture of a cluster of cells, for example, is copied, flipped, rotated, shifted, or cropped is, unfortunately, quite common. In cases where the errors aren't accidental, the doctored images are created to look as if the researchers have more data and conducted more experiments then they really did. Image duplication was the top reason papers were retracted for the American Association for Cancer Research (AACR) over 2016 to 2020, according to Daniel Evanko, the company's Director of Journal Operations and Systems. Having to retract a paper damages the authors and the publishers' reputation.

proofig, publisher, software, (14 more...)

#artificialintelligence

Country:

North America > United States > Illinois > Cook County > Chicago (0.05)
Asia > Middle East > Israel (0.05)

Genre: Research Report (0.32)

Industry: Health & Medicine > Therapeutic Area (0.35)

Technology: Information Technology > Artificial Intelligence > Applied AI (0.61)

Add feedback

Towards Better Understanding Attribution Methods

Rao, Sukrut, Böhle, Moritz, Schiele, Bernt

arXiv.org Artificial IntelligenceMay-20-2022

Deep neural networks are very successful on many vision tasks, but hard to interpret due to their black box nature. To overcome this, various post-hoc attribution methods have been proposed to identify image regions most influential to the models' decisions. Evaluating such methods is challenging since no ground truth attributions exist. We thus propose three novel evaluation schemes to more reliably measure the faithfulness of those methods, to make comparisons between them more fair, and to make visual inspection more systematic. To address faithfulness, we propose a novel evaluation setting (DiFull) in which we carefully control which parts of the input can influence the output in order to distinguish possible from impossible attributions. To address fairness, we note that different methods are applied at different layers, which skews any comparison, and so evaluate all methods on the same layers (ML-Att) and discuss how this impacts their performance on quantitative metrics. For more systematic visualizations, we propose a scheme (AggAtt) to qualitatively evaluate the methods on complete datasets. We use these evaluation schemes to study strengths and shortcomings of some widely used attribution methods. Finally, we propose a post-processing smoothing step that significantly improves the performance of some attribution methods, and discuss its applicability.

attribution, attribution method, final layer, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/CVPR52688.2022.00998

2205.10435

Country:

Europe > Germany > Saarland > Saarbrücken (0.04)
Europe > Italy > Marche > Ancona Province > Ancona (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Image Segmentation : Part 1

#artificialintelligenceJul-18-2021, 16:10:04 GMT

In this article we will cover Threshold Based and Edge based Segmentation. Other segmentation techniques will be discussed in later parts. Image thresholding segmentation is a simple form of image segmentation. It is a way to create a binary or multi color image based on setting a threshold value on the pixel intensity of the original image. In this thresholding process, we will consider the intensity histogram of all the pixels in the image.

image segmentation, pixel, threshold, (11 more...)

#artificialintelligence

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (0.62)
Information Technology > Artificial Intelligence > Vision (0.40)

Add feedback

AskReddit: Help with a guidance for my graduation thesis • /r/MachineLearning

#artificialintelligenceApr-9-2016, 20:06:44 GMT

Hello, I'm a computer scientist student, I will finish CS this year so I already started my graduation thesis. I work on a Computer Vision - Robotics lab here on my university and my main field of interest and that I want to pursue as an academic field is machine learning / deep learning, so I thought about mixing robotics with machine learning which is something very common. My main idea is Outdoor Autonomous Navigation, I want my robot to know what a grass is, what a tree is, what people and cars are so he can avoid it or do the things I will set it to do, my approach to the problem so far and what I already did is: For every image frame I slice the image into subImages and for each subImage I calculate it's histogram and compare with a huge data base containing tons of histograms of grass/sky/trees (for example) and run a knn/svm to classify the subImage into one of the closest histograms, and if everything goes by the script I will have a full labeled system for the robot, but I'm facing some problems and I'm not a really expert on the field yet so I really wan't some guidance because I don't know what to do, my professor told me this will be kinda hard to do this way and for a graduation thesis, I have implemented a LBP descriptor to classificate some textures like grass and asphalt but I can't use LBP for everything, I don't even know if the LBP will be accurate for grass and asphalt (if my dataset is huge enough), anyways, sorry for the long text, I just don't know what path to seek now, I don't even know if my current approach is a good one or I'm doing something silly.

artificial intelligence, guidance, machine learning, (7 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.83)

Add feedback

Fractally Finding the Odd One Out: An Analogical Strategy For Noticing Novelty

McGreggor, Keith (Georgia Institute of Technology) | Goel, Ashok (Georgia Institute of Technology)

AAAI ConferencesNov-1-2011

The Odd One Out test of intelligence consists of 3x3 matrix reasoning problems organized in 20 levels of difficulty. Addressing problems on this test appears to require integration of multiple cognitive abilities usually associated with creativity, including visual encoding, similarity assessment, pattern detection, and analogical transfer. We describe a novel fractal strategy for addressing visual analogy problems on the Odd One Out test. In our strategy, the relationship between images is encoded fractally, capturing important aspects of similarity as well as inherent self-similarity. The strategy starts with fractal representations encoded at a high level of resolution, but, if that is not sufficient to resolve ambiguity, it automatically adjusts itself to the right level of resolution for addressing a given problem. Similarly, the strategy starts with searching for fractally-derived similarity between simpler relationships, but, if that is not sufficient to resolve ambiguity, it automatically shifts to search for such similarity between higher-order relationships. We present preliminary results and initial analysis from applying the fractal technique on nearly 3,000 problems from the Odd One Out test.

artificial intelligence, representation, subimage, (16 more...)

AAAI Conferences

2011 AAAI Fall Symposium Series

Country:

North America > United States > New York (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.88)

Add feedback