AITopics

2304.05898

Country: Asia > Japan > Honshū > Chūgoku > Hiroshima Prefecture > Hiroshima (0.05)

Genre: Research Report > New Finding (0.66)

Industry: Health & Medicine (0.49)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Hanai, Ryo, Domae, Yukiyasu, Ramirez-Alpizar, Ixchel G., Leme, Bruno, Ogata, Tetsuya

Force Map: Learning to Predict Contact Force Distribution from Vision

arXiv.org Artificial IntelligenceApr-12-2023

When humans see a scene, they can roughly imagine the forces applied to objects based on their experience and use them to handle the objects properly. This paper considers transferring this "force-visualization" ability to robots. We hypothesize that a rough force distribution (named "force map") can be utilized for object manipulation strategies even if accurate force estimation is impossible. Based on this hypothesis, we propose a training method to predict the force map from vision. To investigate this hypothesis, we generated scenes where objects were stacked in bulk through simulation and trained a model to predict the contact force from a single image. We further applied domain randomization to make the trained model function on real images. The experimental results showed that the model trained using only synthetic images could predict approximate patterns representing the contact areas of the objects even for real images. Then, we designed a simple algorithm to plan a lifting direction using the predicted force distribution. We confirmed that using the predicted force distribution contributes to finding natural lifting directions for typical real-world scenes. Furthermore, the evaluation through simulations showed that the disturbance caused to surrounding objects was reduced by 26 % (translation displacement) and by 39 % (angular displacement) for scenes where objects were overlapping.

artificial intelligence, machine learning, pattern recognition, (19 more...)

2304.05803

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)

Genre: Research Report > New Finding (0.54)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.86)

Iscen, Ahmet, Fathi, Alireza, Schmid, Cordelia

Improving Image Recognition by Retrieving from Web-Scale Image-Text Data

arXiv.org Artificial IntelligenceApr-11-2023

Retrieval augmented models are becoming increasingly popular for computer vision tasks after their recent success in NLP problems. The goal is to enhance the recognition capabilities of the model by retrieving similar examples for the visual input from an external memory set. In this work, we introduce an attention-based memory module, which learns the importance of each retrieved example from the memory. Compared to existing approaches, our method removes the influence of the irrelevant retrieved examples, and retains those that are beneficial to the input query. We also thoroughly study various ways of constructing the memory dataset. Our experiments show the benefit of using a massive-scale memory dataset of 1B image-text pairs, and demonstrate the performance of different memory representations. We evaluate our method in three different classification tasks, namely long-tailed recognition, learning with noisy labels, and fine-grained classification, and show that it achieves state-of-the-art accuracies in ImageNet-LT, Places-LT and Webvision datasets.

artificial intelligence, machine learning, pattern recognition, (16 more...)

2304.05173

Country:

Europe > Poland (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Image Matching (0.40)

arXiv.org Artificial IntelligenceApr-10-2023

A Hybrid Deep Feature-Based Deformable Image Registration Method for Pathology Images

Zhang, Chulong, Jiang, Yuming, Li, Na, Zhang, Zhicheng, Islam, Md Tauhidul, Dai, Jingjing, Liu, Lin, He, Wenfeng, Qin, Wenjian, Xiong, Jing, Xie, Yaoqin, Liang, Xiaokun

Pathologists need to combine information from differently stained pathology slices for accurate diagnosis. Deformable image registration is a necessary technique for fusing multi-modal pathology slices. This paper proposes a hybrid deep feature-based deformable image registration framework for stained pathology samples. We first extract dense feature points via the detector-based and detector-free deep learning feature networks and perform points matching. Then, to further reduce false matches, an outlier detection method combining the isolation forest statistical model and the local affine correction model is proposed. Finally, the interpolation method generates the deformable vector field for pathology image registration based on the above matching points. We evaluate our method on the dataset of the Non-rigid Histology Image Registration (ANHIR) challenge, which is co-organized with the IEEE ISBI 2019 conference. Our technique outperforms the traditional approaches by 17% with the Average-Average registration target error (rTRE) reaching 0.0034. The proposed method achieved state-of-the-art performance and ranked 1st in evaluating the test dataset. The proposed hybrid deep feature-based registration method can potentially become a reliable method for pathology image registration.

machine learning, pattern recognition, registration, (17 more...)

2208.07655

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Asia > China > Guangdong Province > Guangzhou (0.04)
North America > United States > New York > Monroe County > Rochester (0.04)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Genre: Research Report (1.00)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (0.50)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Image Matching (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceApr-9-2023, 01:10:16 GMT

Learning Distance Metrics with Triplet Loss: Advantages and Challenges - AITechTrend

Triplet loss is a loss function that is widely used in machine learning for tasks such as image recognition, facial recognition, and information retrieval. The idea behind triplet loss is to learn a distance metric between objects such that objects that are similar are close together in the metric space, while objects that are dissimilar are far apart. In this article, we will introduce triplet loss, discuss how it works, and explore some of its applications. Triplet loss is a type of loss function used in machine learning that is designed to learn a distance metric between objects. The goal of triplet loss is to embed objects in a metric space such that objects that are similar are close together in the space, while objects that are dissimilar are far apart.

distance metric, loss function, triplet loss, (13 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Image Matching (0.41)
Information Technology > Artificial Intelligence > Vision > Face Recognition (0.31)

Bruintjes, Robert-Jan, Motyka, Tomasz, van Gemert, Jan

What Affects Learned Equivariance in Deep Image Recognition Models?

arXiv.org Artificial IntelligenceApr-7-2023

Equivariance w.r.t. geometric transformations in neural networks improves data efficiency, parameter efficiency and robustness to out-of-domain perspective shifts. When equivariance is not designed into a neural network, the network can still learn equivariant functions from the data. We quantify this learned equivariance, by proposing an improved measure for equivariance. We find evidence for a correlation between learned translation equivariance and validation accuracy on ImageNet. We therefore investigate what can increase the learned equivariance in neural networks, and find that data augmentation, reduced model capacity and inductive bias in the form of convolutions induce higher learned equivariance in neural networks.

equivariance, machine learning, pattern recognition, (15 more...)

2304.02628

Country:

Europe > Netherlands > South Holland > Delft (0.04)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
North America > United States > New York (0.04)
Africa (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition > Image Matching (0.41)

Neural Information Processing SystemsApr-6-2023, 19:58:19 GMT

Neural Network Star Pattern Recognition for Spacecraft Attitude Determination and Control

Currently, the most complex spacecraft attitude determination and control tasks are ultimately governed by ground-based systems and personnel. Conventional on-board systems face severe serial microprocessors operating on inherently parallel problems. New computer architectures based on the anatomy of the human brain seem to promise high speed and fault-tolerant solutions to the limitations of serial processing.

neural network star pattern recognition, spacecraft attitude determination and control

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.50)

Neural Information Processing SystemsApr-6-2023, 19:52:51 GMT

Recognizing Hand-Printed Letters and Digits

We are developing a hand-printed character recognition system using a multi(cid:173) layered neural net trained through backpropagation. We report on results of training nets with samples of hand-printed digits scanned off of bank checks and hand-printed letters interactively entered into a computer through a sty(cid:173) lus digitizer. Given a large training set, and a net with sufficient capacity to achieve high performance on the training set, nets typically achieved error rates of 4-5% at a 0% reject rate and 1-2% at a 10% reject rate. The topology and capacity of the system, as measured by the number of connections in the net, have surprisingly little effect on generalization. For those developing practical pattern recognition systems, these results suggest that a large and representative training sample may be the single, most important factor in achieving high recognition accuracy.

cid, error rate, recognizing hand-printed letter, (13 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.60)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.59)

Neural Information Processing SystemsApr-6-2023, 19:42:36 GMT

On Stochastic Complexity and Admissible Models for Neural Network Classifiers

In this paper we examine in a general sense the application of Minimum Description Length (MDL) techniques to the problem of selecting a good classifier from a large set of candidate models or hypotheses. Pattern recognition algorithms differ from more conventional statistical modeling techniques in the sense that they typically choose from a very large number of candidate models to describe the available data. Hence, the problem of searching through this set of candidate models is frequently a formidable one, often approached in practice by the use of greedy algorithms. In this context, techniques which allow us to eliminate portions of the hypothesis space are of considerable interest. We will show in this paper that it is possible to use the intrinsic structure of the MDL formalism to eliminate large numbers of candidate models given only minimal information about the data.

candidate model, neural network classifier, stochastic complexity and admissible model, (2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.40)

Neural Information Processing SystemsApr-6-2023, 19:27:06 GMT

Adaptive Elastic Models for Hand-Printed Character Recognition

Hand-printed digits can be modeled as splines that are governed by about 8 control points. Images of digits can be produced by placing Gaussian ink generators uniformly along the spline. Real images can be recognized by finding the digit model most likely to have generated the data. For each digit model we use an elastic matching algorithm to minimize an energy function that includes both the defor(cid:173) mation energy of the digit model and the log probability that the model would generate the inked pixels in the image. If a uniform noise process is included in the model of image generation, some of the inked pixels can be rejected as noise as a digit model is fitting a poorly segmented image.

adaptive elastic model, digit model, hand-printed character recognition, (4 more...)

Technology:

Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.40)