AITopics | image look

Collaborating Authors

image look

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Robotic Environmental State Recognition with Pre-Trained Vision-Language Models and Black-Box Optimization

Kawaharazuka, Kento, Obinata, Yoshiki, Kanazawa, Naoaki, Okada, Kei, Inaba, Masayuki

arXiv.org Artificial IntelligenceSep-26-2024

For example, the robot must recognize whether a door is open, a light is on, water is running, a fire is burning, and so on. In order to change the robot's behavior based on the recognition results, state recognition is usually performed with discrete values of about two or three options. Until now, appropriate individual methods have been used for each state to be recognized, such as direct processing of images or point clouds by human programming [3, 4], creating a dataset with annotations and training neural networks [5], or detecting the state by installing new sensors [6, 7]. However, these methods require us to manually program the process for each state recognition, to train neural networks one by one, and to increase the number of sensors installed. In addition, this will increase the number of programs and trained models needed for each state recognition, which will cause problems in management of source code and computer resource. To cope with these problems, a single program or model should be able to recognize multiple states. In this study, we propose a method to easily recognize various environmental states in a unified manner and through the spoken language (Figure 1). In order to perform state recognition through the spoken language, we use pre-trained large-scale vision-language models (VLMs) [8-12]. Currently, VLMs are being used for map generation [13, 14], scene understanding [15-17], and feature extraction for behav-Corresponding author.

optimization, recognition, state recognition, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1080/01691864.2024.2366995

2409.17519

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.05)

Genre: Research Report > New Finding (0.50)

Industry: Transportation > Air (0.42)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

VQA-based Robotic State Recognition Optimized with Genetic Algorithm

Kawaharazuka, Kento, Obinata, Yoshiki, Kanazawa, Naoaki, Okada, Kei, Inaba, Masayuki

arXiv.org Artificial IntelligenceMar-9-2023

State recognition of objects and environment in robots has been conducted in various ways. In most cases, this is executed by processing point clouds, learning images with annotations, and using specialized sensors. In contrast, in this study, we propose a state recognition method that applies Visual Question Answering (VQA) in a Pre-Trained Vision-Language Model (PTVLM) trained from a large-scale dataset. By using VQA, it is possible to intuitively describe robotic state recognition in the spoken language. On the other hand, there are various possible ways to ask about the same event, and the performance of state recognition differs depending on the question. Therefore, in order to improve the performance of state recognition using VQA, we search for an appropriate combination of questions using a genetic algorithm. We show that our system can recognize not only the open/closed of a refrigerator door and the on/off of a display, but also the open/closed of a transparent door and the state of water, which have been difficult to recognize.

accuracy, image look, recognition, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ICRA48891.2023.10160390

2303.05052

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.05)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre: Research Report (0.86)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

The same frightening women is keep on appearing in AI generated Images - Creepy AI

#artificialintelligenceSep-12-2022, 06:17:55 GMT

Loab was created by negatively weighted prompts and her macabre appearance consistely turns up in images, even when the AI is directed away from Loab's prompts. The creepy monster was discovered by Supercomposite who is a Swedish musician. "I discovered this woman, who I call Loab, in April. The AI reproduced her more easily than most celebrities. Her presence is persistent, and she haunts every image she touches," writes Supercomposite.

frightening woman, loab, supercomposite, (6 more...)

#artificialintelligence

Country: North America > United States (0.18)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Metapix – Metapix makes your images look creative with funny photo effects in seconds with the help of AI.

#artificialintelligenceFeb-2-2022, 15:41:58 GMT

Celebrate Birthday With Our Happy Birthday Effect & Enjoy and share your Friends and Family. Make your photo Magical with Birthday Effect. These are amazing and stunning photo . These beautiful effects are ideal to your memories and make them unforgettable.

funny photo effect, image look, metapix make, (1 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence (0.40)

Add feedback

SnapArt – SnapArt makes your images look creative with funny photo effects in seconds with the help of AI.

#artificialintelligenceJul-14-2021, 04:50:17 GMT

You have visited an Art gallery and amazed by not only the Art collection but frames around Art. Frames create new perspectives on images and make them more relevant. We provide you a chance to Lose your inner Artist inside you and create amazing frames for your images.

funny photo effect, image look, snapart make

#artificialintelligence

Technology: Information Technology > Artificial Intelligence (0.40)

Add feedback

Do you know which inputs your neural network likes most? :: Päpper's Coding Blog -- Have fun coding.

#artificialintelligenceJan-1-2020, 14:43:28 GMT

Recent advances in training deep neural networks have led to a whole bunch of impressive machine learning models which are able to tackle a very diverse range of tasks. When you are developing such a model, one of the notable downsides is that it is considered a "black-box" approach in the sense that your model learns from data you feed it, but you don't really know what is going on inside the model. To make it clearer: you don't really know what your model actually learned and if you have a flaw in your training / data approach it might work well according to your metrics while having learnt the wrong thing. As a self-respecting developer you want to do better than that, so today I will show you a method you can use to get some better introspection into your model by using visualization techniques. So what is a visualization techniqe when we talk about deep neural networks?

input image, neural network, target number, (14 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.61)

Add feedback