Goto

Collaborating Authors

 recognition ability


A Data Generation Perspective to the Mechanism of In-Context Learning

arXiv.org Artificial Intelligence

In-Context Learning (ICL) empowers Large Language Models (LLMs) with the capacity to learn in context, achieving downstream generalization without gradient updates but with a few in-context examples. Despite the encouraging empirical success, the underlying mechanism of ICL remains unclear, and existing research offers various viewpoints of understanding. These studies propose intuition-driven and ad-hoc technical solutions for interpreting ICL, illustrating an ambiguous road map. In this paper, we leverage a data generation perspective to reinterpret recent efforts and demonstrate the potential broader usage of popular technical solutions, approaching a systematic angle. For a conceptual definition, we rigorously adopt the terms of skill learning and skill recognition. The difference between them is skill learning can learn new data generation functions from in-context data. We also provide a comprehensive study on the merits and weaknesses of different solutions, and highlight the uniformity among them given the perspective of data generation, establishing a technical foundation for future research to incorporate the strengths of different lines of research.


Human-Machine Cooperative Multimodal Learning Method for Cross-subject Olfactory Preference Recognition

arXiv.org Artificial Intelligence

Odor sensory evaluation has a broad application in food, clothing, cosmetics, and other fields. Traditional artificial sensory evaluation has poor repeatability, and the machine olfaction represented by the electronic nose (E-nose) is difficult to reflect human feelings. Olfactory electroencephalogram (EEG) contains odor and individual features associated with human olfactory preference, which has unique advantages in odor sensory evaluation. However, the difficulty of cross-subject olfactory EEG recognition greatly limits its application. It is worth noting that E-nose and olfactory EEG are more advantageous in representing odor information and individual emotions, respectively. In this paper, an E-nose and olfactory EEG multimodal learning method is proposed for cross-subject olfactory preference recognition. Firstly, the olfactory EEG and E-nose multimodal data acquisition and preprocessing paradigms are established. Secondly, a complementary multimodal data mining strategy is proposed to effectively mine the common features of multimodal data representing odor information and the individual features in olfactory EEG representing individual emotional information. Finally, the cross-subject olfactory preference recognition is achieved in 24 subjects by fusing the extracted common and individual features, and the recognition effect is superior to the state-of-the-art recognition methods. Furthermore, the advantages of the proposed method in cross-subject olfactory preference recognition indicate its potential for practical odor evaluation applications.


Zero-Shot Refinement of Buildings' Segmentation Models using SAM

arXiv.org Artificial Intelligence

Foundation models have excelled in various tasks but are often evaluated on general benchmarks. The adaptation of these models for specific domains, such as remote sensing imagery, remains an underexplored area. In remote sensing, precise building instance segmentation is vital for applications like urban planning. While Convolutional Neural Networks (CNNs) perform well, their generalization can be limited. For this aim, we present a novel approach to adapt foundation models to address existing models' generalization dropback. Among several models, our focus centers on the Segment Anything Model (SAM), a potent foundation model renowned for its prowess in class-agnostic image segmentation capabilities. We start by identifying the limitations of SAM, revealing its suboptimal performance when applied to remote sensing imagery. Moreover, SAM does not offer recognition abilities and thus fails to classify and tag localized objects. To address these limitations, we introduce different prompting strategies, including integrating a pre-trained CNN as a prompt generator. This novel approach augments SAM with recognition abilities, a first of its kind. We evaluated our method on three remote sensing datasets, including the WHU Buildings dataset, the Massachusetts Buildings dataset, and the AICrowd Mapping Challenge. For out-of-distribution performance on the WHU dataset, we achieve a 5.47% increase in IoU and a 4.81% improvement in F1-score. For in-distribution performance on the WHU dataset, we observe a 2.72% and 1.58% increase in True-Positive-IoU and True-Positive-F1 score, respectively. We intend to release our code repository, hoping to inspire further exploration of foundation models for domain-specific tasks within the remote sensing community.


Generalized Learning Vector Quantization

Neural Information Processing Systems

We propose a new learning method, "Generalized Learning Vec(cid:173) tor Quantization (GLVQ)," in which reference vectors are updated based on the steepest descent method in order to minimize the cost function . The cost function is determined so that the obtained learning rule satisfies the convergence condition. We prove that Kohonen's rule as used in LVQ does not satisfy the convergence condition and thus degrades recognition ability. Experimental re(cid:173) sults for printed Chinese character recognition reveal that GLVQ is superior to LVQ in recognition ability.


Anxiety hinders ability to read emotions, claims study

Daily Mail - Science & tech

From tingling in the limbs to increased heart rate and blood pressure - anxiety has been said to do a range of unusual things to the body. Now, a new study has found that this nervous disorder can also hinder your ability to interpret other people's emotions. Researchers have discovered that those in a heightened state of anxiety were unable to determine whether a person was happy or angry - and many people reported seeing the latter regardless of the facial expression. A new study has found that this nervous disorder can also hinder your ability to interpret facial expressions. Following the first two portions of the study, researcher had discovered that when individuals inhaled the carbon-dioxide rich air, or had an anxiety attack, they were eight percent worse at correctly identifying facial expressions.


What does your face reveal about you, and who is the better judge: humans or AI?

#artificialintelligence

"I never forget a face", "She's got an honest face", "You could see it in his face", and "She looks young for her age" are just a few of the often-used phrases suggesting that faces are important for our interactions with other people and what we think we know about them. But can people really remember faces as well as they think they do, and can we really tell someone's age from their face? Or can artificial intelligence (AI) do it better? And can we really tell if someone is trustworthy just by looking at their face? Research shows that humans exhibit a wide range of facial recognition abilities.


China's 'The Brain' winner beats AI robot in facial recognition challenge

#artificialintelligence

Wang Yuheng, who won China's'The Brain' last year, beats AIipay AI in facial recognition contest. Wang Yuheng, who is famous in China for his exceptional memory and observation skills, won against artificial intelligence (AI) robot "Mark" in a three-round facial recognition challenge. The live challenge, according to International Business Times, was a publicity stunt held by the largest digital payment service in China, Alipay. Alipay has just launched "Mark," a facial recognition AI feature for its service. Alipay invited Wang, who rose to fame after joining and winning Chinese reality television contest "The Brain" in 2014, to a face-off with Mark.


Generalized Learning Vector Quantization

Neural Information Processing Systems

We propose a new learning method, "Generalized Learning Vector Quantization (GLVQ)," in which reference vectors are updated based on the steepest descent method in order to minimize the cost function. The cost function is determined so that the obtained learning rule satisfies the convergence condition. We prove that Kohonen's rule as used in LVQ does not satisfy the convergence condition and thus degrades recognition ability. Experimental results for printed Chinese character recognition reveal that GLVQ is superior to LVQ in recognition ability.


Generalized Learning Vector Quantization

Neural Information Processing Systems

We propose a new learning method, "Generalized Learning Vector Quantization (GLVQ)," in which reference vectors are updated based on the steepest descent method in order to minimize the cost function. The cost function is determined so that the obtained learning rule satisfies the convergence condition. We prove that Kohonen's rule as used in LVQ does not satisfy the convergence condition and thus degrades recognition ability. Experimental results for printed Chinese character recognition reveal that GLVQ is superior to LVQ in recognition ability.


Generalized Learning Vector Quantization

Neural Information Processing Systems

We propose a new learning method, "Generalized Learning Vector Quantization (GLVQ)," in which reference vectors are updated based on the steepest descent method in order to minimize the cost function. The cost function is determined so that the obtained learning rule satisfies the convergence condition. We prove that Kohonen's rule as used in LVQ does not satisfy the convergence condition and thus degrades recognition ability. Experimental results for printed Chinese character recognition reveal that GLVQ is superior to LVQ in recognition ability.