AITopics | desired

Collaborating Authors

desired

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Design as Desired: Utilizing Visual Question Answering for Multimodal Pre-training

Su, Tongkun, Li, Jun, Zhang, Xi, Jin, Haibo, Chen, Hao, Wang, Qiong, Lv, Faqin, Zhao, Baoliang, Hu, Yin

arXiv.org Artificial IntelligenceApr-8-2024

Multimodal pre-training demonstrates its potential in the medical domain, which learns medical visual representations from paired medical reports. However, many pre-training tasks require extra annotations from clinicians, and most of them fail to explicitly guide the model to learn the desired features of different pathologies. To the best of our knowledge, we are the first to utilize Visual Question Answering (VQA) for multimodal pre-training to guide the framework focusing on targeted pathological features. In this work, we leverage descriptions in medical reports to design multi-granular question-answer pairs associated with different diseases, which assist the framework in pre-training without requiring extra annotations from experts. We also propose a novel pre-training framework with a quasi-textual feature transformer, a module designed to transform visual features into a quasi-textual space closer to the textual domain via a contrastive learning strategy. This narrows the vision-language gap and facilitates modality alignment. Our framework is applied to four downstream tasks: report generation, classification, segmentation, and detection across five datasets. Extensive experiments demonstrate the superiority of our framework compared to other state-of-the-art methods. Our code will be released upon acceptance.

international conference, nodule, report generation, (14 more...)

arXiv.org Artificial Intelligence

2404.00226

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.05)
Asia > China > Hong Kong (0.05)
Asia > China > Guangdong Province > Shenzhen (0.05)
Africa > Cameroon > Far North Region > Maroua (0.04)

Genre: Research Report (0.70)

Industry:

Health & Medicine > Health Care Technology (0.87)
Health & Medicine > Diagnostic Medicine > Imaging (0.72)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.61)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Swashplateless-elevon Actuation for a Dual-rotor Tail-sitter VTOL UAV

Chen, Nan, Kong, Fanze, Li, Haotian, Liu, Jiayuan, Ye, Ziwei, Xu, Wei, Zhu, Fangcheng, Lyu, Ximin, Zhang, Fu

arXiv.org Artificial IntelligenceSep-24-2023

In this paper, we propose a novel swashplateless-elevon actuation (SEA) for dual-rotor tail-sitter vertical takeoff and landing (VTOL) unmanned aerial vehicles (UAVs). In contrast to the conventional elevon actuation (CEA) which controls both pitch and yaw using elevons, the SEA adopts swashplateless mechanisms to generate an extra moment through motor speed modulation to control pitch and uses elevons solely for controlling yaw, without requiring additional actuators. This decoupled control strategy mitigates the saturation of elevons' deflection needed for large pitch and yaw control actions, thus improving the UAV's control performance on trajectory tracking and disturbance rejection performance in the presence of large external disturbances. Furthermore, the SEA overcomes the actuation degradation issues experienced by the CEA when the UAV is in close proximity to the ground, leading to a smoother and more stable take-off process. We validate and compare the performances of the SEA and the CEA in various real-world flight conditions, including take-off, trajectory tracking, and hover flight and position steps under external disturbance. Experimental results demonstrate that the SEA has better performances than the CEA. Moreover, we verify the SEA's feasibility in the attitude transition process and fixed-wing-mode flight of the VTOL UAV. The results indicate that the SEA can accurately control pitch in the presence of high-speed incoming airflow and maintain a stable attitude during fixed-wing mode flight. Video of all experiments can be found in youtube.com/watch?v=Sx9Rk4Zf7sQ

cea, disturbance, uav, (15 more...)

arXiv.org Artificial Intelligence

2309.13559

Country:

Asia > China > Hong Kong (0.04)
Asia > Middle East > Republic of Türkiye > Karaman Province > Karaman (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report > New Finding (0.66)

Industry:

Transportation > Air (1.00)
Aerospace & Defense > Aircraft (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.48)

Add feedback

Learning Membership Functions in a Function-Based Object Recognition System

Woods, K., Cook, D., Hall, L., Bowyer, K., Stark, L.

Journal of Artificial Intelligence ResearchOct-1-1995

Functionality-based recognition systems recognize objects at the category level by reasoning about how well the objects support the expected function. Such systems naturally associate a ``measure of goodness'' or ``membership value'' with a recognized object. This measure of goodness is the result of combining individual measures, or membership values, from potentially many primitive evaluations of different properties of the object's shape. A membership function is used to compute the membership value when evaluating a primitive of a particular physical property of an object. In previous versions of a recognition system known as Gruff, the membership function for each of the primitive evaluations was hand-crafted by the system designer. In this paper, we provide a learning component for the Gruff system, called Omlet, that automatically learns membership functions given a set of example objects labeled with their desired category measure. The learning algorithm is generally applicable to any problem in which low-level membership values are combined through an and-or tree structure to give a final overall membership value.

definition tree name, desired, pand, (10 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.236

AI Access Foundation

10144

Journal of Artificial Intelligence Research

Technology:

Information Technology > Artificial Intelligence > Vision (0.40)
Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback