best
- North America > United States > California > Los Angeles County > Los Angeles (0.14)
- North America > Cuba > Holguín Province > Holguín (0.04)
- Asia (0.04)
- Africa (0.04)
Visual Clues: Bridging Vision and Language Foundations for Image Paragraph Captioning
Xie, Yujia, Zhou, Luowei, Dai, Xiyang, Yuan, Lu, Bach, Nguyen, Liu, Ce, Zeng, Michael
People say, "A picture is worth a thousand words". Then how can we get the rich information out of the image? We argue that by using visual clues to bridge large pretrained vision foundation models and language models, we can do so without any extra cross-modal training. Thanks to the strong zero-shot capability of foundation models, we start by constructing a rich semantic representation of the image (e.g., image tags, object attributes / locations, captions) as a structured textual prompt, called visual clues, using a vision foundation model. Based on visual clues, we use large language model to produce a series of comprehensive descriptions for the visual content, which is then verified by the vision model again to select the candidate that aligns best with the image. We evaluate the quality of generated descriptions by quantitative and qualitative measurement. The results demonstrate the effectiveness of such a structured semantic representation.
- North America > United States > California > Los Angeles County > Los Angeles (0.14)
- North America > United States > Nevada > Clark County > Las Vegas (0.04)
- Asia > Malaysia (0.04)
- (6 more...)
- Transportation > Infrastructure & Services (1.00)
- Transportation > Ground > Rail (1.00)
- Transportation > Ground > Road (0.92)
- Leisure & Entertainment > Sports > Tennis (0.70)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)
- Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.68)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Smart and Secure CAV Networks Empowered by AI-Enabled Blockchain: Next Frontier for Intelligent Safe-Driving Assessment
Xia, Le, Sun, Yao, Swash, Rafiq, Mohjazi, Lina, Zhang, Lei, Imran, Muhammad Ali
Securing a safe-driving circumstance for connected and autonomous vehicles (CAVs) continues to be a widespread concern despite various sophisticated functions delivered by artificial intelligence for in-vehicle devices. Besides, diverse malicious network attacks become ubiquitous along with the worldwide implementation of the Internet of Vehicles, which exposes a range of reliability and privacy threats for managing data in CAV networks. Combined with another fact that CAVs are now limited in handling intensive computation tasks, it thus renders a pressing demand of designing an efficient assessment system to guarantee autonomous driving safety without compromising data security. To this end, we propose in this article a novel framework of Blockchain-enabled intElligent Safe-driving assessmenT (BEST) to offer a smart and reliable approach for conducting safe driving supervision while protecting vehicular information. Specifically, a promising solution of exploiting a long short-term memory algorithm is first introduced in detail for an intElligent Safe-driving assessmenT (EST) scheme. To further facilitate the EST, we demonstrate how a distributed blockchain obtains adequate efficiency, trustworthiness and resilience with an adopted byzantine fault tolerance-based delegated proof-of-stake consensus mechanism. Moreover, several challenges and discussions regarding the future research of this BEST architecture are presented.
- Asia > China (0.04)
- North America > United States > Oklahoma (0.04)
- Europe > United Kingdom > Scotland > City of Glasgow > Glasgow (0.04)
- (4 more...)
- Information Technology > Security & Privacy (1.00)
- Automobiles & Trucks (1.00)
- Transportation > Ground > Road (0.49)