AITopics

2409.02361

Country:

North America > United States (0.14)
Europe > Czechia > Prague (0.04)
North America > Dominican Republic (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry: Leisure & Entertainment > Sports > Soccer (0.97)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

arXiv.org Artificial IntelligenceFeb-21-2024

LLMs Meet Long Video: Advancing Long Video Comprehension with An Interactive Visual Adapter in LLMs

Li, Yunxin, Chen, Xinyu, Hu, Baotain, Zhang, Min

Long video understanding is a significant and ongoing challenge in the intersection of multimedia and artificial intelligence. Employing large language models (LLMs) for comprehending video becomes an emerging and promising method. However, this approach incurs high computational costs due to the extensive array of video tokens, experiences reduced visual clarity as a consequence of token aggregation, and confronts challenges arising from irrelevant visual tokens while answering video-related questions. To alleviate these issues, we present an Interactive Visual Adapter (IVA) within LLMs, designed to enhance interaction with fine-grained visual elements. Specifically, we first transform long videos into temporal video tokens via leveraging a visual encoder alongside a pretrained causal transformer, then feed them into LLMs with the video instructions. Subsequently, we integrated IVA, which contains a lightweight temporal frame selector and a spatial feature interactor, within the internal blocks of LLMs to capture instruction-aware and fine-grained visual signals. Consequently, the proposed video-LLM facilitates a comprehensive understanding of long video content through appropriate long video modeling and precise visual interactions. We conducted extensive experiments on nine video understanding benchmarks and experimental results show that our interactive visual adapter significantly improves the performance of video LLMs on long video QA tasks. Ablation studies further verify the effectiveness of IVA in long and short video understandings.

iva, llm, video, (15 more...)

2402.13546

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > France > Provence-Alpes-Côte d'Azur > Bouches-du-Rhône > Marseille (0.04)
Asia > China > Heilongjiang Province > Harbin (0.04)
Asia > China > Guangdong Province > Shenzhen (0.04)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

EngadgetAug-29-2023, 18:30:40 GMT

A Google-powered chatbot is handling GM's non-emergency OnStar calls

General Motors is taking Google's AI chatbot on the road. The automaker announced today that it's using Google Cloud's Dialogflow to automate some non-emergency OnStar features like navigation and call routing. Crucially, the automaker claims the bot can pinpoint keywords indicating an emergency situation and "quickly route the call" to trained humans when needed. GM says the system frees up OnStar Advisors to spend more time with customers requiring a live human. According to GM, the OnStar Interactive Virtual Assistant (IVA) has used Google Cloud's Dialogflow under the hood since IVA's 2022 launch.

google cloud, google-powered chatbot, non-emergency onstar call, (9 more...)

Engadget

Country:

North America > United States (0.06)
North America > Canada (0.06)

Genre: Press Release (0.80)

Industry: Automobiles & Trucks > Manufacturer (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

#artificialintelligenceMar-26-2023, 17:22:02 GMT

2023's Top 4 AI Use Cases in Healthcare Communications

I recently had a scare during the holidays when my octogenarian father, visiting from out of town, fell in the kitchen during our Christmas Eve gathering. My heart skipped a beat at that moment and 10 things ran through my mind about what to do next, like wishing we had a doctor in the family! So going to the ER was not the answer. However, we had concerns for him, and I needed some peace of mind. Could I give him Tylenol, or would that interfere with his current medicines?

healthcare communication, iva, use case, (11 more...)

Industry:

Health & Medicine > Health Care Providers & Services (1.00)
Health & Medicine > Consumer Health (0.71)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.31)

#artificialintelligenceJan-30-2023, 01:20:21 GMT

Airmeez Deploys SoundHound

Airmeez announced they will be working with SoundHound AI, Inc. technology to bring a seamless conversational AI experience to software-as-a-service intelligent virtual assistant, notification and customer engagement solutions. Intelligent Virtual Assistants (IVAs) are AI-powered assistants that achieve the purpose of interactions while generating personalized responses by combining analytics and cognitive computing based on individual customer information, past conversations, and location. SoundHound's advanced voice AI will now be deployed through Airmeez offerings, which include a customer communications platform as a service for building custom IVA, notification and other communication solutions using a no-code drag-and-drop design tool. This voice-enablement will allow callers to interact with Airmeez applications, providing award-winning accuracy of intent detection, convenience and cost savings in healthcare, government, education and other markets. SoundHound's proprietary Speech-to-Meaning system will allow Airmeez to deliver a faster and more positive caller experience, an improved efficiency of call centers, and reduce staffing challenges and costs for their customers.

artificial intelligence, soundhound, speech recognition, (5 more...)

Technology:

Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)

#artificialintelligenceNov-8-2022, 00:07:57 GMT

AI And CCaaS Help Marsh McLennan And McKesson Build Business Efficiencies

Today, organizations understand the pressing need to expand and enhance their digital sales and support solutions to address new customer demands. Consumers are demanding anytime, anywhere services. While many companies have bolstered their web and mobile presence, customers still turn to the contact center when they need additional support. Given this trend, businesses must modernize and improve their contact center capabilities. Five9 recently held a joint analyst call with several of its customer to share how various companies were evolving their contact centers.

marsh mclennan, mckesson, mckesson build business efficiency, (11 more...)

Country: North America (0.05)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.33)

Technology: Information Technology > Artificial Intelligence (1.00)

arXiv.org Artificial IntelligenceOct-26-2022

Speech Emotion Recognition using Supervised Deep Recurrent System for Mental Health Monitoring

Elsayed, Nelly, ElSayed, Zag, Asadizanjani, Navid, Ozer, Murat, Abdelgawad, Ahmed, Bayoumi, Magdy

Understanding human behavior and monitoring mental health are essential to maintaining the community and society's safety. As there has been an increase in mental health problems during the COVID-19 pandemic due to uncontrolled mental health, early detection of mental issues is crucial. Nowadays, the usage of Intelligent Virtual Personal Assistants (IVA) has increased worldwide. Individuals use their voices to control these devices to fulfill requests and acquire different services. This paper proposes a novel deep learning model based on the gated recurrent neural network and convolution neural network to understand human emotion from speech to improve their IVA services and monitor their mental health.

artificial intelligence, emotion, machine learning, (18 more...)

2208.12812

Country:

North America > United States > Louisiana (0.05)
North America > Canada > Ontario > Toronto (0.05)
North America > United States > Ohio (0.04)
(4 more...)

Genre: Research Report (0.64)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

arXiv.org Artificial IntelligenceJul-26-2022

An Adaptive Deep Clustering Pipeline to Inform Text Labeling at Scale

Chen, Xinyu, Beaver, Ian

Mining the latent intentions from large volumes of natural language inputs is a key step to help data analysts design and refine Intelligent Virtual Assistants (IVAs) for customer service and sales support. We created a flexible and scalable clustering pipeline within the Verint Intent Manager (VIM) that integrates the fine-tuning of language models, a high performing k-NN library and community detection techniques to help analysts quickly surface and organize relevant user intentions from conversational texts. The fine-tuning step is necessary because pre-trained language models cannot encode texts to efficiently surface particular clustering structures when the target texts are from an unseen domain or the clustering task is not topic detection. We describe the pipeline and demonstrate its performance and ability to scale on three real-world text mining tasks. As deployed in the VIM application, this clustering pipeline produces high quality results, improving the performance of data analysts and reducing the time it takes to surface intentions from customer service data, thereby reducing the time it takes to build and deploy IVAs in new domains.

data mining, machine learning, natural language, (15 more...)

2202.01211

Country:

Europe > Romania (0.05)
Africa > South Africa (0.05)
Oceania > New Zealand (0.04)
(5 more...)

Genre: Research Report (0.40)

Industry: Leisure & Entertainment > Sports (1.00)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.96)

#artificialintelligenceMar-14-2022, 14:43:42 GMT

Artificial Intelligence: Break the Bias

Women in AI Ireland in collaboration with Women in Research Ireland present this live virtual event: Artificial Intelligence #BreaktheBias. With this event, we celebrate technical competencies in Artificial Intelligence for both industry and academia by highlighting different pathways from academia to industry and challenges therein to mark the International Women's Day 2022! Dr. Georgiana Ifrim is an Associate Professor at the School of Computer Science, UCD, co-lead of the SFI Centre for Research Training in Machine Learning (ML-Labs) and SFI Funded Investigator with the Insight Centre for Data Analytics and VistaMilk SFI Centre. Dr Ifrim holds a PhD and MSc in Machine Learning, from Max-Planck Institute for Informatics, Germany, and a BSc in Computer Science, from University of Bucharest, Romania. Her research focuses on effective approaches for large scale sequence learning, time series classification and text mining.

artificial intelligence, computer science, university, (14 more...)

Country:

Europe > Romania > București - Ilfov Development Region > Municipality of Bucharest > Bucharest (0.25)
Europe > Germany (0.25)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.05)
(4 more...)

Industry: Education > Educational Setting (0.31)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.80)
Information Technology > Artificial Intelligence > Natural Language (0.70)

#artificialintelligenceNov-9-2021, 08:50:30 GMT

Future paths into conversational artificial intelligence

Make way for interactive virtual assistants (IVAs). But can chatbots change the customer experience? Can IVAs like Amelia go beyond the limitations of chatbots? There is plenty to consider. In 50 years, chatbots have evolved first to engage users in dialogues for customer service in many fields and now to conversations on personal medication information.

application, chatbot, conversational artificial intelligence, (11 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)