AITopics | Personal

Collaborating Authors

Personal

LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model

Gao, Peng, Han, Jiaming, Zhang, Renrui, Lin, Ziyi, Geng, Shijie, Zhou, Aojun, Zhang, Wei, Lu, Pan, He, Conghui, Yue, Xiangyu, Li, Hongsheng, Qiao, Yu

arXiv.org Artificial IntelligenceApr-28-2023

How to efficiently transform large language models (LLMs) into instruction followers is recently a popular research direction, while training LLM for multi-modal reasoning remains less explored. Although the recent LLaMA-Adapter demonstrates the potential to handle visual inputs with LLMs, it still cannot generalize well to open-ended visual instructions and lags behind GPT-4. In this paper, we present LLaMA-Adapter V2, a parameter-efficient visual instruction model. Specifically, we first augment LLaMA-Adapter by unlocking more learnable parameters (e.g., norm, bias and scale), which distribute the instruction-following ability across the entire LLaMA model besides adapters. Secondly, we propose an early fusion strategy to feed visual tokens only into the early LLM layers, contributing to better visual knowledge incorporation. Thirdly, a joint training paradigm of image-text pairs and instruction-following data is introduced by optimizing disjoint groups of learnable parameters. This strategy effectively alleviates the interference between the two tasks of image-text alignment and instruction following and achieves strong multi-modal reasoning with only a small-scale image-text and instruction dataset. During inference, we incorporate additional expert models (e.g. captioning/OCR systems) into LLaMA-Adapter to further enhance its image understanding capability without incurring training costs. Compared to the original LLaMA-Adapter, our LLaMA-Adapter V2 can perform open-ended multi-modal instructions by merely introducing 14M parameters over LLaMA. The newly designed framework also exhibits stronger language-only instruction-following capabilities and even excels in chat interactions. Our code and models are available at https://github.com/ZrrSkywalker/LLaMA-Adapter.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2304.1501

Country:

Asia > China > Beijing > Beijing (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Genre:

Research Report (0.64)
Personal (0.46)

Industry: Education > Instructional Theory (0.62)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

"Computing and Technology Ethics: Engaging Through Science Fiction" – an interview with the authors

AIHubApr-26-2023, 14:33:11 GMT

Emanuelle Burton, Judy Goldsmith, Nicholas Mattei, Cory Siler and Sara-Jo Swiatek are the authors of a new book entitled: Computing and Technology Ethics: Engaging Through Science Fiction. We caught up with them to find out more about the book, what it covers, and what inspired them to use science fiction as a tool to teach about ethics. In addition to the content chapters there is a science fiction anthology at the end of the book containing 12 stories from contemporary authors including Ken Liu, T.C. Boyle, Elizabeth Bear, Paolo Bacigalupi, and Rebecca Roanhorse. The book also provides Story Frames for each story that includes an introduction and reflection questions that tie the story, the characters, and their choices to the ethical frameworks. Each of these stories is anchored in multiple places in the content chapters through what we call Story Points where that story picks up on themes and/or ideas from the chapter.

computing and technology ethics, ethics, science fiction story, (12 more...)

AIHub

Country:

North America > United States > Illinois > Cook County > Chicago (0.08)
North America > United States > Kentucky (0.05)

Genre: Personal > Interview (1.00)

Industry: Education (0.51)

Technology: Information Technology > Artificial Intelligence > Science Fiction (1.00)

Add feedback

Unleashing Infinite-Length Input Capacity for Large-scale Language Models with Self-Controlled Memory System

Liang, Xinnian, Wang, Bing, Huang, Hui, Wu, Shuangzhi, Wu, Peihao, Lu, Lu, Ma, Zejun, Li, Zhoujun

arXiv.org Artificial IntelligenceApr-26-2023

Large-scale Language Models (LLMs) are constrained by their inability to process lengthy inputs. To address this limitation, we propose the Self-Controlled Memory (SCM) system to unleash infinite-length input capacity for large-scale language models. Our SCM system is composed of three key modules: the language model agent, the memory stream, and the memory controller. The language model agent iteratively processes ultra-long inputs and stores all historical information in the memory stream. The memory controller provides the agent with both long-term memory (archived memory) and short-term memory (flash memory) to generate precise and coherent responses. The controller determines which memories from archived memory should be activated and how to incorporate them into the model input. Our SCM system can be integrated with any LLMs to enable them to process ultra-long texts without any modification or fine-tuning. Experimental results show that our SCM system enables LLMs, which are not optimized for multi-turn dialogue, to achieve multi-turn dialogue capabilities that are comparable to ChatGPT, and to outperform ChatGPT in scenarios involving ultra-long document summarization or long-term conversations. Additionally, we will supply a test set, which covers common long-text input scenarios, for evaluating the abilities of LLMs in processing long documents.~\footnote{Working in progress.}\footnote{\url{https://github.com/wbbeyourself/SCM4LLMs}}

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2304.13343

Country:

South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
Europe > Ireland > Leinster > County Dublin > Dublin (0.04)
Asia > China > Heilongjiang Province > Harbin (0.04)
(2 more...)

Genre:

Research Report (1.00)
Overview (0.93)
Personal > Interview (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

The face of PlayStation: Shuhei Yoshida on the joy and future of video games

The GuardianApr-24-2023, 13:05:14 GMT

In early 1993, Shuhei Yoshida joined Sony's nascent PlayStation division as a business development guy – the first member of the team who didn't have an engineering background. When he was working with Ken Kutaragi and the other architects of the original PlayStation, and later producing games from Crash Bandicoot and Gran Turismo alongside game development legends Mark Cerny and Kazunori Yamauchi, he freely admits that he could scarcely believe his luck. When I speak to him, on the eve of receiving Bafta's prestigious fellowship award for his contribution to video games, he still seems endearingly surprised by his own success. "The people who have received [this award] before are all creators! I don't know how I fit in," he says.

developer, playstation, yoshida, (10 more...)

The Guardian

Country:

Asia > Japan (0.05)
North America > United States > California > Los Angeles County > Santa Monica (0.05)
Europe (0.05)

Genre: Personal > Honors (0.92)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology: Information Technology > Artificial Intelligence > Games (0.73)

Add feedback

Can YOU tell the difference between a real person and an AI bot?

Daily Mail - Science & techApr-21-2023, 11:46:25 GMT

Popular AI chatbots like ChatGPT and Bard have been designed to replicate human speech as closely as possible. And as deep learning technology gets more and more sophisticated, it's becoming difficult to discern these computer models from real people. Now, a free online game gives you two minutes to have a conversation with someone (or something) and guess whether they're a fellow human or an AI. 'Human or not?' was inspired by the Turing Test, devised by legendary British computer scientist Alan Turing in 1950. A computer passes the so-called test when someone cannot correctly tell the difference between a response from a human and a response from an AI.

google, interrogator, turing test, (12 more...)

Daily Mail - Science & tech

Country:

North America > United States > California (0.05)
Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.05)

Genre: Personal (0.36)

Industry: Information Technology (0.94)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Turing's Test (0.95)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.94)

Add feedback

Reducing Opinion Echo-Chambers by Intelligent Placement of Moderate-Minded Agents

Jana, Prithwish, Choudhury, Romit Roy, Ganguly, Niloy

arXiv.org Artificial IntelligenceApr-21-2023

In the era of social media, people frequently share their own opinions online on various issues and also in the way, get exposed to others' opinions. Be it for selective exposure of news feed recommendation algorithms or our own inclination to listen to opinions that support ours, the result is that we get more and more exposed to opinions closer to ours. Further, any population is inherently heterogeneous i.e. people will hold a varied range of opinions regarding a topic and showcase a varied range of openness to get influenced by others. In this paper, we demonstrate the different behavior put forward by open- and close-minded agents towards an issue, when allowed to freely intermix and communicate. We have shown that the intermixing among people leads to formation of opinion echo chambers i.e. a small closed network of people who hold similar opinions and are not affected by opinions of people outside the network. Echo chambers are evidently harmful for a society because it inhibits free healthy communication among all and thus, prevents exchange of opinions, spreads misinformation and increases extremist beliefs. This calls for reduction in echo chambers, because a total consensus of opinion is neither possible nor is welcome. We show that the number of echo chambers depends on the number of close-minded agents and cannot be lessened by increasing the number of open-minded agents. We identify certain 'moderate'-minded agents, who possess the capability of manipulating and reducing the number of echo chambers. The paper proposes an algorithm for intelligent placement of moderate-minded agents in the opinion-time spectrum by which the opinion echo chambers can be maximally reduced. With various experimental setups, we demonstrate that the proposed algorithm fares well when compared to placement of other agents (open- or close-minded) and random placement of 'moderate'-minded agents.

agent, artificial intelligence, social media, (18 more...)

arXiv.org Artificial Intelligence

2304.10745

Country:

Asia > India > West Bengal > Kharagpur (0.04)
North America > United States > Illinois (0.04)
North America > Canada > Ontario > Waterloo Region > Waterloo (0.04)

Genre:

Personal (0.46)
Research Report (0.40)

Industry: Media > News (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents > Agent Societies (0.68)

Add feedback

The End of Recommendation Letters

The Atlantic - TechnologyApr-20-2023, 15:53:06 GMT

I was lunching with a group of fellow professors, and, as happens these days when we assemble, generative artificial intelligence was discussed. Are your students using it? What are you doing to prevent cheating? Heads were shaken in chagrin as iced teas were sipped for comfort. But then, one of my colleagues wondered: Could he use AI to generate a reference letter for a student?

chatgpt, proposal, recommendation letter, (10 more...)

The Atlantic - Technology

Country: North America > United States > Texas (0.05)

Genre: Personal > Interview (0.36)

Industry: Education > Educational Setting > Higher Education (0.84)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.82)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.62)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.62)

Add feedback

A list of resources, articles, and opinion pieces relating to large language models & robotics

RobohubApr-19-2023, 15:12:24 GMT

Figuring out how humans and robots can collaborate to effectively carry out tasks together is a rapidly growing area of interest. For successful collaboration between humans and robots, communication is key.

language model & robotic, large language model, natural language, (4 more...)

Robohub

Genre: Personal > Opinion (0.40)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)

Add feedback

Broad Recommender System: An Efficient Nonlinear Collaborative Filtering Approach

Huang, Ling, Guan, Can-Rong, Huang, Zhen-Wei, Gao, Yuefang, Kuang, Yingjie, Wang, Chang-Dong, Chen, C. L. Philip

arXiv.org Artificial IntelligenceApr-19-2023

Recently, Deep Neural Networks (DNNs) have been widely introduced into Collaborative Filtering (CF) to produce more accurate recommendation results due to their capability of capturing the complex nonlinear relationships between items and users.However, the DNNs-based models usually suffer from high computational complexity, i.e., consuming very long training time and storing huge amount of trainable parameters. To address these problems, we propose a new broad recommender system called Broad Collaborative Filtering (BroadCF), which is an efficient nonlinear collaborative filtering approach. Instead of DNNs, Broad Learning System (BLS) is used as a mapping function to learn the complex nonlinear relationships between users and items, which can avoid the above issues while achieving very satisfactory recommendation performance. However, it is not feasible to directly feed the original rating data into BLS. To this end, we propose a user-item rating collaborative vector preprocessing procedure to generate low-dimensional user-item input data, which is able to harness quality judgments of the most similar users/items. Extensive experiments conducted on seven benchmark datasets have confirmed the effectiveness of the proposed BroadCF algorithm

artificial intelligence, broadcf, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2204.11602

Country:

Asia > China > Guangdong Province > Guangzhou (0.05)
Asia > Macao (0.04)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.04)
(5 more...)

Genre:

Personal (0.68)
Research Report (0.50)

Industry: Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.66)

Add feedback

Award-winning photograph revealed to be AI-generated image, photographer turns down prize

FOX NewsApr-18-2023, 14:36:54 GMT

Fox News correspondent Grady Trimble has the latest on fears the technology will spiral out of control on'Special Report.' A German artist who won a major prize for photography has turned down the award after revealing his work was created with help from artificial intelligence (AI). Photographer Boris Eldagsen won the creative category of the open competition for the Sony World Photography Awards 2023 with his "photograph," titled "Pseudomnesia: The Electrician." The image, which depicted an older woman holding a younger in black and white, was "the first AI generated image to win in a prestigious international Photography competition," Eldagsen said in a statement posted on his website. "How many of you knew or suspected that it was AI generated? Something about this doesn't feel right, does it? "AI images and photography should not compete with each other in an award like this.

ai-generated image, eldagsen, photography, (10 more...)

FOX News

Genre: Personal (0.37)

Industry:

Media > Photography (1.00)
Media > News (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.32)

Add feedback