Large Language Model
Biden meets with tech company critics on AI
Microsoft, Google, OpenAI and other major tech companies are rushing to develop new AI tools and push them out to millions of people. The companies have been lobbying in Washington and to other governments around the world, suggesting potential regulation while stressing the importance of allowing them to continue develop the tech. Critics have warned that the companies are focused on profit, and are trying to head-off strict government controls, or have a hand in shaping them to their own benefit.
Apple Is an AI Company Now
After more than a decade, autocorrect "fails" could be on their way out. Apple's much-maligned spelling software is getting upgraded by artificial intelligence: Using sophisticated language models, the new autocorrect won't just check words against a dictionary, but will be able to consider the context of the word in a sentence. In theory, it won't suggest consolation when you mean consolidation, because it'll know that those words aren't interchangeable. The next generation of autocorrect was one of several small updates to the iPhone experience that Apple announced earlier this month. The Photos app will be able to differentiate between your dog and other dogs, automatically recognizing your pup the same way it recognizes people who frequently appear in your pictures.
How do you prevent an AI-generated game from losing the plot?
Did you ever get to the end of Wizard of Oz and have notes โ the nagging intuition that you could have taken down all those pesky flying monkeys or handled the backstabbing intricacies of Munchkin guild politics more effectively than Dorothy and her band of misfits did in the books? Thanks to the new AI storytelling platform Hidden Door, which plops players into TTRPG-like adventures based in their favorite literary universes, you'll soon have the chance to walk the Yellow Brick Road however you see fit. Hidden Door is both the company and the game. Hidden Door, the company, was co-founded by Hilary Mason, who is also CEO, and Matt Brandwein in 2020 with a mission to "inspire creativity through play with narrative AI." The staff is split nearly evenly between machine learning engineers and traditional game designers, Mason told Engadget.
Exclusive: OpenAI Lobbied the E.U. to Water Down AI Regulation
The CEO of OpenAI, Sam Altman, has spent the last month touring world capitals where, at talks to sold-out crowds and in meetings with heads of governments, he has repeatedly spoken of the need for global AI regulation. But behind the scenes, OpenAI has lobbied for significant elements of the most comprehensive AI legislation in the world--the E.U.'s AI Act--to be watered down in ways that would reduce the regulatory burden on the company, according to documents about OpenAI's engagement with E.U. officials obtained by TIME from the European Commission via freedom of information requests. In several cases, OpenAI proposed amendments that were later made to the final text of the E.U. law--which was approved by the European Parliament on June 14, and will now proceed to a final round of negotiations before being finalized as soon as January. In 2022, OpenAI repeatedly argued to European officials that the forthcoming AI Act should not consider its general purpose AI systems--including GPT-3, the precursor to ChatGPT, and the image generator Dall-E 2--to be "high risk," a designation that would subject them to stringent legal requirements including transparency, traceability, and human oversight. That argument brought OpenAI in line with Microsoft, which has invested $13 billion into the AI lab, and Google, both of which have previously lobbied E.U. officials in favor of loosening the Act's regulatory burden on large AI providers.
Exploring New Frontiers in Agricultural NLP: Investigating the Potential of Large Language Models for Food Applications
Rezayi, Saed, Liu, Zhengliang, Wu, Zihao, Dhakal, Chandra, Ge, Bao, Dai, Haixing, Mai, Gengchen, Liu, Ninghao, Zhen, Chen, Liu, Tianming, Li, Sheng
This paper explores new frontiers in agricultural natural language processing by investigating the effectiveness of using food-related text corpora for pretraining transformer-based language models. In particular, we focus on the task of semantic matching, which involves establishing mappings between food descriptions and nutrition data. To accomplish this, we fine-tune a pre-trained transformer-based language model, AgriBERT, on this task, utilizing an external source of knowledge, such as the FoodOn ontology. To advance the field of agricultural NLP, we propose two new avenues of exploration: (1) utilizing GPT-based models as a baseline and (2) leveraging ChatGPT as an external source of knowledge. ChatGPT has shown to be a strong baseline in many NLP tasks, and we believe it has the potential to improve our model in the task of semantic matching and enhance our model's understanding of food-related concepts and relationships. Additionally, we experiment with other applications, such as cuisine prediction based on food ingredients, and expand the scope of our research to include other NLP tasks beyond semantic matching. Overall, this paper provides promising avenues for future research in this field, with potential implications for improving the performance of agricultural NLP applications.
Blackbird language matrices (BLM), a new task for rule-like generalization in neural networks: Motivations and Formal Specifications
We motivate and formally define a new task for fine-tuning rule-like generalization in large language models. It is conjectured that the shortcomings of current LLMs are due to a lack of ability to generalize. It has been argued that, instead, humans are better at generalization because they have a tendency at extracting rules from complex data. We try to recreate this tendency to rule-based generalization. When exposed to tests of analytic intelligence, for example, the visual RAVEN IQ test, human problem-solvers identify the relevant objects in the picture and their relevant attributes and reason based on rules applied to these objects and attributes. Based on the induced rules, they are able to provide a solution to the test. We propose a task that translates this IQ task into language. In this paper, we provide the formal specification for the task and the generative process of its datasets.
Retrieval-Based Transformer for Table Augmentation
Glass, Michael, Wu, Xueqing, Naik, Ankita Rajaram, Rossiello, Gaetano, Gliozzo, Alfio
Data preparation, also called data wrangling, is considered one of the most expensive and time-consuming steps when performing analytics or building machine learning models. Preparing data typically involves collecting and merging data from complex heterogeneous, and often large-scale data sources, such as data lakes. In this paper, we introduce a novel approach toward automatic data wrangling in an attempt to alleviate the effort of end-users, e.g. data analysts, in structuring dynamic views from data lakes in the form of tabular data. We aim to address table augmentation tasks, including row/column population and data imputation. Given a corpus of tables, we propose a retrieval augmented self-trained transformer model. Our self-learning strategy consists in randomly ablating tables from the corpus and training the retrieval-based model to reconstruct the original values or headers given the partial tables as input. We adopt this strategy to first train the dense neural retrieval model encoding table-parts to vectors, and then the end-to-end model trained to perform table augmentation tasks. We test on EntiTables, the standard benchmark for table augmentation, as well as introduce a new benchmark to advance further research: WebTables. Our model consistently and substantially outperforms both supervised statistical methods and the current state-of-the-art transformer-based models.
ChatGPT is not Enough: Enhancing Large Language Models with Knowledge Graphs for Fact-aware Language Modeling
Yang, Linyao, Chen, Hongyang, Li, Zhao, Ding, Xiao, Wu, Xindong
Recently, ChatGPT, a representative large language model (LLM), has gained considerable attention due to its powerful emergent abilities. Some researchers suggest that LLMs could potentially replace structured knowledge bases like knowledge graphs (KGs) and function as parameterized knowledge bases. However, while LLMs are proficient at learning probabilistic language patterns based on large corpus and engaging in conversations with humans, they, like previous smaller pre-trained language models (PLMs), still have difficulty in recalling facts while generating knowledge-grounded contents. To overcome these limitations, researchers have proposed enhancing data-driven PLMs with knowledge-based KGs to incorporate explicit factual knowledge into PLMs, thus improving their performance to generate texts requiring factual knowledge and providing more informed responses to user queries. This paper reviews the studies on enhancing PLMs with KGs, detailing existing knowledge graph enhanced pre-trained language models (KGPLMs) as well as their applications. Inspired by existing studies on KGPLM, this paper proposes to enhance LLMs with KGs by developing knowledge graph-enhanced large language models (KGLLMs). KGLLM provides a solution to enhance LLMs' factual reasoning ability, opening up new avenues for LLM research.
Recognizing Unseen Objects via Multimodal Intensive Knowledge Graph Propagation
Wu, Likang, Li, Zhi, Zhao, Hongke, Wang, Zhefeng, Liu, Qi, Huai, Baoxing, Yuan, Nicholas Jing, Chen, Enhong
Zero-Shot Learning (ZSL), which aims at automatically recognizing unseen objects, is a promising learning paradigm to understand new real-world knowledge for machines continuously. Recently, the Knowledge Graph (KG) has been proven as an effective scheme for handling the zero-shot task with large-scale and non-attribute data. Prior studies always embed relationships of seen and unseen objects into visual information from existing knowledge graphs to promote the cognitive ability of the unseen data. Actually, real-world knowledge is naturally formed by multimodal facts. Compared with ordinary structural knowledge from a graph perspective, multimodal KG can provide cognitive systems with fine-grained knowledge. For example, the text description and visual content can depict more critical details of a fact than only depending on knowledge triplets. Unfortunately, this multimodal fine-grained knowledge is largely unexploited due to the bottleneck of feature alignment between different modalities. To that end, we propose a multimodal intensive ZSL framework that matches regions of images with corresponding semantic embeddings via a designed dense attention module and self-calibration loss. It makes the semantic transfer process of our ZSL framework learns more differentiated knowledge between entities. Our model also gets rid of the performance limitation of only using rough global features. We conduct extensive experiments and evaluate our model on large-scale real-world data. The experimental results clearly demonstrate the effectiveness of the proposed model in standard zero-shot classification tasks.
Interactive Molecular Discovery with Natural Language
Zeng, Zheni, Yin, Bangchen, Wang, Shipeng, Liu, Jiarui, Yang, Cheng, Yao, Haishen, Sun, Xingzhi, Sun, Maosong, Xie, Guotong, Liu, Zhiyuan
Natural language is expected to be a key medium for various human-machine interactions in the era of large language models. When it comes to the biochemistry field, a series of tasks around molecules (e.g., property prediction, molecule mining, etc.) are of great significance while having a high technical threshold. Bridging the molecule expressions in natural language and chemical language can not only hugely improve the interpretability and reduce the operation difficulty of these tasks, but also fuse the chemical knowledge scattered in complementary materials for a deeper comprehension of molecules. Based on these benefits, we propose the conversational molecular design, a novel task adopting natural language for describing and editing target molecules. To better accomplish this task, we design ChatMol, a knowledgeable and versatile generative pre-trained model, enhanced by injecting experimental property information, molecular spatial knowledge, and the associations between natural and chemical languages into it. Several typical solutions including large language models (e.g., ChatGPT) are evaluated, proving the challenge of conversational molecular design and the effectiveness of our knowledge enhancement method. Case observations and analysis are conducted to provide directions for further exploration of natural-language interaction in molecular discovery.