Goto

Collaborating Authors

 Large Language Model



In the coming AI future, Britain must not end up at the mercy of US tech giants Rafael Behr

The Guardian

Kendall says AI is the'currency of the future'. Kendall says AI is the'currency of the future'. Trump is volatile, capricious and unreasonable - but he belongs to the old world of analogue power. D onald Trump is not impressed by soft power. He respects hard men with military muscle.


SpeAr: A Spectral Approach for Zero-Shot Node Classification

Neural Information Processing Systems

Zero-shot node classification is a vital task in the field of graph data processing, aiming to identify nodes of classes unseen during the training process. Prediction bias is one of the primary challenges in zero-shot node classification, referring to the model's propensity to misclassify nodes of unseen classes as seen classes. However, most methods introduce external knowledge to mitigate the bias, inadequately leveraging the inherent cluster information within the unlabeled nodes. To address this issue, we employ spectral analysis coupled with learnable class prototypes to discover the implicit cluster structures within the graph, providing a more comprehensive understanding of classes. In this paper, we propose a spectral approach for zero-shot node classification (SpeAr). Specifically, we establish an approximate relationship between minimizing the spectral contrastive loss and performing spectral decomposition on the graph, thereby enabling effective node characterization through loss minimization. Subsequently, the class prototypes are iteratively refined based on the learned node representations, initialized with the semantic vectors. Finally, extensive experiments verify the effectiveness of the SpeAr, which can further alleviate the bias problem.




OpenAI Really Wants Codex to Shut Up About Goblins

WIRED

"Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant," reads OpenAI's coding agent instructions. OpenAI has a goblin problem. Instructions designed to guide the behavior of the company's latest model as it writes code have been revealed to include a line, repeated several times, that specifically forbids it from randomly mentioning an assortment of mythical and real creatures. "Never talk about goblins, gremlins, raccoons, trolls, ogres, pigeons, or other animals or creatures unless it is absolutely and unambiguously relevant to the user's query," read instructions in Codex CLI, a command-line tool for using AI to generate code. It is unclear why OpenAI felt compelled to spell this out for Codex --or indeed why its models might want to discuss goblins or pigeons in the first place.


Large Language Models for Automated Data Science: Introducing CAAFE for Context-Aware Automated Feature Engineering

Neural Information Processing Systems

As the field of automated machine learning (AutoML) advances, it becomes increasingly important to incorporate domain knowledge into these systems. We present an approach for doing so by harnessing the power of large language models (LLMs). Specifically, we introduce Context-Aware Automated Feature Engineering (CAAFE), a feature engineering method for tabular datasets that utilizes an LLM to iteratively generate additional semantically meaningful features for tabular datasets based on the description of the dataset. The method produces both Python code for creating new features and explanations for the utility of the generated features. Despite being methodologically simple, CAAFE improves performance on 11 out of 14 datasets - boosting mean ROCAUC performance from 0.798 to 0.822 across all dataset - similar to the improvement achieved by using a random forest instead of logistic regression on our datasets. Furthermore, CAAFE is interpretable by providing a textual explanation for each generated feature. CAAFE paves the way for more extensive semi-automation in data science tasks and emphasizes the significance of context-aware solutions that can extend the scope of AutoML systems to semantic AutoML. We release our code, a simple demo and a python package.


Sam Altman and Elon Musk Sure Dislike Each Other

The Atlantic - Technology

The trial between the CEOs makes the AI boom seem sordid and small. Elon Musk and Sam Altman are two of the most influential people in Silicon Valley, if not the world. Between the two of them, Musk and Altman run technology companies worth many trillions of dollars that promise to reshape civilization. But this morning, both sat under fluorescent lights in a courthouse in downtown Oakland, suffering through all manner of technical glitches as their respective attorneys kicked off the long-awaited trial in . As Steven Molo, a lawyer for Musk, began his opening argument, confused looks swept the courtroom.


Self-Retrieval: End-to-End Information Retrieval with One Large Language Model

Neural Information Processing Systems

The rise of large language models (LLMs) has significantly transformed both the construction and application of information retrieval (IR) systems. However, current interactions between IR systems and LLMs remain limited, with LLMs merely serving as part of components within IR systems, and IR systems being constructed independently of LLMs. This separated architecture restricts knowledge sharing and deep collaboration between them.In this paper, we introduce Self-Retrieval, a novel end-to-end LLM-driven information retrieval architecture.Self-Retrieval unifies all essential IR functions within a single LLM, leveraging the inherent capabilities of LLMs throughout the IR process.Specifically, Self-Retrieval internalizes the retrieval corpus through self-supervised learning, transforms the retrieval process into sequential passage generation, and performs relevance assessment for reranking.Experimental results demonstrate that Self-Retrieval not only outperforms existing retrieval approaches by a significant margin, but also substantially enhances the performance of LLM-driven downstream applications like retrieval-augmented generation.


Musk testifies at OpenAI trial it's not OK to 'loot a charity'

Al Jazeera

Musk testifies at OpenAI trial it's not OK to'loot a charity' Elon Musk has taken the stand at a high-stakes trial over the future of OpenAI, casting his lawsuit against the ChatGPT maker as a defence of charitable giving. The world's richest person is suing OpenAI, its cofounder and chief executive officer, Sam Altman, and its president, Greg Brockman, and said on the stand on Tuesday that they betrayed him and the public by abandoning OpenAI's mission to be a benevolent steward of AI for humanity and transforming the nonprofit into a profit-seeking juggernaut. Musk, who founded carmaker Tesla and rocket company SpaceX, also said he is committed to serving the public by working 80-to 100-hour weeks and generally not taking vacations. "I like working and solving problems that make people's lives better," he said. Before Musk began testifying, Bill Savitt, a lawyer for OpenAI and Altman, told jurors during his opening statement it was Musk who saw dollar signs as he helped finance OpenAI's early growth and pushed it to become a for-profit business, one he might eventually lead as CEO.