snape
SHARP: Unlocking Interactive Hallucination via Stance Transfer in Role-Playing Agents
Kong, Chuyi, Luo, Ziyang, Lin, Hongzhan, Fan, Zhiyuan, Fan, Yaxin, Sun, Yuxi, Ma, Jing
The advanced role-playing capabilities of Large Language Models (LLMs) have paved the way for developing Role-Playing Agents (RPAs). However, existing benchmarks in social interaction such as HPD and SocialBench have not investigated hallucination and face limitations like poor generalizability and implicit judgments for character fidelity. To address these issues, we propose a generalizable, explicit and effective paradigm to unlock the interactive patterns in diverse worldviews. Specifically, we define the interactive hallucination based on stance transfer and construct a benchmark, SHARP, by extracting relations from a general commonsense knowledge graph and leveraging the inherent hallucination properties of RPAs to simulate interactions across roles. Extensive experiments validate the effectiveness and stability of our paradigm. Our findings further explore the factors influencing these metrics and discuss the trade-off between blind loyalty to roles and adherence to facts in RPAs.
SnapE -- Training Snapshot Ensembles of Link Prediction Models
Snapshot ensembles have been widely used in various fields of prediction. They allow for training an ensemble of prediction models at the cost of training a single one. They are known to yield more robust predictions by creating a set of diverse base models. In this paper, we introduce an approach to transfer the idea of snapshot ensembles to link prediction models in knowledge graphs. Moreover, since link prediction in knowledge graphs is a setup without explicit negative examples, we propose a novel training loop that iteratively creates negative examples using previous snapshot models. An evaluation with four base models across four datasets shows that this approach constantly outperforms the single model approach, while keeping the training time constant.
A Beginner's Introduction to Named Entity Recognition (NER)
It is the process of identifying proper nouns from a piece of text and classifying them into appropriate categories. These categories can be generic like'Organization', 'Person', 'Location', etc. or they can be tailor-made for a particular application, e.g. When I first heard about NER, it sounded terribly boring and even trivial. You might be thinking, "Isn't this a simple lookup problem? Why do we even need ML here?"
Harry Potter: Written by Artificial Intelligence -- Deep Writing
I trained an LSTM Recurrent Neural Network (a deep learning algorithm) on the first four Harry Potter books. I then asked it to produce a chapter based on what it learned. He looked like Madame Maxime. When she strode up the wrong staircase to visit himself. "I'm afraid I've definitely been suspended from power, no chance -- indeed?" said Snape.