Goto

Collaborating Authors

 target role


Using Prompts to Guide Large Language Models in Imitating a Real Person's Language Style

arXiv.org Artificial Intelligence

Large language models (LLMs), such as GPT series and Llama series have demonstrated strong capabilities in natural language processing, contextual understanding, and text generation. In recent years, researchers are trying to enhance the abilities of LLMs in performing various tasks, and numerous studies have proved that well-designed prompts can significantly improve the performance of LLMs on these tasks. This study compares the language style imitation ability of three different large language models under the guidance of the same zero-shot prompt. It also involves comparing the imitation ability of the same large language model when guided by three different prompts individually. Additionally, by applying a Tree-of-Thoughts (ToT) Prompting method to Llama 3, a conversational AI with the language style of a real person was created. In this study, three evaluation methods were used to evaluate LLMs and prompts. The results show that Llama 3 performs best at imitating language styles, and that the ToT prompting method is the most effective to guide it in imitating language styles. Using a ToT framework, Llama 3 was guided to interact with users in the language style of a specific individual without altering its core parameters, thereby creating a text-based conversational AI that reflects the language style of the individual.


A Mixture-of-Experts Approach to Few-Shot Task Transfer in Open-Ended Text Worlds

arXiv.org Artificial Intelligence

Open-ended worlds are those in which there are no pre-specified goals or environmental reward signal. As a consequence, an agent must know how to perform a multitude of tasks. However, when a new task is presented to an agent, we expect it to be able to reuse some of what it knows from previous tasks to rapidly learn that new task. We introduce a novel technique whereby policies for different a priori known tasks are combined into a Mixture-of-Experts model with an attention mechanism across a mix of frozen and unfrozen experts. The model learns when to attend to frozen task-specific experts when appropriate and learns new experts to handle novel situations. We work in an open-ended text-based environment in which the agent is tasked with behaving like different types of character roles and must rapidly learn behaviors associated with new character role types. We show that our agent both obtains more rewards in the zero-shot setting, and discovers these rewards with greater sample efficiency in the few-shot learning settings.


6 Proven Steps to Land a Job in Data Science

@machinelearnbot

After spending numerous evenings and weekends learning and coding for more than a year, you finally did it! You've now completed your data science program, earned your shiny certificate...now what? Chances are you were looking to get a job in data when you signed up for the course. So let's face this, it is time to get a job! The only thing that's standing between you and success is that first data science job offer.


6 Proven Steps to Land a Job in Data Science

#artificialintelligence

After spending numerous evenings and weekends learning and coding for more than a year, you finally did it! You've now completed your data science program, earned your shiny certificate...now what? Chances are you were looking to get a job in data when you signed up for the course. So let's face this, it is time to get a job! The only thing that's standing between you and success is that first data science job offer.