Goto

Collaborating Authors

 Large Language Model


Unleash the Power of ChatGPT

#artificialintelligence

We are introduced to new discoveries and technologies every day, and one of the best and most popular inventions today is artificial intelligence (AI) and its tools. One of them is Chat GPT, a conversational model of AI that is a powerful chatbot that answers follow-up questions and writes code for the users. The day it was launched, everybody was going gaga over the new technology and the remarkable uses of this AI-powered chatbot. In this blog we will get to know about the perks of ChatGPT for coding. A conversational AI-powered chatbot created by OpenAI is popular right now due to its many applications, including assisting students with their homework projects, offering suggestions for creating websites, and even writing code.


AI Hype Bleeds Into Cryptocurrencies

#artificialintelligence

Digital generated image of bitcoin sign over glowing digital circuit board. Artificial intelligence (AI) crypto tokens are soaring in price this week, but price movements seem to be more of a crypto proxy to the AI bubble. The rally comes as a J.P Morgan report that says traders are turning their attention to AI and away from blockchain. "The rise in the price of AI-related cryptocurrencies can without a doubt be driven by real and tangible developments in the AI and blockchain industries," says Vasco Lopes, blockchain and artificial intelligence researcher at the NOVA school of technology near Lisbon, Portugal. "However, AI-related cryptocurrencies are also influenced by hype and investor sentiment, as the increased popularity of AI and AI-related products, such as the release of OpenAI's ChatGPT language model, generates excitement and interest in the AI sector." AI cryptos have reached a $4.27 billion market cap, up 56% from last week.


People are already trying to get ChatGPT to write malware

#artificialintelligence

The ChatGPT AI chatbot has created plenty of excitement in the short time it has been available and now it seems it has been enlisted by some in attempts to help generate malicious code. AI writing tools can help lighten your workload by writing emails and essays and even doing math. They use artificial intelligence to generate text or answer queries based on user input. ChatGPT is one popular example, but there are other noteworthy AI writers. ChatGPT is an AI-driven natural language processing tool which interacts with users in a human-like, conversational way. Among other things, it can be used to help with tasks like composing emails, essays and code.


University Professor Catches Student Cheating With ChatGPT

#artificialintelligence

The professor first entered the suspect text into ChatGPT software to determine if the written reply was by AI. He received a 99.9% likelihood of matching. The software did not offer any citations, unlike standard plagiarism detection software. Hick tried to create the same essay by asking ChatGPT questions that he thought his student would ask. This resulted in similar answers but not direct matches.


Amex and Microsoft turn to AI to make expense reports less horrible • TechCrunch

#artificialintelligence

ChatGPT is getting all the attention as of late, but modern AI technologies have a range of use cases beyond finally making Bing useful. One emerging trend is putting AI to work to aid with the frustrating and laborious task of filing and auditing corporate expense reports. Today, Microsoft and American Express announced a deal that aims to do just that. The companies agreed to expand their decades-long partnership to build solutions that leverage Microsoft Cloud and AI technologies, starting with expense report management. According to Amex, the initial solution will leverage machine learning and AI to automate expense reporting and approvals.


Big Tech companies use cloud computing arms to pursue alliances with AI groups

#artificialintelligence

The arrangement echoes the $1 billion cash-for-computing investment that Microsoft made in OpenAI three years ago. In January, Microsoft announced a further "multiyear, multibillion-dollar" investment in OpenAI estimated at $10 billion. The deal cemented Microsoft's position as exclusive infrastructure provider to one of the world's leading AI startups. Chief executive Satya Nadella claimed that Microsoft had built a supercomputer to handle the OpenAI work, and that it could now handle some AI calculations at half the cost of its rivals. Reducing cost is key for the compute-intensive development of large language models: Estimates put the cost of running ChatGPT, assuming 10 million monthly users, at $1 million per day.


If ChatGPT Can Disrupt Google In 2023, What About Your Company?

#artificialintelligence

"How did you go bankrupt?" "Two ways," answered Mike, "Gradually, then suddenly." The very same story describes how major industry disruption usually happens--gradually, and then suddenly. For board members and other industry leaders, being on the right side of such disruption typically requires looking years ahead. But with the release of ChatGPT in November, 2022, OpenAI "suddenly" and shockingly threatened to overthrow Google's hitherto total dominance of internet search.


The Debate Over Understanding in AI's Large Language Models

arXiv.org Artificial Intelligence

We survey a current, heated debate in the AI research community on whether large pre-trained language models can be said to "understand" language -- and the physical and social situations language encodes -- in any important sense. We describe arguments that have been made for and against such understanding, and key questions for the broader sciences of intelligence that have arisen in light of these arguments. We contend that a new science of intelligence can be developed that will provide insight into distinct modes of understanding, their strengths and limitations, and the challenge of integrating diverse forms of cognition.


The Wisdom of Hindsight Makes Language Models Better Instruction Followers

arXiv.org Artificial Intelligence

Reinforcement learning has seen wide success in finetuning large language models to better align with instructions via human feedback. The so-called algorithm, Reinforcement Learning with Human Feedback (RLHF) demonstrates impressive performance on the GPT series models. However, the underlying Reinforcement Learning (RL) algorithm is complex and requires an additional training pipeline for reward and value networks. In this paper, we consider an alternative approach: converting feedback to instruction by relabeling the original one and training the model for better alignment in a supervised manner. Such an algorithm doesn't require any additional parameters except for the original language model and maximally reuses the pretraining pipeline. To achieve this, we formulate instruction alignment problem for language models as a goal-reaching problem in decision making. We propose Hindsight Instruction Relabeling (HIR), a novel algorithm for aligning language models with instructions. The resulting two-stage algorithm shed light to a family of reward-free approaches that utilize the hindsightly relabeled instructions based on feedback. We evaluate the performance of HIR extensively on 12 challenging BigBench reasoning tasks and show that HIR outperforms the baseline algorithms and is comparable to or even surpasses supervised finetuning.


Distillation of encoder-decoder transformers for sequence labelling

arXiv.org Artificial Intelligence

Driven by encouraging results on a wide range of tasks, the field of NLP is experiencing an accelerated race to develop bigger language models. This race for bigger models has also underscored the need to continue the pursuit of practical distillation approaches that can leverage the knowledge acquired by these big models in a compute-efficient manner. Having this goal in mind, we build on recent work to propose a hallucination-free framework for sequence tagging that is especially suited for distillation. We show empirical results of new state-of-the-art performance across multiple sequence labelling datasets and validate the usefulness of this framework for distilling a large model in a few-shot learning scenario.