Large Language Model
Metal - Best AI Tools
Embeddings are simple to develop, test, and utilise in apps thanks to Metal, an enterprise-ready, fully-managed ML storage platform. It can accept bespoke embeddings, interface with numerous vector databases, and organise the employment of top models to generate embeddings. Moreover, it provides OpenAI Ada, which adds intelligence to text, and OpenAI CLIP, which links text and images. Save my name, email, and website in this browser for the next time I comment.
Elon Musk Is Bringing the Culture Wars to AI
It was only a matter of time before the culture wars came to AI. Since the release of ChatGPT in late 2022, Elon Musk has railed on Twitter against what he has called "Woke AI." He has specifically criticized ChatGPT's developer, OpenAI, for the features designed to prevent the chatbot from parroting racism and sexism. Now, the billionaire is courting AI researchers with a proposal to start a new AI company to rival the developer of ChatGPT, the tech news site The Information reported on Wednesday. "The danger of training AI to be woke--in other words, lie--is deadly," Musk tweeted in December.
ChatGPT: How to make your work life easier 101 - Startup World Tech
So, in an effort to align your startup business with the modern day and age, you have decided to enlist the artificial intelligence of ChatGPT. Arguably an intimidating step, as the commonplace fears of replacement by means of superior technology plague most of us inevitably. Among many professionals, these ideas about losing jobs to AI have been on the rise in modern times since the invention of the internet, and the advancement of robotics. If your job is to carry as many plates and cutlery as possible without dropping any, one or the other Boston Dynamics YouTube video of a humanoid machine, conveying itself and a gigantic load at inhuman speed, might give you pause. You may feel the same way about ChatGPT.
ChatGPT for Data Science Cheat Sheet - KDnuggets
You probably haven't heard of ChatGPT yet... Aside from stealing your job, spreading lies, and plagiarizing on a mass scale (varying degrees of sarcasm intended for each of those common points of detraction), ChatGPT can also be very useful for the average data scientist in their day to day life. The problem that we are currently grappling with is one of expectations: some expect that ChatGPT is nothing more than a stochastic parrot that produces rubbish and is completely useless, while others seem to think (either pessimistically or optimistically, depending on who you ask) that ChatGPT will be taking over the world, while either ridding us of monotony or laying waste to human civilization. If we all tempered our expectations and recognized this technology for what it is, and what it could be useful for, we would all be far better off. But in the era of outrage and hyperbole, nuance takes a back seat to hot takes. ChatGPT (and, indeed, the most robust and latest versions of GPT3) is meant to assist (that's right... assist!) humans that decide to use it as such, and with a little help from your friends at KDnuggets you will be able to hone your prompt engineering skills to do useful things like generate code, assist in your research process, and analyze data.
The Download: how ChatGPT was made, and a boost for infertility treatment
When OpenAI launched ChatGPT, with zero fanfare, in late November 2022, nobody inside the company was prepared for a viral mega-hit. It was viewed in-house as a "research preview," a tease of a more polished version of a two-year-old technology and a way to iron out some of its flaws. But then it absolutely blew up. The firm has been scrambling to catch up--and capitalize on its success--ever since. To get the inside story behind the chatbot--how it was made, how OpenAI has been updating it since release, and how its makers feel about its success--our senior AI editor Will Douglas Heaven talked to four people who helped build what has become the most popular internet app ever.
The inside story of how ChatGPT was built from the people who made it
To get the inside story behind the chatbot--how it was made, how OpenAI has been updating it since release, and how its makers feel about its success--I talked to four people who helped build what has become one of the most popular internet apps ever. In addition to Agarwal and Fedus, I spoke to John Schulman, a cofounder of OpenAI, and Jan Leike, the leader of OpenAI's alignment team, which works on the problem of making AI do what its users want it to do (and nothing more). What I came away with was the sense that OpenAI is still bemused by the success of its research preview, but has grabbed the opportunity to push this technology forward, watching how millions of people are using it and trying to fix the worst problems as they come up. Since November, OpenAI has already updated ChatGPT several times. The researchers are using a technique called adversarial training to stop ChatGPT from letting users trick it into behaving badly (known as jailbreaking).
ChatGPT can be made to write scam emails and it slashes their cost
Scammers could use ChatGPT to write phishing emails at a fraction of the cost of a human-penned missive, potentially cutting the cost per email by about 96 per cent. The popular chatbot, which is based on a large language model (LLM), was released by OpenAI in November 2022 and has since become a useful tool in many industries.
MathPrompter: Mathematical Reasoning using Large Language Models
Imani, Shima, Du, Liang, Shrivastava, Harsh
Large Language Models (LLMs) have limited performance when solving arithmetic reasoning tasks and often provide incorrect answers. Unlike natural language understanding, math problems typically have a single correct answer, making the task of generating accurate solutions more challenging for LLMs. To the best of our knowledge, we are not aware of any LLMs that indicate their level of confidence in their responses which fuels a trust deficit in these models impeding their adoption. To address this deficiency, we propose `MathPrompter', a technique that improves performance of LLMs on arithmetic problems along with increased reliance in the predictions. MathPrompter uses the Zero-shot chain-of-thought prompting technique to generate multiple Algebraic expressions or Python functions to solve the same math problem in different ways and thereby raise the confidence level in the output results. This is in contrast to other prompt based CoT methods, where there is no check on the validity of the intermediate steps followed. Our technique improves over state-of-the-art on the MultiArith dataset ($78.7\%\rightarrow92.5\%$) evaluated using 175B parameter GPT-based LLM.
CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos
Dong, Hao-Wen, Takahashi, Naoya, Mitsufuji, Yuki, McAuley, Julian, Berg-Kirkpatrick, Taylor
Recent years have seen progress beyond domain-specific sound separation for speech or music towards universal sound separation for arbitrary sounds. Prior work on universal sound separation has investigated separating a target sound out of an audio mixture given a text query. Such text-queried sound separation systems provide a natural and scalable interface for specifying arbitrary target sounds. However, supervised text-queried sound separation systems require costly labeled audio-text pairs for training. Moreover, the audio provided in existing datasets is often recorded in a controlled environment, causing a considerable generalization gap to noisy audio in the wild. In this work, we aim to approach text-queried universal sound separation by using only unlabeled data. We propose to leverage the visual modality as a bridge to learn the desired audio-textual correspondence. The proposed CLIPSep model first encodes the input query into a query vector using the contrastive language-image pretraining (CLIP) model, and the query vector is then used to condition an audio separation model to separate out the target sound. While the model is trained on image-audio pairs extracted from unlabeled videos, at test time we can instead query the model with text inputs in a zero-shot setting, thanks to the joint language-image embedding learned by the CLIP model. Further, videos in the wild often contain off-screen sounds and background noise that may hinder the model from learning the desired audio-textual correspondence. To address this problem, we further propose an approach called noise invariant training for training a query-based sound separation model on noisy data. Experimental results show that the proposed models successfully learn text-queried universal sound separation using only noisy unlabeled videos, even achieving competitive performance against a supervised model in some settings.
Will Affective Computing Emerge from Foundation Models and General AI? A First Evaluation on ChatGPT
Amin, Mostafa M., Cambria, Erik, Schuller, Björn W.
ChatGPT has shown the potential of emerging general artificial intelligence capabilities, as it has demonstrated competent performance across many natural language processing tasks. In this work, we evaluate the capabilities of ChatGPT to perform text classification on three affective computing problems, namely, big-five personality prediction, sentiment analysis, and suicide tendency detection. We utilise three baselines, a robust language model (RoBERTa-base), a legacy word model with pretrained embeddings (Word2Vec), and a simple bag-of-words baseline (BoW). Results show that the RoBERTa trained for a specific downstream task generally has a superior performance. On the other hand, ChatGPT provides decent results, and is relatively comparable to the Word2Vec and BoW baselines. ChatGPT further shows robustness against noisy data, where Word2Vec models achieve worse results due to noise. Results indicate that ChatGPT is a good generalist model that is capable of achieving good results across various problems without any specialised training, however, it is not as good as a specialised model for a downstream task.