Large Language Model
Hey ChatGPT, What Can You Do For Humans?
An AI taking the world by surprise would have sounded like a fantasy movie or at least something that won't happen anytime soon. But OpenAI -- an AI research laboratory, has turned the odds by creating ChatGPT -- a very powerful AI assistant. They are making some serious advancements in the field of images and language understanding, and it would be exciting to see what they will come up next in 2023! ChatGPT became an internet sensation within a few days of becoming public. In this blog, we will look at every important aspect you need to know about ChatGPT so that you can also leverage this amazing technology in your day-to-day work. Let's hear this directly from the AI: So, it's an AI assistant that provides information and answers questions to the best of its ability.
AI Platforms like ChatGPT Are Easy to Use
Something incredible is happening in artificial intelligence right now--but it's not entirely good. Everybody is talking about systems like ChatGPT, which generates text that seems remarkably human. This makes it fun to play with, but there is a dark side, too. Because they are so good at imitating human styles, there is risk that such chatbots could be used to mass-produce misinformation. To get a sense of what it does best at its best, consider this example generated by ChatGPT, sent to me over e-mail by Henry Minsky (son of Marvin Minsky, one of AI's foundational researchers).
Prepare for a Digital Transformation Bigger Than Before ( Insights Of ChatGPT)
Technology is rapidly changing every day with newer and newer innovations and our current technologically advancing phase is considered the Third Industrial Revolution. While this is also true but calling this an Industrial Revolution raises another question "Is Third Industrial Revolution already ended silently into a newer Fourth Revolution?" Historically, Dark Age started after the Fall Of Rome when the Church started ruling the entire Europe and what the Bible says become Laws. As a result, entire Europe is heavily religionized and philosophers, scholars and artists became outlawed. Then after many years, new ideas and perspectives on the world away from what the Church have been telling emerged in Florence, Italy. Later, that became what is now known as Renaissance. A period of enlightenment and initial steppings into the scientific revolution later contributed to the Industrial Revolution.
Methodological reflections for AI alignment research using human feedback
Hagendorff, Thilo, Fabi, Sarah
The field of artificial intelligence (AI) alignment aims to investigate whether AI technologies align with human interests and values and function in a safe and ethical manner. AI alignment is particularly relevant for large language models (LLMs), which have the potential to exhibit unintended behavior due to their ability to learn and adapt in ways that are difficult to predict. In this paper, we discuss methodological challenges for the alignment problem specifically in the context of LLMs trained to summarize texts. In particular, we focus on methods for collecting reliable human feedback on summaries to train a reward model which in turn improves the summarization model. We conclude by suggesting specific improvements in the experimental design of alignment studies for LLMs' summarization capabilities.
Chatbots in a Botnet World
Question-and-answer formats provide a novel experimental platform for investigating cybersecurity questions. Unlike previous chatbots, the latest ChatGPT model from OpenAI supports an advanced understanding of complex coding questions. The research demonstrates thirteen coding tasks that generally qualify as stages in the MITRE ATT&CK framework, ranging from credential access to defense evasion. With varying success, the experimental prompts generate examples of keyloggers, logic bombs, obfuscated worms, and payment-fulfilled ransomware. The empirical results illustrate cases that support the broad gain of functionality, including self-replication and self-modification, evasion, and strategic understanding of complex cybersecurity goals. One surprising feature of ChatGPT as a language-only model centers on its ability to spawn coding approaches that yield images that obfuscate or embed executable programming steps or links.
When are Lemons Purple? The Concept Association Bias of CLIP
Yamada, Yutaro, Tang, Yingtian, Yildirim, Ilker
Large-scale vision-language models such as CLIP have shown impressive performance on zero-shot image classification and image-to-text retrieval. However, such zero-shot performance of CLIP-based models does not realize in tasks that require a finer-grained correspondence between vision and language, such as Visual Question Answering (VQA). We investigate why this is the case, and report an interesting phenomenon of CLIP, which we call the Concept Association Bias (CAB), as a potential cause of the difficulty of applying CLIP to VQA and similar tasks. CAB is especially apparent when two concepts are present in the given image while a text prompt only contains a single concept. In such a case, we find that CLIP tends to treat input as a bag of concepts and attempts to fill in the other missing concept crossmodally, leading to an unexpected zero-shot prediction. For example, when asked for the color of a lemon in an image, CLIP predicts ``purple'' if the image contains a lemon and an eggplant. We demonstrate the Concept Association Bias of CLIP by showing that CLIP's zero-shot classification performance greatly suffers when there is a strong concept association between an object (e.g. lemon) and an attribute (e.g. its color). On the other hand, when the association between object and attribute is weak, we do not see this phenomenon. Furthermore, we show that CAB is significantly mitigated when we enable CLIP to learn deeper structure across image and text embeddings by adding an additional Transformer on top of CLIP and fine-tuning it on VQA. We find that across such fine-tuned variants of CLIP, the strength of CAB in a model predicts how well it performs on VQA.
Why Does Surprisal From Larger Transformer-Based Language Models Provide a Poorer Fit to Human Reading Times?
Oh, Byung-Doh, Schuler, William
This work presents a detailed linguistic analysis into why larger Transformer-based pre-trained language models with more parameters and lower perplexity nonetheless yield surprisal estimates that are less predictive of human reading times. First, regression analyses show a strictly monotonic, positive log-linear relationship between perplexity and fit to reading times for the more recently released five GPT-Neo variants and eight OPT variants on two separate datasets, replicating earlier results limited to just GPT-2 (Oh et al., 2022). Subsequently, analysis of residual errors reveals a systematic deviation of the larger variants, such as underpredicting reading times of named entities and making compensatory overpredictions for reading times of function words such as modals and conjunctions. These results suggest that the propensity of larger Transformer-based models to 'memorize' sequences during training makes their surprisal estimates diverge from humanlike expectations, which warrants caution in using pre-trained language models to study human language processing.
Large-scale chemical language representations capture molecular structure and properties
Models based on machine learning can enable accurate and fast molecular property predictions, which is of interest in drug discovery and material design. Various supervised machine learning models have demonstrated promising performance, but the vast chemical space and the limited availability of property labels make supervised learning challenging. Recently, unsupervised transformer-based language models pretrained on a large unlabelled corpus have produced state-of-the-art results in many downstream natural language processing tasks. Inspired by this development, we present molecular embeddings obtained by training an efficient transformer encoder model, MoLFormer, which uses rotary positional embeddings. This model employs a linear attention mechanism, coupled with highly distributed training, on SMILES sequences of 1.1 billion unlabelled molecules from the PubChem and ZINC datasets. We show that the learned molecular representation outperforms existing baselines, including supervised and self-supervised graph neural networks and language models, on several downstream tasks from ten benchmark datasets. They perform competitively on two others. Further analyses, specifically through the lens of attention, demonstrate that MoLFormer trained on chemical SMILES indeed learns the spatial relationships between atoms within a molecule. These results provide encouraging evidence that large-scale molecular language models can capture sufficient chemical and structural information to predict various distinct molecular properties, including quantum-chemical properties. Large language models have recently emerged with extraordinary capabilities, and these methods can be applied to model other kinds of sequence, such as string representations of molecules. Ross and colleagues have created a transformer-based model, trained on a large dataset of molecules, which provides good results on property prediction tasks.
The Near Future of AI is Action-Driven - by John McDonnell
In 2022, large language models (LLMs) finally got good. Specifically, Google and OpenAI have led the way in creating foundation models that respond to instructions more usefully. For OpenAI, this came in the form of Instruct-GPT (OpenAI blogpost), while for Google this was reflected in their FLAN training method (Wei et al. 2022, arxiv). Flan's which beat the Hypermind forecast for MMLU performance two years early: But the best is yet to come. The really exciting applications will be action-driven, where the model acts like an agent choosing actions.
Should marketers be excited or concerned about ChatGPT?
ChatGPT could well be a glimpse into a not-too-distant future where artificial intelligence (AI) can converse, inform and educate our human minds – even offer an immediate solution to my daughter's Year 12 English essay. The model is the creation of OpenAI, a San Francisco-based tech company that wants "to ensure that artificial general intelligence benefits all of humanity". Investors include Silicon Valley luminaries such as Reid Hoffman, Elon Musk and Peter Thiel. OpenAI's biggest benefactor, however, has been Microsoft, which invested $1 billion in the company in 2019 and helped facilitate the ChatGPT project with training on Azure AI supercomputing infrastructure. If you haven't tried it, ChatGPT is free and accessible via a registration link here.