AITopics

Generative AI models have shown impressive ability to produce images with text prompts, which could benefit creativity in visual art creation and self-expression. However, it is unclear how precisely the generated images express contexts and emotions from the input texts. We explored the emotional expressiveness of AI-generated images and developed RePrompt, an automatic method to refine text prompts toward precise expression of the generated images. Inspired by crowdsourced editing strategies, we curated intuitive text features, such as the number and concreteness of nouns, and trained a proxy model to analyze the feature effects on the AI-generated image. With model explanations of the proxy model, we curated a rubric to adjust text prompts to optimize image generation for precise emotion expression. We conducted simulation and user studies, which showed that RePrompt significantly improves the emotional expressiveness of AI-generated images, especially for negative emotions.

large language model, machine learning, natural language, (20 more...)

doi: 10.1145/3544548.3581402

2302.09466

Country:

Asia > Singapore (0.14)
Europe > Germany > Hamburg (0.06)
North America > United States > New York (0.04)
(3 more...)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)
Research Report > Experimental Study (0.68)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
(2 more...)

Muennighoff, Niklas, Tazi, Nouamane, Magne, Loïc, Reimers, Nils

MTEB: Massive Text Embedding Benchmark

Text embeddings are commonly evaluated on a small set of datasets from a single task not covering their possible applications to other tasks. It is unclear whether state-of-the-art embeddings on semantic textual similarity (STS) can be equally well applied to other tasks like clustering or reranking. This makes progress in the field difficult to track, as various models are constantly being proposed without proper evaluation. To solve this problem, we introduce the Massive Text Embedding Benchmark (MTEB). MTEB spans 8 embedding tasks covering a total of 58 datasets and 112 languages. Through the benchmarking of 33 models on MTEB, we establish the most comprehensive benchmark of text embeddings to date. We find that no particular text embedding method dominates across all tasks. This suggests that the field has yet to converge on a universal text embedding method and scale it up sufficiently to provide state-of-the-art results on all embedding tasks. MTEB comes with open-source code and a public leaderboard at https://github.com/embeddings-benchmark/mteb.

data mining, large language model, machine learning, (23 more...)

2210.07316

Country:

Europe > United Kingdom > England (0.04)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)
South America > Argentina (0.04)
(6 more...)

Genre:

Overview (0.67)
Research Report (0.64)

Industry:

Leisure & Entertainment (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.67)
Health & Medicine > Epidemiology (0.67)
(6 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(5 more...)

Behnia, Rouzbeh, Ebrahimi, Mohamamdreza, Pacheco, Jason, Padmanabhan, Balaji

Privately Fine-Tuning Large Language Models with Differential Privacy

Pre-trained Large Language Models (LLMs) are an integral part of modern AI that have led to breakthrough performances in complex AI tasks. Major AI companies with expensive infrastructures are able to develop and train these large models with billions and millions of parameters from scratch. Third parties, researchers, and practitioners are increasingly adopting these pre-trained models and fine-tuning them on their private data to accomplish their downstream AI tasks. However, it has been shown that an adversary can extract/reconstruct the exact training samples from these LLMs, which can lead to revealing personally identifiable information. The issue has raised deep concerns about the privacy of LLMs. Differential privacy (DP) provides a rigorous framework that allows adding noise in the process of training or fine-tuning LLMs such that extracting the training data becomes infeasible (i.e., with a cryptographically small success probability). While the theoretical privacy guarantees offered in most extant studies assume learning models from scratch through many training iterations in an asymptotic setting, this assumption does not hold in fine-tuning scenarios in which the number of training iterations is significantly smaller. To address the gap, we present \ewtune, a DP framework for fine-tuning LLMs based on Edgeworth accountant with finite-sample privacy guarantees. Our results across four well-established natural language understanding (NLU) tasks show that while \ewtune~adds privacy guarantees to LLM fine-tuning process, it directly contributes to decreasing the induced noise to up to 5.6\% and improves the state-of-the-art LLMs performance by up to 1.1\% across all NLU tasks. We have open-sourced our implementations for wide adoption and public testing purposes.

large language model, machine learning, natural language, (16 more...)

doi: 10.1109/ICDMW58026.2022.00078

2210.15042

Country:

Europe > Austria > Vienna (0.14)
North America > United States > Arizona (0.04)
North America > United States > Pennsylvania (0.04)
(6 more...)

Genre: Research Report > New Finding (0.66)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

PanGu-{\Sigma}: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing

Ren, Xiaozhe, Zhou, Pingyi, Meng, Xinfan, Huang, Xinjing, Wang, Yadao, Wang, Weichao, Li, Pengfei, Zhang, Xiaoda, Podolskiy, Alexander, Arshinov, Grigory, Bout, Andrey, Piontkovskaya, Irina, Wei, Jiansheng, Jiang, Xin, Su, Teng, Liu, Qun, Yao, Jun

The scaling of large language models has greatly improved natural language understanding, generation, and reasoning. In this work, we develop a system that trained a trillion-parameter language model on a cluster of Ascend 910 AI processors and MindSpore framework, and present the language model with 1.085T parameters named PanGu-{\Sigma}. With parameter inherent from PanGu-{\alpha}, we extend the dense Transformer model to sparse one with Random Routed Experts (RRE), and efficiently train the model over 329B tokens by using Expert Computation and Storage Separation(ECSS). This resulted in a 6.3x increase in training throughput through heterogeneous computing. Our experimental findings show that PanGu-{\Sigma} provides state-of-the-art performance in zero-shot learning of various Chinese NLP downstream tasks. Moreover, it demonstrates strong abilities when fine-tuned in application data of open-domain dialogue, question answering, machine translation and code generation.

large language model, machine learning, natural language, (18 more...)

2303.10845

Country:

Asia > India (0.14)
Asia > China > Guangdong Province > Shenzhen (0.05)
Asia > China > Beijing > Beijing (0.04)
(21 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area (0.93)
Law (0.93)
Leisure & Entertainment > Sports > Basketball (0.92)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Rafiepour, Mehrdad, Sartakhti, Javad Salimi

CTRAN: CNN-Transformer-based Network for Natural Language Understanding

Intent-detection and slot-filling are the two main tasks in natural language understanding. In this study, we propose CTRAN, a novel encoder-decoder CNN-Transformer-based architecture for intent-detection and slot-filling. In the encoder, we use BERT, followed by several convolutional layers, and rearrange the output using window feature sequence. We use stacked Transformer encoders after the window feature sequence. For the intent-detection decoder, we utilize self-attention followed by a linear layer. In the slot-filling decoder, we introduce the aligned Transformer decoder, which utilizes a zero diagonal mask, aligning output tags with input tokens. We apply our network on ATIS and SNIPS, and surpass the current state-of-the-art in slot-filling on both datasets. Furthermore, we incorporate the language model as word embeddings, and show that this strategy yields a better result when compared to the language model as an encoder.

cnn-transformer-based network, ctran

doi: 10.1016/j.engappai.2023.107013

2303.10606

Genre: Research Report (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Understanding (0.60)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.60)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.60)

#artificialintelligenceMar-18-2023, 23:59:37 GMT

AI causing concern among professors at Utah Tech University – St George News

Professors at Utah Tech University are worried about AI technologies being used for classwork. Launched on Nov. 30 by OpenAI, ChatGPT is an artificially intelligent chat box that gives human-like, computer generated responses to any prompt it is given. ChatGPT, Moonbeam and Jasper are just a few websites where members can log in, input a prompt or question, and receive human-like artificially generated speech, marketing messages or even full essays. Professors are concerned that this could lead to cheating that is virtually impossible to detect, as well as a decrease in critical thinking among students. Randy Jasmine and Jim Haendiges, English professors at Utah Tech University, addressed the topic in an episode on their podcast, "Being Human UTU Podcast."

george news, haendige, student, (10 more...)

Country:

North America > United States > Utah (0.97)
North America > United States > Idaho (0.06)

Industry: Education > Curriculum > Subject-Specific Education (0.37)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.80)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.80)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.80)

#artificialintelligenceMar-18-2023, 22:31:12 GMT

GPT-4 vs GPT-3.5: The Battle of AI Titans

Despite their impressive capabilities, both GPT-4 and GPT-3.5 have limitations. These models can occasionally generate incorrect or nonsensical information, especially when dealing with ambiguous or complex queries. Additionally, both models may produce text that is overly verbose, repeating the same information in different ways. It's important to keep these limitations in mind when using these models for critical applications.

gpt-3, gpt-4 and gpt-3, gpt-4 vs gpt-3, (8 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceMar-18-2023, 22:30:55 GMT

Will I be replaced by chatGPT?. Will I be replaced by chatGPT?

TL;DR: If you have expertise and vision in your field, possess a keen sense of judgment, and embrace AI as a tool to increase productivity, your job should be safe. Like the invention of the camera, an artist with a great mind and taste still thrives. And also as the old saying, you cannot beat the market if you just follow the crowd. Some jobs may become obsolete with the advent of new technologies, however, it's important to remember that these models have limitations and are not capable of all forms of logical deduction or induction. While some researchers are actively working on areas where ChatGPT may fall short, such as reasoning and multimodal capabilities, this shouldn't be the sole reason to feel safe in your job. It's possible that a model could outperform a human in certain areas of reasoning today, and even solve complex problems like the Riemann Hypothesis in the future.

chatgpt, corpora, rlhf

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceMar-18-2023, 17:20:35 GMT

What is ChatGPT? A guide to understanding the AI – Forbes Advisor Australia

When covering investment and personal finance stories, we aim to inform our readers rather than recommend specific financial product or asset classes. While we may highlight certain positives of a financial product or asset class, there is no guarantee that readers will benefit from the product or investment approach and may, in fact, make a loss if they acquire the product or adopt the approach. To the extent any recommendations or statements of opinion or fact made in a story may constitute financial advice, they constitute general information and not personal financial advice in any form. As such, any recommendations or statements do not take into account the financial circumstances, investment objectives, tax implications, or any specific requirements of readers. Readers of our stories should not act on any recommendation without first taking appropriate steps to verify the information in the stories consulting their independent financial adviser in order to ascertain whether the recommendation (if any) is appropriate, having regard to their investment objectives, financial situation and particular needs.

large language model, machine learning, natural language, (10 more...)

Country: Oceania > Australia (0.51)

Industry: Banking & Finance > Financial Services (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.40)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

#artificialintelligenceMar-18-2023, 17:20:09 GMT

Ignite Friday Digital Marketing News (Updated Every Friday)

This week: TikTok challenges Google and Microsoft with search ads, GPT-4 is on the way, and social media engagement rates are dropping. Here's what happened this week in digital marketing. OpenAI hasn't been in the news enough lately so it's time for a fresh update. The next version of GPT, unimaginatively called GPT-4, will go live soon. In fact, it might already be live by the time you read this. As far as the updates that make it more worthwhile than GPT-3, it's got multimodal functionality. That means it supports text, speech, images, and even video. GPT-4 also works across multiple languages. If you've noticed that your social media engagement rates are on the decline, you're not alone.

google, mueller, twitter, (13 more...)

Country:

North America > Canada (0.05)
Asia > China (0.05)
Oceania > New Zealand (0.04)
(2 more...)

Industry: Information Technology > Services (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.35)