AITopics

doi: 10.2196/48904

2304.11567

Country:

North America > United States > Texas (0.04)
North America > United States > New York (0.04)
North America > United States > Massachusetts (0.04)
(2 more...)

Genre: Research Report (1.00)

Industry:

Health & Medicine > Therapeutic Area (0.69)
Health & Medicine > Health Care Technology > Medical Record (0.34)
Health & Medicine > Diagnostic Medicine > Imaging (0.30)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

WIREDApr-22-2023, 13:00:00 GMT

Criminals Are Using Tiny Devices to Hack and Steal Cars

Employees of the US Immigration and Customs Enforcement agency (ICE) abused law enforcement databases to snoop on their romantic partners, neighbors, and business associates, WIRED exclusively revealed this week. New data obtained through record requests show that hundreds of ICE staffers and contractors have faced investigations since 2016 for attempting to access medical, biometric, and location data without permission. The revelations raise further questions about the protections ICE places on people's sensitive information. Security researchers at ESET found old enterprise routers are filled with company secrets. After purchasing and analyzing old routers, the firm found many contained login details for company VPNs, hashed root administrator passwords, and details of who the previous owners were.

hack and steal car, security researcher, tiny device, (16 more...)

WIRED

Country:

North America > United States (0.90)
Asia > Russia (0.16)
North America > Canada > Ontario > Toronto (0.15)
(7 more...)

Genre: Research Report > New Finding (0.70)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Immigration & Customs (1.00)
Government > Regional Government > North America Government > United States Government (0.55)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Social Media (0.98)
Information Technology > Communications > Networks (0.91)
(3 more...)

Communications of the ACMApr-22-2023, 10:50:10 GMT

AlphaFold Spreads through Protein Science

Two years ago, as the COVID-19 pandemic swept across the world, researchers at DeepMind, the artificial intelligence (AI) and research laboratory subsidiary of Alphabet Inc., demonstrated how it could use machine learning to achieve a breakthrough in the ability to predict how proteins, the work-horses of the living cell, fold into the intricate shapes they take on. The work gave hope to biologists that they could use this kind of tool to tackle diseases such as the SARS-CoV-2 coronavirus much more quickly in the future. Researchers were able to assess the abilities of DeepMind's AlphaFold2 thanks to its inclusion in the 14th Critical Assessment of Structure Prediction (CASP14), a benchmarking competition that ran through 2020 and which added a parallel program to uncover the structures of key proteins from the SARS-CoV2 virus to try to accelerate vaccine and drug development. The organizers of CASP14 declared the tool represented "an almost complete solution to the problem of computing three-dimensional structure from amino-acid sequences," though some caveats lie behind that statement. In principle, quantum mechanical simulations can predict which collection of folds leads to the lowest combined energy of all the chemical bonds in the shape and the water and other molecules around it.

accuracy, alphafold2, protein, (17 more...)

Communications of the ACM

Country:

North America > United States > New Mexico (0.05)
North America > United States > Illinois > Cook County > Chicago (0.05)
Europe > United Kingdom > England > Surrey (0.05)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.05)

Industry: Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.56)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.56)

Communications of the ACMApr-22-2023, 10:50:08 GMT

ChatGPT, Can You Tell Me a Story?

As generative AI tools continue to overwhelm "future of technology" discussions at every level, Communications' Senior Editor Ralph Raiola thought it might be interesting to collaborate with OpenAI's ChatGPT on an original sci-fi short story. Here's a full transcript of the process and a partially finished product. COMMUNICATIONS: ChatGPT, would you like to write a sci-fi short story with me?

chatgpt, sci-fi short story

Communications of the ACM

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.84)

The GuardianApr-22-2023, 10:00:07 GMT

Artificial intelligence – coming to a government near you soon?

The recent blizzard of warnings about artificial intelligence and how it is transforming learning, upending legal, financial and organizational functions, and reshaping social and cultural interaction, have mostly left out the role it is already playing in governance. Governments in the US at every level are attempting the transition from a programmatic model of service delivery to a citizen-focused model. Los Angeles, the US's second largest city, is a pioneer in the field, unveiling technologies to help streamline bureaucratic functions from police recruitment to paying parking tickets to filling potholes or locating resources at the library. For now, AI advances are limited to automation. When ChatGPT was asked recently about how it might change how people deal with government, it responded that "the next generation of AI, which includes ChatGPT, has the potential to revolutionize the way governments interact with their citizens."

chatgpt, government, information, (12 more...)

The Guardian

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.25)
North America > United States > Massachusetts (0.05)
Europe > Ukraine (0.05)
(2 more...)

Industry:

Government > Voting & Elections (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Law (0.91)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Understanding EFL Student Idea Generation Strategies for Creative Writing with NLG Tools

Woo, David James, Wang, Yanzhi, Susanto, Hengky, Guo, Kai

Natural language generation (NLG) is a process within artificial intelligence where computer systems produce human-comprehensible language texts from information. English as a foreign language (EFL) students' use of NLG tools might facilitate their idea generation, which is fundamental to creative writing. However, little is known about how EFL students interact with NLG tools to generate ideas. This study explores strategies adopted by EFL students when searching for ideas using NLG tools, evaluating ideas generated by NLG tools and selecting NLG tools for ideas generation. Four Hong Kong secondary school students attended workshops where they learned to write stories comprising their own words and words generated by NLG tools. After the workshops, they answered questions to reflect on their writing experience with NLG tools. In a thematic analysis of the written reflections, we found students may have existing ideas when searching for ideas and evaluating ideas with NLG tools. Students showed some aversion to ideas generated by NLG tools and selected NLG tools that generated a greater quantity of ideas. The findings inform our understanding of EFL students' concerns when using NLG tools for idea generation and can inform educators' instruction to implement NLG tools for classroom creative writing.

large language model, machine learning, natural language, (19 more...)

doi: 10.1177/07356331231175999

2207.01484

Country:

Asia > China > Hong Kong (0.25)
North America > United States > New York (0.04)
North America > United States > Massachusetts > Middlesex County > Lowell (0.04)
(3 more...)

Genre: Research Report > New Finding (1.00)

Industry: Education > Educational Setting > K-12 Education > Secondary School (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)
Information Technology > Artificial Intelligence > Natural Language > Generation (0.69)

Can ChatGPT Reproduce Human-Generated Labels? A Study of Social Computing Tasks

Zhu, Yiming, Zhang, Peixian, Haq, Ehsan-Ul, Hui, Pan, Tyson, Gareth

The release of ChatGPT has uncovered a range of possibilities whereby large language models (LLMs) can substitute human intelligence. In this paper, we seek to understand whether ChatGPT has the potential to reproduce human-generated label annotations in social computing tasks. Such an achievement could significantly reduce the cost and complexity of social computing research. As such, we use ChatGPT to relabel five seminal datasets covering stance detection (2x), sentiment analysis, hate speech, and bot detection. Our results highlight that ChatGPT does have the potential to handle these data annotation tasks, although a number of challenges remain. ChatGPT obtains an average accuracy 0.609. Performance is highest for the sentiment analysis dataset, with ChatGPT correctly annotating 64.9% of tweets. Yet, we show that performance varies substantially across individual labels. We believe this work can open up new lines of analysis and act as a basis for future research into the exploitation of ChatGPT for human annotation tasks.

large language model, machine learning, natural language, (17 more...)

2304.10145

Country:

Europe > Ukraine (0.30)
Asia > Russia (0.29)
Europe > Russia (0.05)
(5 more...)

Genre: Research Report > New Finding (0.66)

Industry:

Law (0.95)
Health & Medicine > Therapeutic Area > Immunology (0.70)
Information Technology > Services (0.68)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Pipeline MoE: A Flexible MoE Implementation with Pipeline Parallelism

Chen, Xin, Zhang, Hengheng, Gu, Xiaotao, Bi, Kaifeng, Xie, Lingxi, Tian, Qi

The Mixture of Experts (MoE) model becomes an important choice of large language models nowadays because of its scalability with sublinear computational complexity for training and inference. However, existing MoE models suffer from two critical drawbacks, 1) tremendous inner-node and inter-node communication overhead introduced by all-to-all dispatching and gathering, and 2) limited scalability for the backbone because of the bound data parallel and expert parallel to scale in the expert dimension. In this paper, we systematically analyze these drawbacks in terms of training efficiency in the parallel framework view and propose a novel MoE architecture called Pipeline MoE (PPMoE) to tackle them. PPMoE builds expert parallel incorporating with tensor parallel and replaces communication-intensive all-to-all dispatching and gathering with a simple tensor index slicing and inner-node all-reduce. Besides, it is convenient for PPMoE to integrate pipeline parallel to further scale the backbone due to its flexible parallel architecture. Extensive experiments show that PPMoE not only achieves a more than $1.75\times$ speed up compared to existing MoE architectures but also reaches $90\%$ throughput of its corresponding backbone model that is $20\times$ smaller.

large language model, machine learning, natural language, (17 more...)

2304.11414

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

N2G: A Scalable Approach for Quantifying Interpretable Neuron Representations in Large Language Models

Foote, Alex, Nanda, Neel, Kran, Esben, Konstas, Ionnis, Barez, Fazl

Understanding the function of individual neurons within language models is essential for mechanistic interpretability research. We propose $\textbf{Neuron to Graph (N2G)}$, a tool which takes a neuron and its dataset examples, and automatically distills the neuron's behaviour on those examples to an interpretable graph. This presents a less labour intensive approach to interpreting neurons than current manual methods, that will better scale these methods to Large Language Models (LLMs). We use truncation and saliency methods to only present the important tokens, and augment the dataset examples with more diverse samples to better capture the extent of neuron behaviour. These graphs can be visualised to aid manual interpretation by researchers, but can also output token activations on text to compare to the neuron's ground truth activations for automatic validation. N2G represents a step towards scalable interpretability methods by allowing us to convert neurons in an LLM to interpretable representations of measurable quality.

artificial intelligence, large language model, natural language, (18 more...)

2304.12918

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > Netherlands (0.04)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)

Genre: Research Report (0.51)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Borisov, Vadim, Seßler, Kathrin, Leemann, Tobias, Pawelczyk, Martin, Kasneci, Gjergji

Language Models are Realistic Tabular Data Generators

Tabular data is among the oldest and most ubiquitous forms of data. However, the generation of synthetic samples with the original data's characteristics remains a significant challenge for tabular data. While many generative models from the computer vision domain, such as variational autoencoders or generative adversarial networks, have been adapted for tabular data generation, less research has been directed towards recent transformer-based large language models (LLMs), which are also generative in nature. To this end, we propose GReaT (Generation of Realistic Tabular data), which exploits an auto-regressive generative LLM to sample synthetic and yet highly realistic tabular data. Furthermore, GReaT can model tabular data distributions by conditioning on any subset of features; the remaining features are sampled without additional overhead. We demonstrate the effectiveness of the proposed approach in a series of experiments that quantify the validity and quality of the produced data samples from multiple angles. We find that GReaT maintains state-of-the-art performance across numerous real-world and synthetic data sets with heterogeneous feature types coming in various sizes.

large language model, machine learning, tabular data, (19 more...)

2210.0628

Country:

Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
North America > United States > California (0.05)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Asia (0.04)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine (0.94)
Banking & Finance > Real Estate (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Generation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)