AITopics | callison-burch

Collaborating Authors

callison-burch

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A Novel Dialect-Aware Framework for the Classification of Arabic Dialects and Emotions

Alsadhan, Nasser A

arXiv.org Artificial IntelligenceFeb-13-2025

Arabic is one of the oldest languages still in use today. As a result, several Arabic-speaking regions have developed dialects that are unique to them. Dialect and emotion recognition have various uses in Arabic text analysis, such as determining an online customer's origin based on their comments. Furthermore, intelligent chatbots that are aware of a user's emotions can respond appropriately to the user. Current research in emotion detection in the Arabic language lacks awareness of how emotions are exhibited in different dialects, which motivates the work found in this study. This research addresses the problems of dialect and emotion classification in Arabic. Specifically, this is achieved by building a novel framework that can identify and predict Arabic dialects and emotions from a given text. The framework consists of three modules: A text-preprocessing module, a classification module, and a clustering module with the novel capability of building new dialect-aware emotion lexicons. The proposed framework generated a new emotional lexicon for different dialects. It achieved an accuracy of 88.9% in classifying Arabic dialects, which outperforms the state-of-the-art results by 6.45 percentage points. Furthermore, the framework achieved 89.1-79% accuracy in detecting emotions in the Egyptian and Gulf dialects, respectively.

artificial intelligence, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

doi: 10.3844/jcssp.2025.88.95

2502.09128

Country:

Asia > Middle East > Saudi Arabia > Riyadh Province > Riyadh (0.05)
Africa > Middle East > Tunisia (0.04)
Africa > Middle East > Morocco (0.04)
(9 more...)

Genre: Research Report (1.00)

Industry: Media (0.30)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.86)
(2 more...)

Add feedback

Learning Translations via Matrix Completion

Wijaya, Derry, Callahan, Brendan, Hewitt, John, Gao, Jie, Ling, Xiao, Apidianaki, Marianna, Callison-Burch, Chris

arXiv.org Artificial IntelligenceJun-19-2024

Bilingual Lexicon Induction is the task of learning word translations without bilingual parallel corpora. We model this task as a matrix completion problem, and present an effective and extendable framework for completing the matrix. This method harnesses diverse bilingual and monolingual signals, each of which may be incomplete or noisy. Our model achieves state-of-the-art performance for both high and low resource languages.

computational linguistic, proceedings, translation, (16 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/D17-1152

2406.13195

Country:

North America > United States > Pennsylvania (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
Europe > Middle East > Malta > Port Region > Southern Harbour District > Valletta (0.04)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

Artificial intelligence (AI) Real or Fake Text? We Can Learn to Spot the Difference

#artificialintelligenceMar-10-2023, 03:50:09 GMT

The most recent generation of chatbots has surfaced longstanding concerns about the growing sophistication and accessibility of artificial intelligence. Fears about the integrity of the job market -- from the creative economy to the managerial class -- have spread to the classroom as educators rethink learning in the wake of ChatGPT. Yet while apprehensions about employment and schools dominate headlines, the truth is that the effects of large-scale language models such as ChatGPT will touch virtually every corner of our lives. These new tools raise society-wide concerns about artificial intelligence's role in reinforcing social biases, committing fraud and identity theft, generating fake news, spreading misinformation and more. A team of researchers at the University of Pennsylvania School of Engineering and Applied Science is seeking to empower tech users to mitigate these risks.

artificial intelligence, callison-burch, intelligence, (9 more...)

#artificialintelligence

Country: North America > United States > Pennsylvania (0.25)

Genre: Research Report > New Finding (0.31)

Industry:

Media > News (0.72)
Education > Educational Setting (0.51)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Improving Paraphrase Detection with the Adversarial Paraphrasing Task

Nighojkar, Animesh, Licato, John

arXiv.org Artificial IntelligenceJun-14-2021

If two sentences have the same meaning, it should follow that they are equivalent in their inferential properties, i.e., each sentence should textually entail the other. However, many paraphrase datasets currently in widespread use rely on a sense of paraphrase based on word overlap and syntax. Can we teach them instead to identify paraphrases in a way that draws on the inferential properties of the sentences, and is not over-reliant on lexical and syntactic similarities of a sentence pair? We apply the adversarial paradigm to this question, and introduce a new adversarial method of dataset creation for paraphrase identification: the Adversarial Paraphrasing Task (APT), which asks participants to generate semantically equivalent (in the sense of mutually implicative) but lexically and syntactically disparate paraphrases. These sentence pairs can then be used both to test paraphrase identification models (which get barely random accuracy) and then improve their performance. To accelerate dataset generation, we explore automation of APT using T5, and show that the resulting dataset also improves accuracy. We discuss implications for paraphrase detection and release our dataset in the hope of making paraphrase detection models better able to detect sentence-level meaning equivalence.

dataset, sentence pair, twitterppdb, (14 more...)

arXiv.org Artificial Intelligence

2106.07691

Country:

North America > United States > Florida > Hillsborough County > Tampa (0.14)
North America > United States > Florida > Broward County > Fort Lauderdale (0.04)
North America > United States > New York > New York County > New York City (0.04)
(5 more...)

Genre: Research Report (0.82)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.94)

Add feedback

Unsupervised Paraphrasing without Translation

Roy, Aurko, Grangier, David

arXiv.org Machine LearningMay-29-2019

Paraphrasing exemplifies the ability to abstract semantic content from surface forms. Recent work on automatic paraphrasing is dominated by methods leveraging Machine Translation (MT) as an intermediate step. This contrasts with humans, who can paraphrase without being bilingual. This work proposes to learn paraphrasing models from an unlabeled monolingual corpus only. To that end, we propose a residual variant of vector-quantized variational auto-encoder. We compare with MT-based approaches on paraphrase identification, generation, and training augmentation. Monolingual paraphrasing outperforms unsupervised translation in all settings. Comparisons with supervised translation are more mixed: monolingual paraphrasing is interesting for identification and augmentation; supervised translation is superior for generation.

machine learning, natural language, translation, (17 more...)

arXiv.org Machine Learning

1905.12752

Country: Asia > North Korea (0.28)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

fast.ai · Making neural nets uncool again

#artificialintelligenceOct-30-2018, 22:01:13 GMT

In machine learning and deep learning we can't do anything without data. So the people that create datasets for us to train our models are the (often under-appreciated) heroes. Some of the most useful and important datasets are those that become important "academic baselines"; that is, datasets that are widely studied by researchers and used to compare algorithmic changes. Some of these become household names (at least, among households that train models!), such as MNIST, CIFAR 10, and Imagenet. We all owe a debt of gratitude to those kind folks who have made datasets available for the research community.

artificial intelligence, deep learning, machine learning, (9 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.58)

Add feedback

Facebook artificial intelligence spots suicidal users - BBC News

#artificialintelligenceMar-2-2017, 02:35:08 GMT

Facebook has begun using artificial intelligence to identify members that may be at risk of killing themselves. The social network has developed algorithms that spot warning signs in users' posts and the comments their friends leave in response. After confirmation by Facebook's human review team, the company contacts those thought to be at risk of self-harm to suggest ways they can seek help. A suicide helpline chief said the move was "not just helpful but critical". The tool is being tested only in the US at present.

artificial intelligence, facebook, social media, (10 more...)

#artificialintelligence

Country: North America > United States (0.36)

Industry:

Information Technology > Services (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

Facebook artificial intelligence spots suicidal users

BBC NewsMar-1-2017, 11:25:06 GMT

artificial intelligence, facebook, social media, (10 more...)

BBC News

Country: North America > United States (0.36)

Industry:

Information Technology > Services (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

How Surfing the Web Improves Machine Learning ENGINEERING.com

#artificialintelligenceNov-19-2016

The new technique makes machine learning a little more like human learning; a more natural fit for natural language processing. In two separate experiments, the new method outperformed conventional machine learning techniques by about 10 percent. Conventional approaches to machine learning information extraction use vast amounts of training data, which increases the capacity of the system to handle difficult problems. The new approach uses much less data, which more realistically represents the amount of info typically available. The system then deals with the limited information in the same way a human would.

information, machine learning, natural language, (9 more...)

#artificialintelligence

Industry:

Education > Curriculum > Subject-Specific Education (0.40)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Why We Need AI to Study America's Gun Violence Epidemic

#artificialintelligenceOct-12-2016, 22:30:39 GMT

Shootings are an epidemic in the US, but federal funding for research into gun violence has been in a deep freeze since 1996, thanks in part to the NRA-backed Dickey Amendment, which prevents the Center for Disease Control from pursuing research "to advocate or promote gun control." Basically, humans can't get money to research the problem of gun violence in the US. To get around this, some scientists want machines to do the job. On September 25, University of Pennsylvania computer scientists Ellie Pavlick and Chris Callison-Burch unveiled a new, human-annotated database of gun violence incidents in the US at the Bloomberg Data for Good Exchange Conference in New York. The database was created by workers on Amazon's Mechanical Turk platform, and carefully highlights information from thousands of news articles over the course of several years, Pavlick told me in an interview.

artificial intelligence, database, natural language, (7 more...)

#artificialintelligence

Country:

North America > United States > Pennsylvania (0.26)
North America > United States > New York (0.26)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language (0.38)

Add feedback