AITopics | Wahle, Jan Philip

Collaborating Authors

Wahle, Jan Philip

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

D3: A Massive Dataset of Scholarly Metadata for Analyzing the State of Computer Science Research

Wahle, Jan Philip, Ruas, Terry, Mohammad, Saif M., Gipp, Bela

arXiv.org Artificial IntelligenceNov-10-2022

DBLP is the largest open-access repository of scientific articles on computer science and provides metadata associated with publications, authors, and venues. We retrieved more than 6 million publications from DBLP and extracted pertinent metadata (e.g., abstracts, author affiliations, citations) from the publication texts to create the DBLP Discovery Dataset (D3). D3 can be used to identify trends in research activity, productivity, focus, bias, accessibility, and impact of computer science research. We present an initial analysis focused on the volume of computer science research (e.g., number of papers, authors, research activity), trends in topics of interest, and citation patterns. Our findings show that computer science is a growing research field ( 15% annually), with an active and collaborative research community. While papers in recent years present more bibliographical entries in comparison to previous decades, the average number of citations has been declining. Investigating papers' abstracts reveals that recent topic trends are clearly reflected in D3. Finally, we list further applications of D3 and pose supplemental research questions. The D3 dataset, our findings, and source code are publicly available for research purposes.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2204.13384

Country:

Europe (0.46)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (0.93)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Analyzing Multi-Task Learning for Abstractive Text Summarization

Kirstein, Frederic, Wahle, Jan Philip, Ruas, Terry, Gipp, Bela

arXiv.org Artificial IntelligenceNov-10-2022

Despite the recent success of multi-task learning and pre-finetuning for natural language understanding, few works have studied the effects of task families on abstractive text summarization. Task families are a form of task grouping during the pre-finetuning stage to learn common skills, such as reading comprehension. To close this gap, we analyze the influence of multi-task learning strategies using task families for the English abstractive text summarization task. We group tasks into one of three strategies, i.e., sequential, simultaneous, and continual multi-task learning, and evaluate trained models through two downstream tasks. We find that certain combinations of task families (e.g., advanced reading comprehension and natural language inference) positively impact downstream performance. Further, we find that choice and combinations of task families influence downstream performance more than the training scheme, supporting the use of task families for abstractive text summarization.

abstractive text summarization, artificial intelligence, natural language, (1 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2022.gem-1.5

2210.14606

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Testing the Generalization of Neural Language Models for COVID-19 Misinformation Detection

Wahle, Jan Philip, Ashok, Nischal, Ruas, Terry, Meuschke, Norman, Ghosal, Tirthankar, Gipp, Bela

arXiv.org Artificial IntelligenceNov-29-2021

A drastic rise in potentially life-threatening misinformation has been a by-product of the COVID-19 pandemic. Computational support to identify false information within the massive body of data on the topic is crucial to prevent harm. Researchers proposed many methods for flagging online misinformation related to COVID-19. However, these methods predominantly target specific content types (e.g., news) or platforms (e.g., Twitter). The methods' capabilities to generalize were largely unclear so far. We evaluate fifteen Transformer-based models on five COVID-19 misinformation datasets that include social media posts, news articles, and scientific papers to fill this gap. We show tokenizers and models tailored to COVID-19 data do not provide a significant advantage over general-purpose ones. Our study provides a realistic assessment of models for detecting COVID-19 misinformation. We expect that evaluating a broad spectrum of datasets and models will benefit future research in developing misinformation detection systems.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2111.07819

Country:

Oceania > Australia (0.14)
North America > United States (0.14)
Europe > Spain (0.14)
(2 more...)

Genre: Research Report > Experimental Study (0.88)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

Incorporating Word Sense Disambiguation in Neural Language Models

Wahle, Jan Philip, Ruas, Terry, Meuschke, Norman, Gipp, Bela

arXiv.org Artificial IntelligenceJun-15-2021

We present two supervised (pre-)training methods to incorporate gloss definitions from lexical resources into neural language models (LMs). The training improves our models' performance for Word Sense Disambiguation (WSD) but also benefits general language understanding tasks while adding almost no parameters. We evaluate our techniques with seven different neural LMs and find that XLNet is more suitable for WSD than BERT. Our best-performing methods exceeds state-of-the-art WSD techniques on the SemCor 3.0 dataset by 0.5% F1 and increase BERT's performance on the GLUE benchmark by 1.1% on average.

artificial intelligence, computational linguistics, text processing, (19 more...)

arXiv.org Artificial Intelligence

2106.07967

Country:

Europe (0.93)
North America > United States > New Jersey (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > Colorado (0.14)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.92)

Add feedback

Are Neural Language Models Good Plagiarists? A Benchmark for Neural Paraphrase Detection

Wahle, Jan Philip, Ruas, Terry, Meuschke, Norman, Gipp, Bela

arXiv.org Artificial IntelligenceMar-23-2021

The rise of language models such as BERT allows for high-quality text paraphrasing. This is a problem to academic integrity, as it is difficult to differentiate between original and machine-generated content. We propose a benchmark consisting of paraphrased articles using recent language models relying on the Transformer architecture. Our contribution fosters future research of paraphrase detection systems as it offers a large collection of aligned original and paraphrased documents, a study regarding its structure, classification experiments with state-of-the-art systems, and we make our findings publicly available.

artificial intelligence, arxiv, text processing, (14 more...)

arXiv.org Artificial Intelligence

2103.1245

Country: Europe (0.15)

Genre: Research Report > New Finding (0.48)

Industry: Education (0.90)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.50)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.50)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.48)

Add feedback

Identifying Machine-Paraphrased Plagiarism

Wahle, Jan Philip, Ruas, Terry, Foltýnek, Tomáš, Meuschke, Norman, Gipp, Bela

arXiv.org Artificial IntelligenceMar-22-2021

Employing paraphrasing tools to conceal plagiarized text is a severe threat to academic integrity. To enable the detection of machine-paraphrased text, we evaluate the effectiveness of five pre-trained word embedding models combined with machine learning classifiers and state-of-the-art neural language models. We analyze preprints of research papers, graduation theses, and Wikipedia articles, which we paraphrased using different configurations of the tools SpinBot and SpinnerChief. The best performing technique, Longformer, achieved an average F1 score of 80.99% (F1=99.68% for SpinBot and F1=71.64% for SpinnerChief cases), while human evaluators achieved F1=78.4% for SpinBot and F1=65.6% for SpinnerChief cases. We show that the automated classification alleviates shortcomings of widely-used text-matching systems, such as Turnitin and PlagScan. To facilitate future research, all data, code, and two web applications showcasing our contributions are openly available.

arxiv, deep learning, neural network, (21 more...)

arXiv.org Artificial Intelligence

2103.11909

Country:

Europe > Czechia (0.14)
North America > United States (0.14)
North America > Canada (0.14)
(2 more...)

Genre: Research Report > New Finding (0.68)

Industry: Education (0.89)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback