AITopics | Grobelnik, Marko

The 2021 Tokyo Olympics Multilingual News Article Dataset

Novak, Erik, Calcina, Erik, Mladenić, Dunja, Grobelnik, Marko

arXiv.org Artificial IntelligenceFeb-13-2025

In this paper, we introduce a dataset of multilingual news articles covering the 2021 Tokyo Olympics. A total of 10,940 news articles were gathered from 1,918 different publishers, covering 1,350 sub-events of the 2021 Olympics, and published between July 1, 2021, and August 14, 2021. These articles are written in nine languages from different language families and in different scripts. To create the dataset, the raw news articles were first retrieved via a service that collects and analyzes news articles. Then, the articles were grouped using an online clustering algorithm, with each group containing articles reporting on the same sub-event. Finally, the groups were manually annotated and evaluated. The development of this dataset aims to provide a resource for evaluating the performance of multilingual news clustering algorithms, for which limited datasets are available. It can also be used to analyze the dynamics and events of the 2021 Tokyo Olympics from different perspectives. The dataset is available in CSV format and can be accessed from the CLARIN.SI repository.

data mining, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2502.06648

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.83)
Europe (0.68)

Genre: Research Report (0.64)

Industry: Leisure & Entertainment > Sports > Olympic Games (1.00)

Technology:

Information Technology > Communications (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.69)
Information Technology > Data Science > Data Mining (0.69)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.68)

Add feedback

Classification of news spreading barriers

Sittar, Abdul, Mladenic, Dunja, Grobelnik, Marko

arXiv.org Artificial IntelligenceApr-10-2023

News media is one of the most effective mechanisms for spreading information internationally, and many events from different areas are internationally relevant. However, news coverage for some news events is limited to a specific geographical region because of information spreading barriers, which can be political, geographical, economic, cultural, or linguistic. In this paper, we propose an approach to barrier classification where we infer the semantics of news articles through Wikipedia concepts. To that end, we collected news articles and annotated them for different kinds of barriers using the metadata of news publishers. Then, we utilize the Wikipedia concepts along with the body text of news articles as features to infer the news-spreading barriers. We compare our approach to the classical text classification methods, deep learning, and transformer-based methods. The results show that the proposed approach using Wikipedia concepts based semantic knowledge offers better performance than the usual for classifying the news-spreading barriers.

machine learning, natural language, news article, (22 more...)

arXiv.org Artificial Intelligence

2304.08167

Country:

North America > United States (1.00)
Europe (1.00)
Asia (1.00)
Africa (1.00)

Genre: Research Report > New Finding (1.00)

Industry:

Media > News (1.00)
Government > Regional Government > North America Government > United States Government (0.68)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.67)
Health & Medicine > Therapeutic Area > Immunology (0.67)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Profiling the news spreading barriers using news headlines

Sittar, Abdul, Mladenic, Dunja, Grobelnik, Marko

arXiv.org Artificial IntelligenceApr-7-2023

News headlines can be a good data source for detecting the news spreading barriers in news media, which may be useful in many real-world applications. In this paper, we utilize semantic knowledge through the inference-based model COMET and sentiments of news headlines for barrier classification. We consider five barriers including cultural, economic, political, linguistic, and geographical, and different types of news headlines including health, sports, science, recreation, games, homes, society, shopping, computers, and business. To that end, we collect and label the news headlines automatically for the barriers using the metadata of news publishers. Then, we utilize the extracted commonsense inferences and sentiments as features to detect the news spreading barriers. We compare our approach to the classical text classification methods, deep learning, and transformer-based methods. The results show that the proposed approach using inferences-based semantic knowledge and sentiment offers better performance than the usual (the average F1-score of the ten categories improves from 0.41, 0.39, 0.59, and 0.59 to 0.47, 0.55, 0.70, and 0.76 for the cultural, economic, political, and geographical respectively) for classifying the news-spreading barriers.

machine learning, natural language, sentiment, (21 more...)

arXiv.org Artificial Intelligence

2304.11088

Country:

Europe (1.00)
Asia > Middle East (0.67)
North America > United States (0.46)

Genre: Research Report > New Finding (1.00)

Industry:

Media > News (1.00)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Commonsense-Infused Language-Agnostic Learning Framework for Enhancing Prediction of Political Polarity in Multilingual News Headlines

Swati, Swati, Grobelnik, Adrian Mladenić, Mladenić, Dunja, Grobelnik, Marko

arXiv.org Artificial IntelligenceDec-1-2022

Predicting the political polarity of news headlines is a challenging task that becomes even more challenging in a multilingual setting with low-resource languages. To deal with this, we propose to utilise the Inferential Commonsense Knowledge via a Translate-Retrieve-Translate strategy to introduce a learning framework. To begin with, we use the method of translation and retrieval to acquire the inferential knowledge in the target language. We then employ an attention mechanism to emphasise important inferences. We finally integrate the attended inferences into a multilingual pre-trained language model for the task of bias prediction. To evaluate the effectiveness of our framework, we present a dataset of over 62.6K multilingual news headlines in five European languages annotated with their respective political polarities. We evaluate several state-of-the-art multilingual pre-trained language models since their performance tends to vary across languages (low/high resource). Evaluation results demonstrate that our proposed framework is effective regardless of the models employed. Overall, the best performing model trained with only headlines show 0.90 accuracy and F1, and 0.83 jaccard score. With attended knowledge in our framework, the same model show an increase in 2.2% accuracy and F1, and 3.6% jaccard score. Extending our experiments to individual languages reveals that the models we analyze for Slovenian perform significantly worse than other languages in our dataset. To investigate this, we assess the effect of translation quality on prediction performance. It indicates that the disparity in performance is most likely due to poor translation quality. We release our dataset and scripts at: https://github.com/Swati17293/KG-Multi-Bias for future research. Our framework has the potential to benefit journalists, social scientists, news producers, and consumers.

knowledge management, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2212.00298

Country:

Europe (0.46)
Asia (0.28)

Genre:

Overview (1.00)
Research Report > New Finding (0.87)

Industry:

Media > News (1.00)
Health & Medicine (1.00)
Information Technology > Security & Privacy (0.92)

Technology:

Information Technology > Knowledge Management > Knowledge Engineering (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
(2 more...)

Add feedback

Reports of the 2016 AAAI Workshop Program

Albrecht, Stefano (The University of Texas at Austin) | Bouchard, Bruno (Université du Québec à Chicoutimi) | Brownstein, John S. (Harvard University) | Buckeridge, David L. (McGill University) | Caragea, Cornelia (University of North Texas) | Carter, Kevin M. (MIT Lincoln Laboratory) | Darwiche, Adnan (University of California, Los Angeles) | Fortuna, Blaz (Bloomberg L.P. and Jozef Stefan Institute) | Francillette, Yannick (Université du Québec à Chicoutimi) | Gaboury, Sébastien (Université du Québec à Chicoutimi) | Giles, C. Lee (Pennsylvania State University) | Grobelnik, Marko (Jozef Stefan Institute) | Hruschka, Estevam R. (Federal University of São Carlos) | Kephart, Jeffrey O. (IBM Thomas J. Watson Research Center) | Kordjamshidi, Parisa (University of Illinois at Urbana-Champaign) | Lisy, Viliam (University of Alberta) | Magazzeni, Daniele (King's College London) | Marques-Silva, Joao (University of Lisbon) | Marquis, Pierre (Université d'Artois) | Martinez, David (MIT Lincoln Laboratory) | Michalowski, Martin (Adventium Labs) | Shaban-Nejad, Arash (University of California, Berkeley) | Noorian, Zeinab (Ryerson University) | Pontelli, Enrico (New Mexico State University) | Rogers, Alex (University of Oxford) | Rosenthal, Stephanie (Carnegie Mellon University) | Roth, Dan (University of Illinois at Urbana-Champaign) | Sinha, Arunesh (University of Southern California) | Streilein, William (MIT Lincoln Laboratory) | Thiebaux, Sylvie (The Australian National University) | Tran, Son Cao (New Mexico State University) | Wallace, Byron C. (University of Texas at Austin) | Walsh, Toby (University of New South Wales and Data61) | Witbrock, Michael (Lucid AI) | Zhang, Jie (Nanyang Technological University)

AI MagazineOct-7-2016

The Workshop Program of the Association for the Advancement of Artificial Intelligence's Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16) was held at the beginning of the conference, February 12-13, 2016. Workshop participants met and discussed issues with a selected focus -- providing an informal setting for active exchange among researchers, developers and users on topics of current interest. To foster interaction and exchange of ideas, the workshops were kept small, with 25-65 participants. Attendance was sometimes limited to active participants only, but most workshops also allowed general registration by other interested individuals.

artificial intelligence, management and information, workshop, (3 more...)

AI Magazine

Industry:

Information Technology (1.00)
Leisure & Entertainment > Games (0.38)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.38)

Add feedback

Reports of the 2016 AAAI Workshop Program

Albrecht, Stefano (The University of Texas at Austin) | Bouchard, Bruno (Université du Québec à Chicoutimi) | Brownstein, John S. (Harvard University) | Buckeridge, David L. (McGill University) | Caragea, Cornelia (University of North Texas) | Carter, Kevin M. (MIT Lincoln Laboratory) | Darwiche, Adnan (University of California, Los Angeles) | Fortuna, Blaz (Bloomberg L.P. and Jozef Stefan Institute) | Francillette, Yannick (Université du Québec à Chicoutimi) | Gaboury, Sébastien (Université du Québec à Chicoutimi) | Giles, C. Lee (Pennsylvania State University) | Grobelnik, Marko (Jozef Stefan Institute) | Hruschka, Estevam R. (Federal University of São Carlos) | Kephart, Jeffrey O. (IBM Thomas J. Watson Research Center) | Kordjamshidi, Parisa (University of Illinois at Urbana-Champaign) | Lisy, Viliam (University of Alberta) | Magazzeni, Daniele (King's College London) | Marques-Silva, Joao (University of Lisbon) | Marquis, Pierre (Université d'Artois) | Martinez, David (MIT Lincoln Laboratory) | Michalowski, Martin (Adventium Labs) | Shaban-Nejad, Arash (University of California, Berkeley) | Noorian, Zeinab (Ryerson University) | Pontelli, Enrico (New Mexico State University) | Rogers, Alex (University of Oxford) | Rosenthal, Stephanie (Carnegie Mellon University) | Roth, Dan (University of Illinois at Urbana-Champaign) | Sinha, Arunesh (University of Southern California) | Streilein, William (MIT Lincoln Laboratory) | Thiebaux, Sylvie (The Australian National University) | Tran, Son Cao (New Mexico State University) | Wallace, Byron C. (University of Texas at Austin) | Walsh, Toby (University of New South Wales and Data61) | Witbrock, Michael (Lucid AI) | Zhang, Jie (Nanyang Technological University)

AI MagazineOct-7-2016

The Workshop Program of the Association for the Advancement of Artificial Intelligence’s Thirtieth AAAI Conference on Artificial Intelligence (AAAI-16) was held at the beginning of the conference, February 12-13, 2016. Workshop participants met and discussed issues with a selected focus — providing an informal setting for active exchange among researchers, developers and users on topics of current interest. To foster interaction and exchange of ideas, the workshops were kept small, with 25-65 participants. Attendance was sometimes limited to active participants only, but most workshops also allowed general registration by other interested individuals. The AAAI-16 Workshops were an excellent forum for exploring emerging approaches and task areas, for bridging the gaps between AI and other fields or between subfields of AI, for elucidating the results of exploratory research, or for critiquing existing approaches. The fifteen workshops held at AAAI-16 were Artificial Intelligence Applied to Assistive Technologies and Smart Environments (WS-16-01), AI, Ethics, and Society (WS-16-02), Artificial Intelligence for Cyber Security (WS-16-03), Artificial Intelligence for Smart Grids and Smart Buildings (WS-16-04), Beyond NP (WS-16-05), Computer Poker and Imperfect Information Games (WS-16-06), Declarative Learning Based Programming (WS-16-07), Expanding the Boundaries of Health Informatics Using AI (WS-16-08), Incentives and Trust in Electronic Communities (WS-16-09), Knowledge Extraction from Text (WS-16-10), Multiagent Interaction without Prior Coordination (WS-16-11), Planning for Hybrid Systems (WS-16-12), Scholarly Big Data: AI Perspectives, Challenges, and Ideas (WS-16-13), Symbiotic Cognitive Systems (WS-16-14), and World Wide Web and Population Health Intelligence (WS-16-15).

neural network, optimization problem, workshop, (21 more...)

AI Magazine

Country:

North America > Canada (0.93)
North America > United States > California (0.48)
Asia > Middle East > Israel > Mediterranean Sea (0.24)
North America > United States > Texas > Travis County > Austin (0.14)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Leisure & Entertainment > Games > Poker (0.34)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
(7 more...)

Add feedback

DiversiNews: Surfacing Diversity in Online News

AI MagazineDec-31-2015

For most events of at least moderate significance, there are likely tens, often hundreds or thousands of online articles reporting on it, each from a slightly different perspective. If we want to understand an event in depth, from multiple perspectives, we need to aggregate multiple sources and understand the relations between them. However, current news aggregators do not offer this kind of functionality. As a step towards a solution, we propose DiversiNews, a real-time news aggregation and exploration platfom whose main feature is a novel set of controls that allow users to contrast reports of a selected event based on topical emphases, sentiment differences and/or publisher geolocation. News events are presented in the form of a ranked list of articles pertaining to the event and an automatically generated summary. Both the ranking and the summary are interactive and respond in real time to user’s change of controls. We validated the concept and the user interface through user tests with positive results.

diversity, information management, social media, (23 more...)

AI Magazine

Country: