AITopics | wikinew

Collaborating Authors

wikinew

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Arabic Diacritics in the Wild: Exploiting Opportunities for Improved Diacritization

Elgamal, Salman, Obeid, Ossama, Kabbani, Tameem, Inoue, Go, Habash, Nizar

arXiv.org Artificial IntelligenceJun-9-2024

The widespread absence of diacritical marks in Arabic text poses a significant challenge for Arabic natural language processing (NLP). This paper explores instances of naturally occurring diacritics, referred to as "diacritics in the wild," to unveil patterns and latent information across six diverse genres: news articles, novels, children's books, poetry, political documents, and ChatGPT outputs. We present a new annotated dataset that maps real-world partially diacritized words to their maximal full diacritization in context. Additionally, we propose extensions to the analyze-and-disambiguate approach in Arabic NLP to leverage these diacritics, resulting in notable improvements. Our contributions encompass a thorough analysis, valuable datasets, and an extended diacritization algorithm. We release our code and datasets as open source.

dataset, diacritization, proceedings, (13 more...)

arXiv.org Artificial Intelligence

2406.0576

Country:

Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Slovenia (0.04)
(22 more...)

Genre:

Research Report (0.50)
Overview (0.48)

Industry: Media (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Thorough Examination of Decoding Methods in the Era of LLMs

Shi, Chufan, Yang, Haoran, Cai, Deng, Zhang, Zhisong, Wang, Yifan, Yang, Yujiu, Lam, Wai

arXiv.org Artificial IntelligenceFeb-10-2024

Decoding methods play an indispensable role in converting language models from next-token predictors into practical task solvers. Prior research on decoding methods, primarily focusing on task-specific models, may not extend to the current era of general-purpose large language models (LLMs). Moreover, the recent influx of decoding strategies has further complicated this landscape. This paper provides a comprehensive and multifaceted analysis of various decoding methods within the context of LLMs, evaluating their performance, robustness to hyperparameter changes, and decoding speeds across a wide range of tasks, models, and deployment environments. Our findings reveal that decoding method performance is notably task-dependent and influenced by factors such as alignment, model size, and quantization. Intriguingly, sensitivity analysis exposes that certain methods achieve superior performance at the cost of extensive hyperparameter tuning, highlighting the trade-off between attaining optimal results and the practicality of implementation in varying contexts.

gsm8k, hyperparameter, unaligned model, (15 more...)

arXiv.org Artificial Intelligence

2402.06925

Country:

North America > United States > Pennsylvania (0.04)
Europe > United Kingdom > Northern Ireland > County Down > Belfast (0.04)
Europe > United Kingdom > Northern Ireland > County Antrim > Belfast (0.04)
(12 more...)

Genre: Research Report > New Finding (0.48)

Industry: Leisure & Entertainment > Sports (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Realised Volatility Forecasting: Machine Learning via Financial Word Embedding

Rahimikia, Eghbal, Zohren, Stefan, Poon, Ser-Huang

arXiv.org Artificial IntelligenceMar-1-2023

This study develops FinText, a financial word embedding compiled from 15 years of business news archives. The results show that FinText produces substantially more accurate results than general word embeddings based on the gold-standard financial benchmark we introduced. In contrast to well-known econometric models, and over the sample period from 27 July 2007 to 27 January 2022 for 23 NASDAQ stocks, using stock-related news, our simple natural language processing model supported by different word embeddings improves realised volatility forecasts on high volatility days. This improvement in realised volatility forecasting performance switches to normal volatility days when general hot news is used. By utilising SHAP, an Explainable AI method, we also identify and classify key phrases in stock-related and general hot news that moved volatility.

fintext, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

doi: 10.2139/ssrn.3895272

2108.0048

Country:

Asia > North Korea (0.28)
Asia > China (0.04)
Asia > Japan (0.04)
(9 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Media > News (1.00)
Information Technology (1.00)
Health & Medicine (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.92)

Add feedback

The presence of occupational structure in online texts based on word embedding NLP models

Kmetty, Zoltán, Koltai, Julia, Rudas, Tamás

arXiv.org Artificial IntelligenceMay-17-2021

Research on social stratification is closely linked to analysing the prestige associated with different occupations. This research focuses on the positions of occupations in the semantic space represented by large amounts of textual data. The results are compared to standard results in social stratification to see whether the classical results are reproduced and if additional insights can be gained into the social positions of occupations. The paper gives an affirmative answer to both questions. The results show fundamental similarity of the occupational structure obtained from text analysis to the structure described by prestige and social distance scales. While our research reinforces many theories and empirical findings of the traditional body of literature on social stratification and, in particular, occupational hierarchy, it pointed to the importance of a factor not discussed in the main line of stratification literature so far: the power and organizational aspect.

artificial intelligence, natural language, occupation, (20 more...)

arXiv.org Artificial Intelligence

doi: 10.1140/epjds/s13688-021-00311-9

2005.08612

Country:

Europe > Hungary > Budapest > Budapest (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Illinois > Cook County > Evanston (0.04)
(5 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area (0.94)
Government (0.93)
Media (0.68)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Add feedback

Open Challenge for Correcting Errors of Speech Recognition Systems

Kubis, Marek, Vetulani, Zygmunt, Wypych, Mikołaj, Ziętkiewicz, Tomasz

arXiv.org Artificial IntelligenceJan-9-2020

The paper announces the new long-term challenge for improving the performance of automatic speech recognition systems. The goal of the challenge is to investigate methods of correcting the recognition results on the basis of previously made errors by the speech processing system. The dataset prepared for the task is described and evaluation criteria are presented.

artificial intelligence, speech recognition, wikinew, (17 more...)

arXiv.org Artificial Intelligence

2001.03041

Country: Europe > Bulgaria (0.14)

Genre: Research Report (0.40)

Industry:

Materials > Chemicals > Industrial Gases > Liquified Gas (0.95)
Materials > Chemicals > Commodity Chemicals > Petrochemicals > LNG (0.95)
Energy > Oil & Gas > Midstream (0.95)

Technology: Information Technology > Artificial Intelligence > Speech > Speech Recognition (1.00)

Add feedback