AITopics | english context

Collaborating Authors

english context

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Do "English" Named Entity Recognizers Work Well on Global Englishes?

Shan, Alexander, Bauer, John, Carlson, Riley, Manning, Christopher

arXiv.org Artificial IntelligenceApr-20-2024

The vast majority of the popular English named entity recognition (NER) datasets contain American or British English data, despite the existence of many global varieties of English. As such, it is unclear whether they generalize for analyzing use of English globally. To test this, we build a newswire dataset, the Worldwide English NER Dataset, to analyze NER model performance on low-resource English variants from around the world. We test widely used NER toolkits and transformer models, including models using the pre-trained contextual models RoBERTa and ELECTRA, on three datasets: a commonly used British English newswire dataset, CoNLL 2003, a more American focused dataset OntoNotes, and our global dataset. All models trained on the CoNLL or OntoNotes datasets experienced significant performance drops-over 10 F1 in some cases-when tested on the Worldwide English dataset. Upon examination of region-specific errors, we observe the greatest performance drops for Oceania and Africa, while Asia and the Middle East had comparatively strong performance. Lastly, we find that a combined model trained on the Worldwide dataset and either CoNLL or OntoNotes lost only 1-2 F1 on both test sets.

dataset, ontonote, worldwide dataset, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/2023.findings-emnlp.788

2404.13465

Country:

Europe > Middle East (0.24)
Africa > Middle East (0.24)
Oceania (0.24)
(14 more...)

Genre: Research Report (0.82)

Industry:

Media (0.68)
Government (0.68)
Leisure & Entertainment (0.46)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)

Add feedback

Māori loanwords project becomes easier with machine learning

#artificialintelligenceFeb-17-2019, 00:46:41 GMT

A machine learning model was used by researchers from the University of Waikato, in New Zealand, to narrow down a massive 8 million tweets to a more manageable 1.2 million in order to look at how te reo Māori is being used in the genre. According to a recent press release, the team focused on 77 Māori loanwords, or te reo Māori words used in an English context, and used them as training data for their machine learning model. Machine learning allows data scientists to provide a computer with a large data set, and teach it to make predictions based on that data. The initial 8 million tweets contained a fair bit of distracting data'noise'. The irrelevant tweets are those that are not used in a New Zealand English context, or were otherwise unrelated.

english context, ori loanword, tweet, (2 more...)

#artificialintelligence

Country: Oceania > New Zealand > North Island > Waikato (0.26)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

When machine learning, Twitter and te reo Maori merge - UoW

#artificialintelligenceFeb-11-2019, 02:00:00 GMT

Researchers have whittled down a massive 8 million tweets, to a more manageable 1.2 million to look at how te reo MÄ ori is being used in the genre. The team from the University of Waikato have focused on 77 MÄ ori loanwords (te reo MÄ ori words used in an English context) and used them as training data for their machine-learning model. Machine learning allows data scientists to provide a computer with a large data set, and teach it to make predictions based on that data. Computing and Mathematical Sciences student David Trye spent the summer working on the project, with supervisorsDr Andreea Calude and Dr Felipe Bravo Márquez. The initial 8-million tweets contained a fair bit of distracting data'noise'.

artificial intelligence, machine learning, reo maori merge, (8 more...)

#artificialintelligence

Country: Oceania > New Zealand > North Island > Waikato (0.26)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback