AITopics | mikolov

Collaborating Authors

mikolov

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Breaking the Activation Function Bottleneck through Adaptive Parameterization

Sebastian Flennerhag, Hujun Yin, John Keane, Mark Elliot

Neural Information Processing SystemsFeb-14-2026, 14:19:32 GMT

Adaptive parameterization is a means of increasing this flexibility and thereby increasing the model's capacity to learn non-linear patterns. We focus on the feed-forward layer, f(x):= φ(W x+b),for some activation functionφ: R 7 R. Define the pre-activation layer as a = A(x):= Wx+band denote byg(a):= φ(a)/athe activation effect ofφgivena, where divisioniselement-wise.

artificial intelligence, arxivpreprint, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > Canada > Quebec > Montreal (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.62)

Add feedback

Review for NeurIPS paper: OTLDA: A Geometry-aware Optimal Transport Approach for Topic Modeling

Neural Information Processing SystemsFeb-6-2025, 20:46:19 GMT

Additional Feedback: Some minor suggestions and typos: -Line 21, missing an "and" -Line 33, "while other developed" - "while other authors developed" -Line 50, and elsewhere in the paper, it is stated that LDA/PLSI use a squared Euclidean loss/distance. This is untrue - both models use likelihood based inference with a multinomial model, and/or Bayesian inference. The older LSI model uses a squared loss, but even the PLSI paper argued that this is insufficient (the implicit Gaussian assumption from squared errors does not hold with small counts as in text data), which motivates the probabilistic modeling approach in PLSI and LDA. The other papers by Mikolov by 2013 are more fundamental references which are better here, especially: Mikolov, T., Sutskever, I., Chen, K., Corrado, G. S., & Dean, J. (2013). Distributed representations of words and phrases and their compositionality.

geometry-aware optimal transport approach, neurips paper, topic modeling, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.59)

Add feedback

Collapse of Self-trained Language Models

Herel, David, Mikolov, Tomas

arXiv.org Artificial IntelligenceApr-2-2024

In various fields of knowledge creation, including science, new ideas often build on pre-existing information. In this work, we explore this concept within the context of language models. Specifically, we explore the potential of self-training models on their own outputs, akin to how humans learn and build on their previous thoughts and actions. While this approach is intuitively appealing, our research reveals its practical limitations. We find that extended self-training of the GPT-2 model leads to a significant degradation in performance, resulting in repetitive and collapsed token output.

iclr 2024, iteration, language model, (13 more...)

arXiv.org Artificial Intelligence

2404.02305

Country:

Europe > Czechia > Prague (0.05)
North America > United States > California > Monterey County > Pacific Grove (0.04)
Europe > Czechia > South Moravian Region > Brno (0.04)

Genre: Research Report > New Finding (0.70)

Industry: Law (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.53)

Add feedback

Research Papers for NLP Beginners - KDnuggets

#artificialintelligenceNov-18-2022, 15:57:48 GMT

If you're new to the world of data and have a particular interest in NLP (Natural Language Processing), you're probably looking for resources to help grasp a better understanding. You have probably come across so many different research papers and are sitting there confused about which one to choose. Because let's face it, they're not short and they do consume a lot of brain power. So it would be smart to choose the right one that will benefit your path to mastering NLP. I have done some research and have collected a few NLP research papers that have been highly recommended for newbies in the NLP area and overall NLP knowledge.

artificial intelligence, machine learning, natural language, (16 more...)

#artificialintelligence

Genre: Instructional Material (0.36)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.80)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.73)

Add feedback

Hyperbolic Centroid Calculations for Text Classification

Gerek, Aydın, Ferahlar, Cüneyt, Sert, Bilge Şipal, Yüney, Mehmet Can, Taşdemir, Onur, Kalafat, Zeynep Billur, Kelkit, Mert, Ganiz, Murat Can

arXiv.org Artificial IntelligenceNov-8-2022

A new development in NLP is the construction of hyperbolic word embeddings. As opposed to their Euclidean counterparts, hyperbolic embeddings are represented not by vectors, but by points in hyperbolic space. This makes the most common basic scheme for constructing document representations, namely the averaging of word vectors, meaningless in the hyperbolic setting. We reinterpret the vector mean as the centroid of the points represented by the vectors, and investigate various hyperbolic centroid schemes and their effectiveness at text classification.

machine learning, natural language, text classification, (18 more...)

arXiv.org Artificial Intelligence

2211.04462

Country:

Europe > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.05)
Asia > Middle East > Republic of Türkiye > Istanbul Province > Istanbul (0.05)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(3 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Nearest Neighbor Methods (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.46)

Add feedback

Word2Vec: Optimal Hyper-Parameters and Their Impact on NLP Downstream Tasks

Adewumi, Tosin P., Liwicki, Foteini, Liwicki, Marcus

arXiv.org Machine LearningMar-23-2020

Word2Vec is a prominent tool for Natural Language Processing (NLP) tasks. Similar inspiration is found in distributed embeddings for state-of-the-art (sota) deep neural networks. However, wrong combination of hyper-parameters can produce poor quality vectors. The objective of this work is to show optimal combination of hyper-parameters exists and evaluate various combinations. We compare them with the original model released by Mikolov. Both intrinsic and extrinsic (downstream) evaluations, including Named Entity Recognition (NER) and Sentiment Analysis (SA) were carried out. The downstream tasks reveal that the best model is task-specific, high analogy scores don't necessarily correlate positively with F1 scores and the same applies for more data. Increasing vector dimension size after a point leads to poor quality or performance. If ethical considerations to save time, energy and the environment are made, then reasonably smaller corpora may do just as well or even better in some cases. Besides, using a small corpus, we obtain better human-assigned WordSim scores, corresponding Spearman correlation and better downstream (NER & SA) performance compared to Mikolov's model, trained on 100 billion word corpus.

analogy score, dataset, dimension, (16 more...)

arXiv.org Machine Learning

2003.11645

Country:

Europe > Sweden (0.04)
Europe > Middle East > Malta > Port Region > Southern Harbour District > Valletta (0.04)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Towards non-toxic landscapes: Automatic toxic comment detection using DNN

D'Sa, Ashwin Geet, Illina, Irina, Fohr, Dominique

arXiv.org Machine LearningNov-19-2019

The spectacular expansion of the Internet led to the development of a new research problem in the natural language processing field: automatic toxic comment detection, since many countries prohibit hate speech in public media. There is no clear and formal definition of hate, offensive, toxic and abusive speeches. In this article, we put all these terms under the "umbrella" of toxic speech. The contribution of this paper is the design of binary classification and regression-based approaches aiming to predict whether a comment is toxic or not. We compare different unsupervised word representations and different DNN classifiers. Moreover, we study the robustness of the proposed approaches to adversarial attacks by adding one (healthy or toxic) word. We evaluate the proposed methodology on the English Wikipedia Detox corpus. Our experiments show that using BERT fine-tuning outperforms feature-based BERT, Mikolov's word embedding or fastText representations with different DNN classifiers.

corpus, mikolov, representation, (13 more...)

arXiv.org Machine Learning

1911.08395

Country: Europe > France > Grand Est > Meurthe-et-Moselle > Nancy (0.04)

Genre: Research Report (0.64)

Industry: Law > Civil Rights & Constitutional Law (0.68)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Word Embedding for Response-To-Text Assessment of Evidence

Zhang, Haoran, Litman, Diane

arXiv.org Artificial IntelligenceAug-6-2019

Manually grading the Response to Text Assessment (RTA) is labor intensive. Therefore, an automatic method is being developed for scoring analytical writing when the RTA is administered in large numbers of classrooms. Our long-term goal is to also use this scoring method to provide formative feedback to students and teachers about students' writing quality. As a first step towards this goal, interpretable features for automatically scoring the evidence rubric of the RTA have been developed. In this paper, we present a simple but promising method for improving evidence scoring by employing the word embedding model. We evaluate our method on corpora of responses written by upper elementary students.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

doi: 10.18653/v1/P17-3013

1908.01969

Country: North America > United States (0.29)

Genre: Research Report > Promising Solution (0.34)

Industry:

Education > Assessment & Standards (0.90)
Education > Educational Technology > Educational Software (0.70)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science (0.70)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.47)

Add feedback

You have no idea what artificial intelligence really does

#artificialintelligenceOct-17-2018, 12:34:11 GMT

WHEN SOPHIA THE ROBOT first switched on, the world couldn't get enough. It had a cheery personality, it joked with late-night hosts, it had facial expressions that echoed our own. Here it was, finally -- a robot plucked straight out of science fiction, the closest thing to true artificial intelligence that we had ever seen. There's no doubt that Sophia is an impressive piece of engineering. It didn't take much to convince people of Sophia's apparent humanity -- many of Futurism's own articles refer to the robot as "her."

artificial intelligence, intelligence, machine learning, (17 more...)

#artificialintelligence

Industry:

Media > Television (0.34)
Leisure & Entertainment (0.34)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision > Face Recognition (0.34)

Add feedback

You have no idea what artificial intelligence really does

#artificialintelligenceOct-16-2018, 16:32:24 GMT

But as Sophia became more popular and people took a closer look, cracks emerged. It became harder to believe that Sophia was the all-encompassing artificial intelligence that we all wanted it to be. Over time, articles that might have once oohed and ahhed about Sophia's conversational skills became more focused on the fact that they were partially scripted in advance. Ben Goertzel, CEO of SingularityNET and Chief Scientist of Hanson Robotics, isn't under any illusions about what Sophia is capable of. "Sophia and the other Hanson robots are not really'pure' as computer science research systems, because they combine so many different pieces and aspects in complex ways. They are not pure learning systems, but they do involve learning on various levels (learning in their neural net visual systems, learning in their OpenCog dialogue systems, etc.)," he told Futurism.

artificial intelligence, machine learning, mikolov, (17 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.49)

Add feedback