AITopics | mercer

Collaborating Authors

mercer

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Self-attention with Functional Time Representation Learning

Da Xu, Chuanwei Ruan, Evren Korpeoglu, Sushant Kumar, Kannan Achan

Neural Information Processing SystemsFeb-14-2026, 05:22:52 GMT

Neural Information Processing Systems http://nips.cc/

dataset, mercer, representation, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Data Science (0.69)

Add feedback

cf34645d98a7630e2bcca98b3e29c8f2-AuthorFeedback.pdf

Neural Information Processing SystemsFeb-14-2026, 05:22:38 GMT

frequency, mercer, representation learning, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.76)

Add feedback

Self-attention with Functional Time Representation Learning

Da Xu, Chuanwei Ruan, Evren Korpeoglu, Sushant Kumar, Kannan Achan

Neural Information Processing SystemsAug-20-2025, 02:51:22 GMT

It detects attention weights from input event sequence and returns the sequence representation.

dataset, mercer, representation, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.04)
North America > Canada (0.04)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Data Science (0.69)

Add feedback

novelty of the proposed method, which addresses how to embed continuous time to differentiable functional domain

Neural Information Processing SystemsAug-20-2025, 02:51:09 GMT

We'd like to thank the reviewers for their careful reading and valuable comments. Second, we apologize for typos, grammar mistakes and unclear notations. They will be corrected in the final version. Third, we provide additional experiment results in Table 1. Additional experiment results (converted to percentage by multiplying by 100).

differentiable functional domain, embed continuous time, frequency, (13 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.76)

Add feedback

BioMNER: A Dataset for Biomedical Method Entity Recognition

Tang, Chen, Yang, Bohao, Zhao, Kun, Lv, Bo, Xiao, Chenghao, Guerin, Frank, Lin, Chenghua

arXiv.org Artificial IntelligenceJun-28-2024

Named entity recognition (NER) stands as a fundamental and pivotal task within the realm of Natural Language Processing. Particularly within the domain of Biomedical Method NER, this task presents notable challenges, stemming from the continual influx of domain-specific terminologies in scholarly literature. Current research in Biomedical Method (BioMethod) NER suffers from a scarcity of resources, primarily attributed to the intricate nature of methodological concepts, which necessitate a profound understanding for precise delineation. In this study, we propose a novel dataset for biomedical method entity recognition, employing an automated BioMethod entity recognition and information retrieval system to assist human annotation. Furthermore, we comprehensively explore a range of conventional and contemporary open-domain NER methodologies, including the utilization of cutting-edge large-scale language models (LLMs) customised to our dataset. Our empirical findings reveal that the large parameter counts of language models surprisingly inhibit the effective assimilation of entity extraction patterns pertaining to biomedical methods. Remarkably, the approach, leveraging the modestly sized ALBERT model (only 11MB), in conjunction with conditional random fields (CRF), achieves state-of-the-art (SOTA) performance.

arxiv preprint arxiv, computational linguistic, language model, (11 more...)

arXiv.org Artificial Intelligence

2406.20038

Country:

North America > Canada > Ontario > Toronto (0.04)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(2 more...)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Evaluating Large Language Models in Analysing Classroom Dialogue

Long, Yun, Luo, Haifeng, Zhang, Yu

arXiv.org Artificial IntelligenceFeb-6-2024

This study explores the application of Large Language Models (LLMs), specifically GPT-4, in the analysis of classroom dialogue, a crucial research task for both teaching diagnosis and quality improvement. Recognizing the knowledge-intensive and labor-intensive nature of traditional qualitative methods in educational research, this study investigates the potential of LLM to streamline and enhance the analysis process. The study involves datasets from a middle school, encompassing classroom dialogues across mathematics and Chinese classes. These dialogues were manually coded by educational experts and then analyzed using a customised GPT-4 model. This study focuses on comparing manual annotations with the outputs of GPT-4 to evaluate its efficacy in analyzing educational dialogues. Time efficiency, inter-coder agreement, and inter-coder reliability between human coders and GPT-4 are evaluated. Results indicate substantial time savings with GPT-4, and a high degree of consistency in coding between the model and human coders, with some discrepancies in specific codes. These findings highlight the strong potential of LLM in teaching evaluation and facilitation.

classroom dialogue, dialogue, interaction, (17 more...)

arXiv.org Artificial Intelligence

2402.0238

Country:

Asia > China > Beijing > Beijing (0.04)
North America > United States > New York (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
(2 more...)

Genre:

Instructional Material (1.00)
Research Report > New Finding (0.68)

Industry: Education > Educational Setting > K-12 Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

On the speed of uniform convergence in Mercer's theorem

Takhanov, Rustem

arXiv.org Artificial IntelligenceSep-24-2022

Mercer kernels play an important role in machine learning and is a mathematical basis of such techniques as kernel density estimation and spline models [14], Support Vector Machines [11], kernel principal components analysis [10], regularization of neural networks [13] and many others. According to Aronszajn's theorem, any Mercer kernel induces a reproducing kernel Hilbert space (RKHS) and vice versa, any RKHS corresponds to a kernel. A relationship between the latter two notions is decribed in the classical Mercer's theorem. A goal of this note is torefine this theoremandgive some estimates onthe speedof uniformconvergencestated in it.

artificial intelligence, machine learning, uniform convergence, (16 more...)

arXiv.org Artificial Intelligence

2205.00487

Country:

Asia > Kazakhstan (0.05)
North America > United States > New York (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Switzerland > Basel-City > Basel (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.55)

Add feedback

Uniform Generalization Bounds for Overparameterized Neural Networks

Vakili, Sattar, Bromberg, Michael, Shiu, Da-shan, Bernacchia, Alberto

arXiv.org Machine LearningSep-13-2021

An interesting observation in artificial neural networks is their favorable generalization error despite typically being extremely overparameterized. It is well known that classical statistical learning methods often result in vacuous generalization errors in the case of overparameterized neural networks. Adopting the recently developed Neural Tangent (NT) kernel theory, we prove uniform generalization bounds for overparameterized neural networks in kernel regimes, when the true data generating model belongs to the reproducing kernel Hilbert space (RKHS) corresponding to the NT kernel. Importantly, our bounds capture the exact error rates depending on the differentiability of the activation functions. In order to establish these bounds, we propose the information gain of the NT kernel as a measure of complexity of the learning problem. Our analysis uses a Mercer decomposition of the NT kernel in the basis of spherical harmonics and the decay rate of the corresponding eigenvalues. As a byproduct of our results, we show the equivalence between the RKHS corresponding to the NT kernel and its counterpart corresponding to the Mat\'ern family of kernels, that induces a very general class of models. We further discuss the implications of our analysis for some recent results on the regret bounds for reinforcement learning algorithms, which use overparameterized neural networks.

activation function, kernel, nt kernel, (15 more...)

arXiv.org Machine Learning

2109.06099

Country:

Asia > Japan > Honshū > Kantō > Kanagawa Prefecture (0.04)
North America > United States > Virginia > Arlington County > Arlington (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(2 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.34)

Add feedback

AI Continues DevOps Expansion

#artificialintelligenceJun-15-2021, 18:15:05 GMT

AI gives us the potential to look through the clutter and pick out pieces of data that really matter. It's no wonder, then, that AI is increasingly being used to target complex IT tasks, including DevOps. For instance, the Swedish company CodeScene is finding success in using machine learning to analyze source code. The company's offering, which is partly based on co-founder Adam Tornhill book "Your Code As A Crime Scene," analyzes version control metadata over time to determine where "hot spots" in the code that companies should be paying more attention to. CodeScene, which was founded in 2015, is owned by Empear AB and raised 30 million Swedish Kronor (about $3.6 million) earlier this year.

ai continue devops expansion, artificial intelligence, machine learning, (5 more...)

#artificialintelligence

Country: North America > United States > California > San Francisco County > San Francisco (0.08)

Technology:

Information Technology > Software Engineering (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.35)

Add feedback

Reproducing Kernel Hilbert Space, Mercer's Theorem, Eigenfunctions, Nystr\"om Method, and Use of Kernels in Machine Learning: Tutorial and Survey

Ghojogh, Benyamin, Ghodsi, Ali, Karray, Fakhri, Crowley, Mark

arXiv.org Machine LearningJun-15-2021

This is a tutorial and survey paper on kernels, kernel methods, and related fields. We start with reviewing the history of kernels in functional analysis and machine learning. Then, Mercer kernel, Hilbert and Banach spaces, Reproducing Kernel Hilbert Space (RKHS), Mercer's theorem and its proof, frequently used kernels, kernel construction from distance metric, important classes of kernels (including bounded, integrally positive definite, universal, stationary, and characteristic kernels), kernel centering and normalization, and eigenfunctions are explained in detail. Then, we introduce types of use of kernels in machine learning including kernel methods (such as kernel support vector machines), kernel learning by semi-definite programming, Hilbert-Schmidt independence criterion, maximum mean discrepancy, kernel mean embedding, and kernel dimensionality reduction. We also cover rank and factorization of kernel matrix as well as the approximation of eigenfunctions and kernels using the Nystr{\"o}m method. This paper can be useful for various fields of science including machine learning, dimensionality reduction, functional analysis in mathematics, and mathematical physics in quantum mechanics.

algorithm, kernel, matrix, (15 more...)

arXiv.org Machine Learning

2106.08443

Country:

North America > Canada > Ontario > Toronto (0.14)
Asia > Middle East > Jordan (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(6 more...)

Genre:

Instructional Material > Course Syllabus & Notes (0.48)
Research Report (0.40)

Industry: Education (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.68)

Add feedback