AITopics | Kittilä

Collaborating Authors

Kittilä

Linguistically-Informed Neural Architectures for Lexical, Syntactic and Semantic Tasks in Sanskrit

arXiv.org Artificial IntelligenceAug-17-2023

The primary focus of this thesis is to make Sanskrit manuscripts more accessible to the end-users through natural language technologies. The morphological richness, compounding, free word orderliness, and low-resource nature of Sanskrit pose significant challenges for developing deep learning solutions. We identify four fundamental tasks, which are crucial for developing a robust NLP technology for Sanskrit: word segmentation, dependency parsing, compound type identification, and poetry analysis. The first task, Sanskrit Word Segmentation (SWS), is a fundamental text processing task for any other downstream applications. However, it is challenging due to the sandhi phenomenon that modifies characters at word boundaries. Similarly, the existing dependency parsing approaches struggle with morphologically rich and low-resource languages like Sanskrit. Compound type identification is also challenging for Sanskrit due to the context-sensitive semantic relation between components. All these challenges result in sub-optimal performance in NLP applications like question answering and machine translation. Finally, Sanskrit poetry has not been extensively studied in computational linguistics. While addressing these challenges, this thesis makes various contributions: (1) The thesis proposes linguistically-informed neural architectures for these tasks. (2) We showcase the interpretability and multilingual extension of the proposed systems. (3) Our proposed systems report state-of-the-art performance. (4) Finally, we present a neural toolkit named SanskritShala, a web-based application that provides real-time analysis of input for various NLP tasks. Overall, this thesis contributes to making Sanskrit manuscripts more accessible by developing robust NLP technology and releasing various resources, datasets, and web-based toolkit.

implicitly encoded context-sensitive semantic relation, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2308.08807

Country:

North America > United States > Texas > Travis County > Austin (0.27)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
(83 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Overview (1.00)
(2 more...)

Industry:

Education (1.00)
Health & Medicine > Therapeutic Area (0.46)
Information Technology > Services (0.45)
Government > Regional Government > North America Government > United States Government (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

$\Phi$-DVAE: Physics-Informed Dynamical Variational Autoencoders for Unstructured Data Assimilation

Glyn-Davies, Alex, Duffin, Connor, Akyildiz, Ö. Deniz, Girolami, Mark

arXiv.org Artificial IntelligenceJul-14-2023

Incorporating unstructured data into physical models is a challenging problem that is emerging in data assimilation. Traditional approaches focus on well-defined observation operators whose functional forms are typically assumed to be known. This prevents these methods from achieving a consistent model-data synthesis in configurations where the mapping from data-space to model-space is unknown. To address these shortcomings, in this paper we develop a physics-informed dynamical variational autoencoder ($\Phi$-DVAE) to embed diverse data streams into time-evolving physical systems described by differential equations. Our approach combines a standard, possibly nonlinear, filter for the latent state-space model and a VAE, to assimilate the unstructured data into the latent dynamical system. Unstructured data, in our example systems, comes in the form of video data and velocity field measurements, however the methodology is suitably generic to allow for arbitrary unknown observation operators. A variational Bayesian framework is used for the joint estimation of the encoding, latent states, and unknown system parameters. To demonstrate the method, we provide case studies with the Lorenz-63 ordinary differential equation, and the advection and Korteweg-de Vries partial differential equations. Our results, with synthetic data, show that $\Phi$-DVAE provides a data efficient dynamics encoding methodology which is competitive with standard approaches. Unknown parameters are recovered with uncertainty quantification, and unseen data are accurately predicted.

artificial intelligence, bayesian inference, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2209.15609

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > Tennessee > Davidson County > Nashville (0.04)
North America > United States > Rhode Island > Providence County > Providence (0.04)
(2 more...)

Genre: Research Report (0.83)

Add feedback

Fast Newton method solving KLR based on Multilevel Circulant Matrix with log-linear complexity

Zhang, Junna, Zhou, Shuisheng, Fu, Cui, Ye, Feng

arXiv.org Artificial IntelligenceJul-28-2022

Kernel logistic regression (KLR) is a conventional nonlinear classifier in machine learning. With the explosive growth of data size, the storage and computation of large dense kernel matrices is a major challenge in scaling KLR. Even the nystr\"{o}m approximation is applied to solve KLR, it also faces the time complexity of $O(nc^2)$ and the space complexity of $O(nc)$, where $n$ is the number of training instances and $c$ is the sampling size. In this paper, we propose a fast Newton method efficiently solving large-scale KLR problems by exploiting the storage and computing advantages of multilevel circulant matrix (MCM). Specifically, by approximating the kernel matrix with an MCM, the storage space is reduced to $O(n)$, and further approximating the coefficient matrix of the Newton equation as MCM, the computational complexity of Newton iteration is reduced to $O(n \log n)$. The proposed method can run in log-linear time complexity per iteration, because the multiplication of MCM (or its inverse) and vector can be implemented the multidimensional fast Fourier transform (mFFT). Experimental results on some large-scale binary-classification and multi-classification problems show that the proposed method enables KLR to scale to large scale problems with less memory consumption and less training time without sacrificing test accuracy.

algorithm, approximation, complexity, (16 more...)

arXiv.org Artificial Intelligence

2108.08605

Country:

Europe > Austria > Vienna (0.14)
North America > Canada > Quebec > Montreal (0.04)
South America > Argentina (0.04)
(12 more...)

Genre: Research Report > New Finding (0.35)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.35)

Add feedback

Three ways to maximize the business value of AI

#artificialintelligenceDec-2-2021, 08:25:58 GMT

Business interest in artificial intelligence (AI) has rocketed in recent years -- spending could reach $15.7 trillion by 2030, according to PwC. But there remain lingering concerns that businesses are failing to realize the full value from their investments. Ever since their emergence, AI, machine learning (ML), and data science have all been surrounded by hype. We've been promised technology that will solve our most complex challenges for us and automatically optimize everything from internal processes to customer experiences. Advances are being made every day that promise to transform virtually every aspect of our lives.

airport, business value, use case, (8 more...)

#artificialintelligence

Country: Europe > Finland > Lapland > Kittilä (0.07)

Industry: Transportation > Infrastructure & Services (0.32)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.74)

Add feedback

Fast Fair Regression via Efficient Approximations of Mutual Information

Steinberg, Daniel, Reid, Alistair, O'Callaghan, Simon, Lattimore, Finnian, McCalman, Lachlan, Caetano, Tiberio

arXiv.org Machine LearningFeb-14-2020

Most work in algorithmic fairness to date has focused on discrete outcomes, such as deciding whether to grant someone a loan or not. In these classification settings, group fairness criteria such as independence, separation and sufficiency can be measured directly by comparing rates of outcomes between subpopulations. Many important problems however require the prediction of a real-valued outcome, such as a risk score or insurance premium. In such regression settings, measuring group fairness criteria is computationally challenging, as it requires estimating information-theoretic divergences between conditional probability density functions. This paper introduces fast approximations of the independence, separation and sufficiency group fairness criteria for regression models from their (conditional) mutual information definitions, and uses such approximations as regularisers to enforce fairness within a regularised risk minimisation framework. Experiments in real-world datasets indicate that in spite of its superior computational efficiency our algorithm still displays state-of-the-art accuracy/fairness tradeoffs.

artificial intelligence, machine learning, regulariser, (11 more...)

arXiv.org Machine Learning

2002.062

Country:

Oceania > Australia > New South Wales > Sydney (0.04)
Oceania > Australia > Australian Capital Territory > Canberra (0.04)
North America > United States > Texas > Dallas County > Dallas (0.04)
(2 more...)

Genre: Research Report > New Finding (0.47)

Industry:

Banking & Finance (0.48)
Education (0.46)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.88)

Add feedback

Finland's airports will soon be run by AI - TechHQ

#artificialintelligenceOct-28-2019, 11:27:30 GMT

Delays, overcrowded departure dates, disgruntled passengers-- airports are rarely stress-free or smooth-sailing. But airports around the globe with savvy management are increasingly experimenting with data analytics and AI in order to make the process of flying more attractive for the customers that pass through their terminals. In Finland, Finavia-- the company behind all of the country's 21 airports-- teamed up with advisory firm Fourkind and agency Reaktor to take a look at its airport of Kittilä in Lapland. Finavia found that the airport wasn't able to keep up with the seasonal demands of the country's booming tourism industry. Tourists flocking to see the Northern Lights and catch a glimpse of Santa Claus were causing lengthy delays.

airport, artificial intelligence, thompson, (17 more...)

#artificialintelligence

Country:

Europe > Finland > Lapland > Kittilä (0.29)
North America > Canada > Ontario > Toronto (0.07)
North America > United States (0.05)
Europe > Finland > Uusimaa > Helsinki (0.05)

Industry:

Transportation > Infrastructure & Services > Airport (1.00)
Transportation > Air (1.00)
Consumer Products & Services > Travel (1.00)

Technology: Information Technology > Artificial Intelligence (0.50)

Add feedback

Linear-Time Inference for Pairwise Comparisons with Gaussian-Process Dynamics

Maystre, Lucas, Kristof, Victor, Grossglauser, Matthias

arXiv.org Machine LearningMar-18-2019

In many competitive sports and games (such as tennis, basketball, chess and electronic sports), the most useful definition of a competitor's skill is the propensity of that competitor to win against an opponent. It is often difficult to measure this skill explicitly: take basketball for example, a team's skill depends on the abilities of its players in terms of shooting accuracy, physical fitness, mental preparation, but also on the team's cohesion and coordination, on its strategy, on the enthusiasm of its fans, and a number of other intangible factors. However, it is easy to observe this skill implicitly through the outcomes of matches. In this setting, probabilistic models of pairwise-comparison outcomes provide an elegant and effective approach to quantifying skill and to predicting future match outcomes given past data. These models, pioneered by Zermelo [1928] in the context of chess (and by Thurstone [1927] in the context of psychophysics), have been studied for almost a century. They posit that each competitor i (i.e., a team or player) is characterized by a latent score s R and that the outcome probabilities of a match between i and j are a function of

artificial intelligence, covariance function, machine learning, (18 more...)

arXiv.org Machine Learning

1903.07746

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Illinois > Cook County > Chicago (0.04)
(11 more...)

Genre: Research Report (1.00)

Industry:

Leisure & Entertainment > Sports > Tennis (1.00)
Leisure & Entertainment > Games > Chess (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

Kalman Temporal Differences

Geist, M., Pietquin, O.

Journal of Artificial Intelligence ResearchOct-29-2010

Because reinforcement learning suffers from a lack of scalability, online value (and Q-) function approximation has received increasing interest this last decade. This contribution introduces a novel approximation scheme, namely the Kalman Temporal Differences (KTD) framework, that exhibits the following features: sample-efficiency, non-linear approximation, non-stationarity handling and uncertainty management. A first KTD-based algorithm is provided for deterministic Markov Decision Processes (MDP) which produces biased estimates in the case of stochastic transitions. Than the eXtended KTD framework (XKTD), solving stochastic MDP, is described. Convergence is analyzed for special cases for both deterministic and stochastic transitions. Related algorithms are experimented on classical benchmarks. They compare favorably to the state of the art while exhibiting the announced features.

algorithm, equation, value function, (13 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.3077

AI Access Foundation

10675

Journal of Artificial Intelligence Research

Country:

North America > United States > California > San Francisco County > San Francisco (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.14)
North America > United States > New York > New York County > New York City (0.04)
(13 more...)

Genre: Research Report (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.48)

Add feedback