AITopics | spatial dataset

Tree-structured Gaussian Process Approximations

Neural Information Processing SystemsSep-30-2025, 09:41:09 GMT

Gaussian process regression can be accelerated by constructing a small pseudo-dataset to summarise the observed data. This idea sits at the heart of many approximation schemes, but such an approach requires the number of pseudo-datapoints to be scaled with the range of the input space if the accuracy of the approximation is to be maintained. This presents problems in time-series settings or in spatial datasets where large numbers of pseudo-datapoints are required since computation typically scales quadratically with the pseudo-dataset size. In this paper we devise an approximation whose complexity grows linearly with the number of pseudo-datapoints. This is achieved by imposing a tree or chain structure on the pseudo-datapoints and calibrating the approximation using a Kullback-Leibler (KL) minimisation. Inference and learning can then be performed efficiently using the Gaussian belief propagation algorithm. We demonstrate the validity of our approach on a set of challenging regression tasks including missing data imputation for audio and spatial datasets. We trace out the speed-accuracy trade-off for the new method and show that the frontier dominates those obtained from a large number of existing approximation techniques.

electronic proceedings, name change, tree-structured gaussian process approximation, (2 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.83)
Information Technology > Data Science > Data Quality (0.60)

Add feedback

Decentralized Inference for Spatial Data Using Low-Rank Models

Shi, Jianwei, Abdulah, Sameh, Sun, Ying, Genton, Marc G.

arXiv.org Machine LearningFeb-10-2025

Advancements in information technology have enabled the creation of massive spatial datasets, driving the need for scalable and efficient computational methodologies. While offering viable solutions, centralized frameworks are limited by vulnerabilities such as single-point failures and communication bottlenecks. This paper presents a decentralized framework tailored for parameter inference in spatial low-rank models to address these challenges. A key obstacle arises from the spatial dependence among observations, which prevents the log-likelihood from being expressed as a summation-a critical requirement for decentralized optimization approaches. To overcome this challenge, we propose a novel objective function leveraging the evidence lower bound, which facilitates the use of decentralized optimization techniques. Our approach employs a block descent method integrated with multi-consensus and dynamic consensus averaging for effective parameter optimization. We prove the convexity of the new objective function in the vicinity of the true parameters, ensuring the convergence of the proposed method. Additionally, we present the first theoretical results establishing the consistency and asymptotic normality of the estimator within the context of spatial low-rank models. Extensive simulations and real-world data experiments corroborate these theoretical findings, showcasing the robustness and scalability of the framework.

artificial intelligence, bayesian inference, machine learning, (21 more...)

arXiv.org Machine Learning

2502.00309

Country:

North America > United States (0.45)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Middle East > Saudi Arabia (0.04)

Genre: Research Report > New Finding (0.67)

Industry: Government (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.84)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

Add feedback

Tree-structured Gaussian Process Approximations

Neural Information Processing SystemsJan-17-2025, 14:17:17 GMT

Gaussian process regression can be accelerated by constructing a small pseudo-dataset to summarise the observed data. This idea sits at the heart of many approximation schemes, but such an approach requires the number of pseudo-datapoints to be scaled with the range of the input space if the accuracy of the approximation is to be maintained. This presents problems in time-series settings or in spatial datasets where large numbers of pseudo-datapoints are required since computation typically scales quadratically with the pseudo-dataset size. In this paper we devise an approximation whose complexity grows linearly with the number of pseudo-datapoints. This is achieved by imposing a tree or chain structure on the pseudo-datapoints and calibrating the approximation using a Kullback-Leibler (KL) minimisation.

spatial dataset, tree-structured gaussian process approximation

Neural Information Processing Systems

Technology:

Information Technology > Modeling & Simulation (0.65)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.48)

Add feedback

Global Big Data Conference

#artificialintelligenceOct-2-2021, 16:10:17 GMT

Apple has been slowly but surely creating a name for itself in the low-code/no-code movement. This July, the Cupertino-based company announced the launch of Trinity AI, a no-code platform for complex spatial datasets. Trinity enables machine learning researchers and non-AI devs to tailor complex spatiotemporal datasets to fit deep learning models. Back in 2019, Apple revealed SwiftUI, a programming language that required much less coding than the Swift language. With the release of Trinity, Apple doubles down on its effort to significantly lower the threshold for non-devs and non-ML devs.

global big data conference, trinity, venturebeat, (8 more...)

#artificialintelligence

Genre: Press Release (0.81)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.65)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.58)

Add feedback

Apple's no-code Trinity AI platform handles complex spatial datasets

#artificialintelligenceOct-1-2021, 14:15:48 GMT

The Transform Technology Summits start October 13th with Low-Code/No Code: Enabling Enterprise Agility. Apple has been slowly but surely creating a name for itself in the low-code/no-code movement. This July, the Cupertino-based company announced the launch of Trinity AI, a no-code platform for complex spatial datasets. Trinity enables machine learning researchers and non-AI devs to tailor complex spatiotemporal datasets to fit deep learning models. Back in 2019, Apple revealed SwiftUI, a programming language that required much less coding than the Swift language.

platform, trinity, venturebeat, (10 more...)

#artificialintelligence

Genre: Press Release (0.76)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.55)

Add feedback

Modeling Massive Spatial Datasets Using a Conjugate Bayesian Linear Regression Framework

Banerjee, Sudipto

arXiv.org Machine LearningSep-9-2021

Geographic Information Systems (GIS) and related technologies have generated substantial interest among statisticians with regard to scalable methodologies for analyzing large spatial datasets. A variety of scalable spatial process models have been proposed that can be easily embedded within a hierarchical modeling framework to carry out Bayesian inference. While the focus of statistical research has mostly been directed toward innovative and more complex model development, relatively limited attention has been accorded to approaches for easily implementable scalable hierarchical models for the practicing scientist or spatial analyst. This article discusses how point-referenced spatial process models can be cast as a conjugate Bayesian linear regression that can rapidly deliver inference on spatial processes. The approach allows exact sampling directly (avoids iterative algorithms such as Markov chain Monte Carlo) from the joint posterior distribution of regression parameters, the latent process and the predictive random variables, and can be easily implemented on statistical programming environments such as R.

banerjee massive spatial data modeling, inference, sudipto banerjee, (11 more...)

arXiv.org Machine Learning

doi: 10.1016/j.spasta.2020.100417

2109.04447

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Florida > Palm Beach County > Boca Raton (0.04)
North America > United States > New Jersey > Hudson County > Hoboken (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.74)

Add feedback

Spatially weighted averages in R with sf

#artificialintelligenceJul-2-2021, 21:05:47 GMT

Spatial joins allow to augment one spatial dataset with information from another spatial dataset by linking overlapping features. In this post I will provide an example showing how to augment a dataset containing school locations with socioeconomic data of their surrounding statistical region using R and the package sf (Pebesma 2018). This approach has the drawback that the surrounding statistical region doesn't reflect the actual catchment area of the school. I will present an alternative approach where the overlaps of the schools' catchment areas with the statistical regions allow to calculate the weighted average of the socioeconomic statistics. If we have no data about the actual catchment areas of the schools, we may resort to approximating these areas as circular regions or as Voronoi regions around schools.

catchment area, dataset, planning area, (15 more...)

#artificialintelligence

Country: Europe > Germany (0.05)

Industry: Education > Educational Setting (0.47)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)

Add feedback

Tree-structured Gaussian Process Approximations

Bui, Thang D., Turner, Richard E.

Neural Information Processing SystemsFeb-14-2020, 09:26:53 GMT

Gaussian process regression can be accelerated by constructing a small pseudo-dataset to summarise the observed data. This idea sits at the heart of many approximation schemes, but such an approach requires the number of pseudo-datapoints to be scaled with the range of the input space if the accuracy of the approximation is to be maintained. This presents problems in time-series settings or in spatial datasets where large numbers of pseudo-datapoints are required since computation typically scales quadratically with the pseudo-dataset size. In this paper we devise an approximation whose complexity grows linearly with the number of pseudo-datapoints. This is achieved by imposing a tree or chain structure on the pseudo-datapoints and calibrating the approximation using a Kullback-Leibler (KL) minimisation.

spatial dataset, tree-structured gaussian process approximation

Neural Information Processing Systems

Technology:

Information Technology > Modeling & Simulation (0.65)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.48)

Add feedback

Sparse Pseudo-input Local Kriging for Large Non-stationary Spatial Datasets with Exogenous Variables

Farmanesh, Babak, Pourhabib, Arash

arXiv.org Machine LearningAug-5-2015

Gaussian process (GP) regression is a powerful tool for building predictive models for spatial systems. However, it does not scale efficiently for large datasets. Particularly, for high-dimensional spatial datasets, i.e., spatial datasets that contain exogenous variables, the performance of GP regression further deteriorates. This paper presents the Sparse Pseudo-input Local Kriging (SPLK) which approximates the full GP for spatial datasets with exogenous variables. SPLK employs orthogonal cuts which decompose the domain into smaller subdomains and then applies a sparse approximation of the full GP in each subdomain. We obtain the continuity of the global predictor by imposing continuity constraints on the boundaries of the neighboring subdomains. The domain decomposition scheme applies independent covariance structures in each region, and as a result, SPLK captures heterogeneous covariance structures. SPLK achieves computational efficiency by utilizing sparse approximation in each subdomain which enables SPLK to accommodate large subdomains that contain many data points and possess a homogenous covariance structure. We Apply the proposed method to real and simulated datasets. We conclude that the combination of orthogonal cuts and sparse approximation makes the proposed method an efficient algorithm for high-dimensional large spatial datasets.

artificial intelligence, machine learning, modeling & simulation, (19 more...)

arXiv.org Machine Learning

1508.01248

Country: North America > United States (0.93)

Genre: Research Report > New Finding (0.46)

Technology: