AITopics

2307.0561

Genre: Research Report > New Finding (0.86)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.67)

arXiv.org Artificial IntelligenceMay-22-2023

Approximating a RUM from Distributions on k-Slates

Chierichetti, Flavio, Giacchini, Mirko, Kumar, Ravi, Panconesi, Alessandro, Tomkins, Andrew

In this work we consider the problem of fitting Random Utility Models (RUMs) to user choices. Given the winner distributions of the subsets of size $k$ of a universe, we obtain a polynomial-time algorithm that finds the RUM that best approximates the given distribution on average. Our algorithm is based on a linear program that we solve using the ellipsoid method. Given that its corresponding separation oracle problem is NP-hard, we devise an approximate separation oracle that can be viewed as a generalization of the weighted feedback arc set problem to hypergraphs. Our theoretical result can also be made practical: we obtain a heuristic that is effective and scales to real-world datasets.

algorithm, artificial intelligence, machine learning, (19 more...)

2305.13283

Country: North America > United States (0.67)

Genre: Research Report (0.64)

Industry:

Transportation (0.46)
Government > Regional Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.46)

arXiv.org Artificial IntelligenceDec-22-2020

Graph Autoencoders with Deconvolutional Networks

Li, Jia, Yu, Tomas, Juan, Da-Cheng, Gopalan, Arjun, Cheng, Hong, Tomkins, Andrew

Recent studies have indicated that Graph Convolutional Networks (GCNs) act as a low pass filter in spectral domain and encode smoothed node representations. In this paper, we consider their opposite, namely Graph Deconvolutional Networks (GDNs) that reconstruct graph signals from smoothed node representations. We motivate the design of Graph Deconvolutional Networks via a combination of inverse filters in spectral domain and de-noising layers in wavelet domain, as the inverse operation results in a high pass filter and may amplify the noise. Based on the proposed GDN, we further propose a graph autoencoder framework that first encodes smoothed graph representations with GCN and then decodes accurate graph signals with GDN. We demonstrate the effectiveness of the proposed method on several tasks including unsupervised graph-level representation, social recommendation and graph generation. Autoencoders have demonstrated excellent performance on tasks such as unsupervised representation learning (Bengio, 2009) and de-noising (Vincent et al., 2010). Recently, several studies (Zeiler & Fergus, 2014; Long et al., 2015) have demonstrated that the performance of autoencoders can be further improved by encoding with Convolutional Networks and decoding with Deconvolutional Networks (Zeiler et al., 2010). Notably, Noh et al. (2015) present a novel symmetric architecture that provides a bottom-up mapping from input signals to latent hierarchical feature space with {convolution, pooling} operations and then maps the latent representation back to the input space with {deconvolution, unpooling} operations. While this architecture has been successful when processing features with structures existed in the Euclidean space (e.g., images), recently there has been a surging interest in applying such a framework on non-Euclidean data like graphs.

deep learning, neural network, representation, (17 more...)

2012.11898

Country:

Asia (0.46)
North America > United States > California (0.14)

Genre: Research Report (0.84)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

arXiv.org Machine LearningAug-17-2020

Generative Models are Unsupervised Predictors of Page Quality: A Colossal-Scale Study

Bahri, Dara, Tay, Yi, Zheng, Che, Metzler, Donald, Brunk, Cliff, Tomkins, Andrew

Large generative language models such as GPT-2 are well-known for their ability to generate text as well as their utility in supervised downstream tasks via fine-tuning. Our work is twofold: firstly we demonstrate via human evaluation that classifiers trained to discriminate between human and machine-generated text emerge as unsupervised predictors of "page quality", able to detect low quality content without any training. This enables fast bootstrapping of quality indicators in a low-resource setting. Secondly, curious to understand the prevalence and nature of low quality pages in the wild, we conduct extensive qualitative and quantitative analysis over 500 million web articles, making this the largest-scale study ever conducted on the topic.

artificial intelligence, health & medicine, machine translation, (20 more...)

2008.13533

Country: North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)

Genre: Research Report (1.00)

Industry: Health & Medicine (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.69)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.56)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.56)
(2 more...)

arXiv.org Machine LearningJul-2-2020

BusTr: Predicting Bus Travel Times from Real-Time Traffic

Barnes, Richard, Buthpitiya, Senaka, Cook, James, Fabrikant, Alex, Tomkins, Andrew, Xu, Fangzhou

Of these two modalities, the world's public transit systems where no official real-time real-time state is disproportionately important for the bus tracking is provided. We demonstrate that our neural routine trips that dominate most people's transportation sequence model improves over DeepTTE, the state-ofthe-art needs. Most transit users know by heart the routes connecting baseline, both in performance ( 30% MAPE) and their home, work, and other frequent destinations, training stability. We also demonstrate significant generalization but they have a well-established need for information gains over simpler models, evaluated on longitudinal about real-time changes. Transit variability is a data to cope with a constantly evolving world.

ground transportation, neural network, prediction, (21 more...)

doi: 10.1145/3394486.3403376

2007.00882

Country:

North America > United States (0.68)
Asia > China (0.46)
South America > Brazil > Rio de Janeiro (0.14)

Genre: Research Report > Experimental Study (0.94)

Industry:

Transportation > Infrastructure & Services (1.00)
Transportation > Ground > Road (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Architecture > Real Time Systems (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

arXiv.org Artificial IntelligenceOct-23-2019

Preventing Adversarial Use of Datasets through Fair Core-Set Construction

Spector, Benjamin, Kumar, Ravi, Tomkins, Andrew

We propose improving the privacy properties of a dataset by publishing only a strategically chosen "core-set" of the data containing a subset of the instances. The core-set allows strong performance on primary tasks, but forces poor performance on unwanted tasks. We give methods for both linear models and neural networks and demonstrate their efficacy on data.

artificial intelligence, dataset, neural network, (16 more...)

1910.10871

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)

Genre: Research Report (0.50)

Industry: Information Technology > Security & Privacy (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.35)

arXiv.org Machine LearningFeb-13-2019

Graph-RISE: Graph-Regularized Image Semantic Embedding

Juan, Da-Cheng, Lu, Chun-Ta, Li, Zhen, Peng, Futang, Timofeev, Aleksei, Chen, Yi-Ting, Gao, Yaxi, Duerig, Tom, Tomkins, Andrew, Ravi, Sujith

Learning image representations to capture fine-grained semantics has been a challenging and important task enabling many applications such as image search and clustering. In this paper, we present Graph-Regularized Image Semantic Embedding (Graph-RISE), a large-scale neural graph learning framework that allows us to train embeddings to discriminate an unprecedented O(40M) ultra-fine-grained semantic labels. Graph-RISE outperforms state-of-the-art image embedding algorithms on several evaluation tasks, including image classification and triplet ranking. We provide case studies to demonstrate that, qualitatively, image retrieval based on Graph-RISE effectively captures semantics and, compared to the state-of-the-art, differentiates nuances at levels that are closer to human-perception.

artificial intelligence, neural network, woodstock, (16 more...)

1902.10814

Country: North America > United States (0.46)

Genre: Research Report (0.82)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.95)
(2 more...)

arXiv.org Machine LearningApr-4-2017

Linear Additive Markov Processes

Kumar, Ravi, Raghu, Maithra, Sarlos, Tamas, Tomkins, Andrew

We introduce LAMP: the Linear Additive Markov Process. Transitions in LAMP may be influenced by states visited in the distant history of the process, but unlike higher-order Markov processes, LAMP retains an efficient parametrization. LAMP also allows the specific dependence on history to be learned efficiently from data. We characterize some theoretical properties of LAMP, including its steady-state and mixing time. We then give an algorithm based on alternating minimization to learn LAMP models from data. Finally, we perform a series of real-world experiments to show that LAMP is more powerful than first-order Markov processes, and even holds its own against deep sequential models (LSTMs) with a negligible increase in parameter complexity.

deep learning, markov process, neural network, (20 more...)

1704.01255

Country:

North America > United States > California > Santa Clara County (0.14)
Oceania > Australia (0.14)

Genre: Research Report (0.82)

Industry:

Leisure & Entertainment (0.68)
Media > Music (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)