AITopics

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.66)

WIREDNov-12-2025, 16:00:00 GMT

Waymo's Robotaxis Can Now Use the Highway, Speeding Up Longer Trips

Waymo's Robotaxis Can Now Use the Highway, Speeding Up Longer Trips The Alphabet company's self-driving cars are opening up shop in more and more cities. When Google's self-driving car project began testing in the Bay Area back in 2009, its engineers focused on highways by sending its sensor-laden vehicles cruising down Interstate 280, which runs the length of Silicon Valley's peninsula. More than 15 years later, the cars are back on the freeway--this time without drivers. On Tuesday, the project, now an Alphabet subsidiary we all know as Waymo, announced that its robotaxi service would now drive on freeways in the San Francisco Bay Area, Los Angeles, and Phoenix. The new service marks another technical leap for Waymo, whose robotaxis currently serve five US metros: Atlanta, Austin, Los Angeles, Phoenix, and the San Francisco Bay Area.

artificial intelligence, promo code, waymo, (13 more...)

WIRED

Country:

North America > United States > California > San Francisco County > San Francisco (0.48)
North America > United States > California > Los Angeles County > Los Angeles (0.46)

Industry:

Transportation > Ground > Road (1.00)
Information Technology > Robotics & Automation (1.00)
Automobiles & Trucks (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Neural Information Processing SystemsMay-27-2025, 03:43:22 GMT

QWO: Speeding Up Permutation-Based Causal Discovery in LiGAMs

Causal discovery is essential for understanding relationships among variables of interest in many scientific domains. In this paper, we focus on permutation-based methods for learning causal graphs in Linear Gaussian Acyclic Models (LiGAMs), where the permutation encodes a causal ordering of the variables. Existing methods in this setting are not scalable due to their high computational complexity. These methods are comprised of two main components: (i) constructing a specific DAG, \mathcal{G} \pi, for a given permutation \pi, which represents the best structure that can be learned from the available data while adhering to \pi, and (ii) searching over the space of permutations (i.e., causal orders) to minimize the number of edges in \mathcal{G} \pi . We introduce QWO, a novel approach that significantly enhances the efficiency of computing \mathcal{G} \pi for a given permutation \pi .

artificial intelligence, ligam, permutation-based causal discovery, (4 more...)

Technology: Information Technology > Artificial Intelligence (0.46)

arXiv.org Artificial IntelligenceNov-7-2024

SuffixDecoding: A Model-Free Approach to Speeding Up Large Language Model Inference

Oliaro, Gabriele, Jia, Zhihao, Campos, Daniel, Qiao, Aurick

We present SuffixDecoding, a novel model-free approach to accelerating large language model (LLM) inference through speculative decoding. Unlike existing methods that rely on draft models or specialized decoding heads, SuffixDecoding leverages suffix trees built from previously generated outputs to efficiently predict candidate token sequences. Our approach enables flexible tree-structured speculation without the overhead of maintaining and orchestrating additional models. SuffixDecoding builds and dynamically updates suffix trees to capture patterns in the generated text, using them to construct speculation trees through a principled scoring mechanism based on empirical token frequencies. SuffixDecoding requires only CPU memory which is plentiful and underutilized on typical LLM serving nodes. We demonstrate that SuffixDecoding achieves competitive speedups compared to model-based approaches across diverse workloads including open-domain chat, code generation, and text-to-SQL tasks. For open-ended chat and code generation tasks, SuffixDecoding achieves up to $1.4\times$ higher output throughput than SpecInfer and up to $1.1\times$ lower time-per-token (TPOT) latency. For a proprietary multi-LLM text-to-SQL application, SuffixDecoding achieves up to $2.9\times$ higher output throughput and $3\times$ lower latency than speculative decoding. Our evaluation shows that SuffixDecoding maintains high acceptance rates even with small reference corpora of 256 examples, while continuing to improve performance as more historical outputs are incorporated.

agenticsql, suffix tree, suffixdecoding, (14 more...)

2411.04975

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > Canada > Ontario > Toronto (0.04)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Neural Information Processing SystemsOct-8-2024, 06:34:09 GMT

Reviews: Speeding Up Latent Variable Gaussian Graphical Model Estimation via Nonconvex Optimization

The paper considers learning the dependency structure of Gaussian graphical models where some variables are latent. Directly applying the usual assumption of sparsity in the precision matrix is difficult because variables that appear correlated might actually both depend on a common latent variable. Previously, Chandrasekaran et al. proposed estimating the model structure by decomposing the full precision matrix into the sum of of a sparse matrix and a low-rank matrix. Likelihood is maximized while the components of the sparse matrix are penalized with an l1 regularizer and the low-rank matrix is penalized with a nuclear norm. Computing the proximal operator to update the low-rank component requires performing SVD in O(d 3) time at each iteration. The authors propose replacing the low-rank component with its Cholesky decomposition ZZ T and finding Z directly.

matrix, nonconvex optimization, variable gaussian graphical model estimation, (7 more...)

Technology:

Information Technology > Artificial Intelligence > Systems & Languages (0.64)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.64)

arXiv.org Artificial IntelligenceNov-9-2023

AccEPT: An Acceleration Scheme for Speeding Up Edge Pipeline-parallel Training

Chen, Yuhao, Yan, Yuxuan, Yang, Qianqian, Shu, Yuanchao, He, Shibo, Shi, Zhiguo, Chen, Jiming

It is usually infeasible to fit and train an entire large deep neural network (DNN) model using a single edge device due to the limited resources. To facilitate intelligent applications across edge devices, researchers have proposed partitioning a large model into several sub-models, and deploying each of them to a different edge device to collaboratively train a DNN model. However, the communication overhead caused by the large amount of data transmitted from one device to another during training, as well as the sub-optimal partition point due to the inaccurate latency prediction of computation at each edge device can significantly slow down training. In this paper, we propose AccEPT, an acceleration scheme for accelerating the edge collaborative pipeline-parallel training. In particular, we propose a light-weight adaptive latency predictor to accurately estimate the computation latency of each layer at different devices, which also adapts to unseen devices through continuous learning. Therefore, the proposed latency predictor leads to better model partitioning which balances the computation loads across participating devices. Moreover, we propose a bit-level computation-efficient data compression scheme to compress the data to be transmitted between devices during training. Our numerical results demonstrate that our proposed acceleration approach is able to significantly speed up edge pipeline parallel training up to 3 times faster in the considered experimental settings.

acceleration scheme, edge pipeline-parallel training, speeding

2311.05827

Genre: Research Report (0.69)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.53)

arXiv.org Artificial IntelligenceSep-18-2023

Speeding Up Speech Synthesis In Diffusion Models By Reducing Data Distribution Recovery Steps Via Content Transfer

Ochieng, Peter

Diffusion based vocoders have been criticised for being slow due to the many steps required during sampling. Moreover, the model's loss function that is popularly implemented is designed such that the target is the original input $x_0$ or error $\epsilon_0$. For early time steps of the reverse process, this results in large prediction errors, which can lead to speech distortions and increase the learning time. We propose a setup where the targets are the different outputs of forward process time steps with a goal to reduce the magnitude of prediction errors and reduce the training time. We use the different layers of a neural network (NN) to perform denoising by training them to learn to generate representations similar to the noised outputs in the forward process of the diffusion. The NN layers learn to progressively denoise the input in the reverse process until finally the final layer estimates the clean speech. To avoid 1:1 mapping between layers of the neural network and the forward process steps, we define a skip parameter $\tau>1$ such that an NN layer is trained to cumulatively remove the noise injected in the $\tau$ steps in the forward process. This significantly reduces the number of data distribution recovery steps and, consequently, the time to generate speech. We show through extensive evaluation that the proposed technique generates high-fidelity speech in competitive time that outperforms current state-of-the-art tools. The proposed technique is also able to generalize well to unseen speech.

forward process, neural network, speech synthesis, (10 more...)

2309.09652

Country:

Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Workflow (0.87)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsApr-6-2023, 16:32:32 GMT

Speeding up the Parti-Game Algorithm

In this paper, we introduce an efficient replanning algorithm for nonde- terministic domains, namely what we believe to be the first incremental heuristic minimax search algorithm. We apply it to the dynamic dis- cretization of continuous domains, resulting in an efficient implemen- tation of the parti-game reinforcement-learning algorithm for control in high-dimensional domains.

parti-game algorithm, speeding

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.82)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.82)

#artificialintelligenceFeb-21-2023, 04:25:27 GMT

Speeding up the time to perform MRI scans with AI-assisted technology – JD Supra

It appears that when machine learning is used to reconstruct MRI images, albeit at a faster pace and with less imaging data acquisition than …

jd supra, perform mri scan, speeding

#artificialintelligence

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (0.76)
Media > News (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.52)

Kannout, Eyad, Nguyen, Hung Son, Grzegorowski, Marek

Speeding Up Recommender Systems Using Association Rules

arXiv.org Artificial IntelligenceNov-16-2022

Recommender systems are considered one of the most rapidly growing branches of Artificial Intelligence. The demand for finding more efficient techniques to generate recommendations becomes urgent. However, many recommendations become useless if there is a delay in generating and showing them to the user. Therefore, we focus on improving the speed of recommendation systems without impacting the accuracy. In this paper, we suggest a novel recommender system based on Factorization Machines and Association Rules (FMAR). We introduce an approach to generate association rules using two algorithms: (i) apriori and (ii) frequent pattern (FP) growth. These association rules will be utilized to reduce the number of items passed to the factorization machines recommendation model. We show that FMAR has significantly decreased the number of new items that the recommender system has to predict and hence, decreased the required time for generating the recommendations. On the other hand, while building the FMAR tool, we concentrate on making a balance between prediction time and accuracy of generated recommendations to ensure that the accuracy is not significantly impacted compared to the accuracy of using factorization machines without association rules.

artificial intelligence, association rule, expert system, (15 more...)

doi: 10.1007/978-3-031-21967-2_14

2211.08799

Country:

Europe > Poland > Masovia Province > Warsaw (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Minnesota (0.04)

Genre: Research Report > New Finding (0.69)

Industry: Health & Medicine (0.69)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)