Goto

Collaborating Authors

 India


The Download: India's AI independence, and predicting future epidemics

MIT Technology Review

Despite its status as a global tech hub, India lags far behind the likes of the US and China when it comes to homegrown AI. That gap has opened largely because India has chronically underinvested in R&D, institutions, and invention. Meanwhile, since no one native language is spoken by the majority of the population, training language models is far more complicated than it is elsewhere. So when the open-source foundation model DeepSeek-R1 suddenly outperformed many global peers, it struck a nerve. This launch by a Chinese startup prompted Indian policymakers to confront just how far behind the country was in AI infrastructure--and how urgently it needed to respond.


Inside India's scramble for AI independence

MIT Technology Review

Historically known as the global back office for the software industry, India has a tech ecosystem that evolved with a services-first mindset. Giants like Infosys and TCS built their success on efficient software delivery, but invention was neither prioritized nor rewarded. Meanwhile, India's R&D spending hovered at just 0.65% of GDP ( 25.4 billion) in 2024, far behind China's 2.68% ( 476.2 billion) and the US's 3.5% ( 962.3 billion). The muscle to invent and commercialize deep tech, from algorithms to chips, was just never built. Isolated pockets of world-class research do exist within government agencies like the DRDO (Defense Research & Development Organization) and ISRO (Indian Space Research Organization), but their breakthroughs rarely spill into civilian or commercial use.


India Is Using AI and Satellites to Map Urban Heat Vulnerability Down to the Building Level

WIRED

Zubaida starts her day at eight in the morning, sorting discarded plastics, glass, and chemicals with her bare hands, to collect items she can sell. With waste-segregation centers in this part of East Delhi currently shut down, she and other waste-pickers from the Seemapuri slum work outside by a dusty road through the hottest hours of the day, under the blazing sun. There is no fan or shade. With Delhi's heat wave season here, they are constantly exposed to intense high temperatures. On June 11, the India Meteorological Department (IMD) issued a red alert for Delhi, warning of a high risk of heat illness and heat stroke.



Microsoft-backed AI startup chatbots revealed to be human employees

Mashable

A startup promised that their AI assistant would build you an app. But the work was actually done by human engineers. Builder.ai, a startup backed by Microsoft, pitched itself as an AI-powered way to simplify app development. Clients chatted with the platform's signature AI assistant, Natasha, and received a functional, AI-generated app based on the information they provided. But instead of using AI technology to run the chatbot and create the app, the company hired 700 engineers in India to pose as Natasha in conversations with clients, and then to do the actual coding of the app.


Sensitivity in Translation Averaging

Neural Information Processing Systems

In 3D computer vision, translation averaging solves for absolute translations given a set of pairwise relative translation directions. While there has been much work on robustness to outliers and studies on the uniqueness of the solution, this paper deals with a distinctly different problem of sensitivity in translation averaging under uncertainty. We first analyze sensitivity in estimating scales corresponding to relative directions under small perturbations of the relative directions. Then, we formally define the conditioning of the translation averaging problem, which assesses the reliability of estimated translations based solely on the input directions. We give a sufficient criterion to ensure that the problem is well-conditioned. Subsequently, we provide an efficient algorithm to identify and remove combinations of directions which make the problem ill-conditioned while ensuring uniqueness of the solution. We demonstrate the utility of such analysis in global structure-frommotion pipelines for obtaining 3D reconstructions, which reveals the benefits of filtering the ill-conditioned set of directions in translation averaging in terms of reduced translation errors, a higher number of 3D points triangulated and faster convergence of bundle adjustment.


Deep Neural Networks with Inexact Matching for Person Re-Identification

Neural Information Processing Systems

Person Re-Identification is the task of matching images of a person across multiple camera views. Almost all prior approaches address this challenge by attempting to learn the possible transformations that relate the different views of a person from a training corpora. Then, they utilize these transformation patterns for matching a query image to those in a gallery image bank at test time.


Supplementary: Pedestrian Trajectory Prediction with Missing Data: Datasets, Imputation, and Benchmarking

Neural Information Processing Systems

Datasets and code files are publicly accessible at Link. Our dataset will be hosted on both the GitHub and cloud storage drive. The dataset is provided under a Creative Commons CC BY-SA 4.0 license, allowing both academics and industry to use it. The structure of the TrajImpute dataset follows a dictionary format with 8 keys, including the obs_traj, pred_traj, obs_traj_rel, pred_traj_rel and missing_mask. We have already included these details in the main paper (see Section 4 and Figure 3 of the main paper).


Pedestrian Trajectory Prediction with Missing Data: Datasets, Imputation, and Benchmarking Pravendra Singh

Neural Information Processing Systems

Pedestrian trajectory prediction is crucial for several applications such as robotics and self-driving vehicles. Significant progress has been made in the past decade thanks to the availability of pedestrian trajectory datasets, which enable trajectory prediction methods to learn from pedestrians' past movements and predict future trajectories. However, these datasets and methods typically assume that the observed trajectory sequence is complete, ignoring real-world issues such as sensor failure, occlusion, and limited fields of view that can result in missing values in observed trajectories. To address this challenge, we present TrajImpute, a pedestrian trajectory prediction dataset that simulates missing coordinates in the observed trajectory, enhancing real-world applicability. TrajImpute maintains a uniform distribution of missing data within the observed trajectories. In this work, we comprehensively examine several imputation methods to reconstruct the missing coordinates and benchmark them for imputing pedestrian trajectories. Furthermore, we provide a thorough analysis of recent trajectory prediction methods and evaluate the performance of these models on the imputed trajectories. Our experimental evaluation of the imputation and trajectory prediction methods offers several valuable insights. Our dataset provides a foundational resource for future research on imputation-aware pedestrian trajectory prediction, potentially accelerating the deployment of these methods in real-world applications.


Superposed Decoding: Multiple Generations from a Single Autoregressive Inference Pass Sarah Pratt

Neural Information Processing Systems

Many applications today provide users with multiple auto-complete drafts as they type, including GitHub's code completion, Gmail's smart compose, and Apple's messaging auto-suggestions. Under the hood, language models support this by running an autoregressive inference pass to provide a draft. Consequently, providing k drafts to the user requires running an expensive language model k times. To alleviate the computation cost of running k inference passes, we propose Superposed Decoding, a new decoding algorithm that generates k drafts at the computation cost of one autoregressive inference pass. We achieve this by feeding a superposition of the most recent token embeddings from the k drafts as input to the next decoding step of the language model.