AITopics | Ganesan, Deepak

Collaborating Authors

Ganesan, Deepak

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models

Liu, Xiao, Zhang, Lijun, Ganesan, Deepak, Guan, Hui

arXiv.org Artificial IntelligenceNov-8-2024

Vision Language Models (VLMs) are central to Visual Question Answering (VQA) systems and are typically deployed in the cloud due to their high computational demands. However, this cloud-only approach underutilizes edge computational resources and requires significant bandwidth for transmitting raw images. In this paper, we introduce an edge-cloud collaborative VQA system, called LLaVA-AlignedVQ, which features a novel Aligned Vector Quantization algorithm (AlignedVQ) that efficiently compress intermediate features without compromising accuracy to support partitioned execution. Our experiments demonstrate that LLaVA-AlignedVQ achieves approximately 1365x compression rate of intermediate features, reducing data transmission overhead by 96.8% compared to transmitting JPEG90-compressed images to the cloud. LLaVA-AlignedVQ achieves an inference speedup of 2-15x while maintaining high accuracy, remaining within -2.23% to +1.6% of the original model's accuracy performance across eight VQA datasets, compared to the cloud-only solution.

accuracy, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2411.05961

Country: North America > United States > Massachusetts (0.28)

Genre: Research Report > Promising Solution (0.34)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

GDTM: An Indoor Geospatial Tracking Dataset with Distributed Multimodal Sensors

Jeong, Ho Lyun, Wang, Ziqi, Samplawski, Colin, Wu, Jason, Fang, Shiwei, Kaplan, Lance M., Ganesan, Deepak, Marlin, Benjamin, Srivastava, Mani

arXiv.org Artificial IntelligenceFeb-21-2024

Constantly locating moving objects, i.e., geospatial tracking, is essential for autonomous building infrastructure. Accurate and robust geospatial tracking often leverages multimodal sensor fusion algorithms, which require large datasets with time-aligned, synchronized data from various sensor types. However, such datasets are not readily available. Hence, we propose GDTM, a nine-hour dataset for multimodal object tracking with distributed multimodal sensors and reconfigurable sensor node placements. Our dataset enables the exploration of several research problems, such as optimizing architectures for processing multimodal data, and investigating models' robustness to adverse sensing conditions and sensor placement variances. A GitHub repository containing the code, sample data, and checkpoints of this work is available at https://github.com/nesl/GDTM.

artificial intelligence, information fusion, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2402.14136

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.16)
North America > United States > Massachusetts > Hampshire County > Amherst (0.15)

Genre: Research Report > Experimental Study (0.46)

Industry:

Leisure & Entertainment > Sports > Motorsports (0.46)
Government (0.46)

Technology:

Information Technology > Sensing and Signal Processing (1.00)
Information Technology > Data Science > Data Integration (1.00)
Information Technology > Communications > Networks (1.00)
(4 more...)

Add feedback

Efficient IoT Inference via Context-Awareness

Rastikerdar, Mohammad Mehdi, Huang, Jin, Fang, Shiwei, Guan, Hui, Ganesan, Deepak

arXiv.org Artificial IntelligenceDec-3-2023

While existing strategies to execute deep learning-based classification on low-power platforms assume the models are trained on all classes of interest, this paper posits that adopting context-awareness i.e. narrowing down a classification task to the current deployment context consisting of only recent inference queries can substantially enhance performance in resource-constrained environments. We propose a new paradigm, CACTUS, for scalable and efficient context-aware classification where a micro-classifier recognizes a small set of classes relevant to the current context and, when context change happens (e.g., a new class comes into the scene), rapidly switches to another suitable micro-classifier. CACTUS features several innovations, including optimizing the training cost of context-aware classifiers, enabling on-the-fly context-aware switching between classifiers, and balancing context switching costs and performance gains via simple yet effective switching policies. We show that CACTUS achieves significant benefits in accuracy, latency, and compute budget across a range of datasets and IoT platforms.

artificial intelligence, classifier, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2310.19112

Country: North America > United States > New York > New York County > New York City (0.14)

Genre: Research Report (0.50)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.94)

Add feedback

Heteroskedastic Geospatial Tracking with Distributed Camera Networks

Samplawski, Colin, Fang, Shiwei, Wang, Ziqi, Ganesan, Deepak, Srivastava, Mani, Marlin, Benjamin M.

arXiv.org Artificial IntelligenceJun-4-2023

Visual object tracking has seen significant progress in recent years. However, the vast majority of this work focuses on tracking objects within the image plane of a single camera and ignores the uncertainty associated with predicted object locations. In this work, we focus on the geospatial object tracking problem using data from a distributed camera network. The goal is to predict an object's track in geospatial coordinates along with uncertainty over the object's location while respecting communication constraints that prohibit centralizing raw image data. We present a novel single-object geospatial tracking data set that includes high-accuracy ground truth object locations and video data from a network of four cameras. We present a modeling framework for addressing this task including a novel backbone model and explore how uncertainty calibration and fine-tuning through a differentiable tracker affect performance.

artificial intelligence, image understanding, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2306.02407

Country:

North America > United States > Massachusetts (0.14)
North America > United States > California (0.14)

Genre: Research Report > New Finding (0.68)

Industry: Commercial Services & Supplies > Security & Alarm Services (0.60)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision > Image Understanding (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Eulerian Phase-based Motion Magnification for High-Fidelity Vital Sign Estimation with Radar in Clinical Settings

Oshim, Md Farhan Tasnim, Surti, Toral, Carreiro, Stephanie, Ganesan, Deepak, Jayasuriya, Suren, Rahman, Tauhidur

arXiv.org Artificial IntelligenceDec-3-2022

Efficient and accurate detection of subtle motion generated from small objects in noisy environments, as needed for vital sign monitoring, is challenging, but can be substantially improved with magnification. We developed a complex Gabor filter-based decomposition method to amplify phases at different spatial wavelength levels to magnify motion and extract 1D motion signals for fundamental frequency estimation. The phase-based complex Gabor filter outputs are processed and then used to train machine learning models that predict respiration and heart rate with greater accuracy. We show that our proposed technique performs better than the conventional temporal FFT-based method in clinical settings, such as sleep laboratories and emergency departments, as well for a variety of human postures.

artificial intelligence, data quality, machine learning, (12 more...)

arXiv.org Artificial Intelligence

2212.04923

Country: North America > United States > Massachusetts (0.70)

Genre: Research Report (0.64)

Industry: Health & Medicine > Diagnostic Medicine > Vital Signs (0.73)

Technology:

Information Technology > Data Science > Data Quality > Data Transformation (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.47)

Add feedback