AITopics | Yousuf, Raquib Bin

Collaborating Authors

Yousuf, Raquib Bin

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Optimizing Product Provenance Verification using Data Valuation Methods

Yousuf, Raquib Bin, Just, Hoang Anh, Xu, Shengzhe, Mayer, Brian, Deklerck, Victor, Truszkowski, Jakub, Simeone, John C., Saunders, Jade, Lu, Chang-Tien, Jia, Ruoxi, Ramakrishnan, Naren

arXiv.org Artificial IntelligenceMar-16-2025

Determining and Determining and verifying product provenance remains a critical verifying product provenance is a challenge in global supply chains, challenge in global supply chains, particularly as geopolitical conflicts as geopolitics and the lure of "don't ask, don't tell" with respect to and shifting borders create new incentives for misrepresentation the ecological and social cost creates incentives for misrepresentation of commodities, such as hiding the origin of illegally harvested of commodities, such as hiding the origin of illegally harvested timber or agriculture grown on illegally cleared land. Stable Isotope timber or agriculture grown on illegally cleared land. Ratio Analysis (SIRA), combined with Gaussian process regressionbased Product identification and provenance verification of traded natural isoscapes, has emerged as a powerful tool for geographic resources have emerged as promising research areas, with origin verification. However, the effectiveness of these models is often various combinations of methods used based on the specific natural constrained by data scarcity and suboptimal dataset selection. In resource sector and the level of granularity of species identification this work, we introduce a novel data valuation framework designed and origin-provenance determination. For example, for wood and to enhance the selection and utilization of training data for machine forest products, determining species identification and geographic learning models applied in SIRA. By prioritizing high-informative harvest provenance requires utilizing multiple testing methods and samples, our approach improves model robustness and predictive tools [5, 8, 20].

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2502.15177

Country:

Europe (1.00)
North America > United States > Virginia (0.30)

Genre: Research Report > New Finding (0.46)

Industry:

Health & Medicine (1.00)
Materials > Paper & Forest Products (0.57)
Food & Agriculture > Agriculture (0.44)

Technology:

Information Technology > Data Science > Data Mining (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)

Add feedback

Chasing the Timber Trail: Machine Learning to Reveal Harvest Location Misrepresentation

Sarkar, Shailik, Yousuf, Raquib Bin, Wang, Linhan, Mayer, Brian, Mortier, Thomas, Deklerck, Victor, Truszkowski, Jakub, Simeone, John C., Norman, Marigold, Saunders, Jade, Lu, Chang-Tien, Ramakrishnan, Naren

arXiv.org Artificial IntelligenceMar-16-2025

Illegal logging poses a significant threat to global biodiversity, climate stability, and depresses international prices for legal wood harvesting and responsible forest products trade, affecting livelihoods and communities across the globe. Stable isotope ratio analysis (SIRA) is rapidly becoming an important tool for determining the harvest location of traded, organic, products. The spatial pattern in stable isotope ratio values depends on factors such as atmospheric and environmental conditions and can thus be used for geographic origin identification. We present here the results of a deployed machine learning pipeline where we leverage both isotope values and atmospheric variables to determine timber harvest location. Additionally, the pipeline incorporates uncertainty estimation to facilitate the interpretation of harvest location determination for analysts. We present our experiments on a collection of oak (Quercus spp.) tree samples from its global range. Our pipeline outperforms comparable state-of-the-art models determining geographic harvest origin of commercially traded wood products, and has been used by European enforcement agencies to identify harvest location misrepresentation. We also identify opportunities for further advancement of our framework and how it can be generalized to help identify the origin of falsely labeled organic products throughout the supply chain.

artificial intelligence, isotope ratio, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2502.14115

Country:

Europe (0.95)
Asia (0.69)
North America > United States > Virginia (0.15)

Genre: Research Report (1.00)

Industry:

Materials > Paper & Forest Products (1.00)
Law (1.00)
Government > Regional Government > North America Government > United States Government (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Add feedback

LLM Augmentations to support Analytical Reasoning over Multiple Documents

Yousuf, Raquib Bin, Defelice, Nicholas, Sharma, Mandar, Xu, Shengzhe, Ramakrishnan, Naren

arXiv.org Artificial IntelligenceNov-25-2024

Building on their demonstrated ability to perform a variety of tasks, we investigate the application of large language models (LLMs) to enhance in-depth analytical reasoning within the context of intelligence analysis. Intelligence analysts typically work with massive dossiers to draw connections between seemingly unrelated entities, and uncover adversaries' plans and motives. We explore if and how LLMs can be helpful to analysts for this task and develop an architecture to augment the capabilities of an LLM with a memory module called dynamic evidence trees (DETs) to develop and track multiple investigation threads. Through extensive experiments on multiple datasets, we highlight how LLMs, as-is, are still inadequate to support intelligence analysts and offer recommendations to improve LLMs for such intricate reasoning applications.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2411.16116

Country: North America > United States > Virginia (0.14)

Genre: Research Report > New Finding (0.46)

Industry: Government (0.55)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.70)

Add feedback

Information Guided Regularization for Fine-tuning Language Models

Sharma, Mandar, Muralidhar, Nikhil, Xu, Shengzhe, Yousuf, Raquib Bin, Ramakrishnan, Naren

arXiv.org Artificial IntelligenceJun-21-2024

The pretraining-fine-tuning paradigm has been the de facto strategy for transfer learning in modern language modeling. With the understanding that task adaptation in LMs is often a function of parameters shared across tasks, we argue that a more surgical approach to regularization needs to exist for smoother transfer learning. Towards this end, we investigate how the pretraining loss landscape is affected by these task-sensitive parameters through an information-theoretic lens. We then leverage the findings from our investigations to devise a novel approach to dropout for improved model regularization and better downstream generalization. This approach, named guided dropout, is both task & architecture agnostic and adds no computational overhead to the fine-tuning process. Through empirical evaluations, we showcase that our approach to regularization yields consistently better performance, even in scenarios of data paucity, compared to standardized baselines.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2406.14005

Country: North America > Canada (0.28)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.89)

Add feedback

Are LLMs Naturally Good at Synthetic Tabular Data Generation?

Xu, Shengzhe, Lee, Cho-Ting, Sharma, Mandar, Yousuf, Raquib Bin, Muralidhar, Nikhil, Ramakrishnan, Naren

arXiv.org Artificial IntelligenceJun-21-2024

Large language models (LLMs) have demonstrated their prowess in generating synthetic text and images; however, their potential for generating tabular data -- arguably the most common data type in business and scientific applications -- is largely underexplored. This paper demonstrates that LLMs, used as-is, or after traditional fine-tuning, are severely inadequate as synthetic table generators. Due to the autoregressive nature of LLMs, fine-tuning with random order permutation runs counter to the importance of modeling functional dependencies, and renders LLMs unable to model conditional mixtures of distributions (key to capturing real world constraints). We showcase how LLMs can be made to overcome some of these deficiencies by making them permutation-aware.

functional dependency, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2406.14541

Country: North America > United States (1.00)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback