AITopics | Sharma, Abhinav

Collaborating Authors

Sharma, Abhinav

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

ASSERTIFY: Utilizing Large Language Models to Generate Assertions for Production Code

Torkamani, Mohammad Jalili, Sharma, Abhinav, Mehrotra, Nikita, Purandare, Rahul

arXiv.org Artificial IntelligenceNov-25-2024

Production assertions are statements embedded in the code to help developers validate their assumptions about the code. They assist developers in debugging, provide valuable documentation, and enhance code comprehension. Current research in this area primarily focuses on assertion generation for unit tests using techniques, such as static analysis and deep learning. While these techniques have shown promise, they fall short when it comes to generating production assertions, which serve a different purpose. This preprint addresses the gap by introducing Assertify, an automated end-to-end tool that leverages Large Language Models (LLMs) and prompt engineering with few-shot learning to generate production assertions. By creating context-rich prompts, the tool emulates the approach developers take when creating production assertions for their code. To evaluate our approach, we compiled a dataset of 2,810 methods by scraping 22 mature Java repositories from GitHub. Our experiments demonstrate the effectiveness of few-shot learning by producing assertions with an average ROUGE-L score of 0.526, indicating reasonably high structural similarity with the assertions written by developers. This research demonstrates the potential of LLMs in automating the generation of production assertions that resemble the original assertions.

assertion, large language model, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2411.16927

Country:

Europe (1.00)
Asia (0.68)
North America > United States > Nebraska (0.28)
Oceania > Australia > Victoria (0.28)

Genre: Research Report > New Finding (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.91)

Add feedback

A Lightweight Measure of Classification Difficulty from Application Dataset Characteristics

Cao, Bryan Bo, Sharma, Abhinav, O'Gorman, Lawrence, Coss, Michael, Jain, Shubham

arXiv.org Artificial IntelligenceApr-8-2024

Despite accuracy and computation benchmarks being widely available to help choose among neural network models, these are usually trained on datasets with many classes, and do not give a precise idea of performance for applications of few (< 10) classes. The conventional procedure to predict performance is to train and test repeatedly on the different models and dataset variations of interest. However, this is computationally expensive. We propose an efficient classification difficulty measure that is calculated from the number of classes and intra- and inter-class similarity metrics of the dataset. After a single stage of training and testing per model family, relative performance for different datasets and models of the same family can be predicted by comparing difficulty measures - without further training and testing. We show how this measure can help a practitioner select a computationally efficient model for a small dataset 6 to 29x faster than through repeated training and testing. We give an example of use of the measure for an industrial application in which options are identified to select a model 42% smaller than the baseline YOLOv5-nano model, and if class merging from 3 to 2 classes meets requirements, 85% smaller.

application, artificial intelligence, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2404.05981

Country: North America > United States > New York (0.14)

Genre: Research Report (0.64)

Industry:

Transportation (0.48)
Automobiles & Trucks (0.47)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

ACROBAT -- a multi-stain breast cancer histological whole-slide-image data set from routine diagnostics for computational pathology

Weitz, Philippe, Valkonen, Masi, Solorzano, Leslie, Carr, Circe, Kartasalo, Kimmo, Boissin, Constance, Koivukoski, Sonja, Kuusela, Aino, Rasic, Dusan, Feng, Yanbo, Pouplier, Sandra Kristiane Sinius, Sharma, Abhinav, Eriksson, Kajsa Ledesma, Latonen, Leena, Laenkholm, Anne-Vibeke, Hartman, Johan, Ruusuvuori, Pekka, Rantalainen, Mattias

arXiv.org Artificial IntelligenceNov-24-2022

The analysis of FFPE tissue sections stained with haematoxylin and eosin (H&E) or immunohistochemistry (IHC) is an essential part of the pathologic assessment of surgically resected breast cancer specimens. IHC staining has been broadly adopted into diagnostic guidelines and routine workflows to manually assess status and scoring of several established biomarkers, including ER, PGR, HER2 and KI67. However, this is a task that can also be facilitated by computational pathology image analysis methods. The research in computational pathology has recently made numerous substantial advances, often based on publicly available whole slide image (WSI) data sets. However, the field is still considerably limited by the sparsity of public data sets. In particular, there are no large, high quality publicly available data sets with WSIs of matching IHC and H&E-stained tissue sections. Here, we publish the currently largest publicly available data set of WSIs of tissue sections from surgical resection specimens from female primary breast cancer patients with matched WSIs of corresponding H&E and IHC-stained tissue, consisting of 4,212 WSIs from 1,153 patients. The primary purpose of the data set was to facilitate the ACROBAT WSI registration challenge, aiming at accurately aligning H&E and IHC images. For research in the area of image registration, automatic quantitative feedback on registration algorithm performance remains available through the ACROBAT challenge website, based on more than 37,000 manually annotated landmark pairs from 13 annotators. Beyond registration, this data set has the potential to enable many different avenues of computational pathology research, including stain-guided learning, virtual staining, unsupervised pre-training, artefact detection and stain-independent models.

artificial intelligence, machine learning, wsis, (15 more...)

arXiv.org Artificial Intelligence

2211.13621

Country: Europe > Finland (0.71)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area > Oncology > Breast Cancer (0.85)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

Searching k-Optimal Goals for an Orienteering Problem on a Specialized Graph with Budget Constraints

Sharma, Abhinav, Deshpande, Advait, Wang, Yanming, Xu, Xinyi, Madumal, Prashan, Hou, Anbin

arXiv.org Artificial IntelligenceNov-2-2020

We propose a novel non-randomized anytime orienteering algorithm for finding k-optimal goals that maximize reward on a specialized graph with budget constraints. This specialized graph represents a real-world scenario which is analogous to an orienteering problem of finding k-most optimal goal states.

algorithm, artificial intelligence, constraint-based reasoning, (11 more...)

arXiv.org Artificial Intelligence

2011.00781

Country: Oceania > Australia (0.15)

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (0.52)

Add feedback

Predicting Infectiousness for Proactive Contact Tracing

Bengio, Yoshua, Gupta, Prateek, Maharaj, Tegan, Rahaman, Nasim, Weiss, Martin, Deleu, Tristan, Muller, Eilif, Qu, Meng, Schmidt, Victor, St-Charles, Pierre-Luc, Alsdurf, Hannah, Bilanuik, Olexa, Buckeridge, David, Caron, Gáetan Marceau, Carrier, Pierre-Luc, Ghosn, Joumana, Ortiz-Gagne, Satya, Pal, Chris, Rish, Irina, Schölkopf, Bernhard, Sharma, Abhinav, Tang, Jian, Williams, Andrew

arXiv.org Artificial IntelligenceOct-23-2020

The COVID-19 pandemic has spread rapidly worldwide, overwhelming manual contact tracing in many countries and resulting in widespread lockdowns for emergency containment. Various DCT methods have been proposed, each making tradeoffs between privacy, mobility restrictions, and public health. The most common approach, binary contact tracing (BCT), models infection as a binary event, informed only by an individual's test results, with corresponding binary recommendations that either all or none of the individual's contacts quarantine. BCT ignores the inherent uncertainty in contacts and the infection process, which could be used to tailor messaging to high-risk individuals, and prompt proactive testing or earlier warnings. It also does not make use of observations such as symptoms or preexisting medical conditions, which could be used to make more accurate infectiousness predictions. In this paper, we use a recently-proposed COVID-19 epidemiological simulator to develop and test methods that can be deployed to a smartphone to locally and proactively predict an individual's infectiousness (risk of infecting others) based on their contact history and other information, while respecting strong privacy constraints. Predictions are used to provide personalized recommendations to the individual via an app, as well as to send anonymized messages to the individual's contacts, who use this information to better predict their own infectiousness, an approach we call proactive contact tracing (PCT). Similarly to other works, we find that compared to no tracing, all DCT methods tested are able to reduce spread of the disease and thus save lives, even at low adoption rates, strongly supporting a role for DCT methods in managing the pandemic. Further, we find a deep-learning based PCT method which improves over BCT for equivalent average mobility, suggesting PCT could help in safe reopening and second-wave prevention. Until pharmaceutical interventions such as a vaccine become available, control of the COVID-19 pandemic relies on nonpharmaceutical interventions such as lockdown and social distancing. While these have often been successful in limiting spread of the disease in the short term, these restrictive measures have important negative social, mental health, and economic impacts. Digital contact tracing (DCT), a technique to track the spread of the virus among individuals in a population using smartphones, is an attractive potential solution to help reduce growth in the number of cases and thereby allow more economic and social activities to resume while keeping the number of cases low. All bolded terms are defined in the Glossary; Appendix 1.

deep learning, immunology, infectiousness, (23 more...)

arXiv.org Artificial Intelligence

2010.12536

Country:

North America > Canada > Quebec > Montreal (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)

Genre: Research Report > New Finding (0.67)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.93)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty (0.93)

Add feedback

COVI White Paper

Alsdurf, Hannah, Belliveau, Edmond, Bengio, Yoshua, Deleu, Tristan, Gupta, Prateek, Ippolito, Daphne, Janda, Richard, Jarvie, Max, Kolody, Tyler, Krastev, Sekoul, Maharaj, Tegan, Obryk, Robert, Pilat, Dan, Pisano, Valerie, Prud'homme, Benjamin, Qu, Meng, Rahaman, Nasim, Rish, Irina, Rousseau, Jean-Francois, Sharma, Abhinav, Struck, Brooke, Tang, Jian, Weiss, Martin, Yu, Yun William

arXiv.org Artificial IntelligenceJul-27-2020

The SARS-CoV-2 (Covid-19) pandemic has caused significant strain on public health institutions around the world. Contact tracing is an essential tool to change the course of the Covid-19 pandemic. Manual contact tracing of Covid-19 cases has significant challenges that limit the ability of public health authorities to minimize community infections. Personalized peer-to-peer contact tracing through the use of mobile apps has the potential to shift the paradigm. Some countries have deployed centralized tracking systems, but more privacy-protecting decentralized systems offer much of the same benefit without concentrating data in the hands of a state authority or for-profit corporations. Machine learning methods can circumvent some of the limitations of standard digital tracing by incorporating many clues and their uncertainty into a more graded and precise estimation of infection risk. The estimated risk can provide early risk awareness, personalized recommendations and relevant information to the user. Finally, non-identifying risk data can inform epidemiological models trained jointly with the machine learning predictor. These models can provide statistical evidence for the importance of factors involved in disease transmission. They can also be used to monitor, evaluate and optimize health policy and (de)confinement scenarios according to medical and economic productivity indicators. However, such a strategy based on mobile apps and machine learning should proactively mitigate potential ethical and privacy risks, which could have substantial impacts on society (not only impacts on health but also impacts such as stigmatization and abuse of personal data). Here, we present an overview of the rationale, design, ethical considerations and privacy strategy of `COVI,' a Covid-19 public peer-to-peer contact tracing and risk awareness mobile application developed in Canada.

immunology, information, neural network, (23 more...)

arXiv.org Artificial Intelligence

2005.08502

Country:

North America > United States (1.00)
Asia (1.00)
North America > Canada > Quebec (0.28)
Europe > United Kingdom > England (0.27)

Genre:

Research Report > Experimental Study (1.00)
Overview (0.85)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Government > Regional Government > North America Government > Canada Government (0.45)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Communications > Mobile (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)

Add feedback