AITopics | Mujumdar, Shashank

Collaborating Authors

Mujumdar, Shashank

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Identifying Semantically Difficult Samples to Improve Text Classification

Mujumdar, Shashank, Mehta, Stuti, Patel, Hima, Mitra, Suman

arXiv.org Artificial IntelligenceFeb-13-2023

In this paper, we investigate the effect of addressing difficult samples from a given text dataset on the downstream text classification task. We define difficult samples as being non-obvious cases for text classification by analysing them in the semantic embedding space; specifically - (i) semantically similar samples that belong to different classes and (ii) semantically dissimilar samples that belong to the same class. We propose a penalty function to measure the overall difficulty score of every sample in the dataset. We conduct exhaustive experiments on 13 standard datasets to show a consistent improvement of up to 9% and discuss qualitative results to show effectiveness of our approach in identifying difficult samples for a text classification model.

difficult sample, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2302.06155

Genre: Research Report (0.82)

Industry: Information Technology (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Classification (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Towards a Multi-modal, Multi-task Learning based Pre-training Framework for Document Representation Learning

Pramanik, Subhojeet, Mujumdar, Shashank, Patel, Hima

arXiv.org Artificial IntelligenceSep-30-2020

In this paper, we propose a multi-task learning-based framework that utilizes a combination of self-supervised and supervised pre-training tasks to learn a generic document representation. We design the network architecture and the pre-training tasks to incorporate the multi-modal document information across text, layout, and image dimensions and allow the network to work with multi-page documents. We showcase the applicability of our pre-training framework on a variety of different real-world document tasks such as document classification, document information extraction, and document retrieval. We conduct exhaustive experiments to compare performance against different ablations of our framework and state-of-the-art baselines. We discuss the current limitations and next steps for our work.

artificial intelligence, dataset, neural network, (21 more...)

arXiv.org Artificial Intelligence

2009.14457

Country: Asia (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Semantic Understanding for Contextual In-Video Advertising

Madhok, Rishi (Delhi Technological University) | Mujumdar, Shashank (IBM Research, India) | Gupta, Nitin (IBM Research, India) | Mehta, Sameep (IBM Research, India)

AAAI ConferencesFeb-8-2018

With the increasing consumer base of online video content, it is important for advertisers to understand the video context when targeting video ads to consumers. To improve the consumer experience and quality of ads, key factors need to be considered such as (i) ad relevance to video content (ii) where and how video ads are placed, and (iii) non-intrusive user experience. We propose a framework to semantically understand the video content for better ad recommendation that ensure these criteria.

artificial intelligence, video, video content, (16 more...)

AAAI Conferences

Thirty-Second AAAI Conference on Artificial Intelligence

Industry: Marketing (0.35)

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback