AITopics | mitra

Collaborating Authors

mitra

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Enhancing Tabular Foundation Models

Neural Information Processing SystemsJun-15-2026, 03:42:29 GMT

Since the seminal work of TabPFN [16], research on tabular foundation models (TFMs) based on in-context learning (ICL) has challenged long-standing paradigms in machine learning. Without seeing any real-world data, models pretrained on purely synthetic datasets generalize remarkably well across diverse datasets, often using only a moderate number of in-context examples. This shifts the focus in tabular machine learning from model architecture design to the design of synthetic datasets, or, more precisely, to the prior distributions that generate them. Yet the guiding principles for prior design remain poorly understood. This work marks the first attempt to address the gap. We systematically investigate and identify key properties of synthetic priors that allow pretrained TFMs to generalize well. Based on these insights, we introduce MITRA 1, a TFM trained on a curated mixture of synthetic priors selected for their diversity, distinctiveness, and performance on real-world tabular data. MITRA consistently outperforms state-of-the-art TFMs, such as TabPFNv2 [17] and TabICL [29], across both classification and regression benchmarks, with better sample efficiency.

data mining, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Information Technology (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)
Information Technology > Data Science > Data Mining (0.92)
(3 more...)

Add feedback

Mitra: Mixed Synthetic Priors for Enhancing Tabular Foundation Models

Neural Information Processing SystemsJun-10-2026, 16:49:24 GMT

Since the seminal work of TabPFN, research on tabular foundation models (TFMs) based on in-context learning (ICL) has challenged long-standing paradigms in machine learning. Without seeing any real-world data, models pretrained on purely synthetic datasets generalize remarkably well across diverse datasets, often using only a moderate number of in-context examples. This shifts the focus in tabular machine learning from model architecture design to the design of synthetic datasets, or, more precisely, to the prior distributions that generate them. Yet the guiding principles for prior design remain poorly understood. This work marks the first attempt to address the gap. We systematically investigate and identify key properties of synthetic priors that allow pretrained TFMs to generalize well. Based on these insights, we introduce Mitra, a TFM trained on a curated mixture of synthetic priors selected for their diversity, distinctiveness, and performance on real-world tabular data. Mitra consistently outperforms state-of-the-art TFMs, such as TabPFNv2 and TabICL, across both classification and regression benchmarks, with better sample efficiency.

artificial intelligence, machine learning, proceedings, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.63)

Add feedback

Bipartite Stochastic Block Models with Tiny Clusters

Stefan Neumann

Neural Information Processing SystemsFeb-14-2026, 02:26:37 GMT

Discovering clusters in bipartite graphs has been researched in many different settings. However, most of these algorithms were heuristics and do not provide theoretical guarantees for the quality oftheir results.

algorithm, artificial intelligence, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Europe > Austria > Vienna (0.15)
North America > Canada (0.04)
Europe > Germany > Baden-Württemberg > Freiburg (0.04)
Europe > Finland > Uusimaa > Helsinki (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Add feedback

Stemming -- The Evolution and Current State with a Focus on Bangla

Paul, Abhijit, Farin, Mashiat Amin, Abdullah, Sharif Md., Kabir, Ahmedul, Masud, Zarif, Rayana, Shebuti

arXiv.org Artificial IntelligenceAug-22-2025

Bangla, the seventh most widely spoken language worldwide with 300 million native speakers, faces digital under-representation due to limited resources and lack of annotated datasets. Stemming, a critical preprocessing step in language analysis, is essential for low-resource, highly-inflectional languages like Bangla, because it can reduce the complexity of algorithms and models by significantly reducing the number of words the algorithm needs to consider. This paper conducts a comprehensive survey of stemming approaches, emphasizing the importance of handling morphological variants effectively. While exploring the landscape of Bangla stemming, it becomes evident that there is a significant gap in the existing literature. The paper highlights the discontinuity from previous research and the scarcity of accessible implementations for replication. Furthermore, it critiques the evaluation methodologies, stressing the need for more relevant metrics. In the context of Bangla's rich morphology and diverse dialects, the paper acknowledges the challenges it poses. To address these challenges, the paper suggests directions for Bangla stemmer development. It concludes by advocating for robust Bangla stemmers and continued research in the field to enhance language analysis and processing.

artificial intelligence, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2508.15711

Country:

Asia (0.47)
North America > United States (0.28)

Genre:

Research Report (1.00)
Overview (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (0.68)
Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (0.52)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.47)

Add feedback

Civil Society in the Loop: Feedback-Driven Adaptation of (L)LM-Assisted Classification in an Open-Source Telegram Monitoring Tool

Pustet, Milena, Steffen, Elisabeth, Mihaljević, Helena, Stanjek, Grischa, Illies, Yannis

arXiv.org Artificial IntelligenceJul-10-2025

The role of civil society organizations (CSOs) in monitoring harmful online content is increasingly crucial, especially as platform providers reduce their investment in content moderation. AI tools can assist in detecting and monitoring harmful content at scale. However, few open-source tools offer seamless integration of AI models and social media monitoring infrastructures. Given their thematic expertise and contextual understanding of harmful content, CSOs should be active partners in co-developing technological tools, providing feedback, helping to improve models, and ensuring alignment with stakeholder needs and values, rather than as passive 'consumers'. However, collaborations between the open source community, academia, and civil society remain rare, and research on harmful content seldom translates into practical tools usable by civil society actors. This work in progress explores how CSOs can be meaningfully involved in an AI-assisted open-source monitoring tool of anti-democratic movements on Telegram, which we are currently developing in collaboration with CSO stakeholders.

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2507.06734

Country: North America > Mexico > Mexico City (0.14)

Genre: Research Report (0.40)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.48)

Add feedback

Vertically rolling ball 'challenges our basic understanding of physics'

Popular ScienceJun-2-2025, 15:02:26 GMT

Breakthroughs, discoveries, and DIY tips sent every weekday. Gravity seems like a predictable, even mundane, aspect of existence. The physics dictating one of the universe's four fundamental forces is relatively straightforward to understand and calculate (most of the time, at least). Even so, the relationships between objects with mass and energy continues to surprise physical engineers. Take recent observations made by a team at the University of Waterloo, for example.

physics, vertical rolling, vertical surface, (6 more...)

Popular Science

Genre: Research Report (0.37)

Technology: Information Technology > Artificial Intelligence > Robots (0.33)

Add feedback

Why are 'driverless' cars still hitting things? Depends on how they 'see.'

Popular ScienceDec-3-2024, 15:20:41 GMT

Late last month, a Tesla owner shared shocking dashcam footage of his Model 3 appearing to collide with and drive through a deer at high speeds. The car, which the driver says was engaged in Tesla's driver-assist Full-Self Driving (FSD) mode, never detected the deer standing in the middle of the road and didn't hit the brakes or maneuver to avoid it. That case came just a few months after a vehicle from Waymo, a leading self-driving company, reportedly ran over and killed a pet dog in a collision the company says was "unavoidable." Neither driverless cars, according to reports detailing the incidents, spotted the animals on the road fast enough to avoid them. Video is cut right before sensitive things appear on screen.

artificial intelligence, rigg, vehicle, (18 more...)

Popular Science

Country:

North America > United States > California > San Francisco County > San Francisco (0.05)
North America > United States > New York (0.04)
North America > United States > Illinois (0.04)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Add feedback

Clustering Mixtures of Discrete Distributions: A Note on Mitra's Algorithm

Seif, Mohamed, Chen, Yanxi

arXiv.org Machine LearningMay-29-2024

Clustering is a critical challenge in network science, pivotal for detecting underlying patterns and structures in unlabeled data. To explore the boundaries of this challenge, stochastic block models (SBMs) have been effectively utilized as a mathematical framework to assess the performance of clustering algorithms. Specifically, an SBM is a statistical model developed to reveal the structural dynamics of networks or graphs, where nodes represent individual entities and edges symbolize the connections between them. In a typical SBM, nodes are categorized into blocks or communities according to their connectivity patterns, with the probability of an edge existing between any two nodes depending on the blocks to which they belong [3]. For example, in a social network using an SBM, nodes might be organized by attributes such as age, gender, or geographic location, with friendship probabilities determined by their block memberships [1, 6]. The Bipartite Stochastic Block Model(B-SBM)[2] extends the conventional SBM to accommodate networks comprising two distinct node types, forming a bipartite graph structure. This adaptation is particularly beneficial in contexts such as recommendation systems, where nodes represent users and products, or in particular social networks, where nodes might denote individuals and the groups or events they participate in. In B-SBMs, the connections between nodes from different sets are governed by an "affinity matrix" that specifies the likelihood of linkage based on group affiliations. This matrix is integral to capturing interaction patterns within the network, allowing for a sophisticated estimation of model parameters from observed connections.

algorithm, high probability, probability, (15 more...)

arXiv.org Machine Learning

2405.19559

Genre: Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (0.49)

Add feedback

How to Guarantee the Safety of Autonomous Vehicles

WIREDFeb-4-2024, 13:00:00 GMT

The original version of this story appeared in Quanta Magazine. Driverless cars and planes are no longer the stuff of the future. In the city of San Francisco alone, two taxi companies have collectively logged 8 million miles of autonomous driving through August 2023. And more than 850,000 autonomous aerial vehicles, or drones, are registered in the United States--not counting those owned by the military. But there are legitimate concerns about safety.

autonomous vehicle, safety, vehicle, (8 more...)

WIRED

Country:

North America > United States > California > San Francisco County > San Francisco (0.26)
North America > United States > Massachusetts (0.06)
North America > United States > Illinois > Champaign County > Urbana (0.06)

Industry: Transportation > Ground > Road (1.00)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (1.00)

Add feedback

AI, 23 new forensic standards in new CA curriculum - Telugu Bullet

#artificialintelligenceMar-22-2022, 16:11:19 GMT

The Institute of Chartered Accountants of India (ICAI) will introduce Artificial Intelligence and forensic science in its curriculum for the Chartered Accountants to detect financial fraud at a much earlier stage. In most cases, the fraud is detected only when they reach a substantial volume. This new curriculum aims to track such irregularity at a much earlier stage so that the big scams either do not happen or are detected at the initial stages. This is the first time when the institute will bring such big technological changes in their international courses. President of ICAI, Debashish Mitra, said: "We are introducing artificial intelligence, data analytics and new forensic standards in the new curriculum. The mission of ICAI is to provide a strong foundation of knowledge, skill, and professional value that enables students to grow as wholesome professionals and adapt to change throughout their professional career."

chartered accountant, curriculum, new forensic standard, (8 more...)

#artificialintelligence

Country:

Asia > India > Tripura (0.06)
Asia > India > Nagaland (0.06)
Asia > India > Mizoram (0.06)
(5 more...)

Genre: Instructional Material > Course Syllabus & Notes (0.97)

Technology: Information Technology > Artificial Intelligence (0.81)

Add feedback