Goto

Collaborating Authors

 hai


96f2d6069db8ad895c34e2285d25c0ed-Supplemental.pdf

Neural Information Processing Systems

Smooth convex optimization problems over polytopes are an important class of problems that appear in many settings, such as low-rank matrix completion [1],structured supervised learning [2,3],electrical flowsovergraphs [4],video co-localization in computer vision [5], traffic assignment problems [6], and submodular function minimization [7].


Lost without translation -- Can transformer (language models) understand mood states?

Shivaprakash, Prakrithi, Mukherjee, Diptadhi, Shukla, Lekhansh, Mukherjee, Animesh, Chand, Prabhat, Murthy, Pratima

arXiv.org Artificial Intelligence

Background: Large Language Models show promise in psychiatry but are English-centric. Their ability to understand mood states in other languages is unclear, as different languages have their own idioms of distress. Aim: To quantify the ability of language models to faithfully represent phrases (idioms of distress) of four distinct mood states (depression, euthymia, euphoric mania, dysphoric mania) expressed in Indian languages. Methods: We collected 247 unique phrases for the four mood states across 11 Indic languages. We tested seven experimental conditions, comparing k-means clustering performance on: (a) direct embeddings of native and Romanised scripts (using multilingual and Indic-specific models) and (b) embeddings of phrases translated to English and Chinese. Performance was measured using a composite score based on Adjusted Rand Index, Normalised Mutual Information, Homogeneity and Completeness. Results: Direct embedding of Indic languages failed to cluster mood states (Composite Score = 0.002). All translation-based approaches showed significant improvement. High performance was achieved using Gemini-translated English (Composite=0.60) and human-translated English (Composite=0.61) embedded with gemini-001. Surprisingly, human-translated English, further translated into Chinese and embedded with a Chinese model, performed best (Composite = 0.67). Specialised Indic models (IndicBERT and Sarvam-M) performed poorly. Conclusion: Current models cannot meaningfully represent mood states directly from Indic languages, posing a fundamental barrier to their psychiatric application for diagnostic or therapeutic purposes in India. While high-quality translation bridges this gap, reliance on proprietary models or complex translation pipelines is unsustainable. Models must first be built to understand diverse local languages to be effective in global mental health.


ERUPD -- English to Roman Urdu Parallel Dataset

Furqan, Mohammed, Khaja, Raahid Bin, Habeeb, Rayyan

arXiv.org Artificial Intelligence

Bridging linguistic gaps fosters global growth and cultural exchange. This study addresses the challenges of Roman Urdu -- a Latin-script adaptation of Urdu widely used in digital communication -- by creating a novel parallel dataset comprising 75,146 sentence pairs. Roman Urdu's lack of standardization, phonetic variability, and code-switching with English complicates language processing. We tackled this by employing a hybrid approach that combines synthetic data generated via advanced prompt engineering with real-world conversational data from personal messaging groups. We further refined the dataset through a human evaluation phase, addressing linguistic inconsistencies and ensuring accuracy in code-switching, phonetic representations, and synonym variability. The resulting dataset captures Roman Urdu's diverse linguistic features and serves as a critical resource for machine translation, sentiment analysis, and multilingual education.


Reinforcement Learning for High-Level Strategic Control in Tower Defense Games

Bergdahl, Joakim, Sestini, Alessandro, Gisslén, Linus

arXiv.org Artificial Intelligence

In strategy games, one of the most important aspects of game design is maintaining a sense of challenge for players. Many mobile titles feature quick gameplay loops that allow players to progress steadily, requiring an abundance of levels and puzzles to prevent them from reaching the end too quickly. As with any content creation, testing and validation are essential to ensure engaging gameplay mechanics, enjoyable game assets, and playable levels. In this paper, we propose an automated approach that can be leveraged for gameplay testing and validation that combines traditional scripted methods with reinforcement learning, reaping the benefits of both approaches while adapting to new situations similarly to how a human player would. We test our solution on a popular tower defense game, Plants vs. Zombies. The results show that combining a learned approach, such as reinforcement learning, with a scripted AI produces a higher-performing and more robust agent than using only heuristic AI, achieving a 57.12% success rate compared to 47.95% in a set of 40 levels. Moreover, the results demonstrate the difficulty of training a general agent for this type of puzzle-like game.


IPA Transcription of Bengali Texts

Fatema, Kanij, Haider, Fazle Dawood, Turpa, Nirzona Ferdousi, Azmal, Tanveer, Ahmed, Sourav, Hasan, Navid, Rahman, Mohammad Akhlaqur, Sarkar, Biplab Kumar, Jahin, Afrar, Hassan, Md. Rezuwan, Zihad, Md Foriduzzaman, Faruque, Rubayet Sabbir, Sushmit, Asif, Imtiaz, Mashrur, Sadeque, Farig, Rahman, Syed Shahrier

arXiv.org Artificial Intelligence

The International Phonetic Alphabet (IPA) serves to systematize phonemes in language, enabling precise textual representation of pronunciation. In Bengali phonology and phonetics, ongoing scholarly deliberations persist concerning the IPA standard and core Bengali phonemes. This work examines prior research, identifies current and potential issues, and suggests a framework for a Bengali IPA standard, facilitating linguistic analysis and NLP resource creation and downstream technology development. In this work, we present a comprehensive study of Bengali IPA transcription and introduce a novel IPA transcription framework incorporating a novel dataset with DL-based benchmarks.


AI Index Report, HAI released the Artificial Intelligence report

#artificialintelligence

The annual report keeps track, collects e displays AI-related data, to support meaningful decisions, and advance AI responsibly and ethically. The AI Index Report supports many different organizations to track progress in artificial intelligence. These organizations include: the Center for Security and Emerging Technology at Georgetown University, LinkedIn, NetBase Quid, Lightcast, and McKinsey. The AI Index Report also expanded its tracking of global AI legislation from 25 countries in 2022 to 127 in 2023. The demand for AI-related job skills is increasing in virtually all industries (in the US).


E Academy: Artificial Intelligence (AI) Benefits & Disadvantages in Hindi

#artificialintelligence

AI-powered healthcare: AI can be used to analyze medical data and assist doctors in making diagnoses and treatment plans. Smart homes and cities: AI can be used to automate household tasks and manage energy usage in smart homes, and in cities, it can be used to manage traffic flow, optimize public transportation, and enhance public safety. Autonomous vehicles: AI can power self-driving cars, trucks, and other vehicles, which can reduce traffic accidents, increase efficiency, and save time. Virtual assistants: AI-powered virtual assistants like Siri, Alexa, and Google Assistant are already widely used, but they are expected to become even more sophisticated and personalized in the future. Improved education: AI can be used to provide personalized education to students, track their progress, and identify areas where they need additional help. However, along with these advancements, AI also presents some challenges that need to be addressed.


Fellowship Programs

Stanford HAI

HAI Fellowship Programs offer opportunities to explore topics, conduct research, and collaborate across disciplines related to AI technologies, applications, or impact. The Institute for Human-Centered Artificial Intelligence (HAI) offers a 2-quarter program for Stanford Graduate Students. The goal of this program is to encourage interdisciplinary research conversations, facilitate new collaborations, and grow the HAI community of graduate scholars who are working in the area of AI, broadly defined. HAI is seeking graduate students to participate in this program. We would like to ensure the cohort is well-rounded across disciplines.


What Stanford's recent AI conference reveals about the state of AI accountability

#artificialintelligence

We are excited to bring Transform 2022 back in-person July 19 and virtually July 20 - 28. Join AI and data leaders for insightful talks and exciting networking opportunities. As AI adoption continues to ramp up exponentially, so is the discussion around -- and concern for -- accountable AI. While tech leaders and field researchers understand the importance of developing AI that is ethical, safe and inclusive, they still grapple with issues around regulatory frameworks and concepts of "ethics washing" or "ethics shirking" that diminish accountability. Perhaps most importantly, the concept is not yet clearly defined. While many sets of suggested guidelines and tools exist -- from the U.S. National Institute of Standards and Technology's Artificial Intelligence Risk Management Framework to the European Commission's Expert Group on AI, for example -- they are not cohesive and are very often vague and overly complex.