AITopics | Pune

Collaborating Authors

Pune

On Importance of Layer Pruning for Smaller BERT Models and Low Resource Languages

Shirke, Mayur, Shembade, Amey, Wagh, Madhushri, Thorat, Pavan, Joshi, Raviraj

arXiv.org Artificial IntelligenceJan-1-2025

This study explores the effectiveness of layer pruning for developing more efficient BERT models tailored to specific downstream tasks in low-resource languages. Our primary objective is to evaluate whether pruned BERT models can maintain high performance while reducing model size and complexity. We experiment with several BERT variants, including MahaBERT-v2 and Google-Muril, applying different pruning strategies and comparing their performance to smaller, scratch-trained models like MahaBERT-Small and MahaBERT-Smaller. We fine-tune these models on Marathi datasets, specifically Short Headlines Classification (SHC), Long Paragraph Classification (LPC) and Long Document Classification (LDC), to assess their classification accuracy. Our findings demonstrate that pruned models, despite having fewer layers, achieve comparable performance to their fully-layered counterparts while consistently outperforming scratch-trained models of similar size. Notably, pruning layers from the middle of the model proves to be the most effective strategy, offering performance competitive with pruning from the top and bottom. However, there is no clear winner, as different pruning strategies perform better in different model and dataset combinations. Additionally, monolingual BERT models outperform multilingual ones in these experiments. This approach, which reduces computational demands, provides a faster and more efficient alternative to training smaller models from scratch, making advanced NLP models more accessible for low-resource languages without compromising classification accuracy.

machine learning, natural language, pruning, (16 more...)

arXiv.org Artificial Intelligence

2501.00733

Country: Asia > India > Maharashtra > Pune (0.15)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Harnessing Pre-Trained Sentence Transformers for Offensive Language Detection in Indian Languages

Joshi, Ananya, Joshi, Raviraj

arXiv.org Artificial IntelligenceOct-3-2023

In our increasingly interconnected digital world, social media platforms have emerged as powerful channels for the dissemination of hate speech and offensive content. This work delves into the domain of hate speech detection, placing specific emphasis on three low-resource Indian languages: Bengali, Assamese, and Gujarati. The challenge is framed as a text classification task, aimed at discerning whether a tweet contains offensive or non-offensive content. Leveraging the HASOC 2023 datasets, we fine-tuned pre-trained BERT and SBERT models to evaluate their effectiveness in identifying hate speech. Our findings underscore the superiority of monolingual sentence-BERT models, particularly in the Bengali language, where we achieved the highest ranking. However, the performance in Assamese and Gujarati languages signifies ongoing opportunities for enhancement. Our goal is to foster inclusive online spaces by countering hate speech proliferation.

artificial intelligence, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

2310.02249

Country:

Asia > Philippines > Luzon > National Capital Region > City of Manila (0.14)
Asia > India > Maharashtra > Pune (0.14)

Genre: Research Report > New Finding (0.54)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

L3Cube-IndicSBERT: A simple approach for learning cross-lingual sentence representations using multilingual BERT

Deode, Samruddhi, Gadre, Janhavi, Kajale, Aditi, Joshi, Ananya, Joshi, Raviraj

arXiv.org Artificial IntelligenceApr-22-2023

The multilingual Sentence-BERT (SBERT) models map different languages to common representation space and are useful for cross-language similarity and mining tasks. We propose a simple yet effective approach to convert vanilla multilingual BERT models into multilingual sentence BERT models using synthetic corpus. We simply aggregate translated NLI or STS datasets of the low-resource target languages together and perform SBERT-like fine-tuning of the vanilla multilingual BERT model. We show that multilingual BERT models are inherent cross-lingual learners and this simple baseline fine-tuning approach without explicit cross-lingual training yields exceptional cross-lingual properties. We show the efficacy of our approach on 10 major Indic languages and also show the applicability of our approach to non-Indic languages German and French. Using this approach, we further present L3Cube-IndicSBERT, the first multilingual sentence representation model specifically for Indian languages Hindi, Marathi, Kannada, Telugu, Malayalam, Tamil, Gujarati, Odia, Bengali, and Punjabi. The IndicSBERT exhibits strong cross-lingual capabilities and performs significantly better than alternatives like LaBSE, LASER, and paraphrase-multilingual-mpnet-base-v2 on Indic cross-lingual and monolingual sentence similarity tasks. We also release monolingual SBERT models for each of the languages and show that IndicSBERT performs competitively with its monolingual counterparts. These models have been evaluated using embedding similarity scores and classification accuracy.

huggingface, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2304.11434

Country: Asia > India > Maharashtra > Pune (0.14)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Data Analyst at IntegriChain - Pune, India

#artificialintelligenceMar-17-2023, 15:08:15 GMT

IntegriChain is the data and application backbone for market access departments of Life Sciences manufacturers. We deliver the data, the applications, and the business process infrastructure for patient access and therapy commercialization. More than 250 manufacturers rely on our ICyte Platform to orchestrate their commercial and government payer contracting, patient services, and distribution channels. ICyte is the first and only platform that unites the financial, operational, and commercial data sets required to support therapy access in the era of specialty and precision medicine. With ICyte, Life Sciences innovators can digitalize their market access operations, freeing up resources to focus on more data-driven decision support.

artificial intelligence, colombia government, data mining, (10 more...)

#artificialintelligence

Country: Asia > India > Maharashtra > Pune (0.43)

Genre: Press Release (0.38)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.40)
Information Technology > Artificial Intelligence (0.40)
Information Technology > Communications > Social Media (0.38)

Add feedback

Power BI Developer at Hitachi Solutions - Pune, India

#artificialintelligenceMar-4-2023, 18:10:53 GMT

Our culture is defined by our values and our deep commitment to help our clients succeed. We are a division of the 38th largest company in the world and bring to bear the strength of a very large network of interconnected Hitachi companies. At the same time we remain absolutely committed to the nimble agility that helped us grow Hitachi Solutions from three founding partners to nearly 2,000 consultants, developers and support personnel all around the globe. Hitachi Solutions is a leader in providing industry solutions based on Microsoft Dynamics AX and Microsoft Dynamics CRM. Hitachi Solutions provides its customers with industry focus, software industry domain expertise, and proven tier-1 people.

artificial intelligence, asia government, japan government, (8 more...)

#artificialintelligence

Country: Asia > India > Maharashtra > Pune (0.40)

Technology:

Information Technology > Software (0.40)
Information Technology > Artificial Intelligence (0.40)

Add feedback

Analytics Engineer at Netcentric - Pune, India

#artificialintelligenceJan-22-2023, 00:15:34 GMT

At Netcentric, we come to work every day knowing we're part of the solution to the most complex challenges brands have ever faced: digital transformation. Consumer expectation of brands is increasing in a world that is more connected and fast-paced. Netcentric is a dynamic and innovative service provider with a unique culture. We empower our employees to use their creativity, looking beyond tools and technology to unlock the full potential of the Adobe Experience Cloud, so that we can deliver visionary digital marketing solutions for the world's most recognized brands. As part of the Cognizant Digital Business, we reap the benefits of combined expertise and access to multidisciplinary teams, forging ahead to become a leading customer experience player in Europe.

artificial intelligence, europe government, netcentric, (9 more...)

#artificialintelligence

Country:

Europe (0.98)
Asia > India > Maharashtra > Pune (0.40)

Genre: Instructional Material (0.34)

Technology: Information Technology > Artificial Intelligence (0.40)

Add feedback

ML Engineer - NLP at Avoma, Inc. - Pune, Maharashtra, India - Remote

#artificialintelligenceJan-6-2023, 21:11:45 GMT

Avoma, Inc. is hiring for Full Time ML Engineer - NLP - Pune, Maharashtra, India - Remote - a Mid-level AI/ML/Data Science role offering benefits such as Career development, Equity, Parental leave

artificial intelligence, engineer, machine learning, (9 more...)

#artificialintelligence

Country: Asia > India > Maharashtra > Pune (0.60)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.42)

Add feedback

Power BI developer (Immediate to 15 days Joiners) at CloudMoyo - Pune, India

#artificialintelligenceDec-6-2022, 12:11:59 GMT

CloudMoyo is the partner of choice for solutions at the intersection of cloud and analytics. We help modern enterprises define their path to the Cloud and leverage the power of data driven insights. Headquartered in Bellevue, WA, with a presence in Overland Park, Kansas and an innovation center in Pune, India, CloudMoyo is set apart by the company's relentless focus on delighting customers, the strength of our smart technology accelerators, our strong business domain experience, and a deep pool of technical talent with experience in the Microsoft Cloud & Advanced Analytics.

artificial intelligence, cloudmoyo, india government, (7 more...)

#artificialintelligence

Country:

Asia > India > Maharashtra > Pune (0.66)
North America > United States > Washington > King County > Bellevue (0.30)
North America > United States > Kansas > Johnson County > Overland Park (0.30)

Technology:

Information Technology > Software (0.59)
Information Technology > Artificial Intelligence (0.40)

Add feedback

Data Science Courses – MKSSS AIT महर ष कर व स त र श क षण स स थ Pune – Data Science Courses For Women In Pune India

#artificialintelligenceNov-25-2022, 10:21:05 GMT

Bigdata a buzzword itself suggest how big and voluminous your data is. To accommodate those data you require a huge storage.In recent times, big data has acquired almost every sector of the world. Even the current market trends are of Bigdata and analytics. BigData is just like an ocean in which you have many areas to learn and earn from it. Python programming is a general-purpose programming language that is open source, flexible, robust and simple.

artificial intelligence, machine learning, python, (7 more...)

#artificialintelligence

Country: Asia > India > Maharashtra > Pune (0.40)

Genre: Instructional Material > Course Syllabus & Notes (0.78)

Industry: Education > Curriculum > Subject-Specific Education (0.78)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.33)

Add feedback

Generative Adversarial Networks and Deep Learning: Theory and Applications: Raut, Roshani, D Pathak, Pranav, R Sakhare, Sachin, Patil, Sonali: 9781032068107: Amazon.com: Books

#artificialintelligenceSep-16-2022, 16:00:44 GMT

Dr. Sachin R Sakhare is working as a Professor in the Department of Computer Engineering of Vishwakarma Institute of Information Technology, Pune, India. He has 26 Years of experience in engineering education. He is recognised as PhD guide by Savitribai Phule Pune University and currently guiding 7 PhD scholars. He is a life member of CSI, ISTE and IAEngg. He has Published 39 research communications in national, international journals and conferences, with around 248 citations and H-index 6.

denmark government, machine learning, technology education, (22 more...)

#artificialintelligence

Country:

Europe > Denmark (0.39)
Asia > India > Maharashtra > Pune (0.28)

Industry: Retail > Online (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Add feedback