Jharkhand
Learning to Plan for Language Modeling from Unlabeled Data
Cornille, Nathan, Moens, Marie-Francine, Mai, Florian
By training to predict the next token in an unlabeled corpus, large language models learn to perform many tasks without any labeled data. However, their next-token-prediction objective arguably limits their performance in scenarios that require planning, such as writing a coherent article. In this paper, we train a module for planning the future writing process via a self-supervised learning objective. By conditioning on generated latent plans, our model extends the successful language model formula to more abstract planning in an unsupervised way. Empirically, we demonstrate that our method improves language modeling performance in general, particularly with respect to the text structure. Because our framework uses a planner module that is unsupervised and external to the language model, new planner modules can be trained at large scale and easily be shared with the community.
Crop Disease Classification using Support Vector Machines with Green Chromatic Coordinate (GCC) and Attention based feature extraction for IoT based Smart Agricultural Applications
Jha, Shashwat, Luhach, Vishvaditya, Gupta, Gauri Shanker, Singh, Beependra
Crops hold paramount significance as they serve as the primary provider of energy, nutrition, and medicinal benefits for the human population. Plant diseases, however, can negatively affect leaves during agricultural cultivation, resulting in significant losses in crop output and economic value. Therefore, it is crucial for farmers to identify crop diseases. However, this method frequently necessitates hard work, a lot of planning, and in-depth familiarity with plant pathogens. Given these numerous obstacles, it is essential to provide solutions that can easily interface with mobile and IoT devices so that our farmers can guarantee the best possible crop development. Various machine learning (ML) as well as deep learning (DL) algorithms have been created & studied for the identification of plant disease detection, yielding substantial and promising results. This article presents a novel classification method that builds on prior work by utilising attention-based feature extraction, RGB channel-based chromatic analysis, Support Vector Machines (SVM) for improved performance, and the ability to integrate with mobile applications and IoT devices after quantization of information. Several disease classification algorithms were compared with the suggested model, and it was discovered that, in terms of accuracy, Vision Transformer-based feature extraction and additional Green Chromatic Coordinate feature with SVM classification achieved an accuracy of (GCCViT-SVM) - 99.69%, whereas after quantization for IoT device integration achieved an accuracy of - 97.41% while almost reducing 4x in size. Our findings have profound implications because they have the potential to transform how farmers identify crop illnesses with precise and fast information, thereby preserving agricultural output and ensuring food security.
A study of the impact of generative AI-based data augmentation on software metadata classification
Kumari, Tripti, Charan, Chakali Sai, Das, Ayan
This paper presents the system submitted by the team from IIT(ISM) Dhanbad in FIRE IRSE 2023 shared task 1 on the automatic usefulness prediction of code-comment pairs as well as the impact of Large Language Model(LLM) generated data on original base data towards an associated source code. We have developed a framework where we train a machine learning-based model using the neural contextual representations of the comments and their corresponding codes to predict the usefulness of code-comments pair and performance analysis with LLM-generated data with base data. In the official assessment, our system achieves a 4% increase in F1-score from baseline and the quality of generated data.
Transformer Assisted Convolutional Network for Cell Instance Segmentation
Pandey, Deepanshu, Gupta, Pradyumna, Bhattacharya, Sumit, Sinha, Aman, Agarwal, Rohit
Region proposal based methods like R-CNN and Faster R-CNN models have proven to be extremely successful in object detection and segmentation tasks. Recently, Transformers have also gained popularity in the domain of Computer Vision, and are being utilised to improve the performance of conventional models. In this paper, we present a relatively new transformer based approach to enhance the performance of the conventional convolutional feature extractor in the existing region proposal based methods. Our approach merges the convolutional feature maps with transformer-based token embeddings by applying a projection operation similar to self-attention in transformers. The results of our experiments show that transformer assisted feature extractor achieves a significant improvement in mIoU (mean Intersection over Union) scores compared to vanilla convolutional backbone.
Sapio Analytics launches 'empathetic' healthcare chatbot
Sapio Smart Healthcare, a division of Indian government advisory firm Sapio Analytics, has developed a chatbot that assists patients from rural and remote areas in India. Based on a press statement, the AKS Sapio Med Bot was designed to understand the "local and personal" concerns of a patient, while assisting them in seeking treatment before getting medical consultations. The chatbot is named after the late Dr Ashok Kumar Srivastava, a surgeon in the tribal areas of Sahibganj and Pakur in the eastern state of Jharkhand, who had demonstrated the delivery of "empathetic healthcare". The company says the chatbot was created to emulate Dr Srivastava's care style following a large-scale evaluation of his mindset, behaviour and persona. Sapio Analytics seeks to build a comprehensive and accessible healthcare system in small towns and villages across India.
Ensuring Responsible Outcomes from Technology
We attempt to make two arguments in this essay. First, through a case study of a mobile phone based voice-media service we have been running in rural central India for more than six years, we describe several implementation complexities we had to navigate towards realizing our intended vision of bringing social development through technology. Most of these complexities arose in the interface of our technology with society, and we argue that even other technology providers can create similar processes to manage this socio-technological interface and ensure intended outcomes from their technology use. We then build our second argument about how to ensure that the organizations behind both market driven technologies and those technologies that are adopted by the state, pay due attention towards responsibly managing the socio-technological interface of their innovations. We advocate for the technology engineers and researchers who work within these organizations, to take up the responsibility and ensure that their labour leads to making the world a better place especially for the poor and marginalized. We outline possible governance structures that can give more voice to the technology developers to push their organizations towards ensuring that responsible outcomes emerge from their technology. We note that the examples we use to build our arguments are limited to contemporary information and communication technology (ICT) platforms used directly by end-users to share content with one another, and hence our argument may not generalize to other ICTs in a straightforward manner.
Govt aims to harness big data, AI in agriculture sector
The government and private companies alike are taking the first steps to deploy big data analytics, artificial intelligence (AI), and the Internet of Things (IoT) to gain insights into and offer solutions to problems in India's agriculture sector. To experiment with such technology, the NITI Aayog, the government's main think-tank, will start a pilot project on "precision agriculture" using AI in 10 districts to be selected from seven states: Assam, Bihar, Jharkhand, Madhya Pradesh, Maharashtra, Rajasthan, and Uttar Pradesh. This month, the NITI Aayog signed an agreement with software firm IBM to develop a model for crop-yield predictions using AI so that farmers can be provided real-time advisories in these states. While the project is aimed at improving yields through last-mile solutions, the private sector is also wagering money on so-called smart-agriculture systems. Companies such as CropIn and Robert Bosch Engineering and Business Solutions, say they are equipped to provide a range of technologies based on AI in areas such as pest surveillance, climate control, controlled irrigation, and warehouse management.