ghosh
Elon Musk just lost another lawsuit. Will he keep fighting?
Elon Musk just lost another lawsuit. Elon Musk, the world's richest man, has not been winning in court lately. His loss on Monday in his lawsuit against OpenAI and its co-founder Sam Altman is the latest in a string of legal defeats or settlements. Late last year he agreed to settle with former Twitter executives and thousands of former employees of the social platform, which he has renamed X, after fighting for years to pay them nothing. Then in March, he lost a case brought against him by investors of Twitter, who claimed they were misled by public statements he made during the takeover.
Boundaries, drops and missed run-out chances - Ghosh's remarkable innings
Boundaries, drops and missed run-out chances - Ghosh's remarkable innings This content is not available in your location. Richa Ghosh's 94 runs off 77 balls, including 15 boundaries, helps save India's innings as they recover from 102-6 to reach 251-8 against South Africa in their ICC Women's Cricket World Cup match. Boundaries, drops and missed run-out chances - Ghosh's remarkable innings. Video, 00:03:29 Boundaries, drops and missed run-out chances - Ghosh's remarkable innings'I was asking ChatGPT is this real?' - Fraser & Tulloch on making black history. Video, 00:04:27 'I was asking ChatGPT is this real?' - Fraser & Tulloch on making black history'We've got mountains to do' - Cavallo on homophobia in football.
Towards Physics-Guided Foundation Models
Farhadloo, Majid, Sharma, Arun, Yang, Mingzhou, Jayaprakash, Bharat, Northrop, William, Shekhar, Shashi
Traditional foundation models are pre-trained on broad datasets to reduce the training resources (e.g., time, energy, labeled samples) needed for fine-tuning a wide range of downstream tasks. However, traditional foundation models struggle with out-of-distribution prediction and can produce outputs that are unrealistic and physically infeasible. We propose the notation of physics-guided foundation models (PGFM), that is, foundation models integrated with broad or general domain (e.g., scientific) physical knowledge applicable to a wide range of downstream tasks.
Spatial Distribution-Shift Aware Knowledge-Guided Machine Learning
Sharma, Arun, Farhadloo, Majid, Yang, Mingzhou, Zeng, Ruolei, Ghosh, Subhankar, Shekhar, Shashi
Given inputs of diverse soil characteristics, and climate data gathered from various regions, we aimed to build a model to predict accurate land emissions. The problem is important since accurate quantification of the carbon cycle in agroecosystems is crucial for mitigating climate change and ensuring sustainable food production. Predicting accurate land emissions is challenging due to since calibrating heterogeneous nature of soil properties, moisture, and environmental conditions is hard at decision-relevant scales. Traditional approaches do not adequately estimate land emissions due to location-independent parameters failing to leverage the spatial heterogeneity and also require large datasets. To overcome these limitations, we proposed Spatial Distribution-Shift A ware Knowledge-Guided Machine Learning (SDSA-KGML) which leverage location-dependent parameters which accounts significant spatial heterogeneity in soil moisture from multiple sites within the same region. Experimental results demonstrate that SDSA-KGML models achieve higher local accuracy for the specified states in the Midwest Region.
CoPERLex: Content Planning with Event-based Representations for Legal Case Summarization
Santosh, T. Y. S. S., Farag, Youssef, Grabmair, Matthias
Legal professionals often struggle with lengthy judgments and require efficient summarization for quick comprehension. To address this challenge, we investigate the need for structured planning in legal case summarization, particularly through event-centric representations that reflect the narrative nature of legal case documents. We propose our framework, CoPERLex, which operates in three stages: first, it performs content selection to identify crucial information from the judgment; second, the selected content is utilized to generate intermediate plans through event-centric representations modeled as Subject-Verb-Object tuples; and finally, it generates coherent summaries based on both the content and the structured plan. Our experiments on four legal summarization datasets demonstrate the effectiveness of integrating content selection and planning components, highlighting the advantages of event-centric plans over traditional entity-centric approaches in the context of legal judgements.
RELexED: Retrieval-Enhanced Legal Summarization with Exemplar Diversity
Santosh, T. Y. S. S., Jia, Chen, Goroncy, Patrick, Grabmair, Matthias
This paper addresses the task of legal summarization, which involves distilling complex legal documents into concise, coherent summaries. Current approaches often struggle with content theme deviation and inconsistent writing styles due to their reliance solely on source documents. We propose RELexED, a retrieval-augmented framework that utilizes exemplar summaries along with the source document to guide the model. RELexED employs a two-stage exemplar selection strategy, leveraging a determinantal point process to balance the trade-off between similarity of exemplars to the query and diversity among exemplars, with scores computed via influence functions. Experimental results on two legal summarization datasets demonstrate that RELexED significantly outperforms models that do not utilize exemplars and those that rely solely on similarity-based exemplar selection.
HateGPT: Unleashing GPT-3.5 Turbo to Combat Hate Speech on X
Deroy, Aniket, Maity, Subhankar
The widespread use of social media platforms like Twitter and Facebook has enabled people of all ages to share their thoughts and experiences, leading to an immense accumulation of user-generated content. However, alongside the benefits, these platforms also face the challenge of managing hate speech and offensive content, which can undermine rational discourse and threaten democratic values. As a result, there is a growing need for automated methods to detect and mitigate such content, especially given the complexity of conversations that may require contextual analysis across multiple languages, including code-mixed languages like Hinglish, German-English, and Bangla. We participated in the English task where we have to classify English tweets into two categories namely Hate and Offensive and Non Hate-Offensive. In this work, we experiment with state-of-the-art large language models like GPT-3.5 Turbo via prompting to classify tweets into Hate and Offensive or Non Hate-Offensive. In this study, we evaluate the performance of a classification model using Macro-F1 scores across three distinct runs. The Macro-F1 score, which balances precision and recall across all classes, is used as the primary metric for model evaluation. The scores obtained are 0.756 for run 1, 0.751 for run 2, and 0.754 for run 3, indicating a high level of performance with minimal variance among the runs. The results suggest that the model consistently performs well in terms of precision and recall, with run 1 showing the highest performance.
CryptoLLM: Unleashing the Power of Prompted LLMs for SmartQnA and Classification of Crypto Posts
Deroy, Aniket, Maity, Subhankar
The rapid growth of social media has resulted in an large volume of user-generated content, particularly in niche domains such as cryptocurrency. This task focuses on developing robust classification models to accurately categorize cryptocurrency-related social media posts into predefined classes, including but not limited to objective, positive, negative, etc. Additionally, the task requires participants to identify the most relevant answers from a set of posts in response to specific questions. By leveraging advanced LLMs, this research aims to enhance the understanding and filtering of cryptocurrency discourse, thereby facilitating more informed decision-making in this volatile sector. We have used a prompt-based technique to solve the classification task for reddit posts and twitter posts. Also, we have used 64-shot technique along with prompts on GPT-4-Turbo model to determine whether a answer is relevant to a question or not.