AITopics | Bashir, Syed Raza

Collaborating Authors

Bashir, Syed Raza

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

VLDBench: Vision Language Models Disinformation Detection Benchmark

Raza, Shaina, Vayani, Ashmal, Jain, Aditya, Narayanan, Aravind, Khazaie, Vahid Reza, Bashir, Syed Raza, Dolatabadi, Elham, Uddin, Gias, Emmanouilidis, Christos, Qureshi, Rizwan, Shah, Mubarak

arXiv.org Artificial IntelligenceFeb-16-2025

The rapid rise of AI-generated content has made detecting disinformation increasingly challenging. In particular, multimodal disinformation, i.e., online posts-articles that contain images and texts with fabricated information are specially designed to deceive. While existing AI safety benchmarks primarily address bias and toxicity, multimodal disinformation detection remains largely underexplored. To address this challenge, we present the Vision-Language Disinformation Detection Benchmark VLDBench, the first comprehensive benchmark for detecting disinformation across both unimodal (text-only) and multimodal (text and image) content, comprising 31,000} news article-image pairs, spanning 13 distinct categories, for robust evaluation. VLDBench features a rigorous semi-automated data curation pipeline, with 22 domain experts dedicating 300 plus hours} to annotation, achieving a strong inter-annotator agreement (Cohen kappa = 0.78). We extensively evaluate state-of-the-art Large Language Models (LLMs) and Vision-Language Models (VLMs), demonstrating that integrating textual and visual cues in multimodal news posts improves disinformation detection accuracy by 5 - 35 % compared to unimodal models. Developed in alignment with AI governance frameworks such as the EU AI Act, NIST guidelines, and the MIT AI Risk Repository 2024, VLDBench is expected to become a benchmark for detecting disinformation in online multi-modal contents. Our code and data will be publicly available.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2502.11361

Country:

North America > United States (1.00)
North America > Canada (1.00)
Europe (1.00)

Genre: Research Report > New Finding (0.92)

Industry:

Media > News (1.00)
Government > Regional Government > Europe Government (0.46)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Progress in Privacy Protection: A Review of Privacy Preserving Techniques in Recommender Systems, Edge Computing, and Cloud Computing

Bashir, Syed Raza, Raza, Shaina, Misic, Vojislav

arXiv.org Artificial IntelligenceJan-20-2024

The digital age is marked by an extraordinary growth in connected devices, leading to a massive influx of data through the Internet [12]. This data is primarily managed by cloud infrastructures. The proliferation of smart devices such as smartphones, tablets, smartwatches, and fitness trackers has transformed them into essential aspects of daily life [8]. These devices accumulate extensive contextual information about users, encompassing their location, activities, and environmental conditions [5]. This information is crucial for applications in predicting user behavior and providing personalized experiences. Mobile crowdsourcing has emerged as a significant phenomenon, where individuals collectively contribute data through various digital channels [32]. Applications in this domain, like traffic monitoring systems, utilize crowd-sourced data to offer real-time insights. However, the process often raises concerns about the privacy of individual contributors. The transparency in data usage and the potential risk of sensitive information being accessed by unauthorized entities are issues that need addressing [11, 26].

data mining, machine learning, recommender system, (19 more...)

arXiv.org Artificial Intelligence

2401.11305

Country: North America > Canada > Ontario > Toronto (0.14)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Networks (1.00)
(5 more...)

Add feedback

NBIAS: A Natural Language Processing Framework for Bias Identification in Text

Raza, Shaina, Garg, Muskan, Reji, Deepak John, Bashir, Syed Raza, Ding, Chen

arXiv.org Artificial IntelligenceAug-29-2023

Bias in textual data can lead to skewed interpretations and outcomes when the data is used. These biases could perpetuate stereotypes, discrimination, or other forms of unfair treatment. An algorithm trained on biased data may end up making decisions that disproportionately impact a certain group of people. Therefore, it is crucial to detect and remove these biases to ensure the fair and ethical use of data. To this end, we develop a comprehensive and robust framework NBIAS that consists of four main layers: data, corpus construction, model development and an evaluation layer. The dataset is constructed by collecting diverse data from various domains, including social media, healthcare, and job hiring portals. As such, we applied a transformer-based token classification model that is able to identify bias words/ phrases through a unique named entity BIAS. In the evaluation procedure, we incorporate a blend of quantitative and qualitative measures to gauge the effectiveness of our models. We achieve accuracy improvements ranging from 1% to 8% compared to baselines. We are also able to generate a robust understanding of the model functioning. The proposed approach is applicable to a variety of biases and contributes to the fair and ethical use of textual data.

annotation, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2308.01681

Country:

Europe (1.00)
North America > Canada > Ontario (0.14)
North America > United States > Louisiana (0.14)
Asia > India > Karnataka (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Promising Solution (0.67)

Industry:

Government (1.00)
Media > News (0.67)
Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Fairness in Machine Learning meets with Equity in Healthcare

Raza, Shaina, Pour, Parisa Osivand, Bashir, Syed Raza

arXiv.org Artificial IntelligenceAug-14-2023

With the growing utilization of machine learning in healthcare, there is increasing potential to enhance healthcare outcomes. However, this also brings the risk of perpetuating biases in data and model design that can harm certain demographic groups based on factors such as age, gender, and race. This study proposes an artificial intelligence framework, grounded in software engineering principles, for identifying and mitigating biases in data and models while ensuring fairness in healthcare settings. A case study is presented to demonstrate how systematic biases in data can lead to amplified biases in model predictions, and machine learning methods are suggested to prevent such biases. Future research aims to test and validate the proposed ML framework in real-world clinical settings to evaluate its impact on promoting health equity.

artificial intelligence, fairness, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2305.07041

Country: North America > Canada > Ontario (0.15)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (0.94)
Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (0.70)

Technology:

Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

BERT4Loc: BERT for Location -- POI Recommender System

Bashir, Syed Raza, Raza, Shaina, Misic, Vojislav

arXiv.org Artificial IntelligenceMay-16-2023

Recommending points of interest (POIs) is a challenging task that requires extracting comprehensive location data from location-based social media platforms. To provide effective location-based recommendations, it's important to analyze users' historical behavior and preferences. In this study, we present a sophisticated location-aware recommendation system that uses Bidirectional Encoder Representations from Transformers (BERT) to offer personalized location-based suggestions. Our model combines location information and user preferences to provide more relevant recommendations compared to models that predict the next POI in a sequence. Our experiments on two benchmark dataset show that our BERT-based model outperforms various state-of-the-art sequential models. Moreover, we see the effectiveness of the proposed model for quality through additional experiments.

artificial intelligence, machine learning, social media, (18 more...)

arXiv.org Artificial Intelligence

2208.01375

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.34)

Industry: Information Technology (0.94)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Leveraging Foundation Models for Clinical Text Analysis

Raza, Shaina, Bashir, Syed Raza

arXiv.org Artificial IntelligenceMar-20-2023

Infectious diseases are a significant public health concern globally, and extracting relevant information from scientific literature can facilitate the development of effective prevention and treatment strategies. However, the large amount of clinical data available presents a challenge for information extraction. To address this challenge, this study proposes a natural language processing (NLP) framework that uses a pre-trained transformer model fine-tuned on task-specific data to extract key information related to infectious diseases from free-text clinical data. The proposed framework includes three components: a data layer for preparing datasets from clinical texts, a foundation model layer for entity extraction, and an assessment layer for performance analysis. The results of the evaluation indicate that the proposed method outperforms standard methods, and leveraging prior knowledge through the pre-trained transformer model makes it useful for investigating other infectious diseases in the future.

information, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2303.13314

Country: North America > Canada > Ontario > Toronto (0.29)

Genre: Research Report > Experimental Study (0.54)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (1.00)
Health & Medicine > Consumer Health (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Addressing Biases in the Texts using an End-to-End Pipeline Approach

Raza, Shaina, Bashir, Syed Raza, Sneha, null, Qamar, Urooj

arXiv.org Artificial IntelligenceMar-13-2023

The concept of fairness is gaining popularity in academia and industry. Social media is especially vulnerable to media biases and toxic language and comments. We propose a fair ML pipeline that takes a text as input and determines whether it contains biases and toxic content. Then, based on pretrained word embeddings, it suggests a set of new words by substituting the biased words, the idea is to lessen the effects of those biases by replacing them with alternative words. We compare our approach to existing fairness models to determine its effectiveness. The results show that our proposed pipeline can detect, identify, and mitigate biases in social media data.

artificial intelligence, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2303.07024

Country: North America > Canada (0.29)

Genre: Research Report > New Finding (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Communications > Social Media (0.89)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.68)

Add feedback

Detecting Fake Points of Interest from Location Data

Bashir, Syed Raza, Misic, Vojislav

arXiv.org Artificial IntelligenceNov-10-2021

The pervasiveness of GPS-enabled mobile devices and the widespread use of location-based services have resulted in the generation of massive amounts of geo-tagged data. In recent times, the data analysis now has access to more sources, including reviews, news, and images, which also raises questions about the reliability of Point-of-Interest (POI) data sources. While previous research attempted to detect fake POI data through various security mechanisms, the current work attempts to capture the fake POI data in a much simpler way. The proposed work is focused on supervised learning methods and their capability to find hidden patterns in location-based data. The ground truth labels are obtained through real-world data, and the fake data is generated using an API, so we get a dataset with both the real and fake labels on the location data. The objective is to predict the truth about a POI using the Multi-Layer Perceptron (MLP) method. In the proposed work, MLP based on data classification technique is used to classify location data accurately. The proposed method is compared with traditional classification and robust and recent deep neural methods. The results show that the proposed method is better than the baseline methods.

machine learning, teaching medhods, teaching method, (20 more...)

arXiv.org Artificial Intelligence

2111.06003

Country: North America > Canada > Ontario > Toronto (0.14)

Genre: Research Report > New Finding (0.88)

Industry:

Information Technology > Services (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine (1.00)
Transportation (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.69)
(2 more...)

Add feedback