AITopics

2212.14315

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Services (1.00)
Information Technology > Security & Privacy (1.00)
Government > Military (0.94)
Law Enforcement & Public Safety (0.92)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.67)

Bhattacharyya, Rupam, Henderson, Nicholas, Baladandayuthapani, Veerabhadran

Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Genomic Data

arXiv.org Machine LearningDec-28-2022

Rapid advancements in collection, processing, and dissemination of multi-platform molecular and genomics (multi-omics, in short) data has resulted in enormous opportunities to aggregate such data in order to understand, prevent, and treat diseases. This has catalyzed development of integrative methods that can collectively mine multiple types and scales of multi-omics data, in order to provide a more holistic view of human disease evolution and progression (Subramanian et al. 2020). Specifically, in the context of cancer, a disease driven predominantly by agglomerations of several molecular changes (Sun et al. 2021), the importance of synthesizing information from multi-platform omics and clinical sources to understand the cellular basis of the disease is even further underscored. Cellular oncological mechanisms, triggered at different molecular levels of the DNA RNA Protein path, can confer profound phenotypic advantages/disadvantages. While significant improvements have been made in multi-omics data integration methods to unveil such mechanisms, focused on both prognosis (Duan et al. 2021) and treatment (Finotello et al. 2020), the precise functions governing them need detailed and data-driven de-novo evaluations. Our work, in the same vein, aims at two different but inter-related scientific axes: (i) selection of biomarkers associated with cancer prognosis and clinical outcomes, and (ii) learning the mechanism of these biomarkers' effects upon such outcomes via integrating upstream molecular information - we provide some additional scientific context below. Classes of Integrative Omics Models First, we briefly discuss existing integrative omics approaches in order to contextualize the need for our framework. Broadly, most of the existing integrative statistical methods can be classified into two categories - horizontal (meta-analysis type) and vertical (multi-omics) integration procedures (Tseng et al. 2015).

artificial intelligence, data mining, machine learning, (18 more...)

arXiv.org Machine Learning

2212.14165

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
Europe > Austria > Vienna (0.14)
Europe > Middle East > Malta (0.04)
(2 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine > Therapeutic Area > Obstetrics/Gynecology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Oncology > Carcinoma (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (1.00)
(2 more...)

Anahideh, Hadis, Nezami, Nazanin, Asudeh, Abolfazl

Finding Representative Group Fairness Metrics Using Correlation Estimations

arXiv.org Artificial IntelligenceDec-28-2022

It is of critical importance to be aware of the historical discrimination embedded in the data and to consider a fairness measure to reduce bias throughout the predictive modeling pipeline. Given various notions of fairness defined in the literature, investigating the correlation and interaction among metrics is vital for addressing unfairness. Practitioners and data scientists should be able to comprehend each metric and examine their impact on one another given the context, use case, and regulations. Exploring the combinatorial space of different metrics for such examination is burdensome. To alleviate the burden of selecting fairness notions for consideration, we propose a framework that estimates the correlation among fairness notions. Our framework consequently identifies a set of diverse and semantically distinct metrics as representative for a given context. We propose a Monte-Carlo sampling technique for computing the correlations between fairness metrics by indirect and efficient perturbation in the model space. Using the estimated correlations, we then find a subset of representative metrics. The paper proposes a generic method that can be generalized to any arbitrary set of fairness metrics. We showcase the validity of the proposal using comprehensive experiments on real-world benchmark datasets.

data mining, fairness metric, machine learning, (14 more...)

2109.05697

Country:

North America > United States > Illinois > Cook County > Chicago (0.04)
North America > United States > New York (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.67)

Industry:

Education > Educational Setting > Higher Education (0.67)
Banking & Finance (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Data Science > Data Mining (0.93)

Journal of Artificial Intelligence ResearchDec-28-2022

Data-Driven Revision of Conditional Norms in Multi-Agent Systems

Dell'Anna, Davide (Utrecht University) | Alechina, Natasha | Dalpiaz, Fabiano | Dastani, Mehdi | Logan, Brian

In multi-agent systems, norm enforcement is a mechanism for steering the behavior of individual agents in order to achieve desired system-level objectives. Due to the dynamics of multi-agent systems, however, it is hard to design norms that guarantee the achievement of the objectives in every operating context. Also, these objectives may change over time, thereby making previously defined norms ineffective. In this paper, we investigate the use of system execution data to automatically synthesise and revise conditional prohibitions with deadlines, a type of norms aimed at prohibiting agents from exhibiting certain patterns of behaviors. We propose DDNR (Data-Driven Norm Revision), a data-driven approach to norm revision that synthesises revised norms with respect to a data set of traces describing the behavior of the agents in the system. We evaluate DDNR using a state-of-the-art, off-the-shelf urban traffic simulator. The results show that DDNR synthesises revised norms that are significantly more accurate than the original norms in distinguishing adequate and inadequate behaviors for the achievement of the system-level objectives.

accuracy, objective, revision, (16 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.13683

AI Access Foundation

13683

Journal of Artificial Intelligence Research

Country:

Europe > Netherlands > South Holland > Delft (0.04)
North America > United States > Oregon (0.04)
Europe > United Kingdom > Scotland > City of Aberdeen > Aberdeen (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (1.00)

Industry:

Transportation (0.93)
Law (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)

#artificialintelligenceDec-27-2022, 02:10:05 GMT

Twitter Artificial Intelligence

How does Twitter use artificial intelligence and machine learning? Twitter uses large-scale machine learning and AI for sentiment analysis, bot analysis and detection of fake accounts, image classification and more. From Amazon to Instagram, Sephora, Microsoft, and Twitter, AI will shape the future of speech in America and beyond. The big question is not if they use it, but how it is being used, and what impact will this have on consumer privacy in the future. For the past fifteen years, I have been a national commentator on the politics of big tech and social media platforms. Social Media content decisions have become highly political, and artificial intelligence has proliferated this process at scale. But somewhere along the way, the public was left in the dark on just how large of a role machine learning plays in large-scale content operations in Silicon Valley. While the national conversation on free speech focuses on high-profile executives of tech companies and how content ...

misinformation, tweet, twitter, (14 more...)

#artificialintelligence

Country:

North America > United States > California (0.24)
Asia > Russia (0.14)
North America > United States > New York > Westchester County (0.04)
(3 more...)

Genre: Personal > Interview (0.93)

Industry:

Media > News (1.00)
Leisure & Entertainment (1.00)
Law > Civil Rights & Constitutional Law (1.00)
(6 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.67)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.46)

Huertas-García, Álvaro, Martín, Alejandro, Tato, Javier Huertas, Camacho, David

Countering Malicious Content Moderation Evasion in Online Social Networks: Simulation and Detection of Word Camouflage

Content moderation is the process of screening and monitoring user-generated content online. It plays a crucial role in stopping content resulting from unacceptable behaviors such as hate speech, harassment, violence against specific groups, terrorism, racism, xenophobia, homophobia, or misogyny, to mention some few, in Online Social Platforms. These platforms make use of a plethora of tools to detect and manage malicious information; however, malicious actors also improve their skills, developing strategies to surpass these barriers and continuing to spread misleading information. Twisting and camouflaging keywords are among the most used techniques to evade platform content moderation systems. In response to this recent ongoing issue, this paper presents an innovative approach to address this linguistic trend in social networks through the simulation of different content evasion techniques and a multilingual Transformer model for content evasion detection. In this way, we share with the rest of the scientific community a multilingual public tool, named "pyleetspeak" to generate/simulate in a customizable way the phenomenon of content evasion through automatic word camouflage and a multilingual Named-Entity Recognition (NER) Transformer-based model tuned for its recognition and detection. The multilingual NER model is evaluated in different textual scenarios, detecting different types and mixtures of camouflage techniques, achieving an overall weighted F1 score of 0.8795. This article contributes significantly to countering malicious information by developing multilingual tools to simulate and detect new methods of evasion of content on social networks, making the fight against information disorders more effective.

artificial intelligence, machine learning, natural language, (20 more...)

2212.14727

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Europe > Spain > Galicia > Madrid (0.05)
North America > United States > New York > New York County > New York City (0.04)
(8 more...)

Genre: Research Report > New Finding (0.93)

Industry:

Media > News (1.00)
Law Enforcement & Public Safety (1.00)
Law (1.00)
(3 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Pelofske, Elijah, Liebrock, Lorie M., Urias, Vincent

A Robust Cybersecurity Topic Classification Tool

Identifying cybersecurity discussions in open forums at scale is a topic of great interest for the purpose of mitigating and understanding modern cyber threats [1-3]. The challenge is that these discussions are typically quite noisy (i.e., they contain community known synonyms or acronyms or slang) and it is difficult to get labelled data in order to train resilient NLP (natural language processing) topic classifiers. Additionally, it is important that a tool that detects cybersecurity discussions in internet text sources is scalable and offers low errors rates (in particular, both low false negative rates and low false positive rates). In order to address the challenges of finding relevant cybersecurity labelled data, we use a technique that gathers posts or articles from different internet sources that have user defined topic labels. We then collect and label the training text as being cybersecurity related or not based on the subset of labels that the text source offers.

artificial intelligence, machine learning, natural language, (18 more...)

2109.02473

Country:

North America > United States > New Mexico > Socorro County > Socorro (0.04)
North America > United States > New Mexico > Bernalillo County > Albuquerque (0.04)
Asia > Singapore > Central Region > Singapore (0.04)

Genre:

Research Report (0.82)
Overview (0.67)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)

Huang, Yongcan, Yang, Jidong J.

Semi-supervised multiscale dual-encoding method for faulty traffic data detection

Inspired by the recent success of deep learning in multiscale information encoding, we introduce a variational autoencoder (VAE) based semi-supervised method for detection of faulty traffic data, which is cast as a classification problem. Continuous wavelet transform (CWT) is applied to the time series of traffic volume data to obtain rich features embodied in time-frequency representation, followed by a twin of VAE models to separately encode normal data and faulty data. The resulting multiscale dual encodings are concatenated and fed to an attention-based classifier, consisting of a self-attention module and a multilayer perceptron. For comparison, the proposed architecture is evaluated against five different encoding schemes, including (1) VAE with only normal data encoding, (2) VAE with only faulty data encoding, (3) VAE with both normal and faulty data encodings, but without attention module in the classifier, (4) siamese encoding, and (5) cross-vision transformer (CViT) encoding. The first four encoding schemes adopted the same convolutional neural network (CNN) architecture while the fifth encoding scheme follows the transformer architecture of CViT. Our experiments show that the proposed architecture with the dual encoding scheme, coupled with attention module, outperforms other encoding schemes and results in classification accuracy of 96.4%, precision of 95.5%, and recall of 97.7%.

artificial intelligence, deep learning, machine learning, (17 more...)

doi: 10.3934/aci.2022006

2212.13596

Country: North America > United States > Georgia (0.28)

Genre: Research Report > New Finding (0.47)

Industry:

Government > Regional Government (0.47)
Energy > Power Industry (0.47)
Transportation > Ground > Road (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Confusion Matrices and Accuracy Statistics for Binary Classifiers Using Unlabeled Data: The Diagnostic Test Approach

Evans, Richard

Sometimes it is important to know the accuracy of a classifier on unlabeled data. The labels may be delayed, as in consumer purchasing predictions, or obtaining the labels is cost prohibitive. The labels may not exist, as for some medical conditions, for which the true gold standard diagnostic test(a 100% sensitive and 100% specific classifier) would require subjects be euthanized and autopsied to obtain labels. Epidemiologists and biostatisticians have developed statistical methods for assessing the sensitivity (Se) and specificity (Sp) of diagnostic tests when gold standard comparison tests are unavailable. In data science terms, the diagnostic test assessment data are unlabeled. In this article, I describe how to modify those diagnostic test statistical methods to estimate confusion matrices and accuracy statistics for binary classifiers.

artificial intelligence, classifier, machine learning, (16 more...)

2208.12664

Genre: Research Report (0.40)

Industry: Health & Medicine > Health Care Technology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.69)

#artificialintelligenceDec-26-2022, 09:50:29 GMT

12 Best Online Courses for Machine Learning with Python- 2023

Python is one of the most widely used programming languages in the Machine Learning field. Python has many packages and libraries that are specifically tailored for certain functions, including pandas, NumPy, scikit-learn, Matplotlib, and SciPy. So if you want to learn Machine Learning with Python, this article is for you. In this article, you will find the 12 Best Online Courses for Machine Learning with Python. Now, without wasting your time, let's start finding the Best Online Courses for Machine Learning with Python.

learning, machine learning, python, (14 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting > Online (1.00)

Technology:

Information Technology > Enterprise Applications > Human Resources > Learning Management (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
(2 more...)