AITopics

Industry: Marketing (0.31)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.77)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.77)

USATODAY - Tech Top StoriesJul-9-2020, 23:43:07 GMT

Trade groups offering $100,000 reward after noose found at Facebook data center

The FBI and Justice Department are assisting the Altoona Police Department's investigation after a noose was found last month at a work site on the Facebook Data Center property in Altoona, Iowa. Altoona police officials say they contacted the FBI on June 19, the day the noose was found. The date coincided with Juneteenth, the annual holiday celebrating the end of slavery. Interviews are still being conducted in the investigation, according to Altoona Police Department Public Information Officer Alyssa Wilson. While federal investigators were already involved with the incident, as of Thursday, all information in the case will be filtered through the FBI's Omaha office.

artificial intelligence, information, natural language, (9 more...)

USATODAY - Tech Top Stories

Country: North America > United States > Iowa (0.71)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Government > Regional Government > North America Government > United States Government (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.64)

Wang, Hongyi, Sreenivasan, Kartik, Rajput, Shashank, Vishwakarma, Harit, Agarwal, Saurabh, Sohn, Jy-yong, Lee, Kangwook, Papailiopoulos, Dimitris

Attack of the Tails: Yes, You Really Can Backdoor Federated Learning

arXiv.org Machine LearningJul-9-2020

Due to its decentralized nature, Federated Learning (FL) lends itself to adversarial attacks in the form of backdoors during training. The goal of a backdoor is to corrupt the performance of the trained model on specific sub-tasks (e.g., by classifying green cars as frogs). A range of FL backdoor attacks have been introduced in the literature, but also methods to defend against them, and it is currently an open question whether FL systems can be tailored to be robust against backdoors. In this work, we provide evidence to the contrary. We first establish that, in the general case, robustness to backdoors implies model robustness to adversarial examples, a major open problem in itself. Furthermore, detecting the presence of a backdoor in a FL model is unlikely assuming first order oracles or polynomial time. We couple our theoretical results with a new family of backdoor attacks, which we refer to as edge-case backdoors. An edge-case backdoor forces a model to misclassify on seemingly easy inputs that are however unlikely to be part of the training, or test data, i.e., they live on the tail of the input distribution. We explain how these edge-case backdoors can lead to unsavory failures and may have serious repercussions on fairness, and exhibit that with careful tuning at the side of the adversary, one can insert them across a range of machine learning tasks (e.g., image classification, OCR, text prediction, sentiment analysis).

arxiv preprint arxiv, machine learning, natural language, (18 more...)

2007.05084

Country:

North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > Virginia (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

#artificialintelligenceJul-5-2020, 14:56:45 GMT

Sentiment Analysis -- from Scratch to Production (Web API)

It is stated that data scientists spend almost 70% of their time on data cleaning. It is one of the most tedious tasks. The model's performance is directly proportional to how clean your data is. Here cleaning includes removing duplicate data, unnecessary elements, and handling missing data. We will perform a couple of standard cleaning techniques before we preprocess the text.

data quality, natural language, sentiment analysis, (6 more...)

Technology:

Information Technology > Data Science > Data Quality (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.40)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.40)

#artificialintelligenceJul-1-2020

What is emotion AI and why should you care? - KDnuggets

By Natalia Modjeska, MBA, PhD, helps organizations make sense of AI/ML. Recently I had the opportunity to attend the inaugural Emotion AI Conference, organized by Seth Grimes, a leading analyst and business consultant in the areas of natural language processing (NLP), text analytics, sentiment analysis, and their business applications. The conference was attended by about 70 people (including presenters and panelists) from industry and academia in the US, Canada, and Europe. Given the conference topic, what is emotion AI, why is it relevant, and what do you need to know about it? Read on to find out (warning: this is a long-ish article), but first, some background. We humans are highly emotional beings, and emotions impact everything we do, even if we are not, for the most part, aware of it.

artificial intelligence, emotion, natural language, (13 more...)

Country:

North America > Canada (0.24)
North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > Poland > Lower Silesia Province > Wroclaw (0.04)

Industry:

Information Technology (0.88)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (0.47)
Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.47)
Health & Medicine > Therapeutic Area > Immunology (0.47)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science > Emotion (0.71)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.70)

#artificialintelligenceJun-29-2020, 18:45:21 GMT

Sentiment Analysis Of Political Speeches Using Hugging Face's Pipeline Feature

The Github repo for this post contains a notebook and the data needed to generate some of the charts in this post, as well as a sample of the Plotly chart and CSV table of the results. The code can be easily tweaked if you wish to generate results for multiple speeches in one go. The data comprises six official speech transcripts taken from the websites of the Singapore Government as well as the Prime Minister's Office. These speeches focused on the Government's plans to deal with the challenges from Covid-19, and are set to frame the broader debate for Singapore's upcoming election. Some excessively long chunks of text were broken up into smaller paragraphs for a fairer assessment of the sentiment, but the vast majority of the speeches were analysed in their original form.

artificial intelligence, natural language, speech, (6 more...)

Country: Asia > Singapore (1.00)

Industry: Government > Regional Government > Asia Government > Singapore Government (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.53)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.53)

Kalin, Josh, Noever, David, Dozier, Gerry

Systematic Attack Surface Reduction For Deployed Sentiment Analysis Models

arXiv.org Machine LearningJun-19-2020

This work proposes a structured approach to baselining a model, identifying attack vectors, and securing the machine learning models after deployment. This method for securing each model post deployment is called the BAD (Build, Attack, and Defend) Architecture. Two implementations of the BAD architecture are evaluated to quantify the adversarial life cycle for a black box Sentiment Analysis system. As a challenging diagnostic, the Jigsaw Toxic Bias dataset is selected as the baseline in our performance tool. Each implementation of the architecture will build a baseline performance report, attack a common weakness, and defend the incoming attack. As an important note: each attack surface demonstrated in this work is detectable and preventable. The goal is to demonstrate a viable methodology for securing a machine learning model in a production setting.

category, machine learning, natural language, (18 more...)

2006.1113

Country:

North America > United States > North Carolina (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Alabama > Madison County > Huntsville (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry:

Information Technology > Security & Privacy (1.00)
Government (0.95)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.72)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.72)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

arXiv.org Machine LearningJun-18-2020

Twitter discussions and emotions about COVID-19 pandemic: a machine learning approach

Xue, Jia, Chen, Junxiang, Hu, Ran, Chen, Chen, Zheng, ChengDa, Liu, Xiaoqian, Zhu, Tingshao

The objective of the study is to examine coronavirus disease (COVID-19) related discussions, concerns, and sentiments that emerged from tweets posted by Twitter users. We analyze 4 million Twitter messages related to the COVID-19 pandemic using a list of 25 hashtags such as "coronavirus," "COVID-19," "quarantine" from March 1 to April 21 in 2020. We use a machine learning approach, Latent Dirichlet Allocation (LDA), to identify popular unigram, bigrams, salient topics and themes, and sentiments in the collected Tweets. Popular unigrams include "virus," "lockdown," and "quarantine." Popular bigrams include "COVID-19," "stay home," "corona virus," "social distancing," and "new cases." We identify 13 discussion topics and categorize them into five different themes, such as "public health measures to slow the spread of COVID-19," "social stigma associated with COVID-19," "coronavirus news cases and deaths," "COVID-19 in the United States," and "coronavirus cases in the rest of the world". Across all identified topics, the dominant sentiments for the spread of coronavirus are anticipation that measures that can be taken, followed by a mixed feeling of trust, anger, and fear for different topics. The public reveals a significant feeling of fear when they discuss the coronavirus new cases and deaths than other topics. The study shows that Twitter data and machine learning approaches can be leveraged for infodemiology study by studying the evolving public discussions and sentiments during the COVID-19. Real-time monitoring and assessment of the Twitter discussion and concerns can be promising for public health emergency responses and planning. Already emerged pandemic fear, stigma, and mental health concerns may continue to influence public trust when there occurs a second wave of COVID-19 or a new surge of the imminent pandemic.

artificial intelligence, machine learning, natural language, (21 more...)

2005.1283

Country:

North America > Canada > Ontario > Toronto (0.15)
North America > United States > New York (0.06)
Asia > China > Hubei Province > Wuhan (0.05)
(11 more...)

Genre: Research Report > New Finding (0.94)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Government > Regional Government > North America Government > United States Government (0.94)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.49)
(2 more...)

Ranjan, Sakshi, Mishra, Subhankar

Comparative Sentiment Analysis of App Reviews

arXiv.org Machine LearningJun-17-2020

Google app market captures the school of thought of users via ratings and text reviews. The critique's viewpoint regarding an app is proportional to their satisfaction level. Consequently, this helps other users to gain insights before downloading or purchasing the apps. The potential information from the reviews can't be extracted manually, due to its exponential growth. Sentiment analysis, by machine learning algorithms employing NLP, is used to explicitly uncover and interpret the emotions. This study aims to perform the sentiment classification of the app reviews and identify the university students' behavior towards the app market. We applied machine learning algorithms using the TF-IDF text representation scheme and the performance was evaluated on the ensemble learning method. Our model was trained on Google reviews and tested on students' reviews. SVM recorded the maximum accuracy(93.37\%), F-score(0.88) on tri-gram + TF-IDF scheme. Bagging enhanced the performance of LR and NB with accuracy of 87.80\% and 85.5\% respectively.

artificial intelligence, machine learning, natural language, (20 more...)

2006.09739

Country:

North America > United States > New York (0.04)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
Asia > Middle East > Israel (0.04)
Asia > India > Maharashtra > Mumbai (0.04)

Genre: Research Report > New Finding (0.47)

Industry: Education > Educational Setting > Higher Education (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (1.00)
(3 more...)

Agrawal, Anumeha, George, Rosa Anil, Ravi, Selvan Sunitha, S, Sowmya Kamath, M, Anand Kumar

Leveraging Multimodal Behavioral Analytics for Automated Job Interview Performance Assessment and Feedback

arXiv.org Machine LearningJun-16-2020

Behavioral cues play a significant part in human communication and cognitive perception. In most professional domains, employee recruitment policies are framed such that both professional skills and personality traits are adequately assessed. Hiring interviews are structured to evaluate expansively a potential employee's suitability for the position - their professional qualifications, interpersonal skills, ability to perform in critical and stressful situations, in the presence of time and resource constraints, etc. Therefore, candidates need to be aware of their positive and negative attributes and be mindful of behavioral cues that might have adverse effects on their success. We propose a multimodal analytical framework that analyzes the candidate in an interview scenario and provides feedback for predefined labels such as engagement, speaking rate, eye contact, etc. We perform a comprehensive analysis that includes the interviewee's facial expressions, speech, and prosodic information, using the video, audio, and text transcripts obtained from the recorded interview. We use these multimodal data sources to construct a composite representation, which is used for training machine learning classifiers to predict the class labels. Such analysis is then used to provide constructive feedback to the interviewee for their behavioral cues and body language. Experimental validation showed that the proposed methodology achieved promising results.

artificial intelligence, machine learning, natural language, (17 more...)

2006.07909

Country:

North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > District of Columbia > Washington (0.04)
Asia > India > Karnataka (0.04)

Genre: Research Report (0.64)

Industry:

Health & Medicine (0.66)
Education > Educational Technology > Educational Software > Computer Based Training (0.40)
Education > Assessment & Standards (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
(2 more...)