Oceania
Classification with Fairness Constraints: A Meta-Algorithm with Provable Guarantees
Celis, L. Elisa, Huang, Lingxiao, Keswani, Vijay, Vishnoi, Nisheeth K.
Developing classification algorithms that are fair with respect to sensitive attributes of the data has become an important problem due to the growing deployment of classification algorithms in various social contexts. Several recent works have focused on fairness with respect to a specific metric, modeled the corresponding fair classification problem as a constrained optimization problem, and developed tailored algorithms to solve them. Despite this, there still remain important metrics for which we do not have fair classifiers and many of the aforementioned algorithms do not come with theoretical guarantees; perhaps because the resulting optimization problem is non-convex. The main contribution of this paper is a new meta-algorithm for classification that takes as input a large class of fairness constraints, with respect to multiple non-disjoint sensitive attributes, and which comes with provable guarantees. This is achieved by first developing a meta-algorithm for a large family of classification problems with convex constraints, and then showing that classification problems with general types of fairness constraints can be reduced to those in this family. We present empirical results that show that our algorithm can achieve near-perfect fairness with respect to various fairness metrics, and that the loss in accuracy due to the imposed fairness constraints is often small. Overall, this work unifies several prior works on fair classification, presents a practical algorithm with theoretical guarantees, and can handle fairness metrics that were previously not possible.
Non-Negative Networks Against Adversarial Attacks
Fleshman, William, Raff, Edward, Sylvester, Jared, Forsyth, Steven, McLean, Mark
Adversarial attacks against Neural Networks are a problem of considerable importance, for which effective defenses are not yet readily available. We make progress toward this problem by showing that non-negative weight constraints can be used to improve resistance in specific scenarios. In particular, we show that they can provide an effective defense for binary classification problems with asymmetric cost, such as malware or spam detection. We also show how non-negativity can be leveraged to reduce an attacker's ability to perform targeted misclassification attacks in other domains such as image processing.
IoT Is Building Higher Levels Of Customer Engagement
Bestselling author Shep Hypken--the "Chief Amazement Officer" at Shephard Presentations--makes a rock-solid case for why customer experience has advanced to the level of 21st-century table stakes: "New research proves that consumers are expecting, if not demanding, highly personalized experiences," Hypken writes in Forbes. "And the good news for those businesses that can deliver is that customers are typically willing to spend more when they receive such custom-tailored service." Computer vision also comes into play where smart retail stores are concerned. Enter the Internet of Things (IoT), which through interconnected devices and strong data analytics makes an entirely new level of customer surprise, delight and convenience possible. What's more, the IoT brings relevant experiences and information to consumers, whether to facilitate the operation of smart homes or to provide relevant health and wellness data that can be shared with medical professionals.
'Babysitter' robot iPal gives maths lessons, tells jokes and keeps China's lonely children company
Parents in China are handing over babysitting duties to robots. The £1,050 ($1,400) 'iPal' speaks two languages, gives maths lessons, tells jokes and interacts with children through a tablet screen in its chest. Engineers designed the device to act like a four to eight-year-old, becoming an extra child in the family that also helps'relieve the burden' felt by China's busy parents. The android offers education and company for lonely children and peace of mind for adults, who can remotely talk to and monitor their child through iPal's screen. A smartphone app directly links parents to the humanoid machine, allowing them to see and hear everything in iPal's vicinity.
Nvidia will open deep learning research lab in Toronto
Nvidia today announced plans to open an AI research facility in Toronto to further explore novel approaches to deep learning. The team will be led by deep learning and computer vision expert and University of Toronto assistant professor Sanja Fidler. The lab will operate out of Nvidia's Toronto office, which will double its current headcount of 50 employees in order to triple the number of AI and deep learning researchers working there by the end of the year, according to a Nvidia blog post published today. Though hiring is underway, Nvidia does not anticipate opening the research lab until August, a company spokesperson told VentureBeat in an email. A Nvidia robotics research lab in Seattle is also scheduled to open in the coming months, senior director Dieter Fox told VentureBeat in a recent interview.
Analytics, machine learning predict World Cup scores - ITWeb Africa
South African-based data scientists at Principa are at it again; this time using predictive analytics and machine learning to foretell the results of the 2018 Football World Cup. The 2018 FIFA World Cup kicks off tomorrow in Russia with the host nation taking on Saudi Arabia in Group A. Principa has already predicted the results for all the first games in the first round of matches. The company's data scientists use different algorithms to develop models that can predict the outcome of the matches. Principa notes that as the objective of machine learning is to develop models that can retrain themselves to adapt when exposed to new data, the algorithms will be re-trained with the results of each match to improve the accuracy of the following round's generated prediction. It points out that the purpose is to see how well different predictive analytics techniques used successfully in other areas can outperform the best human-made predictions.
ServeNet: A Deep Neural Network for Web Service Classification
Yang, Yilong, Liu, Peng, Ding, Lianchao, Shen, Bingqing, Wang, Weiru
Automated service classification plays a crucial role in service management such as service discovery, selection, and composition. In recent years, machine learning techniques have been used for service classification. However, they can only predict around 10 to 20 service categories due to the quality of feature engineering and the imbalance problem of service dataset. In this paper, we present a deep neural network ServeNet with a novel dataset splitting algorithm to deal with these issues. ServeNet can automatically abstract low-level representation to high-level features, and then predict service classification based on the service datasets produced by the proposed splitting algorithm. To demonstrate the effectiveness of our approach, we conducted a comprehensive experimental study on 10,000 real-world services in 50 categories. The result shows that ServeNet can achieve higher accuracy than other machine learning methods.
Efficient sampling for Gaussian linear regression with arbitrary priors
Hahn, P. Richard, He, Jingyu, Lopes, Hedibert
This paper develops a computationally efficient posterior sampling algorithm for Bayesian linear regression models with Gaussian errors. Our new approach is motivated by the fact that existing software implementations for Bayesian linear regression do not readily handle problems with large number of observations (hundreds of thousands) and predictors (thousands). Moreover, existing sampling algorithms for popular shrinkage priors are bespoke Gibbs samplers based on case-specific latent variable representations. By contrast, the new algorithm does not rely on case-specific auxiliary variable representations, which allows for rapid prototyping of novel shrinkage priors outside the conditionally Gaussian framework. Specifically, we propose a slice-within-Gibbs sampler based on the elliptical slice sampler of Murray et al. [2010].
Improved Density-Based Spatio--Textual Clustering on Social Media
Nguyen, Minh D., Shin, Won-Yong
DBSCAN may not be sufficient when the input data type is heterogeneous in terms of textual description. When we aim to discover clusters of geo-tagged records relevant to a particular point-of-interest (POI) on social media, examining only one type of input data (e.g., the tweets relevant to a POI) may draw an incomplete picture of clusters due to noisy regions. To overcome this problem, we introduce DBSTexC, a newly defined density-based clustering algorithm using spatio--textual information. We first characterize POI-relevant and POI-irrelevant tweets as the texts that include and do not include a POI name or its semantically coherent variations, respectively. By leveraging the proportion of POI-relevant and POI-irrelevant tweets, the proposed algorithm demonstrates much higher clustering performance than the DBSCAN case in terms of $\mathcal{F}_1$ score and its variants. While DBSTexC performs exactly as DBSCAN with the textually homogeneous inputs, it far outperforms DBSCAN with the textually heterogeneous inputs. Furthermore, to further improve the clustering quality by fully capturing the geographic distribution of tweets, we present fuzzy DBSTexC (F-DBSTexC), an extension of DBSTexC, which incorporates the notion of fuzzy clustering into the DBSTexC. We then demonstrate the robustness of F-DBSTexC via intensive experiments. The computational complexity of our algorithms is also analytically and numerically shown.