Sarajevo
Privacy-Aware Recommender Systems Challenge on Twitter's Home Timeline
Belli, Luca, Ktena, Sofia Ira, Tejani, Alykhan, Lung-Yut-Fon, Alexandre, Portman, Frank, Zhu, Xiao, Xie, Yuanpu, Gupta, Akshay, Bronstein, Michael, Delić, Amra, Sottocornola, Gabriele, Anelli, Walter, Andrade, Nazareno, Smith, Jessie, Shi, Wenzhe
Recommender systems constitute the core engine of most social network platforms nowadays, aiming to maximize user satisfaction along with other key business objectives. Twitter is no exception. Despite the fact that Twitter data has been extensively used to understand socioeconomic and political phenomena and user behaviour, the implicit feedback provided by users on Tweets through their engagements on the Home Timeline has only been explored to a limited extent. At the same time, there is a lack of large-scale public social network datasets that would enable the scientific community to both benchmark and build more powerful and comprehensive models that tailor content to user interests. By releasing an original dataset of 160 million Tweets along with engagement information, Twitter aims to address exactly that. During this release, special attention is drawn on maintaining compliance with existing privacy laws. Apart from user privacy, this paper touches on the key challenges faced by researchers and professionals striving to predict user engagements. It further describes the key aspects of the RecSys 2020 Challenge that was organized by ACM RecSys in partnership with Twitter using this dataset.
Feature Selection with Evolving, Fast and Slow Using Two Parallel Genetic Algorithms
Cetin, Uzay, Gundogmus, Yunus Emre
Feature selection is one of the most challenging issues in machine learning, especially while working with high dimensional data. In this paper, we address the problem of feature selection and propose a new approach called Evolving Fast and Slow. This new approach is based on using two parallel genetic algorithms having high and low mutation rates, respectively. Evolving Fast and Slow requires a new parallel architecture combining an automatic system that evolves fast and an effortful system that evolves slow. With this architecture, exploration and exploitation can be done simultaneously and in unison. Evolving fast, with high mutation rate, can be useful to explore new unknown places in the search space with long jumps; and Evolving Slow, with low mutation rate, can be useful to exploit previously known places in the search space with short movements. Our experiments show that Evolving Fast and Slow achieves very good results in terms of both accuracy and feature elimination.
An adaptive data-driven approach to solve real-world vehicle routing problems in logistics
Zunic, Emir, Donko, Dzenana, Buza, Emir
Transportation occupies one-third of the amount in the logistics costs, and accordingly transportation systems largely influence the performance of the logistics system. This work presents an adaptive data-driven innovative modular approach for solving the real-world Vehicle Routing Problems (VRP) in the field of logistics. The work consists of two basic units: (i) an innovative multi-step algorithm for successful and entirely feasible solving of the VRP problems in logistics, (ii) an adaptive approach for adjusting and setting up parameters and constants of the proposed algorithm. The proposed algorithm combines several data transformation approaches, heuristics and Tabu search. Moreover, as the performance of the algorithm depends on the set of control parameters and constants, a predictive model that adaptively adjusts these parameters and constants according to historical data is proposed. A comparison of the acquired results has been made using the Decision Support System with predictive models: Generalized Linear Models (GLM) and Support Vector Machine (SVM). The algorithm, along with the control parameters, which using the prediction method were acquired, was incorporated into a web-based enterprise system, which is in use in several big distribution companies in Bosnia and Herzegovina. The results of the proposed algorithm were compared with a set of benchmark instances and validated over real benchmark instances as well. The successful feasibility of the given routes, in a real environment, is also presented.
A Gap Analysis of Low-Cost Outdoor Air Quality Sensor In-Field Calibration
Concas, Francesco, Mineraud, Julien, Lagerspetz, Eemil, Varjonen, Samu, Puolamäki, Kai, Nurmi, Petteri, Tarkoma, Sasu
In recent years, interest in monitoring air quality has been growing. Traditional environmental monitoring stations are very expensive, both to acquire and to maintain, therefore their deployment is generally very sparse. This is a problem when trying to generate air quality maps with a fine spatial resolution. Given the general interest in air quality monitoring, low-cost air quality sensors have become an active area of research and development. Low-cost air quality sensors can be deployed at a finer level of granularity than traditional monitoring stations. Furthermore, they can be portable and mobile. Low-cost air quality sensors, however, present some challenges: they suffer from cross-sensitivities between different ambient pollutants; they can be affected by external factors such as traffic, weather changes, and human behavior; and their accuracy degrades over time. Some promising machine learning approaches can help us obtain highly accurate measurements with low-cost air quality sensors. In this article, we present low-cost sensor technologies, and we survey and assess machine learning-based calibration techniques for their calibration. We conclude by presenting open questions and directions for future research.
News Daily: Facebook data row and NHS set for pay deal
An academic who created an app which harvested data from 50 million Facebook users says he has been made "a scapegoat" for Facebook and UK firm Cambridge Analytica. Dr Aleksandr Kogan completed work for Cambridge Analytica in 2014, but said he had no idea the data would be used to benefit Donald Trump's US presidential campaign. Facebook says Dr Kogan violated the site's policies. Last night, Alexander Nix, the chief executive of Cambridge Analytica, was suspended, having been secretly filmed by Channel 4 News appearing to suggest the company could use tactics to discredit politicians online. The company says the programme "grossly misrepresented" Mr Nix's conversation.
Enhancing Genetic Algorithms using Multi Mutations
Hassanat, Ahmad B. A., Alkafaween, Esra'a, Al-Nawaiseh, Nedal A., Abbadi, Mohammad A., Alkasassbeh, Mouhammd, Alhasanat, Mahmoud B.
Mutation is one of the most important stages of genetic algorithms because of its impact on the exploration of the search space, and in overcoming premature convergence. Since there are many types of mutations one common problem lies in selecting the appropriate type. The decision then becomes more difficult and needs more trial and error to find the best mutation to be used. This paper investigates the use of more than one mutation operator to enhance the performance of genetic algorithms. New mutation operators are proposed, in addition to two election strategies for the mutation operators. One is based on selecting the best mutation operator and the other randomly selects any operator. Several experiments were conducted on the Travelling Salesman Problem (TSP) to evaluate the proposed methods. These were compared to the well-known exchange mutation and rearrangement mutation. The results show the importance of some of the proposed methods, in addition to the significant enhancement of the genetic algorithms' performance, particularly when using more than one mutation operator.
The Modern Code Developer Challenge
The particle detectors at CERN are like cathedral-sized 3D digital cameras, capable of recording hundreds of millions of collision events per second. The detectors consist of multiple'layers' of detecting equipment, designed to recognise different types of charged particles produced by the collisions at the heart of the detector. As the charged particles fly outwards through the various layers of the detector, they leave traces, or'hits'. Tracking is the art of connecting the hits to recreate trajectories, thus helping researchers to understand more about and identify the particles. The algorithms used to reconstruct the collision events by identifying which dots belong to which charged particles can be very computationally expensive.
Are these entrepreneurs the next Jobs and Wozniak?
We apply the label of "genius" to everyone today, from our greatest achievers to exceptional children, and we even nickname our favorite musicians after it. But what, exactly, is genius? Is it someone who is crazy enough to tackle problems we are even afraid to acknowledge, or is it just someone who is better than us at living our dreams. We are the ones who marvel and wonder, longing for the salvation genius might bring. We are the ones who pay homage and obeisance.
Inside a virtual war: can video games recreate life in a conflict-ridden city?
In March 2014, a few months before the release of This War of Mine, the developers at 11 Bit Studios were discussing potential endings to their video game story of civilians trying to survive in a war-torn city. Wojciech Setlak, one of the writers, suggested they have a neighbouring country intervene, sending in troops to gain control of part of the weakened nation. A month later, in the real world, militia flying Russian flags – known to the locals as "little green men" – appeared in eastern Ukraine. "It was uncanny," says Setlak. "We had anticipated something that actually happened."