Collaborating Authors


Cybersecurity with Data Science: Implementation


Hands-on of Machine Learning in Cybersecurity Supervised and unsupervised machine learning models for cybersecurity Description Machine learning is disrupting cybersecurity to a greater extent than almost any other industry. Many problems in cyber security are well suited to the application of machine learning as they often involve some form of anomaly detection on very large volumes of data. This course deals the most found issues in cybersecurity such as malware, anomalies detection, SQL injection, credit card fraud, bots, spams and phishing. All these problems are covered in case studies.

Data science for cybersecurity: A probabilistic time series model for detecting RDP inbound brute force attacks - Microsoft Security


Our approach to time series anomaly detection is computationally efficient, automatically learns how to update probabilities and adapt to changes in data. As we describe in the next section, this approach has yielded successful attack detection at high precision. The proposed time series anomaly detection model was deployed and utilized by Microsoft Threat Experts to detect RDP brute force attacks during threat hunting activities. A list that ranks machines across enterprises with the lowest anomaly scores (indicating the likelihood of observing a value at least as large under expected conditions in all signals considered) is updated and reviewed every day. See Table 1 for an example.

An Anomaly Contribution Explainer for Cyber-Security Applications Machine Learning

--In this paper we introduce Anomaly Contribution Explainer or ACE, a tool to explain security anomaly detection models in terms of the model features through a regression framework, and its variant, ACE-KL, which highlights the important anomaly contributors. ACE and ACE-KL provide insights in diagnosing which attributes significantly contribute to an anomaly by building a specialized linear model to locally approximate the anomaly score that a black-box model generates. We conducted experiments with these anomaly detection models to detect security anomalies on both synthetic data and real data. In particular, we evaluate performance on three public data sets: CERT insider threat, netflow logs, and Android malware. The experimental results are encouraging: our methods consistently identify the correct contributing feature in the synthetic data where ground truth is available; similarly, for real data sets, our methods point a security analyst in the direction of the underlying causes of an anomaly, including in one case leading to the discovery of previously overlooked network scanning activity. We have made our source code publicly available. Cyber-security is a key concern for both private and public organizations, given the high cost of security compromises and attacks; malicious cyber-activity cost the U.S. economy between $57 billion and $109 billion in 2016 [1]. As a result, spending on security research and development, and security products and services to detect and combat cyber-attacks has been increasing [2]. Organizations produce large amounts of network, host and application data that can be used to gain insights into cyber-security threats, misconfigurations, and network operations. While security domain experts can manually sift through some amount of data to spot attacks and understand them, it is virtually impossible to do so at scale, considering that even a medium sized enterprise can produce terabytes of data in a few hours.

What Trends Are Shaping AI In Energy This Year? 9 Experts Share Their Insights - Disruptor Daily


What other trends are shaping the future of energy extraction, refinement, and consumption. These industry insiders provided their takes on the #1 trend shaping energy this year, and into the future. "Some areas where we see nascent AI is in predictive maintenance and asset monitoring. There are a few who are beginning to look at utilizing AI to analyze images from drones for surveillance and also for acoustic listening." "Advances in the'time-series' AI world (as opposed to AI for images or audio) are shaping the energy industry today. These include techniques for time series forecasting, anomaly detection, optimization etc. Specifically, probabilistic techniques and algorithms are showing significant improvements and becoming the driver of the next wave of optimization and value creation. These techniques augment today's unilateral AI predictions with additional information about the confidence in these predictions. This is not unlike the trend shaping the peer to peer transportation industry."

Anomaly Detection with Unsupervised AI in MixMode: Why Threat Intel Alone is Not Enough - MixMode


Historically, the MixMode platform has provided its users with a forensic hunting platform with intel-based Indicators and Security Events from public & proprietary sources. While these detections still have their place in the security ecosystem, the increase in state-sponsored attacks, insider threats and adversarial artificial intelligence means there are simply too many threats to your network to rely on solely intelligence-based detections or proactive hunting. Many of these threats are sophisticated enough to evade traditional threat detection or, in the case of zero-day threats, signature-based detection may not even be possible. In the face of this growing threat, the best defense is to supplement these traditional methods with anomaly detection, a term that is quickly becoming genericized as it is rapidly bandied about within the industry. Here we will discuss some of the opportunities and challenges that can arise with anomaly detection as well as MixMode's unique approach to the solution.

Detecting and preventing cyberattacks with anomaly detection and machine learning


The Gartner Security & Risk Management Summit is just a few days away, and I'm delighted to have the opportunity to chat with attendees about how anomaly detection and machine learning can help give your organization a more proactive security posture. You don't need to have been in the cybersecurity space for long to be bewildered by and unsure about vendor claims around artificial intelligence, machine learning, and analytics. At Interset (acquired by Micro Focus in February of this year), we have regular conversations with security professionals who struggle to understand which techniques and tools are effective in boosting breach defense in the real world. Ultimately, these conversations lead to an important question for us: How can you implement user and entity behavioral analytics (UEBA) in a way that will enable an efficient security operations center (SOC)? There are multiple factors that go into an effective UEBA implementation, but it's helpful to start with ensuring that the math and machine learning powering the solution are suitable for your security objectives.

Online Multivariate Anomaly Detection and Localization for High-dimensional Settings Machine Learning

This paper considers the real-time detection of anomalies in high-dimensional systems. The goal is to detect anomalies quickly and accurately so that the appropriate countermeasures could be taken in time, before the system possibly gets harmed. We propose a sequential and multivariate anomaly detection method that scales well to high-dimensional datasets. The proposed method follows a nonparametric, i.e., data-driven, and semi-supervised approach, i.e., trains only on nominal data. Thus, it is applicable to a wide range of applications and data types. Thanks to its multivariate nature, it can quickly and accurately detect challenging anomalies, such as changes in the correlation structure and stealth low-rate cyberattacks. Its asymptotic optimality and computational complexity are comprehensively analyzed. In conjunction with the detection method, an effective technique for localizing the anomalous data dimensions is also proposed. We further extend the proposed detection and localization methods to a supervised setup where an additional anomaly dataset is available, and combine the proposed semi-supervised and supervised algorithms to obtain an online learning algorithm under the semi-supervised framework. The practical use of proposed algorithms are demonstrated in DDoS attack mitigation, and their performances are evaluated using a real IoT-botnet dataset and simulations.

Cybersecurity Data Science: Minding the Growing Gap - DATAVERSITY


Click to learn more about author Scott Mongeau. Following cybersecurity Data Science best practices can help beleaguered and resource-strapped security teams transform Big Data into smart data for better anomaly detection and enterprise protection. The consequences of ignoring security challenges are rising. According to the Cisco 2018 Annual Cybersecurity Report, over half of cyberattacks resulted in damages of greater than $500K, with nearly 20 percent costing more than $2.5M. Meanwhile regulators, seeking to spur heightened oversight, have become more aggressive in levying fines and holding corporate boards accountable.

SafeRide tackles connected vehicle security with machine learning


As concerns over security risks for connected vehicles continue to build, automotive cybersecurity company SafeRide Technologies believes unsupervised machine learning will help keep threat actors out of the driver's seats. Earlier this month, SafeRide launched its vXRay technology for connected vehicles' security operations center (SOC), which uses unsupervised machine learning technology to provide behavioral profiling and anomaly detection to improve connected vehicle security. Gil Reiter, vice president of product management and marketing at SafeRide, based in Tel Aviv, Israel, said vXRay is available for OEMs and fleet managers to integrate in their vehicles' SOC. "The vXRay technology establishes the normal behavior of the vehicle without any dependencies or without any knowledge of the specific electronic control unit properties," Reiter said. "Once the behavioral baseline of the vehicle is established, the technology can accurately detect and then flag any abnormal behavior of the vehicle system and report the abnormal behavior to the connected vehicle's SOC for further analysis."

Cyber Anomaly Detection Using Graph-node Role-dynamics Machine Learning

Intrusion detection systems (IDSs) generate valuable knowledge about network security, but an abundance of false alarms and a lack of methods to capture the interdependence among alerts hampers their utility for network defense. Here, we explore a graph-based approach for fusing alerts generated by multiple IDSs (e.g., Snort, OSSEC, and Bro). Our approach generates a weighted graph of alert fields (not network topology) that makes explicit the connections between multiple alerts, IDS systems, and other cyber artifacts. We use this multi-modal graph to identify anomalous changes in the alert patterns of a network. To detect the anomalies, we apply the role-dynamics approach, which has successfully identified anomalies in social media, email, and IP communication graphs. In the cyber domain, each node (alert field) in the fused IDS alert graph is assigned a probability distribution across a small set of roles based on that node's features. A cyber attack should trigger IDS alerts and cause changes in the node features, but rather than track every feature for every alert-field node individually, roles provide a succinct, integrated summary of those feature changes. We measure changes in each node's probabilistic role assignment over time, and identify anomalies as deviations from expected roles. We test our approach using simulations including three weeks of normal background traffic, as well as cyber attacks that occur near the end of the simulations. This paper presents a novel approach to multi-modal data fusion and a novel application of role dynamics within the cyber-security domain. Our results show a drastic decrease in the false-positive rate when considering our anomaly indicator instead of the IDS alerts themselves, thereby reducing alarm fatigue and providing a promising avenue for threat intelligence in network defense.