Demystifying the Performance of Data Transfers in High-Performance Research Networks
Saeedizade, Ehsan, Zhang, Bing, Arslan, Engin
–arXiv.org Artificial Intelligence
High-speed research networks are built to meet the ever-increasing needs of data-intensive distributed workflows. However, data transfers in these networks often fail to attain the promised transfer rates for several reasons, including I/O and network interference, server misconfigurations, and network anomalies. Although understanding the root causes of performance issues is critical to mitigating them and increasing the utilization of expensive network infrastructures, there is currently no available mechanism to monitor data transfers in these networks. In this paper, we present a scalable, end-to-end monitoring framework to gather and store key performance metrics for file transfers to shed light on the performance of transfers. The evaluation results show that the proposed framework can monitor up to 400 transfers per host and more than 40, 000 transfers in total while collecting performance statistics at one-second precision. We also introduce a heuristic method to automatically process the gathered performance metrics and identify the root causes of performance anomalies with an F-score of 87 - 98%.
arXiv.org Artificial Intelligence
Aug-20-2023
- Country:
- North America > United States
- Texas (0.04)
- Illinois (0.04)
- Tennessee > Anderson County
- Oak Ridge (0.04)
- New Mexico > Bernalillo County
- Albuquerque (0.04)
- Nevada > Washoe County
- Reno (0.04)
- California > Alameda County
- Livermore (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (0.48)
- Industry:
- Telecommunications > Networks (0.69)
- Technology:
- Information Technology
- Communications > Networks (1.00)
- Architecture (1.00)
- Data Science (0.93)
- Artificial Intelligence
- Machine Learning > Statistical Learning (0.46)
- Representation & Reasoning > Agents (0.31)
- Information Technology