Goto

Collaborating Authors

 pypi


Which Is Better For Reducing Outdated and Vulnerable Dependencies: Pinning or Floating?

arXiv.org Artificial Intelligence

Developers consistently use version constraints to specify acceptable versions of the dependencies for their project. \emph{Pinning} dependencies can reduce the likelihood of breaking changes, but comes with a cost of manually managing the replacement of outdated and vulnerable dependencies. On the other hand, \emph{floating} can be used to automatically get bug fixes and security fixes, but comes with the risk of breaking changes. Security practitioners advocate \emph{pinning} dependencies to prevent against software supply chain attacks, e.g., malicious package updates. However, since \emph{pinning} is the tightest version constraint, \emph{pinning} is the most likely to result in outdated dependencies. Nevertheless, how the likelihood of becoming outdated or vulnerable dependencies changes across version constraint types is unknown. The goal of this study is to aid developers in making an informed dependency version constraint choice by empirically evaluating the likelihood of dependencies becoming outdated or vulnerable across version constraint types at scale. In this study, we first identify the trends in dependency version constraint usage and the patterns of version constraint type changes made by developers in the npm, PyPI, and Cargo ecosystems. We then modeled the dependency state transitions using survival analysis and estimated how the likelihood of becoming outdated or vulnerable changes when using \emph{pinning} as opposed to the rest of the version constraint types. We observe that among outdated and vulnerable dependencies, the most commonly used version constraint type is \emph{floating-minor}, with \emph{pinning} being the next most common. We also find that \emph{floating-major} is the least likely to result in outdated and \emph{floating-minor} is the least likely to result in vulnerable dependencies.


systemds ยท PyPI

#artificialintelligence

These high-level scripts are compiled into hybrid execution plans of local, in-memory CPU and GPU operations, as well as distributed operations on Apache Spark. In contrast to existing systems - that either provide homogeneous tensors or 2D Datasets - and in order to serve the entire data science lifecycle, the underlying data model are DataTensors, i.e., tensors (multi-dimensional arrays) whose first dimension may have a heterogeneous and nested schema.


PyTorch dependency poisoned with malicious code โ€ข The Register

#artificialintelligence

An unknown attacker used the PyPI code repository to get developers to download a compromised PyTorch dependency that included malicious code designed to steal system data. Developers who last week downloaded the nightly builds of the open source PyTorch framework also unknowingly installed a malicious version of the torchtriton dependency found in the Python Package Index, according to PyTorch's maintainers. In a blog post this week, PyTorch recommended those who installed the PyTorch nightly on Linux through pip between December 25 and December 30 to uninstall it and use the latest nightly binaries that were released after December 30. They said devs using the PyTorch stable packages were not affected by the malicious binary. However, the extent of the attack is unclear.


PyTorch Machine Learning Framework Compromised with Malicious Dependency

#artificialintelligence

The maintainers of the PyTorch package have warned users who have installed the nightly builds of the library between December 25, 2022, and December 30, 2022, to uninstall and download the latest versions following a dependency confusion attack. "PyTorch-nightly Linux packages installed via pip during that time installed a dependency, torchtriton, which was compromised on the Python Package Index (PyPI) code repository and ran a malicious binary," the PyTorch team said in an alert over the weekend. PyTorch, analogous to Keras and TensorFlow, is an open source Python-based machine learning framework that was originally developed by Meta Platforms. The PyTorch team said that it became aware of the malicious dependency on December 30, 4:40 p.m. GMT. The supply chain attack entailed uploading the malware-laced copy of a legitimate dependency named torchtriton to the Python Package Index (PyPI) code repository.


GitHub - RubensZimbres/best-of-ml-python: ๐Ÿ† A ranked list of awesome machine learning Python libraries. Updated weekly.

#artificialintelligence

A ranked list of awesome machine learning Python libraries. This curated list contains 830 awesome open-source projects with a total of 2.6M stars grouped into 32 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Discover other best-of lists or create your own.


GitHub - ml-tooling/best-of-ml-python: ๐Ÿ† A ranked list of awesome machine learning Python libraries. Updated weekly.

#artificialintelligence

A ranked list of awesome machine learning Python libraries. This curated list contains 920 awesome open-source projects with a total of 3.4M stars grouped into 34 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Discover other best-of lists or create your own.


GitHub - keras-team/keras: Deep Learning for humans

#artificialintelligence

This repository hosts the development of the Keras library. Keras is a deep learning API written in Python, running on top of the machine learning platform TensorFlow. It was developed with a focus on enabling fast experimentation. Being able to go from idea to result as fast as possible is key to doing good research. TensorFlow 2 is an end-to-end, open-source machine learning platform.


A complete guide to building a Docker Image serving a Machine learning system in Production

#artificialintelligence

Building a Docker image is generally considered trivial compared to developing other components of a ML system like data pipeline, model training, serving infra, etc. But an inefficient, bulky docker image can greatly reduce performance and can even bring down the serving infra. This blog aims to focus on building an ideal Docker image and not on its concept or benefits. Most of the time a ML system will be based on Python, so it critical building any Python-based Docker image efficiently. Let us go through them.


5 Things that I learned through the School of AI -- And made me a better Machine Learning Engineer

#artificialintelligence

During the last two months of 2019, I was lucky to participate in the School of Artificial Intelligence. If you are not familiar with the School of AI, it is a first-class, hands-on training program on Artificial Intelligence hosted by Pi School, an educational project inside the startup district of Pi Campus, in Rome. The program aims at a multitude of profiles that have in common a great passion for Artificial Intelligence and at least basic stats and programming abilities. The program lasts two months, it is free for the students, and you also get a grant covering expenses such as flights and rent. There is a catch, though: it is extremely difficult to get in!


matloff/R-vs.-Python-for-Data-Science

#artificialintelligence

This Web page is aimed at shedding some light on the perennial R-vs.-Python debates in the Data Science community. As a professional computer scientist and statistician, I hope to shed some useful light on the topic. I have potential bias: I've written four R-related books, I've given a keynote talk at useR!; I currently serve as Editor-in-Chief of the R Journal; etc. But I hope this analysis will be considered fair and helpful. This is subjective, of course, but having written (and taught) in many different programming languages, I really appreciate Python's greatly reduced use of parentheses and braces: This is of particular interest to me, as an educator.