AITopics | pytorch code

Collaborating Authors

pytorch code

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Learning Bug Context for PyTorch-to-JAX Translation with LLMs

Phan, Hung, Vu, Son Le, Jannesari, Ali

arXiv.org Artificial IntelligenceOct-14-2025

Despite recent progress of large language models (LLMs) on code translation among mainstream languages, translating PyTorch to JAX remains nontrivial. The two libraries, though both embedded in Python, differ in core design, execution semantics, and ecosystem maturity; JAX is newer and comparatively underrepresented in public code, and parallel PyTorch--JAX corpora are limited. Weaknesses in existing evaluation further complicate cross-framework benchmarking. We present T2J, a prompt-augmentation framework that strengthens LLM-based PyTorch to JAX translation. Our pipeline (i) assembles two PyTorch sources -- the problem-solving set from TorchLeet (Aroori & Chien, 2025) and a GitHub-derived set from CodeParrot (Wolf et al., 2022) -- and uses GPT-4o-mini to produce initial JAX drafts; (ii) engages two professional developers to iteratively repair those drafts until functional equivalence, yielding a curated fixed-bug dataset of common errors and patches; and (iii) constructs augmented prompts that inject structured guidance from these fixes to steer lightweight LLMs (e.g., GPT-4o-mini). We also introduce three metrics tailored to PyTorch to JAX: T2J CodeTrans Score, T2J FixCost Score (an LLM-based estimate of bug-fix effort), and T2J Comparison Score (LLM-as-judge). Empirically, T2J raises GPT-4o-mini performance by up to 10% on CodeBLEU, 50% on T2J FixCost Score, 1.33 points on T2J CodeTrans Score (0--4 scale), and 100% on T2J Comparison Score; moreover, the generated code runs up to 2.5x faster than the baseline.

large language model, machine learning, translation, (17 more...)

arXiv.org Artificial Intelligence

2510.09898

Country: North America > United States (0.28)

Genre: Research Report (0.52)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Overleaf Example

Neural Information Processing SystemsAug-18-2025, 15:41:08 GMT

artificial intelligence, machine learning, perturbation, (18 more...)

Neural Information Processing Systems

Genre: Research Report (0.47)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.73)

Add feedback

Introduction to Lightning Fabric

#artificialintelligenceApr-7-2023, 23:11:30 GMT

Lightning Fabric is a new, open-source library that allows you to quickly and easily scale models while maintaining full control over your training loop. In the past, getting PyTorch code to run efficiently on GPUs and scaling it up to many machines and large datasets was possible with PyTorch Lightning. As time went on, however, we became aware of the need to provide a scaling option that landed somewhere between a raw deep learning framework like PyTorch on the one hand, and a high-level, feature-rich framework like PyTorch Lightning. Lightning Fabric is just that. While PyTorch Lightning provides many features to save time and improve readability and collaboration, there are complex use cases where full control over the training loop is needed.

fabric, lightning fabric, pytorch lightning, (11 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Does Zero-Shot Reinforcement Learning Exist?

Touati, Ahmed, Rapin, Jérémy, Ollivier, Yann

arXiv.org Artificial IntelligenceMar-1-2023

A zero-shot RL agent is an agent that can solve any RL task in a given environment, instantly with no additional planning or learning, after an initial reward-free learning phase. This marks a shift from the reward-centric RL paradigm towards "controllable" agents that can follow arbitrary instructions in an environment. Current RL agents can solve families of related tasks at best, or require planning anew for each task. Strategies for approximate zero-shot RL ave been suggested using successor features (SFs) [BBQ+ 18] or forward-backward (FB) representations [TO21], but testing has been limited. After clarifying the relationships between these schemes, we introduce improved losses and new SF models, and test the viability of zero-shot RL schemes systematically on tasks from the Unsupervised RL benchmark [LYL+21]. To disentangle universal representation learning from exploration, we work in an offline setting and repeat the tests on several existing replay buffers. SFs appear to suffer from the choice of the elementary state features. SFs with Laplacian eigenfunctions do well, while SFs based on auto-encoders, inverse curiosity, transition models, low-rank transition matrix, contrastive learning, or diversity (APS), perform unconsistently. In contrast, FB representations jointly learn the elementary and successor features from a single, principled criterion. They perform best and consistently across the board, reaching 85% of supervised RL performance with a good replay buffer, in a zero-shot manner.

large language model, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2209.14935

Country: Europe > France > Hauts-de-France > Nord > Lille (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

GitHub - rentruewang/koila: Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code.

#artificialintelligenceDec-22-2022, 22:25:39 GMT

Koila solves CUDA error: out of memory error painlessly. Fix it with just one line of code, and forget it. To use it, download release v0.1.1 here.* Automatically accumulates gradients when batch sizes are too large. Lazily evaluates PyTorch code to save computing power.

cuda error, pytorch, tensor, (14 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.87)

Add feedback

Learn PyTorch for Deep Learning – Free 26-Hour Course

#artificialintelligenceOct-15-2022, 15:09:18 GMT

My comprehensive PyTorch course is now live on the freeCodeCamp.org The best way to learn is by doing. And that's just what we'll do in the Learn PyTorch for Deep Learning: Zero to Mastery course. If you're new to data science and machine learning, consider the course a momentum builder. By the end, you'll be comfortable navigating the PyTorch documentation, reading PyTorch code, writing PyTorch code, searching for things you don't understand and building your own machine learning projects.

machine learning, pytorch, pytorch code, (13 more...)

#artificialintelligence

Genre: Instructional Material > Course Syllabus & Notes (0.33)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Learn PyTorch in a day. Literally.

#artificialintelligenceJul-26-2022, 11:45:34 GMT

My latest YouTube video is 25 hours, 36 minutes and 57 seconds long (actually a little over a day). And its sole purpose is to be a momentum builder, to help you learn PyTorch for deep learning. In each section, we'll get hands-on and learn important machine learning concepts by writing PyTorch code together, apprenticeship style. It still amazes me to think how much the field of machine learning is progressing. There are things in the video that weren't possible 10 years ago.

learn pytorch, pytorch, video, (2 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Scale your PyTorch code with LightningLite

#artificialintelligenceDec-10-2021, 01:36:58 GMT

LightningLite comes with distributed utilities and basic built-in properties that you can rely on to ensure your distributed training works as expected. LightningLite provides you only with the tool to scale your training. You need to know several significant challenges ahead of you, but they are already resolved in PyTorch Lightning. If you are facing one of those challenges, then you are already meeting the limit of LightningLite. We recommend you convert to Lightning, so you never have to worry about those.

grid, lightninglite, pytorch code

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.81)

Add feedback

Best Machine Learning Research of 2020

#artificialintelligenceOct-14-2021, 06:16:03 GMT

We saw excellent progress with enterprise acceptance of machine learning across a wide swath of industries and problem domains. In terms of pure research, I had a good time tracking the acceleration of progress in the area of machine learning. In this article, we'll take a tour of my top pick of papers that I found intriguing and useful. In my attempt to stay current with the field's research progress, the directions represented here are very promising. I hope you enjoy the results as much as I have. Overfitting & underfitting and stable training are important challenges in machine learning. Current approaches for these issues are mixup, SamplePairing, and BC learning. This paper states the hypothesis that mixing many images together can be more effective than just two.

algorithm, learning, machine learning, (14 more...)

#artificialintelligence

Genre:

Research Report (1.00)
Overview (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.97)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.94)

Add feedback

A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification

Angelopoulos, Anastasios N., Bates, Stephen

arXiv.org Artificial IntelligenceJul-15-2021

Black-box machine learning learning methods are now routinely used in high-risk settings, like medical diagnostics, which demand uncertainty quantification to avoid consequential model failures. Distribution-free uncertainty quantification (distribution-free UQ) is a user-friendly paradigm for creating statistically rigorous confidence intervals/sets for such predictions. Critically, the intervals/sets are valid without distributional assumptions or model assumptions, with explicit guarantees with finitely many datapoints. Moreover, they adapt to the difficulty of the input; when the input example is difficult, the uncertainty intervals/sets are large, signaling that the model might be wrong. Without much work, one can use distribution-free methods on any underlying algorithm, such as a neural network, to produce confidence sets guaranteed to contain the ground truth with a user-specified probability, such as 90%. Indeed, the methods are easy-to-understand and general, applying to many modern prediction problems arising in the fields of computer vision, natural language processing, deep reinforcement learning, and so on. This hands-on introduction is aimed at a reader interested in the practical implementation of distribution-free UQ, including conformal prediction and related methods, who is not necessarily a statistician. We will include many explanatory illustrations, examples, and code samples in Python, with PyTorch syntax. The goal is to provide the reader a working understanding of distribution-free UQ, allowing them to put confidence intervals on their algorithms, with one self-contained document.

conformal prediction, prediction, regression, (16 more...)

arXiv.org Artificial Intelligence

2107.07511

Country: Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.40)

Industry: Health & Medicine > Diagnostic Medicine (0.48)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.54)
(2 more...)

Add feedback