probability velocity
- North America > United States > Ohio > Cuyahoga County > Cleveland (0.04)
- Europe > Slovenia > Drava > Municipality of Maribor > Maribor (0.04)
- North America > United States > Georgia > Fulton County > Atlanta (0.04)
- (3 more...)
- Education (0.92)
- Government > Regional Government > Asia Government > North Korea Government (0.46)
- Government > Regional Government > North America Government > United States Government (0.45)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Vision (0.68)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
- North America > United States > Ohio > Cuyahoga County > Cleveland (0.04)
- Europe > Slovenia > Drava > Municipality of Maribor > Maribor (0.04)
- North America > United States > Georgia > Fulton County > Atlanta (0.04)
- (4 more...)
- Education (0.92)
- Government > Regional Government > Asia Government > North Korea Government (0.46)
- Government > Regional Government > North America Government > United States Government (0.45)
- Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
- Information Technology > Artificial Intelligence > Natural Language (1.00)
- Information Technology > Artificial Intelligence > Vision (0.68)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Discrete Flow Matching
Gat, Itai, Remez, Tal, Shaul, Neta, Kreuk, Felix, Chen, Ricky T. Q., Synnaeve, Gabriel, Adi, Yossi, Lipman, Yaron
Despite Flow Matching and diffusion models having emerged as powerful generative paradigms for continuous variables such as images and videos, their application to high-dimensional discrete data, such as language, is still limited. In this work, we present Discrete Flow Matching, a novel discrete flow paradigm designed specifically for generating discrete data. Discrete Flow Matching offers several key contributions: (i) it works with a general family of probability paths interpolating between source and target distributions; (ii) it allows for a generic formula for sampling from these probability paths using learned posteriors such as the probability denoiser ($x$-prediction) and noise-prediction ($\epsilon$-prediction); (iii) practically, focusing on specific probability paths defined with different schedulers considerably improves generative perplexity compared to previous discrete diffusion and flow models; and (iv) by scaling Discrete Flow Matching models up to 1.7B parameters, we reach 6.7% Pass@1 and 13.4% Pass@10 on HumanEval and 6.7% Pass@1 and 20.6% Pass@10 on 1-shot MBPP coding benchmarks. Our approach is capable of generating high-quality discrete data in a non-autoregressive fashion, significantly closing the gap between autoregressive models and discrete flow models.
- North America > United States > Ohio > Cuyahoga County > Cleveland (0.04)
- Europe > Slovenia > Drava > Municipality of Maribor > Maribor (0.04)
- North America > United States > Georgia > Fulton County > Atlanta (0.04)
- (3 more...)
- Education (0.92)
- Government > Regional Government > Asia Government > North Korea Government (0.46)
- Government > Regional Government > North America Government > United States Government (0.45)