AITopics

2501.07991

Country: Europe (0.46)

Genre: Research Report (1.00)

Industry:

Education (0.70)
Energy (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

arXiv.org Artificial IntelligenceJul-15-2024

Optical Diffusion Models for Image Generation

Oguz, Ilker, Dinc, Niyazi Ulas, Yildirim, Mustafa, Ke, Junjie, Yoo, Innfarn, Wang, Qifei, Yang, Feng, Moser, Christophe, Psaltis, Demetri

Diffusion models generate new samples by progressively decreasing the noise from the initially provided random distribution. This inference procedure generally utilizes a trained neural network numerous times to obtain the final output, creating significant latency and energy consumption on digital electronic hardware such as GPUs. In this study, we demonstrate that the propagation of a light beam through a semi-transparent medium can be programmed to implement a denoising diffusion model on image samples. This framework projects noisy image patterns through passive diffractive optical layers, which collectively only transmit the predicted noise term in the image. The optical transparent layers, which are trained with an online training approach, backpropagating the error to the analytical model of the system, are passive and kept the same across different steps of denoising. Hence this method enables high-speed image generation with minimal power consumption, benefiting from the bandwidth and energy efficiency of optical information processing.

artificial intelligence, machine learning, modulation layer, (17 more...)

2407.10897

Country: North America > United States (0.46)

Genre: Research Report (0.71)

Industry:

Energy (0.89)
Education > Educational Setting > Online (0.70)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

arXiv.org Artificial IntelligenceJun-5-2024

Training of Physical Neural Networks

Momeni, Ali, Rahmani, Babak, Scellier, Benjamin, Wright, Logan G., McMahon, Peter L., Wanjura, Clara C., Li, Yuhang, Skalli, Anas, Berloff, Natalia G., Onodera, Tatsuhiro, Oguz, Ilker, Morichetti, Francesco, del Hougne, Philipp, Gallo, Manuel Le, Sebastian, Abu, Mirhoseini, Azalia, Zhang, Cheng, Marković, Danijela, Brunner, Daniel, Moser, Christophe, Gigan, Sylvain, Marquardt, Florian, Ozcan, Aydogan, Grollier, Julie, Liu, Andrea J., Psaltis, Demetri, Alù, Andrea, Fleury, Romain

Physical neural networks (PNNs) are a class of neural-like networks that leverage the properties of physical systems to perform computation. While PNNs are so far a niche research area with small-scale laboratory demonstrations, they are arguably one of the most underappreciated important opportunities in modern AI. Could we train AI models 1000x larger than current ones? Could we do this and also have them perform inference locally and privately on edge devices, such as smartphones or sensors? Research over the past few years has shown that the answer to all these questions is likely "yes, with enough research": PNNs could one day radically change what is possible and practical for AI systems. To do this will however require rethinking both how AI models work, and how they are trained - primarily by considering the problems through the constraints of the underlying hardware physics. To train PNNs at large scale, many methods including backpropagation-based and backpropagation-free approaches are now being explored. These methods have various trade-offs, and so far no method has been shown to scale to the same scale and performance as the backpropagation algorithm widely used in deep learning today. However, this is rapidly changing, and a diverse ecosystem of training techniques provides clues for how PNNs may one day be utilized to create both more efficient realizations of current-scale AI models, and to enable unprecedented-scale models.

artificial intelligence, machine learning, neural network, (15 more...)

2406.03372

Country:

Europe > France (0.68)
North America > United States > New York > New York County > New York City (0.14)
North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.14)
(3 more...)

Genre: Research Report (0.64)

Industry:

Education (0.68)
Energy > Oil & Gas (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceAug-10-2023

Forward-Forward Training of an Optical Neural Network

Oguz, Ilker, Ke, Junjie, Wang, Qifei, Yang, Feng, Yildirim, Mustafa, Dinc, Niyazi Ulas, Hsieh, Jih-Liang, Moser, Christophe, Psaltis, Demetri

Neural networks (NN) have demonstrated remarkable capabilities in various tasks, but their computation-intensive nature demands faster and more energy-efficient hardware implementations. Optics-based platforms, using technologies such as silicon photonics and spatial light modulators, offer promising avenues for achieving this goal. However, training multiple trainable layers in tandem with these physical systems poses challenges, as they are difficult to fully characterize and describe with differentiable functions, hindering the use of error backpropagation algorithm. The recently introduced Forward-Forward Algorithm (FFA) eliminates the need for perfect characterization of the learning system and shows promise for efficient training with large numbers of programmable parameters. The FFA does not require backpropagating an error signal to update the weights, rather the weights are updated by only sending information in one direction. The local loss function for each set of trainable weights enables low-power analog hardware implementations without resorting to metaheuristic algorithms or reinforcement learning. In this paper, we present an experiment utilizing multimode nonlinear wave propagation in an optical fiber demonstrating the feasibility of the FFA approach using an optical system. The results show that incorporating optical transforms in multilayer NN architectures trained with the FFA, can lead to performance improvements, even with a relatively small number of trainable weights. The proposed method offers a new path to the challenge of training optical NNs and provides insights into leveraging physical transformations for enhancing NN performance.

artificial intelligence, machine learning, trainable layer, (18 more...)

2305.1917

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

arXiv.org Artificial IntelligenceJul-18-2023

Nonlinear Processing with Linear Optics

Yildirim, Mustafa, Dinc, Niyazi Ulas, Oguz, Ilker, Psaltis, Demetri, Moser, Christophe

Deep neural networks have achieved remarkable breakthroughs by leveraging multiple layers of data processing to extract hidden representations, albeit at the cost of large electronic computing power. To enhance energy efficiency and speed, the optical implementation of neural networks aims to harness the advantages of optical bandwidth and the energy efficiency of optical interconnections. In the absence of low-power optical nonlinearities, the challenge in the implementation of multilayer optical networks lies in realizing multiple optical layers without resorting to electronic components. In this study, we present a novel framework that uses multiple scattering that is capable of synthesizing programmable linear and nonlinear transformations concurrently at low optical power by leveraging the nonlinear relationship between the scattering potential, represented by data, and the scattered field. Theoretical and experimental investigations show that repeating the data by multiple scattering enables non-linear optical computing at low power continuous wave light.

artificial intelligence, machine learning, modulation layer, (18 more...)

2307.08533

Country: Europe (0.28)

Genre: Research Report > New Finding (1.00)

Industry: Semiconductors & Electronics (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Neural Information Processing SystemsDec-31-1992

The VC-Dimension versus the Statistical Capacity of Multilayer Networks

Ji, Chuanyi, Psaltis, Demetri

The former characterizes their "Present Address: Department of Electrical Computer and System Engineering, Rensselaer Poly tech Institute, Troy, NY 12180.

artificial intelligence, neural network, vc-dimension, (17 more...)

Country: North America > United States > New York > Rensselaer County > Troy (0.24)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.50)

Neural Information Processing SystemsDec-31-1992

The VC-Dimension versus the Statistical Capacity of Multilayer Networks

Ji, Chuanyi, Psaltis, Demetri

The former characterizes their "Present Address: Department of Electrical Computer and System Engineering, Rensselaer Polytech Institute, Troy, NY 12180.

artificial intelligence, neural network, vc-dimension, (16 more...)

Country: North America > United States > New York > Rensselaer County > Troy (0.24)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.50)

Neural Information Processing SystemsDec-31-1992

The VC-Dimension versus the Statistical Capacity of Multilayer Networks

Ji, Chuanyi, Psaltis, Demetri

The former characterizes their "Present Address: Department of Electrical Computer and System Engineering, Rensselaer Poly tech Institute, Troy, NY 12180.

artificial intelligence, neural network, vc-dimension, (17 more...)

Country: North America > United States > New York > Rensselaer County > Troy (0.24)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Computational Learning Theory (0.50)

Neural Information Processing SystemsDec-31-1991

Asymptotic slowing down of the nearest-neighbor classifier

Snapp, Robert R., Psaltis, Demetri, Venkatesh, Santosh S.

M2/n' for sufficiently large values of M. Here, Poo(error) denotes the probability of error in the infinite sample limit, and is at most twice the error of a Bayes classifier. Although the value of the coefficient a depends upon the underlying probability distributions, the exponent of M is largely distribution free. We thus obtain a concise relation between a classifier's ability to generalize from a finite reference sample and the dimensionality of the feature space, as well as an analytic validation of Bellman's well known "curse of dimensionality." 1 INTRODUCTION One of the primary tasks assigned to neural networks is pattern classification. Common applications include recognition problems dealing with speech, handwritten characters, DNA sequences, military targets, and (in this conference) sexual identity. Two fundamental concepts associated with pattern classification are generalization (how well does a classifier respond to input data it has never encountered before?) and scalability (how are a classifier's processing and training requirements affected by increasing the number of features that describe the input patterns?).

bayesian inference, classifier, health & medicine, (19 more...)

Country:

North America > United States > Vermont (0.14)
North America > United States > Pennsylvania (0.14)
North America > United States > California (0.14)

Industry: Health & Medicine (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.58)

Neural Information Processing SystemsDec-31-1991

Asymptotic slowing down of the nearest-neighbor classifier

Snapp, Robert R., Psaltis, Demetri, Venkatesh, Santosh S.

Santosh S. Venkatesh Electrical Engineering University of Pennsylvania Philadelphia, PA 19104 If patterns are drawn from an n-dimensional feature space according to a probability distribution that obeys a weak smoothness criterion, we show that the probability that a random input pattern is misclassified by a nearest-neighbor classifier using M random reference patterns asymptotically satisfies a PM(error) "" Poo(error) M2/n' for sufficiently large values of M. Here, Poo(error) denotes the probability of error in the infinite sample limit, and is at most twice the error of a Bayes classifier. Although the value of the coefficient a depends upon the underlying probability distributions, the exponent of M is largely distribution free.We thus obtain a concise relation between a classifier's ability to generalize from a finite reference sample and the dimensionality of the feature space, as well as an analytic validation of Bellman's well known "curse of dimensionality." 1 INTRODUCTION One of the primary tasks assigned to neural networks is pattern classification.

bayesian inference, classifier, health & medicine, (19 more...)

Country: North America > United States > Pennsylvania > Philadelphia County > Philadelphia (0.54)

Industry: Health & Medicine (0.47)