AITopics | subspace learning

L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization

Neural Information Processing SystemsApr-25-2026, 17:33:37 GMT

Silicon-photonics-based optical neural network (ONN) is a promising hardware platform that could represent a paradigm shift in efficient AI with its CMOScompatibility, flexibility, ultra-low execution latency, and high energy efficiency. In-situ training on the online programmable photonic chips is appealing but still encounters challenging issues in on-chip implementability, scalability, and efficiency. In this work, we propose a closed-loop ONN on-chip learning framework L2ight to enable scalable ONN mapping and efficient in-situ learning. L2ightadopts a three-stage learning flow that first calibrates the complicated photonic circuit states under challenging physical constraints, then performs photonic core mapping via combined analytical solving and zeroth-order optimization. A subspace learning procedure with multi-level sparsity is integrated into L2ightto enable in-situ gradient evaluation and fast adaptation, unleashing the power of optics for real on-chip intelligence. Extensive experiments demonstrate our proposed L2ightoutperforms prior ONN training protocols with 3-order-of-magnitude higher scalability and over 30 better efficiency, when benchmarked on various models and learning tasks. This synergistic framework is the first scalable on-chip learning solution that pushes this emerging field from intractable to scalable and further to efficient for next-generation self-learnable photonic neural chips. From a co-design perspective, L2ightalso provides essential insights for hardware-restricted unitary subspace optimization and efficient sparse training.

artificial intelligence, machine learning, neural network, (14 more...)

Neural Information Processing Systems

Industry:

Energy (0.66)
Education (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

Exploring the Precise Dynamics of Single-Layer GAN Models: Leveraging Multi-Feature Discriminators for High-Dimensional Subspace Learning

Neural Information Processing SystemsMar-21-2026, 08:33:10 GMT

Subspace learning is a critical endeavor in contemporary machine learning, particularly given the vast dimensions of modern datasets.

artificial intelligence, machine learning, proceedings, (5 more...)

Neural Information Processing Systems

Genre: Research Report > New Finding (0.37)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.77)

Add feedback

ExploringthePreciseDynamicsofSingle-LayerGAN Models: LeveragingMulti-FeatureDiscriminatorsfor High-DimensionalSubspaceLearning

Neural Information Processing SystemsFeb-16-2026, 04:54:09 GMT

Subspace learning is acritical endeavor in contemporary machine learning, particularly given the vast dimensions of modern datasets.

artificial intelligence, discriminator, machine learning, (17 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Genre: Research Report > New Finding (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

48aedb8880cab8c45637abc7493ecddd-Paper.pdf

Neural Information Processing SystemsFeb-8-2026, 12:05:06 GMT

accuracy, neural network, proc, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Texas > Travis County > Austin (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

On the Sample Complexity of Subspace Learning

Neural Information Processing SystemsSep-30-2025, 12:43:24 GMT

A large number of algorithms in machine learning, from principal component analysis (PCA), and its non-linear (kernel) extensions, to more recent spectral embedding and support estimation methods, rely on estimating a linear subspace from samples. In this paper we introduce a general formulation of this problem and derive novel learning error estimates. Our results rely on natural assumptions on the spectral properties of the covariance operator associated to the data distribution, and hold for a wide class of metrics between subspaces. As special cases, we discuss sharp error estimates for the reconstruction properties of PCA and spectral support estimation. Key to our analysis is an operator theoretic approach that has broad applicability to spectral learning methods.

error estimate, sample complexity, subspace learning, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.31)

Add feedback

DropLoRA: Sparse Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

Zhang, Haojie

arXiv.org Artificial IntelligenceAug-26-2025

LoRA-based large model parameter-efficient fine-tuning (PEFT) methods use low-rank de- composition to approximate updates to model parameters. However, compared to full- parameter fine-tuning, low-rank updates often lead to a performance gap in downstream tasks. To address this, we introduce DropLoRA, a novel pruning-based approach that focuses on pruning the rank dimension. Unlike conven- tional methods that attempt to overcome the low-rank bottleneck, DropLoRA innovatively integrates a pruning module between the two low-rank matrices in LoRA to simulate dy- namic subspace learning. This dynamic low- rank subspace learning allows DropLoRA to overcome the limitations of traditional LoRA, which operates within a static subspace. By continuously adapting the learning subspace, DropLoRA significantly boosts performance without incurring additional training or infer- ence costs. Our experimental results demon- strate that DropLoRA consistently outperforms LoRA in fine-tuning the LLaMA series across a wide range of large language model gener- ation tasks, including commonsense reason- ing, mathematical reasoning, code generation, and instruction-following. Our code is avail- able at https://github.com/TayeeChang/DropLoRA.

droplora, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2508.17337

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

8198901c33d24a391b4c6b3f30f253ce-Paper-Conference.pdf

Neural Information Processing SystemsAug-20-2025, 20:29:05 GMT

assumption, discriminator, subspace, (17 more...)

Neural Information Processing Systems

Country: Asia > Middle East > Jordan (0.04)

Genre:

Research Report > Experimental Study (1.00)
Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

Exploring the Precise Dynamics of Single-Layer GAN Models: Leveraging Multi-Feature Discriminators for High-Dimensional Subspace Learning

Bond, Andrew, Dogan, Zafer

arXiv.org Machine LearningNov-1-2024

Subspace learning is a critical endeavor in contemporary machine learning, particularly given the vast dimensions of modern datasets. In this study, we delve into the training dynamics of a single-layer GAN model from the perspective of subspace learning, framing these GANs as a novel approach to this fundamental task. Through a rigorous scaling limit analysis, we offer insights into the behavior of this model. Extending beyond prior research that primarily focused on sequential feature learning, we investigate the non-sequential scenario, emphasizing the pivotal role of inter-feature interactions in expediting training and enhancing performance, particularly with an uninformed initialization strategy. Our investigation encompasses both synthetic and real-world datasets, such as MNIST and Olivetti Faces, demonstrating the robustness and applicability of our findings to practical scenarios. By bridging our analysis to the realm of subspace learning, we systematically compare the efficacy of GAN-based methods against conventional approaches, both theoretically and empirically. Notably, our results unveil that while all methodologies successfully capture the underlying subspace, GANs exhibit a remarkable capability to acquire a more informative basis, owing to their intrinsic ability to generate new data samples. This elucidates the unique advantage of GAN-based approaches in subspace learning tasks.

artificial intelligence, machine learning, subspace, (17 more...)

arXiv.org Machine Learning

2411.00498

Genre: Research Report > New Finding (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.46)

Add feedback

bdb106a0560c4e46ccc488ef010af787-Reviews.html

Neural Information Processing SystemsMar-13-2024, 20:01:13 GMT

The key result shows for n samples drawn from some underlying distribution, the quality of subspace estimation improves at a rate O(n -r), where r related to the decay rate of the spectrum of the underlying distribution. Review: I am not familiar with the previous literature on PAC-style analysis of subspace learning, or if properties of the spectrum of the covariance was previously considered for subspace learning; so assuming that the work is novel, I believe authors have done a good job in relating these concepts. I do have a few suggestions that the authors should consider adding to the current text: Although authors have focused on the theoretical aspects of subspace learning, it would be nice to see how well the condition of'polynomial decay' holds on real world data. This would help with the significance of this work to the larger machine learning audience. Going a step further, it would be very instructive to see what the rates look like when the covariance C is unknown.

literature, short review period, subspace learning, (3 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.38)

Add feedback

On the Sample Complexity of Subspace Learning

Neural Information Processing SystemsApr-6-2023, 11:59:05 GMT

A large number of algorithms in machine learning, from principal component analysis (PCA), and its non-linear (kernel) extensions, to more recent spectral embedding and support estimation methods, rely on estimating a linear subspace from samples. In this paper we introduce a general formulation of this problem and derive novel learning error estimates. Our results rely on natural assumptions on the spectral properties of the covariance operator associated to the data distribution, and hold for a wide class of metrics between subspaces. As special cases, we discuss sharp error estimates for the reconstruction properties of PCA and spectral support estimation. Key to our analysis is an operator theoretic approach that has broad applicability to spectral learning methods.

error estimate, sample complexity, subspace learning, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Principal Component Analysis (0.31)

Add feedback

Filters

Collaborating Authors

subspace learning

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

L2ight: Enabling On-Chip Learning for Optical Neural Networks via Efficient in-situ Subspace Optimization

Exploring the Precise Dynamics of Single-Layer GAN Models: Leveraging Multi-Feature Discriminators for High-Dimensional Subspace Learning

ExploringthePreciseDynamicsofSingle-LayerGAN Models: LeveragingMulti-FeatureDiscriminatorsfor High-DimensionalSubspaceLearning

48aedb8880cab8c45637abc7493ecddd-Paper.pdf

On the Sample Complexity of Subspace Learning

DropLoRA: Sparse Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

8198901c33d24a391b4c6b3f30f253ce-Paper-Conference.pdf

Exploring the Precise Dynamics of Single-Layer GAN Models: Leveraging Multi-Feature Discriminators for High-Dimensional Subspace Learning

bdb106a0560c4e46ccc488ef010af787-Reviews.html

On the Sample Complexity of Subspace Learning