AITopics | moonlight

Collaborating Authors

moonlight

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

3870e6fe5fc75307508ee1458e81e27c-Supplemental-Datasets_and_Benchmarks_Track.pdf

Neural Information Processing SystemsFeb-11-2026, 06:06:50 GMT

artificial intelligence, machine learning, natural language, (18 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.48)

Industry:

Law (0.68)
Media > Photography (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.95)
Information Technology > Sensing and Signal Processing > Image Processing (0.95)
Information Technology > Artificial Intelligence > Vision (0.94)

Add feedback

Group-Aware Reinforcement Learning for Output Diversity in Large Language Models

Anschel, Oron, Shoshan, Alon, Botach, Adam, Hakimi, Shunit Haviv, Gendler, Asaf, Baruch, Emanuel Ben, Bhonker, Nadav, Kviatkovsky, Igor, Aggarwal, Manoj, Medioni, Gerard

arXiv.org Artificial IntelligenceNov-18-2025

Large Language Models (LLMs) often suffer from mode collapse, repeatedly generating the same few completions even when many valid answers exist, limiting their diversity across a wide range of tasks. We introduce Group-Aware Policy Optimization (GAPO), a simple extension of the recent and popular Group Relative Policy Optimization (GRPO) that computes rewards over the group as a whole. GAPO enables learning from the group-level properties such as diversity and coverage. We demonstrate GAPO using a frequency-aware reward function that encourages uniform sampling over valid LLM completions, and show that GAPO-trained models produce valid and more diverse model responses. Beyond this setup, GAPO generalizes to open-ended prompts and improves response diversity without compromising accuracy on standard LLM benchmarks (GSM8K, MATH, HumanEval, MMLU-Pro). Our code will be made publicly available.

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2511.12596

Country: North America > United States (0.28)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

MARS-M: When Variance Reduction Meets Matrices

Liu, Yifeng, Yuan, Angela, Gu, Quanquan

arXiv.org Machine LearningOct-29-2025

Matrix-based preconditioned optimizers, such as Muon, have recently been shown to be more efficient than scalar-based optimizers for training large-scale neural networks, including large language models (LLMs). On the other hand, recent benchmarks on optimizers for LLM pre-training have demonstrated that variance-reduction techniques such as MARS can achieve substantial speedups over standard optimizers that do not employ variance reduction. In this paper, to achieve the best of both worlds, we introduce MARS-M, a new optimizer that integrates the variance reduction technique in MARS with Muon. Under standard regularity conditions, we prove that Muon-M converges to a first-order stationary point at a rate of $\tilde{\mathcal{O}}(T^{-1/3})$, which improves upon $\tilde{\mathcal{O}}(T^{-1/4})$ rate attained by Muon. Our empirical results on language modeling and computer vision tasks demonstrate that MARS-M consistently yields lower losses and improved performance across various downstream benchmarks. The implementation of MARS-M is available at https://github.com/AGI-Arena/MARS/tree/main/MARS_M.

large language model, machine learning, natural language, (19 more...)

arXiv.org Machine Learning

2510.218

Country:

Europe (0.67)
North America > United States > California > Los Angeles County > Los Angeles (0.28)

Genre:

Research Report (0.64)
Workflow (0.46)

Industry: Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Supplementary Materials: FiV A: Fine-grained Visual Attribute Dataset for T ext-to-Image Diffusion Models

Neural Information Processing SystemsOct-9-2025, 23:32:10 GMT

Section A. We then introduce additional details on dataset construction in Section B. Further, we Finally, we discuss the limitations and future work of the project in Section D. Please also find the Details on attribute taxonomy and statistics. We visualize the rough distribution of visual attributes and subjects on the left. We also visualize the attribute alignment accuracy via human validation here. Due to space limitations, only 15 sub-subjects are listed for each major-subject. The result shows that Image 4 exhibits inconsistencies, with the reasons provided.

dataset, fine-grained visual attribute dataset, validation, (12 more...)

Neural Information Processing Systems

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report > New Finding (0.48)

Industry:

Law (0.68)
Media > Photography (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.95)
Information Technology > Sensing and Signal Processing > Image Processing (0.95)
Information Technology > Artificial Intelligence > Vision (0.94)

Add feedback

Muon is Scalable for LLM Training

Liu, Jingyuan, Su, Jianlin, Yao, Xingcheng, Jiang, Zhejun, Lai, Guokun, Du, Yulun, Qin, Yidao, Xu, Weixin, Lu, Enzhe, Yan, Junjie, Chen, Yanru, Zheng, Huabin, Liu, Yibo, Liu, Shaowei, Yin, Bohong, He, Weiran, Zhu, Han, Wang, Yuzhi, Wang, Jianzhou, Dong, Mengnan, Zhang, Zheng, Kang, Yongsheng, Zhang, Hao, Xu, Xinran, Zhang, Yutao, Wu, Yuxin, Zhou, Xinyu, Yang, Zhilin

arXiv.org Artificial IntelligenceFeb-24-2025

Recently, the Muon optimizer based on matrix orthogonalization has demonstrated strong results in training small-scale language models, but the scalability to larger models has not been proven. We identify two crucial techniques for scaling up Muon: (1) adding weight decay and (2) carefully adjusting the per-parameter update scale. These techniques allow Muon to work out-of-the-box on large-scale training without the need of hyper-parameter tuning. Scaling law experiments indicate that Muon achieves $\sim\!2\times$ computational efficiency compared to AdamW with compute optimal training. Based on these improvements, we introduce Moonlight, a 3B/16B-parameter Mixture-of-Expert (MoE) model trained with 5.7T tokens using Muon. Our model improves the current Pareto frontier, achieving better performance with much fewer training FLOPs compared to prior models. We open-source our distributed Muon implementation that is memory optimal and communication efficient. We also release the pretrained, instruction-tuned, and intermediate checkpoints to support future research.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2502.16982

Country:

Asia > Middle East > Jordan (0.05)
North America > United States > California > San Diego County > San Diego (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction

Duan, Zhongjie, Zhao, Qianyi, Chen, Cen, Chen, Daoyuan, Zhou, Wenmeng, Li, Yaliang, Chen, Yingda

arXiv.org Artificial IntelligenceDec-18-2024

The emergence of diffusion models has significantly advanced image synthesis. The recent studies of model interaction and self-corrective reasoning approach in large language models offer new insights for enhancing text-to-image models. Inspired by these studies, we propose a novel method called ArtAug for enhancing text-to-image models in this paper. To the best of our knowledge, ArtAug is the first one that improves image synthesis models via model interactions with understanding models. In the interactions, we leverage human preferences implicitly learned by image understanding models to provide fine-grained suggestions for image synthesis models. The interactions can modify the image content to make it aesthetically pleasing, such as adjusting exposure, changing shooting angles, and adding atmospheric effects. The enhancements brought by the interaction are iteratively fused into the synthesis model itself through an additional enhancement module. This enables the synthesis model to directly produce aesthetically pleasing images without any extra computational cost. In the experiments, we train the ArtAug enhancement module on existing text-to-image models. Various evaluation metrics consistently demonstrate that ArtAug enhances the generative capabilities of text-to-image models without incurring additional computational costs. The source code and models will be released publicly.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2412.12888

Country:

Europe > Switzerland > Zürich > Zürich (0.14)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Asia > China > Zhejiang Province > Hangzhou (0.04)
Asia > China > Shanghai > Shanghai (0.04)

Genre: Research Report > Promising Solution (0.48)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Thriving in Bowls of Moonlight - AI Generated Artwork

#artificialintelligenceNov-12-2022, 01:10:14 GMT

AI Art Generator App. ✅ Fast ✅ Free ✅ Easy. Create amazing artworks using artificial intelligence.

ai generated artwork, moonlight, thriving, (1 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence (1.00)

Add feedback

Chinese Traditional Poetry Generating System Based on Deep Learning

Bao, Chenlei, Huang, Lican

arXiv.org Artificial IntelligenceOct-23-2021

Chinese traditional poetry is an important intangible cultural heritage of China and an artistic carrier of thought, culture, spirit and emotion. However, due to the strict rules of ancient poetry, it is very difficult to write poetry by machine. This paper proposes an automatic generation method of Chinese traditional poetry based on deep learning technology, which extracts keywords from each poem and matches them with the previous text to make the poem conform to the theme, and when a user inputs a paragraph of text, the machine obtains the theme and generates poem sentence by sentence. Using the classic word2vec model as the preprocessing model, the Chinese characters which are not understood by the computer are transformed into matrix for processing. Bi-directional Long Short-Term Memory is used as the neural network model to generate Chinese characters one by one and make the meaning of Chinese characters as accurate as possible. At the same time, TF-IDF and TextRank are used to extract keywords. Using the attention mechanism based encoding-decoding model, we can solve practical problems by transforming the model, and strengthen the important information of long-distance information, so as to grasp the key points without losing important information. In the aspect of emotion judgment, Long Short-Term Memory network is used. The final result shows that it can get good poetry outputs according to the user input text.

information, keyword, poetry, (12 more...)

arXiv.org Artificial Intelligence

2110.12335

Country:

Asia > Japan (0.04)
Asia > China > Zhejiang Province > Hangzhou (0.04)
Asia > China > Beijing > Beijing (0.04)

Genre: Research Report (0.84)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

European Space Agency reveals ambitious plans to build sat-nav around the moon

Daily Mail - Science & techMay-21-2021, 13:15:02 GMT

The European Space Agency (ESA) has launched an ambitious new project to build a sat-nav and communication satellite network in orbit around the moon. This new infrastructure could one day turn our natural satellite into the'eighth continent' as humanity spreads its wings and builds cities on the lunar surface. ESA says the project, known as Moonlight, will support the Lunar Gateway space station, multiple agencies working on moon missions and human exploration. In what will be the world's first commercial service of its kind, a number of British firms have won contracts to investigate how it might work, worth over £2 million. 'We are entering a new phase - the systematic exploration of our "eighth continent", the Moon,' ESA's David Parker told BBC News.

exploration, lunar surface, navigation service, (13 more...)

Daily Mail - Science & tech

Country:

North America > United States > Florida > Brevard County > Cape Canaveral (0.05)
North America > Canada (0.05)
Asia > Japan (0.05)
(6 more...)

Industry: Government > Space Agency (1.00)

Technology: Information Technology > Artificial Intelligence (0.70)

Add feedback

'The Women's Balcony,' 'Moonlight' and more critics' picks, March 3-9

Los Angeles TimesMar-2-2017, 23:40:07 GMT

Arrival Amy Adams stars in this elegant, involving science-fiction drama that is simultaneously old and new, revisiting many alien-invasion conventions but with unexpected intelligence, visual style and heart. Elle Paul Verhoeven's brilliantly booby-trapped thriller starring Isabelle Huppert is a gripping whodunit, a tour de force of psychological suspense and a wickedly droll comedy of manners. The Founder Michael Keaton gives a performance of ratty, reptilian brilliance as Ray Kroc, the American salesman who turned a California burger stand into the global fast-food behemoth that is McDonald's, in John Lee Hancock's shrewd and satisfyingly fat-free biopic. I Am Not Your Negro As directed by the gifted Raoul Peck, this documentary on James Baldwin uses the entire spectrum of movie effects, not only spoken language but also sound, music, editing and all manner of visuals, to create a cinematic essay that is powerful and painfully relevant. La La Land Starring a well-paired Ryan Gosling and Emma Stone, writer-director Damien Chazelle's tuneful tribute to classic movie musicals is often stronger in concept than execution, but it's lovely and transporting all the same.

artificial intelligence, justin chang, kenneth turan, (6 more...)

Los Angeles Times

Country:

North America > United States > California (0.26)
Asia > Middle East > Israel > Jerusalem District > Jerusalem (0.06)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Health & Medicine (1.00)

Technology: Information Technology > Artificial Intelligence (0.74)

Add feedback