AITopics | Cheng, Qixiang

Collaborating Authors

Cheng, Qixiang

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Seamless Optical Cloud Computing across Edge-Metro Network for Generative AI

Xing, Sizhe, Sun, Aolong, Wang, Chengxi, Wang, Yizhi, Dong, Boyu, Hu, Junhui, Deng, Xuyu, Yan, An, Liu, Yingjun, Hu, Fangchen, Li, Zhongya, Huang, Ouhan, Zhao, Junhao, Zhou, Yingjun, Li, Ziwei, Shi, Jianyang, Xiao, Xi, Penty, Richard, Cheng, Qixiang, Chi, Nan, Zhang, Junwen

arXiv.org Artificial IntelligenceDec-4-2024

The rapid advancement of generative artificial intelligence (AI) in recent years has profoundly reshaped modern lifestyles, necessitating a revolutionary architecture to support the growing demands for computational power. Cloud computing has become the driving force behind this transformation. However, it consumes significant power and faces computation security risks due to the reliance on extensive data centers and servers in the cloud. Reducing power consumption while enhancing computational scale remains persistent challenges in cloud computing. Here, we propose and experimentally demonstrate an optical cloud computing system that can be seamlessly deployed across edge-metro network. By modulating inputs and models into light, a wide range of edge nodes can directly access the optical computing center via the edge-metro network. The experimental validations show an energy efficiency of 118.6 mW/TOPs (tera operations per second), reducing energy consumption by two orders of magnitude compared to traditional electronic-based cloud computing solutions. Furthermore, it is experimentally validated that this architecture can perform various complex generative AI models through parallel computing to achieve image generation tasks.

cloud computing, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2412.12126

Country: Asia > China (0.47)

Genre: Research Report > New Finding (0.46)

Industry:

Energy (1.00)
Information Technology > Security & Privacy (0.68)

Technology:

Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.91)

Add feedback

Asymmetrical estimator for training grey-box deep photonic neural networks

Wang, Yizhi, Chen, Minjia, Yao, Chunhui, Ma, Jie, Yan, Ting, Penty, Richard, Cheng, Qixiang

arXiv.org Artificial IntelligenceMay-28-2024

Physical neural networks (PNNs) are emerging paradigms for neural network acceleration due to their high-bandwidth, in-propagation analogue processing. Despite the advantages of PNN for inference, training remains a challenge. The imperfect information of the physical transformation means the failure of conventional gradient-based updates from backpropagation (BP). Here, we present the asymmetrical training (AT) method, which treats the PNN structure as a grey box. AT performs training while only knowing the last layer output and neuron topological connectivity of a deep neural network structure, not requiring information about the physical control-transformation mapping. We experimentally demonstrated the AT method on deep grey-box PNNs implemented by uncalibrated photonic integrated circuits (PICs), improving the classification accuracy of Iris flower and modified MNIST hand-written digits from random guessing to near theoretical maximum. We also showcased the consistently enhanced performance of AT over BP for different datasets, including MNIST, fashion-MNIST, and Kuzushiji-MNIST. The AT method demonstrated successful training with minimal hardware overhead and reduced computational overhead, serving as a robust light-weight training alternative to fully explore the advantages of physical computation.

artificial intelligence, information, machine learning, (20 more...)

arXiv.org Artificial Intelligence

2405.18458

Country: Europe > United Kingdom (0.14)

Genre: Research Report (0.50)

Industry: Energy > Oil & Gas (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.48)

Add feedback