bingsheng
ML-Asset Management: Curation, Discovery, and Utilization
Wang, Mengying, Duan, Moming, Huang, Yicong, Li, Chen, He, Bingsheng, Wu, Yinghui
Machine learning (ML) assets, such as models, datasets, and metadata, are central to modern ML workflows. Despite their explosive growth in practice, these assets are often underutilized due to fragmented documentation, siloed storage, inconsistent licensing, and lack of unified discovery mechanisms, making ML-asset management an urgent challenge. This tutorial offers a comprehensive overview of ML-asset management activities across its lifecycle, including curation, discovery, and utilization. We provide a categorization of ML assets, and major management issues, survey state-of-the-art techniques, and identify emerging opportunities at each stage. We further highlight system-level challenges related to scalability, lineage, and unified indexing. Through live demonstrations of systems, this tutorial equips both researchers and practitioners with actionable insights and practical tools for advancing ML-asset management in real-world and domain-specific settings.
- North America > United States > California > Orange County > Irvine (0.14)
- North America > United States > Ohio > Cuyahoga County > Cleveland (0.04)
- Asia > Singapore > Central Region > Singapore (0.04)
- Asia > Middle East > Jordan (0.04)
- Law (1.00)
- Banking & Finance > Trading (1.00)
RGL: A Graph-Centric, Modular Framework for Efficient Retrieval-Augmented Generation on Graphs
Li, Yuan, Hu, Jun, Jiang, Jiaxin, Liu, Zemin, Hooi, Bryan, He, Bingsheng
Recent advances in graph learning have paved the way for innovative retrieval-augmented generation (RAG) systems that leverage the inherent relational structures in graph data. However, many existing approaches suffer from rigid, fixed settings and significant engineering overhead, limiting their adaptability and scalability. Additionally, the RAG community has largely overlooked the decades of research in the graph database community regarding the efficient retrieval of interesting substructures on large-scale graphs. In this work, we introduce the RAG-on-Graphs Library (RGL), a modular framework that seamlessly integrates the complete RAG pipeline-from efficient graph indexing and dynamic node retrieval to subgraph construction, tokenization, and final generation-into a unified system. RGL addresses key challenges by supporting a variety of graph formats and integrating optimized implementations for essential components, achieving speedups of up to 143x compared to conventional methods. Moreover, its flexible utilities, such as dynamic node filtering, allow for rapid extraction of pertinent subgraphs while reducing token consumption. Our extensive evaluations demonstrate that RGL not only accelerates the prototyping process but also enhances the performance and applicability of graph-based RAG systems across a range of tasks.
- Asia > Singapore > Central Region > Singapore (0.05)
- North America > United States > New Mexico > Los Alamos County > Los Alamos (0.05)
- Asia > China (0.04)
PyGDA: A Python Library for Graph Domain Adaptation
Zhang, Zhen, Liu, Meihan, He, Bingsheng
Graph domain adaptation has emerged as a promising approach to facilitate knowledge transfer across different domains. Recently, numerous models have been proposed to enhance their generalization capabilities in this field. However, there is still no unified library that brings together existing techniques and simplifies their implementation. To fill this gap, we introduce PyGDA, an open-source Python library tailored for graph domain adaptation. As the first comprehensive library in this area, PyGDA covers more than 20 widely used graph domain adaptation methods together with different types of graph datasets. Specifically, PyGDA offers modular components, enabling users to seamlessly build custom models with a variety of commonly used utility functions. To handle large-scale graphs, PyGDA includes support for features such as sampling and mini-batch processing, ensuring efficient computation. In addition, PyGDA also includes comprehensive performance benchmarks and well-documented user-friendly API for both researchers and practitioners. To foster convenient accessibility, PyGDA is released under the MIT license at https://github.com/pygda-team/pygda, and the API documentation is https://pygda.readthedocs.io/en/stable/.
- North America > United States > New Mexico > Los Alamos County > Los Alamos (0.05)
- Asia > Singapore > Central Region > Singapore (0.04)
- Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
- Asia > China > Zhejiang Province > Hangzhou (0.04)
Welcome
Welcome to the special section highlighting cutting-edge research and innovation emerging from East Asia and Oceania. Our region encompasses Southeast Asia, Oceania, and Asia-Pacific countries, including Japan and Korea. The articles in this section--designated as "Hot Topics" and "Big Trends"--aim to not only showcase technological advancements from this region, but also to strengthen research collaboration and communication with regions worldwide. This special section brings together some of the most innovative research in computer science and technology from this flourishing region. The articles cover a wide range of topics, from state-of-the-art developments in learning analytics, AI and machine learning, education, Big Data, neuromorphic computing, and blockchain technology, to applications in disease prediction and assistive devices.
- Asia > East Asia (0.29)
- Asia > Southeast Asia (0.26)
- Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.17)
- (6 more...)