AITopics | Gong, Ziqin

Collaborating Authors

Gong, Ziqin

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Wang, Jun, Fang, Meng, Wan, Ziyu, Wen, Muning, Zhu, Jiachen, Liu, Anjie, Gong, Ziqin, Song, Yan, Chen, Lei, Ni, Lionel M., Yang, Linyi, Wen, Ying, Zhang, Weinan

arXiv.org Artificial IntelligenceOct-12-2024

In this technical report, we introduce OpenR, an open-source framework designed to integrate key components for enhancing the reasoning capabilities of large language models (LLMs). OpenR unifies data acquisition, reinforcement learning training (both online and offline), and non-autoregressive decoding into a cohesive software platform. Our goal is to establish an open-source platform and community to accelerate the development of LLM reasoning. Inspired by the success of OpenAI's o1 model, which demonstrated improved reasoning abilities through step-by-step reasoning and reinforcement learning, OpenR integrates test-time compute, reinforcement learning, and process supervision to improve reasoning in LLMs. Our work is the first to provide an open-source framework that explores the core techniques of OpenAI's o1 model with reinforcement learning, achieving advanced reasoning capabilities beyond traditional autoregressive methods. We demonstrate the efficacy of OpenR by evaluating it on the MATH dataset, utilising publicly available data and search methods. Our initial experiments confirm substantial gains, with relative improvements in reasoning and performance driven by test-time computation and reinforcement learning through process reward models. The OpenR framework, including code, models, and datasets, is accessible at https://openreasoner.github.io.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2410.09671

Country: Asia > China (0.28)

Genre:

Workflow (1.00)
Research Report (0.64)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.45)

Add feedback

Looking Ahead to Avoid Being Late: Solving Hard-Constrained Traveling Salesman Problem

Chen, Jingxiao, Gong, Ziqin, Liu, Minghuan, Wang, Jun, Yu, Yong, Zhang, Weinan

arXiv.org Artificial IntelligenceMar-8-2024

Many real-world problems can be formulated as a constrained Traveling Salesman Problem (TSP). However, the constraints are always complex and numerous, making the TSPs challenging to solve. When the number of complicated constraints grows, it is time-consuming for traditional heuristic algorithms to avoid illegitimate outcomes. Learning-based methods provide an alternative to solve TSPs in a soft manner, which also supports GPU acceleration to generate solutions quickly. Nevertheless, the soft manner inevitably results in difficulty solving hard-constrained problems with learning algorithms, and the conflicts between legality and optimality may substantially affect the optimality of the solution. To overcome this problem and to have an effective solution against hard constraints, we proposed a novel learning-based method that uses looking-ahead information as the feature to improve the legality of TSP with Time Windows (TSPTW) solutions. Besides, we constructed TSPTW datasets with hard constraints in order to accurately evaluate and benchmark the statistical performance of various approaches, which can serve the community for future research. With comprehensive experiments on diverse datasets, MUSLA outperforms existing baselines and shows generalizability potential.

artificial intelligence, constraint-based reasoning, dataset, (16 more...)

arXiv.org Artificial Intelligence

2403.05318

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)

Add feedback

RITA: Boost Driving Simulators with Realistic Interactive Traffic Flow

Zhu, Zhengbang, Zhang, Shenyu, Zhuang, Yuzheng, Liu, Yuecheng, Liu, Minghuan, Mao, Liyuan, Gong, Ziqin, Kai, Shixiong, Gu, Qiang, Wang, Bin, Cheng, Siyuan, Wang, Xinyu, Hao, Jianye, Yu, Yong

arXiv.org Artificial IntelligenceDec-7-2023

High-quality traffic flow generation is the core module in building simulators for autonomous driving. However, the majority of available simulators are incapable of replicating traffic patterns that accurately reflect the various features of real-world data while also simulating human-like reactive responses to the tested autopilot driving strategies. Taking one step forward to addressing such a problem, we propose Realistic Interactive TrAffic flow (RITA) as an integrated component of existing driving simulators to provide high-quality traffic flow for the evaluation and optimization of the tested driving strategies. RITA is developed with consideration of three key features, i.e., fidelity, diversity, and controllability, and consists of two core modules called RITABackend and RITAKit. RITABackend is built to support vehicle-wise control and provide traffic generation models from real-world datasets, while RITAKit is developed with easy-to-use interfaces for controllable traffic generation via RITABackend. We demonstrate RITA's capacity to create diversified and high-fidelity traffic simulations in several highly interactive highway scenarios. The experimental findings demonstrate that our produced RITA traffic flows exhibit all three key features, hence enhancing the completeness of driving strategy evaluation. Moreover, we showcase the possibility for further improvement of baseline strategies through online fine-tuning with RITA traffic flows.

artificial intelligence, machine learning, traffic flow, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3627676.3627681

2211.03408

Country:

Asia > China (0.14)
North America > United States (0.14)
Europe > Italy (0.14)

Genre: Research Report (0.69)

Industry:

Consumer Products & Services > Travel (1.00)
Transportation > Ground > Road (0.50)

Technology:

Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.89)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)

Add feedback