AITopics | Zong, Zeshun

Collaborating Authors

Zong, Zeshun

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

GRIP: A General Robotic Incremental Potential Contact Simulation Dataset for Unified Deformable-Rigid Coupled Grasping

Ma, Siyu, Du, Wenxin, Yu, Chang, Jiang, Ying, Zong, Zeshun, Xie, Tianyi, Chen, Yunuo, Yang, Yin, Han, Xuchen, Jiang, Chenfanfu

arXiv.org Artificial IntelligenceMar-6-2025

Grasping is fundamental to robotic manipulation, and recent advances in large-scale grasping datasets have provided essential training data and evaluation benchmarks, accelerating the development of learning-based methods for robust object grasping. However, most existing datasets exclude deformable bodies due to the lack of scalable, robust simulation pipelines, limiting the development of generalizable models for compliant grippers and soft manipulands. To address these challenges, we present GRIP, a General Robotic Incremental Potential contact simulation dataset for universal grasping. GRIP leverages an optimized Incremental Potential Contact (IPC)-based simulator for multi-environment data generation, achieving up to 48x speedup while ensuring efficient, intersection- and inversion-free simulations for compliant grippers and deformable objects. Our fully automated pipeline generates and evaluates diverse grasp interactions across 1,200 objects and 100,000 grasp poses, incorporating both soft and rigid grippers. The GRIP dataset enables applications such as neural grasp generation and stress field prediction.

artificial intelligence, dataset, simulation, (15 more...)

arXiv.org Artificial Intelligence

2503.0502

Country: Europe > Switzerland > Zürich > Zürich (0.14)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Robots > Manipulation (0.46)

Add feedback

A Convex Formulation of Material Points and Rigid Bodies with GPU-Accelerated Async-Coupling for Interactive Simulation

Yu, Chang, Du, Wenxin, Zong, Zeshun, Castro, Alejandro, Jiang, Chenfanfu, Han, Xuchen

arXiv.org Artificial IntelligenceMar-6-2025

We present a novel convex formulation that weakly couples the Material Point Method (MPM) with rigid body dynamics through frictional contact, optimized for efficient GPU parallelization. Our approach features an asynchronous time-splitting scheme to integrate MPM and rigid body dynamics under different time step sizes. We develop a globally convergent quasi-Newton solver tailored for massive parallelization, achieving up to 500x speedup over previous convex formulations without sacrificing stability. Our method enables interactive-rate simulations of robotic manipulation tasks with diverse deformable objects including granular materials and cloth, with strong convergence guarantees. We detail key implementation strategies to maximize performance and validate our approach through rigorous experiments, demonstrating superior speed, accuracy, and stability compared to state-of-the-art MPM simulators for robotics. We make our method available in the open-source robotics toolkit, Drake.

artificial intelligence, parallelization, simulation, (18 more...)

arXiv.org Artificial Intelligence

2503.05046

Country: North America > United States (0.28)

Genre: Research Report (0.40)

Technology:

Information Technology > Graphics (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Embedded IPC: Fast and Intersection-free Simulation in Reduced Subspace for Robot Manipulation

Du, Wenxin, Yu, Chang, Ma, Siyu, Jiang, Ying, Zong, Zeshun, Yang, Yin, Masterjohn, Joe, Castro, Alejandro, Han, Xuchen, Jiang, Chenfanfu

arXiv.org Artificial IntelligenceSep-24-2024

Physics-based simulation is essential for developing and evaluating robot manipulation policies, particularly in scenarios involving deformable objects and complex contact interactions. However, existing simulators often struggle to balance computational efficiency with numerical accuracy, especially when modeling deformable materials with frictional contact constraints. We introduce an efficient subspace representation for the Incremental Potential Contact (IPC) method, leveraging model reduction to decrease the number of degrees of freedom. Our approach decouples simulation complexity from the resolution of the input model by representing elasticity in a low-resolution subspace while maintaining collision constraints on an embedded high-resolution surface. Our barrier formulation ensures intersection-free trajectories and configurations regardless of material stiffness, time step size, or contact severity. We validate our simulator through quantitative experiments with a soft bubble gripper grasping and qualitative demonstrations of placing a plate on a dish rack. The results demonstrate our simulator's efficiency, physical accuracy, computational stability, and robust handling of frictional contact, making it well-suited for generating demonstration data and evaluating downstream robot training applications.

artificial intelligence, gripper, simulation, (16 more...)

arXiv.org Artificial Intelligence

2409.16385

Country: North America > United States (0.28)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Robots > Manipulation (0.70)

Add feedback

VideoPhy: Evaluating Physical Commonsense for Video Generation

Bansal, Hritik, Lin, Zongyu, Xie, Tianyi, Zong, Zeshun, Yarom, Michal, Bitton, Yonatan, Jiang, Chenfanfu, Sun, Yizhou, Chang, Kai-Wei, Grover, Aditya

arXiv.org Artificial IntelligenceJun-5-2024

Recent advances in internet-scale video data pretraining have led to the development of text-to-video generative models that can create high-quality videos across a broad range of visual concepts and styles. Due to their ability to synthesize realistic motions and render complex objects, these generative models have the potential to become general-purpose simulators of the physical world. However, it is unclear how far we are from this goal with the existing text-to-video generative models. To this end, we present VideoPhy, a benchmark designed to assess whether the generated videos follow physical commonsense for real-world activities (e.g. marbles will roll down when placed on a slanted surface). Specifically, we curate a list of 688 captions that involve interactions between various material types in the physical world (e.g., solid-solid, solid-fluid, fluid-fluid). We then generate videos conditioned on these captions from diverse state-of-the-art text-to-video generative models, including open models (e.g., VideoCrafter2) and closed models (e.g., Lumiere from Google, Pika). Further, our human evaluation reveals that the existing models severely lack the ability to generate videos adhering to the given text prompts, while also lack physical commonsense. Specifically, the best performing model, Pika, generates videos that adhere to the caption and physical laws for only 19.7% of the instances. VideoPhy thus highlights that the video generative models are far from accurately simulating the physical world. Finally, we also supplement the dataset with an auto-evaluator, VideoCon-Physics, to assess semantic adherence and physical commonsense at scale.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2406.0352

Country: North America > United States > California (0.14)

Genre: Research Report > New Finding (0.67)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Atlas3D: Physically Constrained Self-Supporting Text-to-3D for Simulation and Fabrication

Chen, Yunuo, Xie, Tianyi, Zong, Zeshun, Li, Xuan, Gao, Feng, Yang, Yin, Wu, Ying Nian, Jiang, Chenfanfu

arXiv.org Artificial IntelligenceMay-28-2024

Existing diffusion-based text-to-3D generation methods primarily focus on producing visually realistic shapes and appearances, often neglecting the physical constraints necessary for downstream tasks. Generated models frequently fail to maintain balance when placed in physics-based simulations or 3D printed. This balance is crucial for satisfying user design intentions in interactive gaming, embodied AI, and robotics, where stable models are needed for reliable interaction. Additionally, stable models ensure that 3D-printed objects, such as figurines for home decoration, can stand on their own without requiring additional supports. To fill this gap, we introduce Atlas3D, an automatic and easy-to-implement method that enhances existing Score Distillation Sampling (SDS)-based text-to-3D tools. Atlas3D ensures the generation of self-supporting 3D models that adhere to physical laws of stability under gravity, contact, and friction. Our approach combines a novel differentiable simulation-based loss function with physically inspired regularization, serving as either a refinement or a post-processing module for existing frameworks. We verify Atlas3D's efficacy through extensive generation tasks and validate the resulting 3D models in both simulated and real-world environments.

arxiv preprint arxiv, machine learning, natural language, (14 more...)

arXiv.org Artificial Intelligence

2405.18515

Country: North America > United States > California (0.14)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

A Convex Formulation of Frictional Contact for the Material Point Method and Rigid Bodies

Zong, Zeshun, Jiang, Chenfanfu, Han, Xuchen

arXiv.org Artificial IntelligenceMar-22-2024

Since then, MPM has been applied to many other compliant contact approach, [42] builds on top of [43] and engineering problems such as the simulation of landslide [44] to formulate an unconstrained convex optimization problem [10], terramechanics [11], and avalanche [12]. The adoption for frictional contact and proposes the Semi-Analytical and development of MPM have significantly accelerated following Primal (SAP) solver that guarantees global convergence. This its introduction to the computer graphics community method is extended by [45] to support deformable bodies [13]. This has led to its application in simulating a variety modeled with Finite Element Method (FEM). In this work, of phenomena, especially those characterized by elastoplastic we propose a further extension to incorporate MPM, enabling behaviors, by employing specialized elastoplasticity constitutive the robust simulation of a broader spectrum of materials and models. This includes the dynamics of non-Newtonian further enriching the domain of robotics simulation.

artificial intelligence, mpm, simulation, (14 more...)

arXiv.org Artificial Intelligence

2403.13783

Country: North America > United States > California (0.14)

Genre: Research Report (0.64)

Industry: Energy > Oil & Gas > Upstream (0.46)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Gaussian Splashing: Dynamic Fluid Synthesis with Gaussian Splatting

Feng, Yutao, Feng, Xiang, Shang, Yintong, Jiang, Ying, Yu, Chang, Zong, Zeshun, Shao, Tianjia, Wu, Hongzhi, Zhou, Kun, Jiang, Chenfanfu, Yang, Yin

arXiv.org Artificial IntelligenceJan-27-2024

We demonstrate the feasibility of integrating physics-based animations of solids and fluids with 3D Gaussian Splatting (3DGS) to create novel effects in virtual scenes reconstructed using 3DGS. Leveraging the coherence of the Gaussian splatting and position-based dynamics (PBD) in the underlying representation, we manage rendering, view synthesis, and the dynamics of solids and fluids in a cohesive manner. Similar to Gaussian shader, we enhance each Gaussian kernel with an added normal, aligning the kernel's orientation with the surface normal to refine the PBD simulation. This approach effectively eliminates spiky noises that arise from rotational deformation in solids. It also allows us to integrate physically based rendering to augment the dynamic surface reflections on fluids. Consequently, our framework is capable of realistically reproducing surface highlights on dynamic fluids and facilitating interactions between scene objects and fluids from new views. For more information, please visit our project page at \url{https://amysteriouscat.github.io/GaussianSplashing/}.

artificial intelligence, human computer interaction, kernel, (17 more...)

arXiv.org Artificial Intelligence

2401.15318

Country:

North America > United States > Utah (0.15)
Asia > China > Zhejiang Province (0.15)
North America > United States > California (0.14)
(2 more...)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.34)

Add feedback

PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics

Xie, Tianyi, Zong, Zeshun, Qiu, Yuxing, Li, Xuan, Feng, Yutao, Yang, Yin, Jiang, Chenfanfu

arXiv.org Artificial IntelligenceNov-22-2023

We introduce PhysGaussian, a new method that seamlessly integrates physically grounded Newtonian dynamics within 3D Gaussians to achieve high-quality novel motion synthesis. Employing a custom Material Point Method (MPM), our approach enriches 3D Gaussian kernels with physically meaningful kinematic deformation and mechanical stress attributes, all evolved in line with continuum mechanics principles. A defining characteristic of our method is the seamless integration between physical simulation and visual rendering: both components utilize the same 3D Gaussian kernels as their discrete representations. This negates the necessity for triangle/tetrahedron meshing, marching cubes, "cage meshes," or any other geometry embedding, highlighting the principle of "what you see is what you simulate (WS$^2$)." Our method demonstrates exceptional versatility across a wide variety of materials--including elastic entities, metals, non-Newtonian fluids, and granular materials--showcasing its strong capabilities in creating diverse visual content with novel viewpoints and movements. Our project page is at: https://xpandora.github.io/PhysGaussian/

artificial intelligence, gaussian, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2311.12198

Country: Europe > United Kingdom (0.14)

Genre: Research Report (0.64)

Industry: Energy > Oil & Gas > Upstream (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Graphics (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Neural Stress Fields for Reduced-order Elastoplasticity and Fracture

Zong, Zeshun, Li, Xuan, Li, Minchen, Chiaramonte, Maurizio M., Matusik, Wojciech, Grinspun, Eitan, Carlberg, Kevin, Jiang, Chenfanfu, Chen, Peter Yichen

arXiv.org Artificial IntelligenceOct-26-2023

We propose a hybrid neural network and physics framework for reduced-order modeling of elastoplasticity and fracture. State-of-the-art scientific computing models like the Material Point Method (MPM) faithfully simulate large-deformation elastoplasticity and fracture mechanics. However, their long runtime and large memory consumption render them unsuitable for applications constrained by computation time and memory usage, e.g., virtual reality. To overcome these barriers, we propose a reduced-order framework. Our key innovation is training a low-dimensional manifold for the Kirchhoff stress field via an implicit neural representation. This low-dimensional neural stress field (NSF) enables efficient evaluations of stress values and, correspondingly, internal forces at arbitrary spatial locations. In addition, we also train neural deformation and affine fields to build low-dimensional manifolds for the deformation and affine momentum fields. These neural stress, deformation, and affine fields share the same low-dimensional latent space, which uniquely embeds the high-dimensional simulation state. After training, we run new simulations by evolving in this single latent space, which drastically reduces the computation time and memory consumption. Our general continuum-mechanics-based reduced-order framework is applicable to any phenomena governed by the elastodynamics equation. To showcase the versatility of our framework, we simulate a wide range of material behaviors, including elastica, sand, metal, non-Newtonian fluids, fracture, contact, and collision. We demonstrate dimension reduction by up to 100,000X and time savings by up to 10X.

artificial intelligence, machine learning, reduced-order elastoplasticity and fracture, (1 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3610548.3618207

2310.1779

Genre: Research Report (0.40)

Industry: Energy > Oil & Gas > Upstream (0.53)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.53)

Add feedback