AITopics | Wang, Zihe

Collaborating Authors

Wang, Zihe

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation

Feng, Zhenyang, Wang, Zihe, Bueno, Saul Ibaven, Frelek, Tomasz, Ramesh, Advikaa, Bai, Jingyan, Wang, Lemeng, Huang, Zanming, Gu, Jianyang, Yoo, Jinsu, Pan, Tai-Yu, Chowdhury, Arpita, Ramirez, Michelle, Campolongo, Elizabeth G., Thompson, Matthew J., Lawrence, Christopher G., Record, Sydne, Rosser, Neil, Karpatne, Anuj, Rubenstein, Daniel, Lapp, Hilmar, Stewart, Charles V., Berger-Wolf, Tanya, Su, Yu, Chao, Wei-Lun

arXiv.org Artificial IntelligenceJan-12-2025

We study image segmentation in the biological domain, particularly trait and part segmentation from specimen images (e.g., butterfly wing stripes or beetle body parts). This is a crucial, fine-grained task that aids in understanding the biology of organisms. The conventional approach involves hand-labeling masks, often for hundreds of images per species, and training a segmentation model to generalize these labels to other images, which can be exceedingly laborious. We present a label-efficient method named Static Segmentation by Tracking (SST). SST is built upon the insight: while specimens of the same species have inherent variations, the traits and parts we aim to segment show up consistently. This motivates us to concatenate specimen images into a ``pseudo-video'' and reframe trait and part segmentation as a tracking problem. Concretely, SST generates masks for unlabeled images by propagating annotated or predicted masks from the ``pseudo-preceding'' images. Powered by Segment Anything Model 2 (SAM~2) initially developed for video segmentation, we show that SST can achieve high-quality trait and part segmentation with merely one labeled image per species -- a breakthrough for analyzing specimen images. We further develop a cycle-consistent loss to fine-tune the model, again using one labeled image. Additionally, we highlight the broader potential of SST, including one-shot instance segmentation on images taken in the wild and trait-based image retrieval.

machine learning, natural language, segmentation, (16 more...)

arXiv.org Artificial Intelligence

2501.06749

Country: North America > United States > Wisconsin (0.14)

Genre: Research Report > Promising Solution (0.34)

Industry: Health & Medicine (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Are High-Degree Representations Really Unnecessary in Equivariant Graph Neural Networks?

Cen, Jiacheng, Li, Anyi, Lin, Ning, Ren, Yuxiang, Wang, Zihe, Huang, Wenbing

arXiv.org Artificial IntelligenceOct-30-2024

Equivariant Graph Neural Networks (GNNs) that incorporate E(3) symmetry have achieved significant success in various scientific applications. As one of the most successful models, EGNN leverages a simple scalarization technique to perform equivariant message passing over only Cartesian vectors (i.e., 1st-degree steerable vectors), enjoying greater efficiency and efficacy compared to equivariant GNNs using higher-degree steerable vectors. This success suggests that higher-degree representations might be unnecessary. In this paper, we disprove this hypothesis by exploring the expressivity of equivariant GNNs on symmetric structures, including $k$-fold rotations and regular polyhedra. We theoretically demonstrate that equivariant GNNs will always degenerate to a zero function if the degree of the output representations is fixed to 1 or other specific values. Based on this theoretical insight, we propose HEGNN, a high-degree version of EGNN to increase the expressivity by incorporating high-degree steerable vectors while maintaining EGNN's efficiency through the scalarization trick. Our extensive experiments demonstrate that HEGNN not only aligns with our theoretical analyses on toy datasets consisting of symmetric structures, but also shows substantial improvements on more complicated datasets such as $N$-body and MD17. Our theoretical findings and empirical results potentially open up new possibilities for the research of equivariant GNNs.

artificial intelligence, international conference, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2410.11443

Country: Asia > China (0.46)

Genre: Research Report (1.00)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

A Survey of Geometric Graph Neural Networks: Data Structures, Models and Applications

Han, Jiaqi, Cen, Jiacheng, Wu, Liming, Li, Zongzhao, Kong, Xiangzhe, Jiao, Rui, Yu, Ziyang, Xu, Tingyang, Wu, Fandi, Wang, Zihe, Xu, Hongteng, Wei, Zhewei, Liu, Yang, Rong, Yu, Huang, Wenbing

arXiv.org Artificial IntelligenceMar-1-2024

Geometric graph is a special kind of graph with geometric features, which is vital to model many scientific problems. Unlike generic graphs, geometric graphs often exhibit physical symmetries of translations, rotations, and reflections, making them ineffectively processed by current Graph Neural Networks (GNNs). To tackle this issue, researchers proposed a variety of Geometric Graph Neural Networks equipped with invariant/equivariant properties to better characterize the geometry and topology of geometric graphs. Given the current progress in this field, it is imperative to conduct a comprehensive survey of data structures, models, and applications related to geometric GNNs. In this paper, based on the necessary but concise mathematical preliminaries, we provide a unified view of existing models from the geometric message passing perspective. Additionally, we summarize the applications as well as the related datasets to facilitate later research for methodology development and experimental evaluation. We also discuss the challenges and future potential directions of Geometric GNNs at the end of this survey.

artificial intelligence, conference, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2403.00485

Country:

North America > United States (0.45)
Europe (0.27)

Genre: Overview (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Energy (0.67)
Materials (0.67)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Enhancing Multi-modal Cooperation via Fine-grained Modality Valuation

Wei, Yake, Feng, Ruoxuan, Wang, Zihe, Hu, Di

arXiv.org Artificial IntelligenceNov-21-2023

One primary topic of multi-modal learning is to jointly incorporate heterogeneous information from different modalities. However, most models often suffer from unsatisfactory multi-modal cooperation, which could not jointly utilize all modalities well. Some methods are proposed to identify and enhance the worse learnt modality, but are often hard to provide the fine-grained observation of multi-modal cooperation at sample-level with theoretical support. Hence, it is essential to reasonably observe and improve the fine-grained cooperation between modalities, especially when facing realistic scenarios where the modality discrepancy could vary across different samples. To this end, we introduce a fine-grained modality valuation metric to evaluate the contribution of each modality at sample-level. Via modality valuation, we regretfully observe that the multi-modal model tends to rely on one specific modality, resulting in other modalities being low-contributing. We further analyze this issue and improve cooperation between modalities by enhancing the discriminative ability of low-contributing modalities in a targeted manner. Overall, our methods reasonably observe the fine-grained uni-modal contribution at sample-level and achieve considerable improvement on different multi-modal models.

artificial intelligence, machine learning, modality, (17 more...)

arXiv.org Artificial Intelligence

2309.06255

Country:

Asia > China (0.14)
Europe > North Macedonia (0.14)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (0.95)
Information Technology > Data Science (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Computational Issues in Time-Inconsistent Planning

Tang, Pingzhong (Tsinghua University) | Teng, Yifeng (University of Wisconsin-Madison) | Wang, Zihe (Shanghai University of Finance and Economics) | Xiao, Shenke (Tsinghua University) | Xu, Yichong (Carnegie Mellon University)

AAAI ConferencesFeb-14-2017

Time-inconsistency refers to a paradox in decision making where agents exhibit inconsistent behaviors over time. Examples are procrastination where agents tend to postpone easy tasks, and abandonments where agents start a plan and quit in the middle. To capture such behaviors and to quantify inefficiency caused by such behaviors, Kleinberg and Oren (2014) propose a graph model with a certain cost structure and initiate the study of several interesting computation problems: 1) cost ratio: the worst ratio between the actual cost of the agent and the optimal cost, over all the graph instances; 2) motivating subgraph: how to motivate the agent to reach the goal by deleting nodes and edges; 3) Intermediate rewards: how to incentivize agents to reach the goal by placing intermediate rewards. Kleinberg and Oren give partial answers to these questions, but the main problems are open. In this paper, we give answers to all three open problems. First, we show a tight upper bound of cost ratio for graphs, and confirm the conjecture by Kleinberg and Oren that Akerlof’s structure is indeed the worst case for cost ratio. Second, we prove that finding a motivating subgraph is NP-hard, showing that it is generally inefficient to motivate agents by deleting nodes and edges in the graph. Last but not least, we show that computing a strategy to place minimum amount of total reward is also NP-hard and we provide a 2n- approximation algorithm.

agent, artificial intelligence, subgraph, (16 more...)

AAAI Conferences

Thirty-First AAAI Conference on Artificial Intelligence

Country: North America > United States > Wisconsin (0.14)

Industry: Leisure & Entertainment (0.46)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning (0.48)

Add feedback

Optimal Auctions for Partially Rational Bidders

Wang, Zihe (Tsinghua University) | Tang, Pingzhong (Tsinghua University)

AAAI ConferencesJul-15-2015

We investigate the problem of revenue optimal mechanism design [Myerson, 1981] under the context of the partial rationality model, where buyers randomize between two modes: rational and irrational. When a buyer is irrational (can be thought of as lazy), he acts according to certain fixed strategies, such as bidding his true valuation. The seller cannot observe the buyer’s valuation, or his rationality mode, but treat them as random variables from known distributions. The seller’s goal is to design a single-shot auction that maximizes her expected revenue. A minor generalization as it may seem, our findings are in sharp contrast to Myerson’s theory on the standard rational bidder case. In particular, we show that, even for the simplest setting with one buyer, direct value revelation loses generality. However, we do show that, in terms of revenue, the optimal value-revelation and type-revelation mechanisms are equivalent. In addition, the posted-price mechanism is no longer optimal. In fact, the more complicated the mechanism, the higher the revenue. For the case where there are multiple bidders with IID uniform valuations, we show that when the irrational buyers are truthful, first price auction yields more revenue than second price auction.

artificial intelligence, bidder, game theory, (18 more...)

AAAI Conferences

Twenty-Fourth International Joint Conference on Artificial Intelligence

Country:

Asia > China (0.14)
North America > United States (0.14)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)

Add feedback