Xinjian Zhao (赵鑫鉴)

The Chinese University of Hong Kong, Shenzhen

I am a third-year PhD student in Computer Science at The Chinese University of Hong Kong, Shenzhen, and I am fortunate to be advised by Prof. Tianshu Yu. During my PhD, I also spent time as a visiting student at the Institute of Automation, Chinese Academy of Sciences, hosted by Prof. Shu Wu, and as a research intern at Shanghai Artificial Intelligence Laboratory. Before my PhD, I completed an M.S. in Data Science at City University of Hong Kong under the supervision of Dr. Ruocheng Guo and a B.S. in Computer Science at Shandong University under the supervision of Prof. Xuemeng Song. I am interested in graph learning, graph-augmented intelligence, and impactful applications in science and industry.

Selected Papers

denotes equal contribution.

Manufacturing MLLMs

Fine-grained multimodal evaluation in industrial scenarios
  • Manufacturing
  • MLLM
  • Benchmark

1 / 9

FORGE: Fine-grained Multimodal Evaluation for Manufacturing Scenarios

Xiangru Jian, Hao Xu, Wei Pang, Xinjian Zhao, Chengyu Tao, Qixin Zhang, Xikun Zhang, Chao Zhang, Guanzhi Deng, Alex Xue, Juan Du, Tianshu Yu, Garth Tarr, Linqi Song, Qiuzhuang Sun, Dacheng Tao

Preprint, 2026

We introduce FORGE, a fine-grained multimodal benchmark for manufacturing that combines real-world 2D images and 3D point clouds with domain-specific semantic annotations. Evaluating 18 state-of-the-art MLLMs across workpiece verification, surface inspection, and assembly verification reveals that domain knowledge, rather than visual grounding alone, is the key bottleneck.

Key Insight

In manufacturing MLLMs, domain-specific knowledge is often a more critical bottleneck than raw visual grounding.

Synthesis Layer

Unifying trends across vision and graph learning
  • Survey
  • Vision
  • Graph ML

2 / 9

When Vision Meets Graphs: A Survey on Graph Reasoning and Learning

Xinjian Zhao, Wei Pang, Zhixuan Yu, Xiangru Jian, Xiaozhuang Song, Yaoyao Xu, Zhongkai Xue, Dingshuo Chen, Shu Wu, Philip Torr, Tianshu Yu

The 35th International Joint Conference on Artificial Intelligence (IJCAI), 2026

This survey provides a first systematic consolidation of the emerging 'When Vision Meets Graphs' area, where graph depictions are treated as first-class inputs. Using the RPI lens and a three-thread taxonomy, it organizes fragmented literature and outlines concrete future directions toward graph foundation models.

Key Insight

Treating visualized graphs as computational objects, not just illustrations, opens a coherent new research agenda.

Structure via Pixels

How far vision models can reason about graph structure
  • Vision Models
  • Structural Reasoning
  • NeurIPS

3 / 9

The Underappreciated Power of Vision Models for Graph Structural Understanding

Xinjian Zhao*, Wei Pang*, Zhongkai Xue*, Xiangru Jian*, Lei Zhang, Yaoyao Xu, Xiaozhuang Song, Shu Wu, Tianshu Yu

Conference on Neural Information Processing Systems (NeurIPS), 2025

For the first time, we conduct a controlled comparison between pure vision backbones and GNNs for graph structural understanding, and introduce GraphAbstract to isolate topology reasoning. Pure vision models achieve comparable overall performance and outperform GNNs on holistic structure tasks with stronger cross-scale generalization.

Key Insight

For global topology abstraction, a global-first visual inductive bias can be stronger than local message passing.

Benchmark Taxonomy

Graph-theoretic capabilities of LLM systems
  • LLM
  • Graph Reasoning
  • Benchmark

4 / 9

GraphOmni: A Comprehensive and Extendable Benchmark Framework for Large Language Models on Graph-theoretic Tasks

Hao Xu*, Xiangru Jian*, Xinjian Zhao*, Wei Pang*, Chao Zhang, Suyucheng Wang, Qixin Zhang, Zhengyuan Dong, Joao Monteiro, Bang Liu, Qiuzhuang Sun, Tianshu Yu

International Conference on Learning Representations (ICLR), 2026

We introduce GraphOmni, a modular benchmark that factorizes graph-theoretic reasoning by graph type, serialization format, and prompting strategy. Large-scale evaluation reveals strong interactions among these factors and substantial capability gaps that are hidden by single-score comparisons.

Key Insight

Graph reasoning is conditional on representation and prompting choices, so evaluation must be factorized rather than leaderboard-style.

Distributional Geometry

Edge layout distributions as structural priors
  • Graph Representation
  • Robustness
  • KDD

5 / 9

Graph Learning with Distributional Edge Layouts

Xinjian Zhao, Chaolong Ying, Yaoyao Xu, Tianshu Yu

ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2025

We propose Distributional Edge Layouts (DEL), modeling graph topology as a distribution by globally sampling edge layouts with Langevin dynamics under Boltzmann energy. As a model-agnostic preprocessing module, DEL consistently improves diverse GNN backbones and reaches strong state-of-the-art performance on multiple datasets.

Key Insight

Modeling topology as a distribution, rather than a fixed graph, is a practical path to robustness and extra expressivity.

Spectral Views

Augmentation choices in graph contrastive learning
  • Graph SSL
  • Spectral Methods
  • TMLR

6 / 9

Rethinking Spectral Augmentation for Contrast-based Graph Self-Supervised Learning

Xiangru Jian*, Xinjian Zhao*, Wei Pang*, Chaolong Ying, Yimu Wang, Yaoyao Xu, Tianshu Yu

Transactions on Machine Learning Research (TMLR), 2025

We re-examine spectral augmentation in contrastive graph self-supervised learning via broad empirical studies and theoretical analysis of InfoNCE bounds. Simple edge perturbations (edge dropping for node-level, edge adding for graph-level) often match or outperform spectral methods with much lower computational cost.

Key Insight

In graph SSL, choosing invariances well matters more than increasingly complex spectral engineering.

AI4Science Transfer

Hard negative design for protein modeling
  • AI4Science
  • Protein LM
  • ECML-PKDD

7 / 9

Boosting Protein Language Models with Negative Sample Mining

Yaoyao Xu*, Xinjian Zhao*, Xiaozhuang Song, Benyou Wang, Tianshu Yu

European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD), 2024

We introduce negative sample mining for protein language models by constructing hard negatives from protein pairs across distinct categories. The approach improves downstream performance and shows better alignment with biological mechanisms such as protein-protein interaction.

Key Insight

Domain-aware hard negatives can steer protein LMs away from shortcut co-evolution cues and toward biologically meaningful structure.

Topological Priors

Persistent homology for graph pooling
  • Topology
  • Graph Pooling
  • NeurIPS

8 / 9

Boosting Graph Pooling with Persistent Homology

Chaolong Ying, Xinjian Zhao, Tianshu Yu

Conference on Neural Information Processing Systems (NeurIPS), 2024

We propose a graph pooling mechanism that injects persistent homology directly into coarsening, motivated by the alignment between PH filtration and cutoff-style pooling. The method is plug-and-play across pooling families and yields consistent, substantial gains on standard benchmarks.

Key Insight

Topological priors are most effective when embedded in pooling itself, rather than appended as auxiliary features.

Curriculum Robustness

Adversarial scheduling for graph contrastive learning
  • Contrastive Learning
  • Robustness
  • Preprint

9 / 9

Adversarial Curriculum Graph Contrastive Learning with Pair-wise Augmentation

Xinjian Zhao, Liang Zhang, Yang Liu, Ruocheng Guo, Xiangyu Zhao

Preprint, 2024

We propose ACGCL, combining pair-wise augmentation with controllable similarity and subgraph contrastive learning for graph-level representation learning. An adversarial curriculum progressively increases sample difficulty and focuses optimization on harder examples, outperforming strong baselines on six benchmarks.

Key Insight

Graph contrastive learning becomes more reliable when similarity and sample difficulty are explicitly controlled during training.

Other Works

Trajectory

Core Research Trunk: Structure-Aware Intelligence

Core question: how can models learn and understand graph structure, and how can graph structure in turn improve general-purpose models and intelligence?

Mentorship

Co-supervision with Prof. Tianshu Yu.

2 students mentored

Zhongkai Xue 2024.12-Present

B.S. in Financial Engineering, CUHK-Shenzhen

Achievement: NeurIPS 2025; Research Intern at ByteDance

Next Position: CS PhD student at UCAS

Zhixuan Yu 2025.11-Present

B.S. in Electronic and Computer Engineering, CUHK-Shenzhen

Contact

The fastest way to reach me is via my institutional email listed in CV.