2025
- NeutronTask: Scalable and Efficient Multi-GPU GNN Training with Task Parallelism (CCF A) [Code]
Zhenbo Fu, Xin Ai, Qiange Wang, Yanfeng Zhang, Shizhan Lu, Chaoyi Chen, Chunyu Cao, Hao Yuan, Zhewei Wei, Yu Gu, Yingyou Wen, Ge Yu
Proceedings of the VLDB Endowment (PVLDB 2025), London, United Kingdom, 2025.
- SRDC: Semantics-based Ransomware Detection and Classification with LLM-assisted Pre-training (CCFA)
Ce Zhou, Yilun Liu, Weibin Meng, Shimin Tao, Weinan Tian, Feiyu Yao, Xiaochun Li, Tao Han, Boxing Chen, Hao Yang.
Association for the Advancement of Artificial Intelligence (AAAI 2025). Philadelphia, Pennsylvania, USA, 2025.
- MassBFT: Fast and Scalable Geo-Distributed Byzantine Fault-Tolerant Consensus (CCFA)
[Code]
Zeshun Peng, Yanfeng Zhang, Tinghao Feng, Weixing Zhou, Xiaohua Li, Ge Yu.
IEEE International Conference on Data Engineering (ICDE 2025).
2024
- NeutronSketch: An in-depth exploration of redundancy in large-scale graph neural network training (SCI一区)[Code]
Yajiong Liu, Yanfeng Zhang, Qiange Wang, Hao Yuan, Xin Ai and Ge Yu.
knowledge-based-systems (KBS).
- NeutronTP: Load-Balanced Distributed Full-Graph GNN Training with Tensor Parallelism (CCF A) [Code]
Xin Ai, Hao Yuan, Zeyu Ling, Qiange Wang, Yanfeng Zhang, Zhenbo Fu, Chaoyi Chen, Yu Gu, Ge Yu
Proceedings of the VLDB Endowment (PVLDB 2025), London, United Kingdom, 2025.
- Towards Efficient Graph Processing in Geo-Distributed Data Centers (CCFA)
Feng Yao, Qian Tao, Shengyuan Lin, Yanfeng Zhang, Wenyuan Yu, Shufeng Gong, Qiange Wang, Ge Yu, and Jingren Zhou.
IEEE Transactions on Parallel and Distributed Systems (TPDS), 2024, 35(11): 2147-2160.
- GastCoCo: Graph Storage and Coroutine-Based Prefetch Co-Design for Dynamic Graph Processing (CCF A)[Code]
Hongfu Li, Qian Tao, Song Yu, Shufeng Gong, Yanfeng Zhang, Feng Yao, Wenyuan Yu, Ge Yu, and Jingren Zhou
Proceedings of the VLDB Endowment (PVLDB 2025), London, United Kingdom, 2025.
- LSMGraph: A High-Performance Dynamic Graph Storage System with Multi-level CSR (CCF A)
Song Yu, Shufeng Gong, Qian Tao, Sijie Shen, Yanfeng Zhang, Wenyuan Yu, Pengxi Liu, Zhixin Zhang, Hongfu Li, Xiaojian Luo, Ge Yu, and Jingren Zhou.
Proceedings of the International Conference on Management of Data (SIGMOD 2025), Berlin, Germany, 2025.
- DynaHB: A Communication-Avoiding Asynchronous Distributed Framework with Hybrid Batches for Dynamic GNN Training (CCF A)
Zhen Song, Yu Gu, Qing Sun, Tianyi Li, Yanfeng Zhang, Yushuai Li, Christian S. Jensen, and Ge Yu
Proceedings of the VLDB Endowment (PVLDB 2024), Guangzhou, China, 2024.
- Accelerating Topic-Sensitive PageRank by Exploiting the Query History (CCF B) [Slides]
Shufeng Gong, Zhixin Zhang, Jing Lu, Yanfeng Zhang, Cong Fu, and Ge Yu
Proceedings of the International Computing and Combinatorics Conference (COCOON 2024), Shanghai, China, 2024.
- Hammer: A General Blockchain Evaluation Framework (CCF B) [Slides]
Gang Wang, Yanfeng Zhang, Chenhao Ying, Xiaohua Li and Ge Yu.
Proceedings of the International Conference on Distributed Computing Systems (ICDCS 2024), Jersey City, American, 2024.
- Towards Transaction as a Service
Yanfeng Zhang, Weixing Zhou, Yang Ren, Sihao Li, Guoliang Li, and Ge Yu
arxiv 2311.07874, 2024.
- NeutronOrch: Rethinking Sample-based GNN Training under CPU-GPU Heterogeneous Environments (CCF A)[Code, Slides]
Xin Ai, Qiange Wang, Chunyu Cao, Yanfeng Zhang, Chaoyi Chen, Hao Yuan, Yu Gu, Ge Yu.
Proceedings of the VLDB Endowment (PVLDB 2024), Guangzhou, China (and hybrid), 2024.
- Fast Iterative Graph Computing with Updated Neighbor States (CCF A) [Code, Slides]
Yijie Zhou, Shufeng Gong, Feng Yao, Hanzhang Chen, Song Yu, Pengxi Liu, Yanfeng Zhang, Ge Yu, Jeffrey Xu Yu
IEEE International Conference on Data Engineering (ICDE) (ICDE 2024), Utrecht, Netherlands, 2024.
- Ingress: an automated incremental graph processing system (CCF A) [Code]
Shufeng Gong, Chao Tian, Qiang Yin, Zhengdong Wang, Song Yu, Yanfeng Zhang, Wenyuan Yu, Liang Geng, Chong Fu, Ge Yu, Jingren Zhou
The VLDB Journal (2024)(VLDBJ 2024)
- Comprehensive Evaluation of GNN Training Systems: A Data Management Perspective (CCF A)[Code, Slides]
Hao Yuan, Yajiong Liu, Yanfeng Zhang, Xin Ai, Qiange Wang, Chaoyi Chen, Yu Gu, Ge Yu.
Proceedings of the VLDB Endowment (PVLDB 2024), Guangzhou, China (and hybrid), 2024.
- RAGraph: A Region-Aware Framework for Geo-Distributed Graph Processing (CCF
A) [Code, Slides]
Feng Yao, Qian Tao, Wenyuan Yu, Yanfeng Zhang, Shufeng Gong, Qiange Wang, Ge Yu, and Jingren Zhou
Proceedings of the VLDB Endowment (PVLDB 2024), 2024, 17(3), 264-277,
Guangzhou, China (and hybrid), 2024.
- NeutronStream: A Dynamic GNN Training Framework with Sliding Window for Graph Streams (CCF
A)[Code, Slides]
Chaoyi Chen, Dechao Gao, Yanfeng Zhang, Qiange Wang, Zhenbo Fu, Xuecang Zhang, Junhua Zhu, Yu Gu, Ge Yu
Proceedings of the VLDB Endowment (PVLDB 2024),
Guangzhou, China (and hybrid), 2024.
- ADGNN: Towards Scalable GNN Training with Aggregation-Difference Aware Sampling (CCF
A)
Zhen Song, Yu Gu, Tianyi Li, Qing Sun, Yanfeng Zhang, Christian S. Jensen, and Ge Yu
Proceedings of the International Conference on Management of Data
(SIGMOD 2024),
Santiago, Chile, 2024.
2023
- GeoGauss: Strongly Consistent and Light-Coordinated OLTP for Geo-Replicated SQL Database (CCF
A)[Code,
Slides,
Video]
Weixing Zhou, Qi Peng, Zijie Zhang, Yanfeng Zhang, Yang Ren, Sihao Li, Guo Fu, Yulong Cui, Qiang Li, Caiyi Wu, Shangjun Han, Shengyi Wang, Guoliang Li, Ge Yu
Proceedings of the International Conference on Management of Data
(SIGMOD 2023), Seattle, USA, 2023.
- Layph: Making Change Propagation Constraint in Incremental Graph Processing by Layering Graph (CCF
A) [Slides, Video]
Song Yu, Shufeng Gong, Yanfeng Zhang, Wenyuan Yu, Qiang Yin, Chao Tian, Qian Tao, Yongze Yan, Ge Yu, Jingren Zhou
Proceedings of IEEE International Conference on Data Engineering (ICDE 2023),
Anaheim, USA, 2023.
- HyTGraph: GPU-Accelerated Graph Processing with Hybrid Transfer Management (CCF
A) [Code, Slides, Video]
Qiange Wang, Xin Ai, Yanfeng Zhang, Jing Chen, and Ge Yu
Proceedings of IEEE International Conference on Data Engineering (ICDE 2023),
Anaheim, USA, 2023.
- Improving Density Peaks Clustering through GPU acceleration (JCR 1区
A)
Zhuojin Liu,Shufeng Gong,Yuxuan Su,Changyi Wan,Yanfeng Zhang,Ge Yu
Future Generation Computer Systems (FGCS 2023),
Anaheim, USA, 2023.
2022
- NeuChain: A Fast Permissioned Blockchain System with Deterministic Ordering (CCF
A) [Code,
Slides]
Zeshun Peng, Yanfeng Zhang, Qian Xu, Haixu Liu, Yuxiao Gao, Xiaohua Li, Ge Yu
Proceedings of the VLDB Endowment (PVLDB 2022),
Sydney, Australia (and hybrid), 2022.
- NeutronStar: Distributed GNN Training with Hybrid Dependency Management (CCF A) [Code]
Qiange Wang, Yanfeng Zhang, Hao Wang, Chaoyi Chen, Xiaodong Zhang, Ge Yu
Proceedings of the International Conference on Management of
Data (SIGMOD 2022), Philadephia, USA, 2022.
2021
- Automating Incremental
Graph Processing with Flexible Memoization (CCF A)[Code,
Slides,
Video]
Shufeng Gong, Chao Tian, Qiang Yin, Wenyuan Yu, Yanfeng Zhang, Liang Geng, Song Yu, Ge Yu, Jingren
Zhou
Proceedings of the VLDB Endowment (PVLDB 2021),
Copenhagen, Denmark, 2021.
- Accelerating
Large-Scale Prioritized Graph Computations by Hotness Balanced Partition (CCF A) [Code]
Shufeng Gong, Yanfeng Zhang, Ge Yu
IEEE Transactions on Parallel and Distributed
Systems (TPDS), 32(4), Apr, 2021, pp. 746-759.
- 大规模图神经网络系统综述 (CCF
中文A)
赵港,王千阁,姚烽,张岩峰,于戈
软件学报, 2021(在线出版).
- What Have We Learned from OpenReview? [Code] Best Student Paper Runner-up
Gang Wang, Qi Peng, Yanfeng Zhang, Mingyang Zhang
APWeb-WAIM 2021, Guangzhou, China, 2021.
- DragDL: 一种易用的深度学习模型可视化构建系统
[Code]
汤世征,张岩峰
计算机科学, 2021.
2020
- Automating
Incremental and Asynchronous Evaluation for Recursive Aggregate Data Processing (CCF A)
[Code,
Slides]
Qiange Wang, Yanfeng Zhang, Hao Wang, Liang Geng, Rubao Lee, Xiaodong Zhang, Ge Yu.
ACM International Conference on Management of Data
(SIGMOD 2020), Portlad, USA, 2020.
- HBP:
Hotness Balanced Partition for Prioritized Iterative Graph Computations (CCF A) [Code]
Shufeng Gong, Yanfeng Zhang, and Ge Yu
IEEE International Conference on Data Engineering 2020
(ICDE 2020), Dallas, USA, 2020.
- GDPC:
A GPU-Accelerated Density Peaks Clustering Algorithm (CCF B)
Yuxuan Su, Yanfeng Zhang, Changyi Wan, and Ge Yu
International Conference on Database Systems for
Advanced Applications (DASFAA 2020), Jeju, South Korea, 2020.
- MxPool: Multiplex Pooling for Hierarchical Graph
Representation Learning [Code]
Yanyan Liang, Yanfeng Zhang, Dechao Gao, Qian Xu
arXiv preprint arXiv:2004.06846, 2020.
- Distributed Graph Processing:
Techniques and Systems (Invited Tutorial)
Yanfeng Zhang, Qiange Wang, Shufeng Gong.
APWeb-WAIM International Joint Conference on Web
and Big Data (APWeb-WAIM 2020), Tianjin, China, 2020.
- A Fair Comparison of Message Queuing
Systems
Guo Fu, Yanfeng Zhang, Ge Yu
IEEE Access, 9, Dec, 2020, pp. 421-432.
- 区块链新技术综述:图型区块链和分区型区块链
张长贵,张岩峰,李晓华,聂铁铮,于戈
计算机科学, 2020.
2019
- SEP-Graph:
Finding Shortest Execution Paths for Graph Processing under a Hybrid Framework on GPU (CCF
A) [Code]
Hao Wang, Liang Geng, Rubao Lee, Kaixi Hou, Yanfeng Zhang, and Xiaodong Zhang
Conference on Principles and Practice of Parallel
Programming 2019 (PPoPP 2019), Washington DC, USA, 2019.
- PowerHash: a Hybrid Grouping
Scheme by Leveraging Power-Law Properties of Data [Code]
Xun Wei, Xiaowang Kong, Yanfeng Zhang, and Ge Yu
International
Journal of Data Science and Analytics (JDSA), 2019.
- GANCoder: An Automatic Natural
Language-to-Programming Language Translation Approach Based on GAN (CCF C)
Yabing Zhu, Yanfeng Zhang, Huili Yang, and Fangjing Wang
The 8th CCF International Conference on
Natural Language Processing and Chinese Computing (NLPCC 2019), Dunhuang, China,
2019.
- 区块链系统中的分布式数据管理技术—挑战与展望
(CCF 中文A)
于戈,聂铁铮,李晓华,张岩峰, 申德荣,鲍玉斌
计算机学报, 2019.
2018
- Clustering Stream Data by Exploring the
Evolution of Density Mountain (CCF A) [Code]
Shufeng Gong, Yanfeng Zhang, Ge Yu
Proceedings of the VLDB Endowment (PVLDB),
11(4):393-405, 2018.
- A Low-cost Disk Solution Enabling LSM-tree to
Achieve High Performance for Mixed Read/Write Workloads (CCF A)
Dejun Teng, Lei Guo, Rubao Lee, Feng Chen, Yanfeng Zhang, Siyuan Ma, and Xiaodong Zhang
ACM Transactions on Storage (TOS),
14(2), Apr, 2018, pp. 15:1-15:26.
- Accelerating
distributed Expectation-Maximization algorithms with frequent updates (CCF B)
Jiangtao Yin, Yanfeng Zhang, Lixin Gao
Journal of Parallel and
Distributed Computing (JPDC), 111, 2018, pp.65-75.
- SQLoop: High
Performance Iterative Processing in Data Management (CCF B)
Sofoklis Floratos, Yanfeng Zhang, Yuan Yuan, Rubao Lee, and Xiaodong Zhang
38th IEEE International Conference on Distributed Computing
Systems (ICDCS 2018), Vienna, Austria, 2018.
- 流式处理的异步图处理框架 (CCF
中文A)
李金吉,张岩峰,巩树凤,于戈,高立新
软件学报, 29(3), March, 2018, pp.528-544.
- PowerHash: A Hybrid Grouping Scheme by
Leveraging Power-Law Properties of Data 最佳论文提名奖
Xun Wei, Xiaowang Kong, and Yanfeng Zhang
6th CCF BigData Conference (CCF BigData 2018),
Xi'An, China, 2017.
2017
- LSbM-tree: Re-Enabling Buffer Caching in
Data Management for Mixed Reads and Writes (CCF B)
Dejun Teng, Lei Guo, Rubao Lee, Feng Chen, Siyuan Ma, Yanfeng Zhang, Xiaodong Zhang:
37th IEEE International Conference on Distributed
Computing Systems (ICDCS 2017), Atlanta, USA, 2017.
- bHash: An I/O Efficient External
Hash Grouping Scheme 最佳学生论文奖
Xiaowang Kong, Yanfeng Zhang, Ge Yu
第五届CCF大数据学术会议(CCF BigData 2017),深圳,中国,
2017.
- Efficient Distributed Density Peaks
for Clustering Large Data Sets in MapReduce (Extended Abstract) (TKDE Poster)
Yanfeng Zhang, Shimin Chen, Ge Yu
33rd IEEE International Conference on Data Engineering (ICDE
2017), San Diego, USA, 2017.
2016
- Efficient Distributed Density Peaks for
Clustering Large Data Sets in MapReduce (CCF A)
Yanfeng Zhang, Shimin Chen, Ge Yu
IEEE Transactions on Knowledge and Data Engineering (TKDE),
28(12), December, 2016, pp. 3218-3230.
- EDDPC: 一种高效的分布式密度中心聚类算法 (CCF
中文A)
巩树凤,张岩峰
计算机研究与发展, 53(6), June, 2016,
pp.1567-1579.
- i2MapReduce:
Incremental MapReduce for Mining Evolving Big Data (Extended Abstract) (TKDE Poster)
Yanfeng Zhang, Shimin Chen, Qiang Wang, Ge Yu
32nd IEEE International Conference on Data Engineering (ICDE
2016), Helsinki, Finland, 2016.
2015
- i2MapReduce: Incremental MapReduce for Mining
Evolving Big Data [technical report] (CCF
A)
Yanfeng Zhang, Shimin Chen, Qiang Wang, Ge Yu
IEEE Transactions on Knowledge and Data Engineering (TKDE),
27(7), July, 2015, pp.1906-1919.
- Asyn-SimRank:一种可异步执行的大规模SimRank算法 (CCF
中文A)
王春磊,张岩峰,鲍玉斌,赵长宽,于戈,高立新
计算机研究与发展, 52(7), July, 2015,
pp.1906-1919..
- EDDPC:一种高效的分布式密度中心聚类算法 最佳论文奖
巩树凤,张岩峰
第32届全国数据库学术会议 (NDBC 2015),
成都,中国,2015.
2014
- Maiter: An Asynchronous Graph
Processing Framework for Delta-based Accumulative Iterative Computation [Supplemental
File] (CCF A)[Code]
Yanfeng Zhang, Qixin Gao, Lixin Gao, Cuirong Wang
IEEE Transactions on Parallel and Distributed
Systems (TPDS), 25(8), August, 2014, pp.2091-2100.
- Asynchronous
Computation Model for Large-Scale Iterative
Computations
Yanfeng Zhang, Qixin Gao, Lixin Gao, Cuirong Wang
In Xiaolin Li, Judy Qiu (Eds.), Cloud
Computing for Data-Intensive Applications, Springer Press, 2014, pp.303-329.
- Extending MapReduce for Iterative
Processing
Yanfeng Zhang, Qixin Gao, Lixin Gao, Cuirong Wang
In Sherif Sakr, Mohamed Gaber (Eds.), Large Scale and Big Data:
Processing and Management, CRC Press, 2014, pp.107-126.
- Mux-Kmeans: Multiplex Kmeans
for Clustering Large-scale Data Set
Chen Li, Yanfeng Zhang, Minghai Jiao, Ge Yu
Proc. 5th Workshop on Scientific Cloud
Computing (ScienceCloud 2014), Vancouver, Canada, June 2014.
- MaiterStore: A Hot-aware,
High-Performance Key-Value Store for Graph Processing
Dong Chang, Yanfeng Zhang, Ge Yu
Proc. 2nd Workshop on Big Data Management and Analytics
(BDMA 2014), Bali, Indonesia, April 2014.
2013
- PrIter: A Distributed
Framework for Prioritized Iterative Computations [Supplemental
File] (CCF A)
Yanfeng Zhang, Qixin Gao, Lixin Gao, Cuirong Wang
IEEE Transactions on Parallel and Distributed
Systems (TPDS), 24(9), September, 2013, pp. 1884-1893.
- i2MapReduce:
Incremental Iterative MapReduce [slides]
Yanfeng Zhang, Shimin Chen
2nd International Workshop on Cloud Intelligence
(Cloud-I 2013), Riva del Garda, Italy, August 2013.
2012
- iMapReduce: A
Distributed Computing Framework for Iterative Computation (CCF C)
Yanfeng Zhang, Qixin Gao, Lixin Gao, Cuirong Wang
Journal of Grid
Computing, 10(1), March, 2012, pp.47-68.
- Accelerating
Expectation-Maximization Algorithms with Frequent Updates (CCF B)
Jiangtao Yin, Yanfeng Zhang, Lixin Gao
Proc. IEEE CLUSTER 2012,
Beijing, China, September 2012.
- Accelerate
Large-Scale Iterative Computation through Asynchronous Accumulative Updates [slides]
Yanfeng Zhang, Qixin Gao, Lixin Gao, Cuirong Wang
Proc. 3rd Workshop on Scientific Cloud
Computing (ScienceCloud 2012), Delft, Netherlands, June 2012.
2011
- PrIter: A Distributed
Framework for Prioritized Iterative Computations [slides] Honored as Paper of Distinction (CCF B)
Yanfeng Zhang, Qixin Gao, Lixin Gao, Cuirong Wang
Proc. ACM Symposium on Cloud Computing (SOCC
2011), Cascais, Portugal, October 2011.
- iMapReduce: A
Distributed Computing Framework for Iterative Computation [slides]
Yanfeng Zhang, Qixin Gao, Lixin Gao, Cuirong Wang
Proc. International Workshop
on Data Intensive Computing in the Cloud (DataCloud 2011), Alaska, USA, May
2011.
Before 2011
- MultiNet: Multiple
Virtual Networks for a Reliable Live Streaming Service (CCF C)
Yanfeng Zhang, Lixin Gao, Cuirong Wang
Proc. IEEE Global Communications Conference (GLOBECOM
'09), Hawaii, USA, November 2009.
- Weighted Size-Aware
Packet Distribution for Multipath Live Streaming (CCF C)
Yanfeng Zhang, Cuirong Wang, Yuan Gao
Proc. IEEE International Conference on Communications
(ICC '09), Dresden, Germany, June 2009.