Email: yuz9 [AT] illinois [DOT] edu
Office: home Room 1117, Siebel Center for Computer Science, 201 N. Goodwin Ave, Urbana, IL 61801

About Me

I am a Ph.D. student in the Data Mining Group at University of Illinois at Urbana-Champaign, advised by Prof. Jiawei Han. I finished my M.Sc. study in the same group in 2019. My research interests are text mining, text-rich network mining, and their applications to bioinformatics.

Prior to UIUC, I received my B.Sc. degree in Computer Science from Peking University in 2017, advised by Prof. Yan Zhang.

In summer 2021, I will intern at Microsoft Research, Redmond (virtually), hosted by Dr. Iris Shen.

In summer 2020, I interned at Microsoft Research, Redmond (virtually), working with Dr. Iris Shen and Dr. Yuxiao Dong.

In summer 2016, I visited Carnegie Mellon University, working with Prof. Kathleen M. Carley.

For further information, please see my CV.

What’s New [What’s Not New…]

2021-04 Invited to be a reviewer of NeurIPS 2021.

2021-03 Invited to be a PC member of EMNLP 2021.

2021-03-08 Attended WSDM 2021 virtually to present our work.

2021-01 Invited to be a PC member of ACL 2021.

2021-01-15 My summer intern work on Multi-Label Academic Paper Classification was accepted by WWW 2021! The acceptance rate is 20.6% (357/1736).

2020-12-15 Our survey paper on Heterogeneous Network Representation Learnining was accepted by IEEE TKDE!

2020-10 Invited to be a PC member of NAACL-HLT 2021.

2020-10-15 Our paper on Hierarchical Metadata-Aware Document Categorization was accepted by WSDM 2021! The acceptance rate is 18.6% (112/603).

2020-07-26 Attended SIGIR 2020 virtually to present our work. Served as a conference volunteer.

2020-07 Invited to be a reviewer of ICLR 2021.

2020-05-18 Started my summer internship at Microsoft Research, Redmond, working with Dr. Iris Shen and Dr. Yuxiao Dong.

2020-05-15 Our paper on Hierarchical Topic Mining was accepted by KDD 2020 research track! The acceptance rate is 16.9% (216/1279).

2020-04-22 Our paper on Classifying Text with Metadata under Weak Supervision was accepted by SIGIR 2020 as a full paper! The acceptance rate is 26.5% (147/555).

Selected Publications [Google Scholar]

(* indicates equal contribution. Unless otherwise specified, the paper is accepted as a research track long/regular paper.)


MATCH: Metadata-Aware Text Classification in A Large Hierarchy [PDF] [arXiv] [code]
Yu Zhang, Zhihong Shen, Yuxiao Dong, Kuansan Wang, Jiawei Han.
WWW 2021. Ljubljana, Slovenia.

Hierarchical Metadata-Aware Document Categorization under Weak Supervision [PDF] [arXiv] [code]
Yu Zhang, Xiusi Chen, Yu Meng, Jiawei Han.
WSDM 2021. Jerusalem, Israel.


Minimally Supervised Categorization of Text with Metadata [PDF] [arXiv] [code]
Yu Zhang*, Yu Meng*, Jiaxin Huang, Frank F. Xu, Xuan Wang, Jiawei Han.
SIGIR 2020. Xi’an, China.

Heterogeneous Network Representation Learning: A Unified Framework with Survey and Benchmark [PDF] [arXiv] [code]
Carl Yang*, Yuxin Xiao*, Yu Zhang*, Yizhou Sun, Jiawei Han.
IEEE TKDE. Accepted.

Hierarchical Topic Mining via Joint Spherical Tree and Text Embedding [PDF] [arXiv] [code]
Y. Meng, Y. Zhang, J. Huang, Y. Zhang, C. Zhang, J. Han.
KDD 2020. San Diego, CA, USA.

Discriminative Topic Mining via Category-Name Guided Text Embedding [PDF] [arXiv] [code]
Y. Meng, J. Huang, G. Wang, Z. Wang, C. Zhang, Y. Zhang, J. Han.
WWW 2020. Taipei.


HiGitClass: Keyword-Driven Hierarchical Classification of GitHub Repositories [PDF] [arXiv] [code]
Yu Zhang, Frank F. Xu, Sha Li, Yu Meng, Xuan Wang, Qi Li, Jiawei Han.
ICDM 2019. Beijing, China.

Diversifying Seeds and Audience in Social Influence Maximization [PDF] [arXiv]
Yu Zhang.
ASONAM 2019. Vancouver, Canada. (Short)

Cross-type Biomedical Named Entity Recognition with Deep Multi-Task Learning [PDF] [arXiv] [bioRxiv] [code]
Xuan Wang, Yu Zhang, Xiang Ren, Yuhao Zhang, Marinka Zitnik, Jingbo Shang, Curtis Langlotz, Jiawei Han.
Bioinformatics. Oxford Academic. Volume 35, Issue 10.

Integrating Local Context and Global Cohesiveness for Open Information Extraction [PDF] [arXiv] [code]
Q. Zhu, X. Ren, J. Shang, Y. Zhang, A. El-Kishky, J. Han.
WSDM 2019. Melbourne, VIC, Australia.


Weakly-supervised Relation Extraction by Pattern-enhanced Embedding Learning [PDF] [arXiv] [code]
Meng Qu, Xiang Ren, Yu Zhang, Jiawei Han.
WWW 2018. Lyon, France.

Open Information Extraction with Global Structure Constraints [PDF] [code]
Q. Zhu, X. Ren, J. Shang, Y. Zhang, F. F. Xu, J. Han.
WWW 2018. Lyon, France. (Poster, Best Poster Award Runner-up)


RATE: Overcoming Noise and Sparsity of Textual Features in Real-Time Location Estimation [PDF] [code]
Yu Zhang, Wei Wei, Binxuan Huang, Kathleen M. Carley, Yan Zhang.
CIKM 2017. Singapore. (Short)

Top-K Influential Nodes in Social Networks: A Game Perspective [PDF] [code]
Yu Zhang, Yan Zhang.
SIGIR 2017. Shinjuku, Tokyo, Japan. (Short)

Honors and Awards

2021 WWW 2021 Student Scholarship
2021 WSDM 2021 Student Travel Grant
2020 SIGIR 2020 Student Travel Grant
2018 WWW 2018 Best Poster Award Runner-up
2017 Best Undergraduate Thesis Award, School of EECS, Peking University (10/310+)
2017 Outstanding Graduates, Peking University
2017 SIGIR 2017 Student Travel Grant
2016 Kwang-Hua Scholarship
2015 May 4th Scholarship
2014 National Scholarship (Top 1%)
2011/2012 First Prize, National Olympiad in Informatics in Provinces


Conference Program Committee
ACL 2021; NeurIPS 2021; ICLR 2021; EMNLP 2020-2021; NAACL-HLT 2021

Journal Reviewer
ACM Transactions on Knowledge Discovery from Data (TKDD)
IEEE Transactions on Big Data (TBD)

Student Volunteer
SIGIR 2020


I was born and raised in Shanghai.

I like taking MOOCs when I intend to learn something as a beginner. Some MOOCs I have finished: Networks (2015-04), Model Thinking (2015-05), Probability (2015-05), Economics (2018-08).

I played bridge during my high school and undergraduate time.