profile pic

I am a PhD student in the Department of Statistics of University of California, Los Angeles (UCLA), under supervision of Dr. Jingyi Jessica Li. I obtained Master of Science in Statistics from Zhejiang University (ZJU), under supervision of Dr. Yi Zhang. Prior to that, I received Bachelor of Science in Mathematics and Bachelor of Economics from Shandong University.

My research interests lie in developing new statistical methods for understanding the real-world data. Specific research topics include

  1. Statistical Bioinformatics

    1. Statistical methods for analyzing high-dimensional single-cell and spatial omics data

    2. Using synthetic data to enhance the statistical rigor in single-cell and spatial omics data analysis

  2. General Statistical Methodologies: High-dimensional model inference and variable selection

  3. Statistics in Education: Statistical method for promoting education equity (reported by Forbes and this podcast)

📣 News

Publications

author*: equal distribution

Preprints

G. Yan, J.J. Li and M. Biggin (2024). Question-Score Identity Detection (Q-SID): A statistical algorithm to detect collusion groups with error quantification from exam question scores. arXiv. (Under review at Journal of the American Statistical Association) [Website] [Forbes article] [Podcast]

Z. Li, Z. M. Patel, D. Song, G. Yan, J. J. Li and L. Pinello (2023). Benchmarking computational methods to identify spatially variable genes and peaks. bioRxiv. (Under review at Nature Methods)

Peer reviewed

G. Yan, S. Hua and J.J. Li (2025). Categorization of 34 computational methods to detect spatially variable genes from spatially resolved transcriptomics data. Nature Communications, 16, 1141.

J. Zhao, F. Lao, G. Yan and Y. Zhang (2024). How data heterogeneity affects innovating knowledge and information in gene identification: A statistical learning perspective. Journal of Innovation & Knowledge, 9-3.

G. Yan, D. Song and J.J. Li (2023). scReadSim: a single-cell RNA-seq and ATAC-seq read simulator. Nature Communications, 14(1), 7428. [Software] [Website]

D. Song, Q. Wang, G. Yan, T. Liu and J.J. Li (2023). scDesign3 generates realistic in silico data for multimodal single-cell and spatial omics. Nature Biotechnology, 1-6. [Software]

S. Tang, H. Wang, G. Yan, L. Zhang (2022). Empirical likelihood based tests for detecting the presence of significant predictors in marginal quantile regression. Metrika, 1-31.

S. Chen*, G. Yan*, W. Zhang, J. Li, R. Jiang and Z. Lin (2021). RA3 is a reference-guided approach for epigenetic characterization of single cells. Nature Communications, 12(1), 1-13. [Software]

J. Zhao, G. Yan and Y. Zhang (2021). Robust estimation and shrinkage in ultrahigh dimensional expectile regression with heavy tails and variance heterogeneity. Statistical Papers, 1-28.

J. Zhao*, G. Yan* and Y. Zhang (2019). Semiparametric expectile regression for high-dimensional heavy-tailed and heterogeneous data. arXiv. (Accepted at Applied Mathematics-A Journal of Chinese Universities)

Patents

Mark Douglas Biggin, Jingyi Li, Guan'ao Yan. Systems and methods for detecting collusion in student testing using graded scores or answers for individual questions (Serial No. 17/450,984; US Patent 11,915,615 B2, filed Feburary 17, 2024)

Presentations

Oral Presentations

Poster Presentations

Teaching

Teaching assistant

Workshop instructor

Guest lecturer

Undergraduate student mentor



If we all worked on the assumption that what is accepted as true is really true, there would be little hope of advance.

- Orville Wright