Associate Professor Pengyi Yang
People_

Associate Professor Pengyi Yang

University of Sydney Robinson Fellow
Head, Computational Systems Biology
Children's Medical Research Institute
Charles Perkins Centre
Associate Professor, School of Mathematics &
Statistics
Address
F07 - Carslaw Building
The University of Sydney
Details
Websites
Associate Professor Pengyi Yang

Pengyi Yang, Ph.D.,is an Associate Professor and a Robinson Fellow at the School of Mathematics and Statistics.He heads the Computational Trans-Regulatory Biology group at Charles Perkins Centre (CPC), the University of Sydney, and holds a conjoint appointment as Unit Head of Computational Systems Biologyat Children's Medical Research Institute (CMRI), at the Westmead Research Hub.

Cells are the fundamental building blocks of all life on earth. In multicellular organisms, these cellular building blocks come in a plethora of types and serve distinct functions to coordinate and support the homeostasis of an organism. Intriguingly, all cell types of an organism share the same genetic code and originate from the same group of pluripotentstem cellsthat have undergone different cellular fates during development.

Research from the last few decades has established that cell identity and cell-fate decisions are determined by underlying.trans-regulatory networks (TRNs)comprised of cell signalling, transcriptional, and epigenetic regulations, and the intra- and extra-cellular networks they form. Understanding how TRNs regulate cell identity and interact through cell-cell communications in a spatial-temporal manner is critical for gaining insight into the complex nature of multicellular development. To this end, our research takes a holistic approach and seeks to reconstruct the TRNs, that cut across multiple molecular programs, for modelling cell identity and controlling cell-fate decisions.

Leveraging on recent advances in single-cell omics technologies and our experitise in machine learning, deep learning, and statistical modelling, we develop computational models to reconstruct TRNs in stem cells and model their differentiation to specialised cell types. By employing a multidisiciplinary approach that combines 'dry' (computation) and 'wet' (laboratory) studies at the systems level, we contribute to the following research aims:

  • Develop computational methods to reconstruct TRNs for modelling cell identity.
  • Develop computational methods to guide TRN modulation for controlling cell fate-decisions.
  • Generate knowledge on how do different layers of TRNs coordinately regulate cell identity and cell fate-decisions.
  • Develop computational framework to assess fidelity of cell conversion for applications such as tissue engineering and stem cell therapy.

1. Single-cell transcriptomics and its application to cell fate decisions.

2. Deep learning and its application to systems biology.

3. Reconstruct trans-regulatory networks using multi-omics profiling and data integration.

Please contact me if you are interested in any of the above projects. Projects can be tailored to fit students research plan and interest.

  • 2024 The University of Sydney – National University of Singapore Ignition Grant
  • 2023 – 2024 Tianqiao and Chrissy Chen Institute (TCCI) support of development and mentorship of the Early Career Editorial Board of Stem Cell Reports, USA
  • 2023 Sydney Research Accelerator (SOAR) prize, The University of Sydney, Australia
  • 2021 Outstanding Mid-career Researcher Award, Australian Bioinformatics and Computational Biology Society
  • 2021 Metcalf Prize, National Stem Cell Foundation, Australia
  • 2020 – 2024 Investigator (Emerging Leadership Level 2), National Health and Medical Research Council
  • 2019 Charles Perkins Centre Exceptional Contribution Award, Australia
  • 2018 Robinson Fellowship (2025 – 2028, DVC Research), The University of Sydney, Australia
  • 2017 J G Russell Award, Australian Academy of Science, Australia (presented to 4 top-ranked DECRAs nationwide in 2017)
  • 2017 Finalist of Eureka Prize for Outstanding Early Career Researcher, Australian Museum
  • 2017 – 2019 Discovery Early Career Researcher Award (DECRA), Australian Research Council (ARC)
  • 2017 Paper of the Year Award, National Institute of Environmental Health Sciences, National Institutes of Health, USA
  • 2015 – 2017 University of Sydney Fellowship (DVC Research), The University of Sydney, Australia
  • 2015 Fellows Award for Research Excellence (2015), National Institutes of Health, USA
  • 2014 Paper of the Year Award, National Institute of Environmental Health Sciences, National Institutes of Health, USA
  • 2014 Fellows Award for Research Excellence (2014), National Institutes of Health, USA
  • 2012 – 2016 Research Fellowship, National Institutes of Health, USA
  • 2009 – 2012 National ICT Australia (NICTA) Research Project Award (NRPA), Australia
  • 2009 – 2012 National ICT Australia (NICTA) International Postgraduate Award (NIPA), Australia
  • 2009 Best Thesis Award on Master of Engineering, Chongqing Education Commission, China
  • 2008 Student Travel Award, International Conference on Pattern Recognition in Bioinformatics (PRIB 2008), Melbourne, Australia
  • 2005 – 2008 National Scholarship Award on Master of Engineering, Ministry of Education of the People’s Republic of China
Project titleResearch student
Interpretable deep learning-based approaches for precision medicineManoj M WAGLE
Developing Robust clustering methods for scRNA-seq data.Elijah WILLIE

Publications

Book Chapters

  • Yang, P., Zhou, B., Yang, J., Zomaya, A. (2014). Stability of Feature Selection Algorithms and Ensemble Feature Selection Methods in Bioinformatics. In Mourad Elloumi, Albert Y. Zomaya (Eds.), Biological Knowledge Discovery Handbook: Preprocessing, Mining and Postprocessing of Biological Data, (pp. 333-352). New Jersey: John Wiley & Sons. [More Information]

Journals

  • Shen, S., Werner, T., Lukowski, S., Andersen, S., Sun, Y., Shim, W., Mizikovsky, D., Kobayashi, S., Outhwaite, J., Yang, P., et al (2025). Atlas of multilineage stem cell differentiation reveals TMEM88 as a developmental regulator of blood pressure. Nature Communications, 16(1), 1356-1-1356-19. [More Information]
  • Chen, C., Kim, H., Yang, P. (2024). Evaluating spatially variable gene detection methods for spatial transcriptomics data. Genome Biology, 25 Open Access(1), Article 18 -1-Article 18 - 21. [More Information]
  • Wagle, M., Long, S., Chen, C., Liu, C., Yang, P. (2024). Interpretable deep learning in single-cell omics. Bioinformatics, 40(5), 8 pages. [More Information]

Conferences

  • Tang, T., Wu, H., Bao, W., Yang, P., Yuan, D., Zhou, B. (2019). New parallel algorithms for all pairwise computation on large HPC clusters. 20th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT 2019), : IEEE Computer Society. [More Information]
  • Yang, P., Liu, W., Yang, J. (2017). Positive Unlabeled Learning via Wrapper-Based Adaptive Sampling. 26th International Joint Conference on Artificial Intelligence (IJCAI 2017), Melbourne: International Joint Conferences on Artificial Intelligence. [More Information]
  • Yang, P., Liu, W., Zhou, B., Chawla, S., Zomaya, A. (2013). Ensemble-Based Wrapper Methods for Feature Selection and Class Imbalance Learning. 17th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2013), Berlin: Springer. [More Information]

2025

  • Shen, S., Werner, T., Lukowski, S., Andersen, S., Sun, Y., Shim, W., Mizikovsky, D., Kobayashi, S., Outhwaite, J., Yang, P., et al (2025). Atlas of multilineage stem cell differentiation reveals TMEM88 as a developmental regulator of blood pressure. Nature Communications, 16(1), 1356-1-1356-19. [More Information]

2024

  • Chen, C., Kim, H., Yang, P. (2024). Evaluating spatially variable gene detection methods for spatial transcriptomics data. Genome Biology, 25 Open Access(1), Article 18 -1-Article 18 - 21. [More Information]
  • Wagle, M., Long, S., Chen, C., Liu, C., Yang, P. (2024). Interpretable deep learning in single-cell omics. Bioinformatics, 40(5), 8 pages. [More Information]
  • Wu, Z., Zhao, Q., Kim, D., Yang, P., Hill, T., Jones, A., Fairlie, D., Pébay, A., Hewitt, A., Tam, P., et al (2024). Wnt dose escalation during the exit from pluripotency identifies tranilast as a regulator of cardiac mesoderm. Developmental Cell, 59(6), 705-722.e8. [More Information]

2023

  • Cao, Y., Ghazanfar, S., Yang, P., Yang, J. (2023). Benchmarking of analytical combinations for COVID-19 outcome prediction using single-cell RNA sequencing data. Briefings in Bioinformatics, 24(3). [More Information]
  • Kim, H., O'Hara-Wright, M., Kim, D., Loi, T., Lim, B., Jamieson, R., Gonzalez Cordero, A., Yang, P. (2023). Comprehensive characterization of fetal and mature retinal cell identity to assess the fidelity of retinal organoids. Stem Cell Reports, 18(1), 175-189. [More Information]
  • Xiao, D., Chen, C., Yang, P. (2023). Computational systems approach towards phosphoproteomics and their downstream regulation. Proteomics, 23(3 - 4), Article no. 2200068. [More Information]

2022

  • Yu, L., Cao, Y., Yang, J., Yang, P. (2022). Benchmarking clustering algorithms on estimating the number of cell types from single-cell RNA-sequencing data. Genome Biology, 23(1). [More Information]
  • Tran, A., Yang, P., Yang, J., Ormerod, J. (2022). Computational approaches for direct cell reprogramming: from the bulk omics era to the single cell era. Briefings in Functional Genomics, 21(4), 270-279. [More Information]
  • Fernando, M., Lee, S., Wark, J., Xiao, D., Lim, B., O'Hara-Wright, M., Kim, H., Smith, G., Wong, T., Teber, E., Yang, P., Graham, M., Gonzalez Cordero, A., et al (2022). Differentiation of brain and retinal organoids from confluent cultures of pluripotent stem cells connected by nerve-like axonal projections of optic origin. Stem Cell Reports, 17(6), 1476-1492. [More Information]

2021

  • Cao, Y., Yang, P., Yang, J. (2021). A benchmark study of simulation methods for single-cell RNA sequencing data. Nature Communications, 12(1), 6911. [More Information]
  • Kim, T., Tang, O., Vernon, S., Kott, K., Koay, Y., Park, J., James, D., Grieve, S., Speed, T., Yang, P., Figtree, G., John, O., Yang, J. (2021). A hierarchical approach to removal of unwanted variation for large-scale metabolomics data. Nature Communications, 12(1), 4992-1-4992-10. [More Information]
  • Kearney, A., Norris, D., Ghomlaghi, M., Wong, M., Humphrey, S., Carroll, L., Yang, G., Cooke, K., Yang, P., Geddes, T., James, D., Burchfield, J., et al (2021). Akt phosphorylates insulin receptor substrate to limit PI3K-mediated PIP3 synthesis. eLife, 10, 1-32. [More Information]

2020

  • Kim, H., Lin, Y., Geddes, T., Yang, J., Yang, P. (2020). CiteFuse enables multi-modal analysis of CITE-seq data. Bioinformatics, 36(14), 4137-4143. [More Information]
  • Azimi, A., Yang, P., Ali, M., Howard, V., Mann, G., Alexander, K., Fernandez Penas, P. (2020). Data Independent Acquisition Proteomic Analysis Can Discriminate between Actinic Keratosis, Bowen's Disease, and Cutaneous Squamous Cell Carcinoma. Journal of Investigative Dermatology, 140(1), 212-222. [More Information]
  • Cao, Y., Geddes, T., Yang, J., Yang, P. (2020). Ensemble deep learning in bioinformatics. Nature Machine Intelligence, 2(September 2020), 500-508. [More Information]

2019

  • Yang, P., Ormerod, J., Liu, W., Ma, C., Zomaya, A., Yang, J. (2019). AdaSampling for Positive-Unlabeled and Label Noise Learning With Bioinformatics Applications. IEEE Transactions on Cybernetics, 49(5), 1932-1943. [More Information]
  • Parker, B., Calkin, A., Seldin, M., Keating, M., Tarling, E., Yang, P., Moody, S., Liu, Y., Zerenturk, E., Needham, E., James, D., et al (2019). An integrative systems genetic analysis of mammalian lipid metabolism. Nature, 567(7747), 187-193. [More Information]
  • De Ridder, M., Klein, K., Yang, J., Yang, P., Lagopoulos, J., Hickie, I., Bennett, M., Kim, J. (2019). An Uncertainty Visual Analytics Framework for fMRI Functional Connectivity. Neuroinformatics, 17(2), 211-223. [More Information]

2018

  • Fazakerley, D., Chaudhuri, R., Yang, P., Maghazal, G., Cooke, K., Krycer, J., Humphrey, S., Parker, B., Fisher-Wellman, K., Meoli, C., Hoffman, N., Diskin, C., Burchfield, J., Yang, J., James, D., et al (2018). Mitochondrial CoQ deficiency is a common driver of mitochondrial oxidants and insulin resistance. eLife, 7(Article number e32111), 1-38. [More Information]

2017

  • Norris, D., Yang, P., Krycer, J., Fazakerley, D., James, D., Burchfield, J. (2017). An improved Akt reporter reveals intra- and inter-cellular heterogeneity and oscillations in signal transduction. Journal of Cell Science, 130, 2757-2766. [More Information]
  • Yang, P., Oldfield, A., Kim, T., Yang, A., Yang, J., Ho, J. (2017). Integrative analysis identifies co-dependent gene expression regulation of BRG1 and CHD7 at distal regulatory sites in embryonic stem cells. Bioinformatics, 33(13), 1916-1920. [More Information]
  • Cinghu, S., Yang, P., Kosak, J., Conway, A., Kumar, D., Oldfield, A., Adelman, K., Jothi, R. (2017). Intragenic Enhancers Attenuate Host Gene Expression. Molecular Cell, 68(1), 104-117. [More Information]

2016

  • Zheng, X., Yang, P., Lackford, B., Bennett, B., Wang, L., Li, H., Wang, Y., Miao, Y., Foley, J., Fargo, D., et al (2016). CNOT3-Dependent mRNA Deadenylation Safeguards the Pluripotent State. Stem Cell Reports, 7(5), 897-910. [More Information]
  • Yang, P., Patrick, E., Humphrey, S., Ghazanfar, S., James, D., Jothi, R., Yang, J. (2016). KinasePA: Phosphoproteomics data annotation using hypothesis driven kinase perturbation analysis. Proteomics, 16(13), 1868-1871. [More Information]
  • Minard, A., Tan, S., Yang, P., Fazakerley, D., Domanova, W., Parker, B., Humphrey, S., Jothi, R., Stoeckli, J., James, D. (2016). mTORC1 Is a Major Regulatory Node in the FGF21 Signaling Network in Adipocytes. Cell Reports, 17(1), 29-36. [More Information]

2015

  • Pathania, R., Ramachandran, S., Elangovan, S., Padia, R., Yang, P., Cinghu, S., Veeranan-Karmegam, R., Arjunan, P., Gnana-Prakasam, J., Sadanand, F., et al (2015). DNMT1 is essential for mammary and cancer stem cell maintenance and tumorigenesis. Nature Communications, 6, 1-11. [More Information]
  • Hoffman, N., Parker, B., Chaudhuri, R., Fisher-Wellman, K., Kleinert, M., Humphrey, S., Yang, P., Holliday, M., Trefely, S., Fazakerley, D., Stoeckli, J., Burchfield, J., James, D., et al (2015). Global Phosphoproteomic Analysis of Human Skeletal Muscle Reveals a Network of Exercise-Regulated Kinases and AMPK Substrates. Cell Metabolism, 22(5), 922-935. [More Information]
  • Yang, P., Zheng, X., Jayaswal, V., Hu, G., Yang, J., Jothi, R. (2015). Knowledge-Based Analysis for Detecting Key Signaling Events from Time-Series Phosphoproteomics Data. PLoS Computational Biology, 11(8), 1-18. [More Information]

2014

  • Yang, P., Patrick, E., Tan, S., Fazakerley, D., Burchfield, J., Gribben, C., Prior, M., James, D., Yang, J. (2014). Direction pathway analysis of large-scale proteomics data reveals novel features of the insulin action pathway. Bioinformatics, 30(6), 808-814. [More Information]
  • Lackford, B., Yao, C., Charles, G., Weng, L., Zheng, X., Choi, E., Xie, X., Wan, J., Xing, Y., Freudenberg, J., et al (2014). Fip1 regulates mRNA alternative polyadenylation to promote stem cell self-renewal. EMBO Journal, 33(8), 878-889. [More Information]
  • Oldfield, A., Yang, P., Conway, A., Cinghu, S., Freudenberg, J., Yellaboina, S., Jothi, R. (2014). Histone-fold domain protein NF-Y promotes chromatin accessibility for cell type-specific master transcription factors. Molecular Cell, 55(5), 708-722. [More Information]

2013

  • Humphrey, S., Yang, G., Yang, P., Fazakerley, D., Stockli, J., Yang, J., James, D. (2013). Dynamic adipocyte phosphoproteome reveals that akt directly regulates mTORC2. Cell Metabolism, 17(6), 1009-1020. [More Information]
  • Yang, P., Liu, W., Zhou, B., Chawla, S., Zomaya, A. (2013). Ensemble-Based Wrapper Methods for Feature Selection and Class Imbalance Learning. 17th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD 2013), Berlin: Springer. [More Information]

2012

  • Yang, P., Ma, J., Wang, P., Zhu, Y., Zhou, B., Yang, J. (2012). Improving X!Tandem on peptide identification from mass spectrometry by self-boosted percolator. IEEE - ACM Transactions on Computational Biology and Bioinformatics, 9(5), 1273-1280. [More Information]
  • Wang, P., Yang, P., Yang, J. (2012). OCAP: an open comprehensive analysis pipeline for iTRAQ. Bioinformatics, 28(10), 1404-1405. [More Information]
  • Yang, P., Humphrey, S., Fazakerley, D., Prior, M., Yang, G., James, D., Yang, J. (2012). Re-Fraction: A machine learning approach for deterministic identification of protein homologues and splice variants in large-scale MS-based proteomics. Journal of Proteome Research, 11(5), 3035-3045. [More Information]

2011

  • Yang, P., Ho, J., Yang, J., Zhou, B. (2011). Gene-gene interaction filtering with ensemble of filters. BMC Bioinformatics, 12(Supp 1: S10), 1-10. [More Information]
  • Yang, P., Zhang, Z., Zhou, B., Zomaya, A. (2011). Sample Subset Optimization for Classifying Imbalanced Biological Data. 15th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining, PAKDD 2011, Heidelberg, Germany: Springer. [More Information]

2010

  • Yang, P., Zhang, Z., Zhou, B., Zomaya, A. (2010). A clustering based hybrid system for biomarker selection and sample classification of mass spectrometry data. Neurocomputing, 73(13-15), 2317-2331. [More Information]
  • Wang, P., Yang, P., Arthur, J., Yang, J. (2010). A dynamic wavelet-based algorithm for pre-processing tandem mass spectrometry data. Bioinformatics, 26(18), 2242-2249. [More Information]
  • Yang, P., Ho, J., Zomaya, A., Zhou, B. (2010). A genetic ensemble approach for gene-gene interaction identification. BMC Bioinformatics, 11(524), 1-15. [More Information]

2009

  • Yang, P., Xu, L., Zhou, B., Zhang, Z., Zomaya, A. (2009). A particle swarm based hybrid system for imbalanced medical data sampling. BMC Genomics, 10(Suppl 3), S34-1-S34-14. [More Information]
  • Zhang, Z., Yang, P., Wu, X., Zhang, C. (2009). An Agent-Based Hybrid System for Microarray Data Analysis. IEEE Intelligent Systems, 24(5), 53-63. [More Information]
  • Yang, P., Zhang, Z. (2009). An embedded two-layer feature selection approach for microarray data analysis. IEEE Intelligent Informatics Bulletin, 10(1), 24-32.

Selected Grants

2024

  • Integrative statistical learning in trans-regulatory networks of stem and progenitor cells and disease, Yang P, DVC Research/Robinson Fellowship
  • Towards generating dopaminergic neuron containing stem cell-derived brain organoids, Sun X, Yang P, Long S, Zyner K, Chen C, Office of Global Engagement/Ignition Grants
  • Assembling the building blocks in the blueprint of the embryonic head, Tam P, Yang P, Australian Research Council (ARC)/Discovery Projects (DP)

2023

  • Evaluating human blastoids for modelling early human development, Yang P, DVC Research/Bridging Support Grant
  • Purification and cryopreservation of an allogeneic stem cell-derived photoreceptor cell product, Gonzalez Cordero A, Kim H, Tam P, Elwood N, Yang P, Department of Health and Aged Care (Federal)/MRFF - Stem Cell Therapies
  • Member of International Society for Stem Cell Research (ISSCR) Momentum Award selection committee (2024)
  • Member of Australian Bioinformatics and Computational Biology Society (ABACBS) committee (2024-present)
  • Early Career Editor of Stem Cell Reports (2023-present)
  • Member of Organising Committee of Sydney Bioinformatics Research Symposium (2022, 2023), Sydney, Australia
  • Member of Program Committee of GIW/ISMB-Asia conference (2022), Tainan, Taiwan
  • Theme chair, Oz Single Cell conference (2022), Gold Coast, Australia
  • Member of Program Committee of ABACBS conference (2021), Melbourne, Australia
  • Co-chair of Program Committee of ABACBS conference (2019), Sydney, Australia
  • Member of Postgraduate Research Excellence Award (PREA) review panel, Faculty of Science,The University of Sydney (2019)
  • Session chair of Australasian Genomic Technologies Association (AGTA) conference (2017), Hobart, Australia
  • Member of Program Committee of Pattern Recognition in Bioinformatics (PRIB) conference (2014), Stockholm, Sweden