The Wayback Machine - https://web.archive.org/web/20220619125438/https://github.com/yzhao062
Skip to content
Avatar
๐Ÿ’œ
Limited Bandwidth
๐Ÿ’œ
Limited Bandwidth

Highlights

  • Pro

Organizations

@pyg-team @pygod-team @Open-Source-ML
Block or Report

Block or report yzhao062

Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
yzhao062/README.md

Hi there, I'm Yue ZHAO (่ตต่ถŠ in Chinese)! ๐Ÿ‘‹

๐ŸŒฑ Short Bio: My name is Yue ZHAO (่ตต่ถŠ in Chinese). I am a third-year Ph.D. student at Carnegie Mellon University (CMU). Before joining CMU, I earned my Master degree from University of Toronto (2016), and worked as a senior consultant at PwC Canada (2019). I am an expert on anomaly detection (a.k.a outlier detection) algorithms, systems, and its applications in security, healthcare, and Finance, with more than 7 year professional experience and 20+ papers (in JMLR, TKDE, NeurIPS, etc.). My research is partly supported by Norton Labs Graduate Fellowship. See my homepage and CV for more information.

Contributions to outlier detection systems and applications: I build automated, scalable, and accelerated machine learning systems (MLSys) to support large-scale, real-world outlier detection applications in security, finance, and healthcare with millions of downloads. I designed CPU-based (PyOD), GPU-based (TOD), distributed detection systems (SUOD) for tabular (PyOD), time-series (TODS), and graph data (PyGOD). My work has been widely used by thousands of projects, including leading firms like IBM, Morgan Stanley, and Tesla. See more applications.

๐Ÿ”ญ Research outcomes (related to outlier detection if not specified):

Primary field Secondary Method Year Venue Lead author
large-scale Benchmark anomaly detection ADBench 2022 Preprint Y
machine learning systems PyOD 2019 JMLR Y
machine learning systems time series TODS 2020 AAAI
machine learning systems benchmark TODS 2021 NeurIPS
machine learning systems SUOD 2021 MLSys Y
machine learning systems distributed systems TOD 2022 Preprint Y
machine learning systems graph neural networks PyGOD 2022 Preprint Y
ensemble learning semi-supervised XGBOD 2018 IJCNN Y
ensemble learning LSCP 2019 SDM Y
ensemble learning machine learning systems combo 2020 AAAI Y
ensemble learning interpretable ML COPOD 2020 ICDM Y
ensemble learning interpretable ML ECOD 2022 TKDE Y
automated machine learning graph mining AutoAudit 2022 BigData
automated machine learning MetaOD 2021 NeurIPS Y
graph neural networks contrastive learning CONAD 2022 PAKDD
AI x Science large-scale Benchmark HR manage. 2018 Intellisys Y
AI x Science CIBS 2020 BIBM
AI x Science PyHealth 2020 Preprint Y
AI x Science large-scale Benchmark TDC 2021 NeurIPS

At CMU, I work with Prof. Leman Akoglu (DATA Lab), Prof. Zhihao Jia (Catalyst), and Prof. George H. Chen. Externally, I collaborate with Prof. Jure Leskovec at Stanford University and Prof. Xia "Ben" Hu at Rice University.


โšก Open-source Contribution: I have led or contributed as a core member to more than 10 ML open-source initiatives, receiving 13,000 GitHub stars (top 0.002%: ranked 800 out of 40M GitHub users) and >10,000,000 total downloads. Popular ones:

  • PyOD: A Python Toolbox for Scalable Outlier Detection (Anomaly Detection).
  • ADBench: The most comprehensive tabular anomaly detection benchmark (30 anomaly detection algorithms on 55 benchmark datasets).
  • TOD: Tensor-based outlier detection--First large-scale GPU-based system for acceleration!
  • SUOD: An Acceleration System for Large-scale Heterogeneous Outlier Detection.
  • anomaly-detection-resources: The most starred resources (books, courses, etc.)!
  • PyTorch Geometric (PyG): Graph Neural Network Library for PyTorch. Contributed to profiler & benchmarking, and heterogeneous data transformation, as a member of the PyG team.
  • Python Graph Outlier Detection (PyGOD): A Python Library for Graph Outlier Detection.
  • Therapeutics Data Commons (TDC): Machine learning for drug discovery.
  • combo: A Python Toolbox for ML Model Combination (Ensemble Learning).
  • TODS: Time-series Outlier Detection. Contributed to core detection models.
  • MetaOD: Automatic Unsupervised Outlier Model Selection (AutoML).

๐Ÿ“ซ Contact me by:


๐Ÿ’ฌ News & Travel:

  • Jun 2022: We just released a 36-page, the most comprehensive anomaly detection benchmark paper. The fully open-sourced ADBench compares 30 anomaly detection algorithms on 55 benchmark datasets. Please star, fork, and follow for the latest update! See paper here!

  • Jun 2022: Have a new system out TOD: GPU-accelerated Outlier Detection via Tensor Operations. with George H. Chen and Zhihao Jia. Preprint, Code being released

  • TOD is the first fast, comprehensive, GPU-based outlier detection system.

    • ๐ŸŒŸ on average it is 11 times faster than PyOD!
    • ๐ŸŒŸ it supports various OD algorithms, e,g., kNN, LOF, ABOD, HBOS, etc.
  • Jun 2022: ๐ŸŒŸ Reached 900 citations on Google Scholar!

  • May 2022: Invited to present at Morgan Stanley for automated outlier detection!

  • Apr 2022: PyGOD (Python Graph Outlier Detection) received 400+ stars in a week! We released PyGOD (Python Graph Outlier Detection). With PyGOD, you could do anomaly detection with the latest graph neural networks in 5 lines! See paper here!


Yue's github stats Top Langs

Pinned

  1. pyod Public

    A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)

    Python 5.7k 1.1k

  2. Official Implement of "ADBench: Anomaly Detection Benchmark".

    Python 128 12

  3. Anomaly detection related books, papers, videos, and toolboxes

    Python 6k 1.5k

  4. A Python Library for Graph Outlier Detection (Anomaly Detection)

    Python 536 43

  5. Graph Neural Network Library for PyTorch

    Python 14.8k 2.7k

  6. pytod Public

    TOD: Tensor-based Outlier Detection

    Python 72 11

369 contributions in the last year

Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun Mon Wed Fri

Contribution activity

June 2022

Created 3 commits in 2 repositories
Created 1 repository