Hi there, I'm Yue ZHAO (赵越 in Chinese)! 👋
I consider myself as an expert in outlier detection systems (ODSys)---I build automated, scalable ODSys to support real-world applications in security and finance with millions of downloads. I designed CPU-based (PyOD), GPU-based (TOD), distributed detection systems (SUOD) for tabular, time-series (TODS), and graph data (PyGOD).
Community: I am organizing the largest open-source community of outlier detection, including (1) the most popular detection system PyOD (6M downloads, top 10 data mining projects on GitHub) (2) the most watched and starred Anomaly Detection Resources with hundreds of books, tutorial, and papers (5k+ stars) (3) the discussion groups with hundreds of outlier detection researchers and practitioners from MIT, Meta, and more (see contact section below for joining).
- Outlier detection systems (JMLR'19, AAAI'21, MLsys'21, NeurIPS'21)
- Outlier detection algorithms (IJCNN'18, SDM'19, BigData'20, ICDM'20, TKDE'22)
- Automated outlier detection (NeuIPS'21)
- AI x Science (AAAI'20, NeuIPS'21)
At CMU, I work with Prof. Leman Akoglu from DATA Lab on outlier detection, Prof. Zhihao Jia from Catalyst on machine learning systems, and Prof. George H. Chen on general ML and statistic methods. Externally, I am also fortunate to visit and collaborate with Prof. Jure Leskovec at Stanford University.
- [JMLR] PyOD: A Python Toolbox for Scalable Outlier Detection (Anomaly Detection).
- TOD: Tensor-based outlier detection--First large-scale GPU-based system for acceleration!
- [MLSys] SUOD: An Acceleration System for Large-scale Heterogeneous Outlier Detection.
- PyTorch Geometric (PyG): Graph Neural Network Library for PyTorch. Contributed to profiler & benchmarking, and heterogeneous data transformation, as a member of the PyG team.
- [NeurIPS] Therapeutics Data Commons (TDC): An extensive machine learning data hub for drug discovery.
- [AAAI] combo: A Python Toolbox for ML Model Combination (Ensemble Learning).
- [NeurIPS, AAAI] TODS: Time-series Outlier Detection. Contributed to core detection models.
- [NeurIPS] MetaOD: Automatic Unsupervised Outlier Model Selection (AutoML).
- collaboration opportunities (anytime & anywhere & any type)
- paper review, tutorial, workshop, and talk opportunities
-
Apr 2022: We released PyGOD (Python Graph Outlier Detection). With PyGOD, you could do anomaly detection with the latest graph neural networks in 5 lines!
-
Mar 2022: Invited to present at Morgan Stanley for large-scale anomaly detection systems!
-
Mar 2022:
🎉 I received the prestigious 2022 Norton Labs Graduate Fellowship (one of the two graduate students worldwide). Thanks to the selection committee and my advisors! -
Mar 2022: ECOD: Unsupervised Outlier Detection Using Empirical Cumulative Distribution Functions is accepted to IEEE Transactions on Knowledge and Data Engineering (TKDE)! ECOD is a simple yet effective detection algorithm with extremely fast O(nd) runtime.
-
Feb 2022:
🌟 Reached 700 citations on Google Scholar! -
Feb 2022: Invited talk at Tesla for large-scale anomaly detection.
-
Feb 2022: Propose a new initiative called Detected AI (detected.ai) for large-scale anomaly detection applications. It is still too early to tell, but it will be exciting!
-
Feb 2021: Have new system out TOD: GPU-accelerated Outlier Detection via Tensor Operations*. with George H. Chen and Zhihao Jia. Preprint, Code being released
- TOD is the first fast, comprehensive, GPU-based outlier detection system.
🌟 on average it is 11 times faster than PyOD!🌟 it supports various OD algorithms, e,g., kNN, LOF, ABOD, HBOS, etc.

Formed in 2009, the Archive Team (not to be confused with the archive.org Archive-It Team) is a rogue archivist collective dedicated to saving copies of rapidly dying or deleted websites for the sake of history and digital heritage. The group is 100% composed of volunteers and interested parties, and has expanded into a large amount of related projects for saving online and digital history.
