Downloads
ClinVar provides files for download in several formats and with different degrees of coverage.
XML files
- FTP directory for XML files
- XML files represent the complete public data set, including all variants in ClinVar and all data types for each variant
- XML is available aggregated by variant (VCV files) or aggregated by variant-condition pairs (RCV files)
- Updated weekly; only the release on the first Thursday of the month is archived
- More information is available in the README for ClinVar's XML files
VCF
- FTP directory for VCF files based on GRCh37
- FTP directory for VCF files based on GRCh38
- Limited to variants with a precise location
- variants with imprecise start and stop, such as exon deletions and CNVs detected by microarray, are not available in VCF files
- Limited to summary-level data for each variant
- Updated weekly; only the release on the first Thursday of the month is archived
- More information is available in the README for ClinVar's VCF files
ClinVar TSV files
- FTP directory for TSV files
- TSV files provide several slices of summary-level data for variants, genes, and submitting organizations
- Files are comprehensive for all variants in ClinVar
- Updated weekly; see the README file for which TSV files are archived monthly
Other TSV files
- disease names lists the preferred names and database identifiers used in GTR and ClinVar for diseases
- gene_condition_source_id lists gene-disease relationships used in ClinVar, Gene, GTR and MedGen
- Files are comprehensive for all diseases in ClinVar
- Updated daily
More information is available about ClinVar's release cycle and about ClinVar's FTP files.