Data Packages

Downloadable DNA Barcoding Datasets

This BOLD Data Package module makes barcoding data discoverable and accessible to researchers, policy makers, and industry. Data is served using frictionless standards to maximize uptake and reuse.

Three classes of packages are available to serve a broad range of use cases:

  • Recent Data: Weekly snapshots of all public data for regular database synchronization. These data packages are impermanent and will only be hosted for one year.
  • Historical Data: Quarterly and yearly snapshots to be used as citable reference libraries. The provided data packages will persist in perpetuity.
  • Project Packages: Datasets from completed international, national or thematic projects.



Recent Data

The two most recent weekly snapshots of Public data on BOLD will be available below.

Snapshot Date Specimens Sequences Description
22-MAY-2026 23,526,890 23,909,032 BOLD DNA Barcode Reference Library snapshot taken on May 22, 2026. Data is provided in TSV and FASTA formats along with metadata files in JSON format following BCDM. Download
15-MAY-2026 23,485,847 23,867,791 BOLD DNA Barcode Reference Library snapshot taken on May 15, 2026. Data is provided in TSV and FASTA formats along with metadata files in JSON format following BCDM. Download

* Data partners looking for snapshot perma-links should contact the BOLD Support (support@boldsystems.org)



Historical Data

Quarterly snapshots of the Public data on BOLD over the past year.

Snapshot Date Specimens Sequences Description
27-MAR-2026 22,605,991 22,985,766 BOLD DNA Barcode Reference Library snapshot taken on Mar 27, 2026. Data is provided in TSV and FASTA formats along with metadata files in JSON format following BCDM. Download
26-DEC-2025 22,098,815 22,461,407 BOLD DNA Barcode Reference Library snapshot taken on Dec 26, 2025. Data is provided in TSV and FASTA formats along with metadata files in JSON format following BCDM. Download
26-SEP-2025 20,614,947 20,965,204 BOLD DNA Barcode Reference Library snapshot taken on Sep 26, 2025. Data is provided in TSV and FASTA formats along with metadata files in JSON format following BCDM. Download
27-JUN-2025 19,757,886 20,103,547 BOLD DNA Barcode Reference Library snapshot taken on Jun 27, 2025. Data is provided in TSV and FASTA formats along with metadata files in JSON format following BCDM. Download


Project Specimens Sequences Description
CBG.R5.02-Mar-2026 277,853 277,853 This is the fifth data release from the Centre for Biodiversity Genomics (CBG). It marks the continuation of a strict data release policy and its commitment to open science. This dataset encompasses records generated over a 6 month period. The records originate from over 180 countries and represent 7K species. All records underwent validation, though errors may persist. Efforts were made to ensure validity at least to the family level. CBG releases this data to support global biodiversity research and advance collaboration within the biodiversity science community. Download
CBG.R4.01-Sep-2025 803,360 803,360 This is the fourth data release from the Centre for Biodiversity Genomics (CBG). It marks the continuation of a strict data release policy and its commitment to open science. This dataset encompasses records generated over a 6 month period. The records originate from over 200 countries and represent 24K species. All records underwent validation, though errors may persist. Efforts were made to ensure validity at least to the family level. CBG releases this data to support global biodiversity research and advance collaboration within the biodiversity science community. Download
CBG.R3.01-Mar-2025 783,225 783,225 This is the third data release from the Centre for Biodiversity Genomics (CBG). It marks the continuation of a strict data release policy and its commitment to open science. This dataset encompasses records generated over the year. The records originate from over 25 countries and represent 4K species. All records underwent validation, though errors may persist. Efforts were made to ensure validity at least to the family level. CBG releases this data to support global biodiversity research and advance collaboration within the biodiversity science community. Download
CBG.R2.13-Dec-2024 1,145,004 1,145,004 This is the second data release from the Centre for Biodiversity Genomics (CBG). It marks the continuation of a strict data release policy and its commitment to open science. This dataset encompasses records generated over the past three years. The records originate from over 100 countries and represent 12K species. All records underwent validation, though errors may persist. Efforts were made to ensure validity at least to the family level. CBG releases this data to support global biodiversity research and advance collaboration within the biodiversity science community. Download
CBG.R1.21-Mar-2024 4,183,458 4,183,458 This is the initial data release from the Centre for Biodiversity Genomics (CBG). It marks the adoption of a strict data release policy and signals its commitment to open science. This dataset, likely the largest and most diverse DNA barcode dataset released, encompasses records generated over a 15-year period, with the majority produced in the past three years. The records originate from over 180 countries and represent 351K species. All records underwent validation, though errors may persist. Efforts were made to ensure validity at least to the family level. CBG aims to support global biodiversity research and advance collaborative efforts across biodiversity science community by releasing this data. Download
iBOLD.31-Dec-2016 2,787,799 2,799,047 BARCODE 500K program was the inaugural program of the International Barcode of Life (iBOL) consortium. It delivered DNA barcodes for five hundred thousand species sourced from a global network of collections. The program was initiated in 2009 and concluded in 2015. This dataset includes barcodes from reference museum specimens as well as new collections generated from environmental samples. Download
CBN.31-Dec-2008 94,318 102,707 The Canadian Barcode of Life Network project's goal was dramatically advance the inventory of Canadian biodiversity. The project was initiated 2005 and concluded in 2009. This dataset includes barcodes from groups of particular economic and social interest in Canada as well as samples from a wide range of other species. Download



Copyright BOLD © 2014-2026