In v0.9.0 we started offering --keep_intermediate_avro_files. Avro files are a great candidate for long term storage of variant data for several reasons, mainly because they can be loaded into BigQuery fast and free.
The only missing data is information in VCF header. Storing them with AVRO files will enable us to recov
Collection of simple C scripts for parsing vcf or bam files using the htslib C library. These scripts can be used as the starting point for more complex scripts
Accelerate the generation of personalized proteomes from a Variant calling format (VCF) file and a reference proteome using graphical processing units (GPUs).
VCF: Variant Call Format files are difficult to understand at first, I already had to dive into them in order to learn how to edit them, so you can use my scripts. Some of these functions already exist in vcftools, but it seems it stopped being updated. so you can use these scripts to do various tasks with them
In v0.9.0 we started offering
--keep_intermediate_avro_files. Avro files are a great candidate for long term storage of variant data for several reasons, mainly because they can be loaded into BigQuery fast and free.The only missing data is information in VCF header. Storing them with AVRO files will enable us to recov