This repository contains data indexes from NIST’s Genome in a Bottle (GIAB) project. The indexes for sequences and alignments are also available at NIST GIAB FTP.
Submissions are validated using grz-cli v0.1.4
.
grz-cli validate
: PASSQC pipeline
: UNKNOWNGarvan NA12878 HG001 HiSeq Exome
Created panel FASTQ files from panel gene list csv, .bed and .bam files.
Generate Panel BED File:
Run the script ./extract_panel_bed_file.sh
to create the
panel .bed
file.
Intersect BAM with Panel BED:
Use bedtools intersect
to filter the input
.bam
file with the panel .bed
file:
bedtools intersect -abam input.bam -b panel.bed > panel.bam
Run convert_bam_to_fastq.sh script to convert panel bams to paired-end fastq files
./convert_bam_to_fastq.sh <BAM_FILE_PREFIX> <OUT_FASTQ_FILE_PREFIX>
Use bedtools intersect to filter the input .vcf file with the panel .bed file
bedtools intersect -header -a input.vcf -b panel.bed > panel.vcf
grz-cli validate
: PASSQC pipeline
: FAIL (Low coverage of data)Garvan NA12878 HG001 HiSeq Exome
grz-cli validate
: FAILQC pipeline
: UNKNOWNgrz-cli validate
: PASSQC pipeline
: UNKNOWNHG008 Liss Lab BCM Illumina-WGS (2024-03-13)
grz-cli validate
: FAILQC pipeline
: UNKNOWN