Skip to content

Instantly share code, notes, and snippets.

@torinkwok
Last active July 5, 2019 06:22
Show Gist options
  • Save torinkwok/64bad905e2b7f47cd5a0fc7da6dc9e8e to your computer and use it in GitHub Desktop.
Save torinkwok/64bad905e2b7f47cd5a0fc7da6dc9e8e to your computer and use it in GitHub Desktop.
Retrieve data for doing cancer calling evaluation using synthetic dataset 3
#!/bin/bash
set -eu -o pipefail
# Retrieve data for doing cancer calling evaluation using synthetic dataset 3
# from the ICGC-TCGA DREAM challenge:
# https://www.synapse.org/#!Synapse:syn312572/wiki/62018
# ==============================
# Exome only data
# ------------------------------
curl -sSO https://s3.amazonaws.com/bcbio_nextgen/dream/synthetic_challenge_set3_normal_NGv3_1.fq.gz
curl -sSO https://s3.amazonaws.com/bcbio_nextgen/dream/synthetic_challenge_set3_normal_NGv3_2.fq.gz
curl -sSO https://s3.amazonaws.com/bcbio_nextgen/dream/synthetic_challenge_set3_tumor_NGv3_1.fq.gz
curl -sSO https://s3.amazonaws.com/bcbio_nextgen/dream/synthetic_challenge_set3_tumor_NGv3_2.fq.gz
# ==============================
# Evaluation data and BED files
# ------------------------------
TUMOR_20PCTMASKED_TRUTH="synthetic_challenge_set3_tumor_20pctmasked_truth.tar.gz"
curl -sSO https://s3.amazonaws.com/bcbio_nextgen/dream/"$TUMOR_20PCTMASKED_TRUTH"
tar -xzvpf "$TUMOR_20PCTMASKED_TRUTH"
curl -sSO https://s3.amazonaws.com/bcbio_nextgen/dream/refseq-merged.bed.gz
curl -sSO https://s3.amazonaws.com/bcbio_nextgen/NGv3.bed.gz
gunzip *.bed.gz
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment