TXGP RNAseq analysis

From Marcotte Lab
Revision as of 14:26, 14 June 2011 by Taejoon (Talk | contribs)

Jump to: navigation, search


Scripts for BFAST

  • Prepare csfastq files: csfasta + qual --> fastq
$ solid2fastq -o reads foobar.csfasta foobar_QV.qual
  • Prepare database sequences. Multiple indexes are not used yet.
$ bfast fasta2brg -f mygenome.fa -A 1
$ bfast fasta2brg -f mygenome.fa
$ bfast index -f mygenome.fa -m 1111111111111111111111 -d 1 -w 14 -A 1
  • run-bfast-match.sh : a script to map csfastq reads to FASTA file.
#!/bin/bash
FASTA=$1
if [[ -f $FASTA ]];
then
  echo "File:",$FASTA
  for FASTQ in $(ls ../fastq/*.fastq.gz)
  do
    BASENAME=$(basename $FASTQ)
    BMF=${BASENAME/".fastq.gz"/}".bmf"
    BAF=${BASENAME/".fastq.gz"/}".baf"
    SAM=${BASENAME/".fastq.gz"/}".sam"
    echo "$FASTQ -- $FASTA --> $SAM"
    bfast match -A 1 -n 4 -f $FASTA -r $FASTQ -z > $BMF
    bfast localalign -A 1 -n 4 -f $FASTA -m $BMF > $BAF
    bfast postprocess -A 1 -n 4 -f $FASTA -i $BAF > $SAM
  done
else
  echo "Usage: run-bfast-match.sh <DB fasta file>"
fi

See also