1 research outputs found

    Python script for filtering .SAM formatted mapping files aligned with BWA mem

    No full text
    The filter.py script works on sorted .SAM formatted mapping files from BWA mem alignment. For every pair of mapped forward and reverse reads, it parses out the CIGAR field (column 6 of the SAM file) and the MD tag to calculate the number of insertions, deletions, and mismatches. If a pair of reads have mismatches less than or equal to five and insertion/deletions less than or equal to two, then the pair is kept and printed to linux standard output. SAM header lines are ignored by the parser but also printed to standard output for compatible down-stream analysis
    corecore