Amplification Bias - 2

less than 1 minute read

This is a further look at the paper: Calibrating genomic and allelic coverage bias in single-cell sequencing by Zhang, et. al.

In particular we reproduce Figure 2a cited in our earlier post. The data set can be retrieved as a bam file from SRA reference SRR1779331. The coverage data can be extracted from chromosome 1 (which is the only contig that’s used for the graph) using samtools depth.

To compute the autocorrelation, after considering several techniques, we settled on exploiting the sparse matrix library from scipy. That is described in this post.

The figure we obtained does in fact reproduce Zhang:

Updated: