-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Valid V3 reads ending up in the unmapped files #228
Comments
I was unable to reproduce this when re-running this sample locally.
Using master branch on my workstation (commit 67566cd) I obtained the following:
|
I just checked out release 6.8.1 on my workstation (commit aa76593) and still could not reproduce this. Hopefully it is not stochastic. I will repeat to check. |
On Don's suggestion, copying the required sample files to a temp folder on cluster and running 6.8 from there. It may be a samtools or bowtie2 versioning issue. |
Reproduced issue on cluster. Either |
I ran the development pipeline with |
samtools-1.1 failed to remedy this issue. Now checking if it is a bowtie2 versioning problem. |
Running |
Calling |
I just pulled the |
I just reproduced the problem on my workstation. I ran the code from the v6.8.1 tag with bowtie2 2.2.1 and samtools 0.1.18. I saw 136199 unmapped reads in sample 63249A-V3-3. |
Funny - I've reverted back to |
I ran it once on my workstation with standalone Micall 7.0 and bowtie2 2.2.1, and only saw 2800 unmapped reads, so that's a good sign. However, when I ran it with the data that had been censored for high error rates in the phiX data, I saw 108352 unmapped reads. |
Going back to the pipeline |
I ran the 63249A sample with Micall 7.0, and looked at all the variants that were successfully called in the
It looks like the censoring is affecting the X4 reads at the same rate as the other reads, so the only remaining question is why the Micall results disagree with the 454 results. According to Conan, the 454 called 23% of the reads X4. When I calculated the same totals for the |
After a few bug fixes, I reran the 63249A sample with Micall 7.0 as above. I counted the following:
|
See sample 63249A-V3.3 from run 150529, it has too many unmapped reads. When it was processed in production under Micall 6.8, it had 136199 unmapped reads. When Art processed it on his workstation, it only had 43125 unmapped reads. Try to see what the difference is, and fix the problem.
Updated Summary
There were several bugs in the G2P clipping, so we fixed those. The Micall 7.0 pipeline now calls 19% X4 compared to the 454 calling 23% in this sample.
is_ruby_compatible
flag fromsam_g2p.py
.The text was updated successfully, but these errors were encountered: