Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

analyse multiple samples from the same individual together to increase power #129

Closed
rosaranli opened this issue Jun 6, 2023 · 7 comments

Comments

@rosaranli
Copy link

Thanks again for this wonderful tool. We had multiple samples from he same tumour and did single cell RNA sequencing. From other analysis, we knew that the samples from the same tumour share some, if not all, tumour clones. I was thinking of combining the multiple samples together to infer the CNV clones to increase power as some of the tumour samples may not have lots of cells. To do this, I combined the {sample}_allele_counts.tsv.gz from different samples and run run_numbet function but I got the error "'Inconsistent SNP genotypes; Are cells from two different individuals mixed together". I realised the error was caused that the same SNP may have different phased genotype. Just wondering since the tumour samples are from the same tumour, if they have different phased genotypes, does that mean they have different somatic mutations? If so, I would assume the cells from the same tumour sample could also have this problems as different cells may have different somatic mutations. But I never saw this happen using cells from the same tumour sample. Just wondering if you could give suggestions how to get around with this problem and analysis the cells from multiple samples together to increase the power?

@teng-gao
Copy link
Collaborator

teng-gao commented Jun 6, 2023

Have you tried the multi-sample mode of pileup_and_phase? You can provide BAMs and other inputs in comma delimited list.
https://kharchenkolab.github.io/numbat/articles/numbat.html#preparing-data

@rosaranli
Copy link
Author

Thank you very much for your quick reply. I thought that would generate multiple {sample}_allele_counts.tsv.gz files and cause the same problem. I will try it to see if it can solve the problem.

@rosaranli
Copy link
Author

Problem solved by running tumour samples together and combine the df files. Thank you very much for that. Most of the samples, I seem to see more clones than expected and some of the clones look false positive to me. Is it possible to specify the number of clones detected or how to adjust the parameters if I want to reduce these false positives? Many thanks!

@teng-gao
Copy link
Collaborator

Hi @rosaranli ,
You can adjust the number of clones after running Numbat:
https://kharchenkolab.github.io/numbat/articles/results.html#refine-subclones-on-the-phylogeny

@rosaranli
Copy link
Author

sorry for keep posting questions on the closed issue. I didn't express clearly. I was thinking how to change the number of clones in the bulk_clones_final.png so I could associate the clones with the allele frequency profiles.

@rosaranli
Copy link
Author

For example, in the bulk_clones_final.png in one of my analysis, most clones have the chr1 loss except one. But the one that show chr1 neural seems to have lower expression than 0 in terms of logFC, which indicate it may have lost chr1 or at least some of the cells in this clone. I was thinking of reducing the clone number to reevaluate the allele frequency profile.

@teng-gao
Copy link
Collaborator

@rosaranli Mind opening up a separate issue?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants