Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

issues with assigning GPSCs #311

Open
johnlees opened this issue May 21, 2024 · 4 comments
Open

issues with assigning GPSCs #311

johnlees opened this issue May 21, 2024 · 4 comments

Comments

@johnlees
Copy link
Member

          I am also having issues with assigning GPSCs -- the external clustering file labels every strain as "NA" even though the GPSC of most of these strains have been previously established. My cytoscape network also produces a lot more distinct clusters than I'd expect given how closely related the strains are, so I wonder if these issues are related at all to the previous one.

Originally posted by @sydelstan in #309 (comment)

@johnlees johnlees changed the title issues with assigning GPSCs -- the external clustering file labels every strain as "NA" even though the GPSC of most of these strains have been previously established. My cytoscape network also produces a lot more distinct clusters than I'd expect given how closely related the strains are, so I wonder if these issues are related at all to the previous one. issues with assigning GPSCs May 21, 2024
@johnlees
Copy link
Member Author

@sydelstan opening new issue to track this

I'd require more information to diagnose this: command and ideally a minimal reproducible example (e.g. a genome that causes this issue)

@sydelstan
Copy link

Hello,

I used the code

poppunk_assign --db GPS_v8_ref --external-clustering GPS_v8_external_clusters.csv --query qfile.txt --output

based on these instructions:

https://www.pneumogen.net/gps/#/training

These are some of the accession numbers I used:

ERR884352
ERR884330
ERR1788221
ERR025290
SRR3661255
ERR884389

@johnlees
Copy link
Member Author

I have checked with my colleague @HarryHung in the GPS project who says the following:

Checking ERR884352
It is one of the GPS samples with:
Public Name: GPS_PL_1533_10
This sample has been re-analysed by the GPS Pipeline just few days ago, and its GPSC is correctly assigned to GPSC 12 with v8_ref database and PopPUNK v2.6.3 (via Docker image staphb/poppunk:2.6.3)

I am not sure what is going on their side. I wonder if they are using raw reads as input (in my pipeline, pre-processing and de novo assembly were done before passing the assemblies to PopPUNK)? Or it could be some installation issues (might try out the Docker image)?

Are you using reads or an assembly as input?

@sydelstan
Copy link

sydelstan commented May 31, 2024

Using assemblies and not reads fixed the issue, thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants