Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Proteomes from transcriptomic data #5

Open
qhelleu opened this issue Jan 13, 2025 · 1 comment
Open

Proteomes from transcriptomic data #5

qhelleu opened this issue Jan 13, 2025 · 1 comment

Comments

@qhelleu
Copy link

qhelleu commented Jan 13, 2025

Hi,
Thanks for the very interesting soft/pipeline. I had a question regarding the input dataset. How does the program handles the presence of multiple isoforms (typical from transcriptomes) in the input proteome?
Thanks a lot.

@pskvins
Copy link
Collaborator

pskvins commented Jan 16, 2025

Hi,

Currently, we do not handle anything with isoforms,
so if multiple isoforms with similar sequences and lengths exist in the input proteome, unicore will consider those as multiple copies of genes instead of a single copy.
If the isoforms have big differences in sequence or length, unicore will capture those isoforms as different types of protein with a high chance, which will not be a problem when using unicore.

Our current recommendation on this problem is to select one isoform from multiple isoforms and make that the input proteome.

Please let us know if you have any further questions regarding it :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants