Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data model - database #18

Open
skrakau opened this issue Dec 24, 2020 · 5 comments
Open

Data model - database #18

skrakau opened this issue Dec 24, 2020 · 5 comments
Assignees
Milestone

Comments

@skrakau
Copy link
Collaborator

skrakau commented Dec 24, 2020

No description provided.

@skrakau skrakau added the enhancement New feature or request label Dec 24, 2020
@skrakau skrakau changed the title Data structure Data model - database Dec 24, 2020
@skrakau skrakau added this to the 1.0.0 milestone Dec 24, 2020
@skrakau skrakau added core functionality and removed enhancement New feature or request labels Dec 24, 2020
@lkuchenb
Copy link
Collaborator

lkuchenb commented Dec 24, 2020

Current draft proposal

metapep

@skrakau skrakau pinned this issue Dec 24, 2020
@skrakau
Copy link
Collaborator Author

skrakau commented Jan 6, 2021

prediction_id is not needed ...

@skrakau
Copy link
Collaborator Author

skrakau commented Jan 8, 2021

Regarding the size of the peptide tables and memory, for my current datasets:

# proteins:
1,855,616

# non-unique peptides (9mers) across proteins (multiple occurrences within one protein not counted!):
552,451,599

# unique peptides:
392,722,935

Peak memory for generate_peptides:
peak_vmem=176,900,708

@lkuchenb
Copy link
Collaborator

lkuchenb commented Jan 17, 2021

New model containing entities as an additional link between microbiomes and proteins, modelling the linking entity aka taxa, MAGs/bins or assembly contigs

metapep

New color coding:

Orange -> provided or pre-computed entities
Gray -> associations
Purple -> Pipeline output

@skrakau
Copy link
Collaborator Author

skrakau commented Jan 18, 2021

protein_orig_id ist missing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants