Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can I use this program to assemble the fungal mitochondrial genome? #25

Closed
OmonkeyGOD opened this issue Oct 9, 2019 · 12 comments
Closed

Comments

@OmonkeyGOD
Copy link

Hi,
Can I use this program to assemble the fungal mitochondrial genome?

@linzhi2013
Copy link
Owner

Dear OmonkeyGOD,

I have not tried MitoZ with fungal data.

Two things I can confirm,

  1. the assemble command can do assembly, generating the nucl + mt assembly.

  2. but it may not be able to find out the fungal mitochondrial sequences, since MitoZ does not include HMM profiles of fungi.

However, you can try the assemble command, and find the nucl + mt assembly file in the tmp/XXX.assembly directory, and filename looks like *.ScafSeq.

Then you can blast this *.ScafSeq file to some true fungi mitochondrial genomes (e.g download from Genbank by yourself).

Hope this helps!

@OmonkeyGOD
Copy link
Author

Thank you very much. I will try this assembler.

@OmonkeyGOD
Copy link
Author

Hi,
When it comes annotation...
If I am only interested in one or two species of fungal, and there are only two or three GenBank files of this species in NCBI, do I need to add more related species' GenBank files to build the database for annotation? If so, does that mean the bigger the database the better of the results?
On the other hand, If there is no complete Genbank file for that species, but some fragments of the mitogenome could be found in NCBI, can anyone use these fragments to build the annotation database?
Thank you very much!

@linzhi2013
Copy link
Owner

linzhi2013 commented Oct 23, 2019

Hi,
When it comes annotation...
If I am only interested in one or two species of fungal, and there are only two or three GenBank files of this species in NCBI, do I need to add more related species' GenBank files to build the database for annotation? If so, does that mean the bigger the database the better of the results?
On the other hand, If there is no complete Genbank file for that species, but some fragments of the mitogenome could be found in NCBI, can anyone use these fragments to build the annotation database?
Thank you very much!

Hi OmenkyGOD,

During building the database for annotation, we deal with each gene one by one. Thus you do not have to have a complete Genbank file for that species.

But building the database for annotation by yourself can be a little overhead, and this is due that I have not a chance to document the procedures.

Therefore, I would suggest you use http://mitos.bioinf.uni-leipzig.de/index.py for annotating your fungi species currently.

Cheers

@OmonkeyGOD
Copy link
Author

Hi,
Thank you very much for your quick reply.
I just tried one of my assemblies with MITOS Web Server and I find the result is not very good. Because it only has genetic code 04 related to fungi, and I believe it is quite different from the species I am studying. For example, it did not find gene atp8. But when I blast the atp8 sequence with my contig, it shows a 100% match. Thus, I still want to build a very small database with only several species information related to my study. If so, it will be easier and convenient for my future studies.
I found your advice for annotating other species in question #8 changing MitoZ's annotating database to annotate other types of mitogenome, and I think I will try it first.

Thanks.

@Prunoideae
Copy link

Dear linzhi2013,
In the version v2.4-alpha, it seems that there are already *.fa files other than Chordata and Arthropoda inside, but it seems that the --clade option only accepts the Chordata and Arthropoda, as the MitoZ.py has choices=["Chordata", "Arthropoda"] to limit it.
Is that a miss on script programming, or is the *.fa files of other phylogenic classes are invalid or 'untuned' for annotation?

@linzhi2013
Copy link
Owner

linzhi2013 commented Oct 31, 2019

Dear linzhi2013,
In the version v2.4-alpha, it seems that there are already *.fa files other than Chordata and Arthropoda inside, but it seems that the --clade option only accepts the Chordata and Arthropoda, as the MitoZ.py has choices=["Chordata", "Arthropoda"] to limit it.
Is that a miss on script programming, or is the *.fa files of other phylogenic classes are invalid or 'untuned' for annotation?

I am not quite sure what kind of clade that your interested species belong to, but I have actually built some HMM profiles for other clades in the latest version of MitoZ. However, since we still don’t have enough time and material to test their efficiency, I intendedly make them not available from the command lines currently.

If your interested species happens to be in the clades as below, the easiest way to work around is:
(1) Find the main script “MitoZ.py”
(2) Then edit the “search_and_annot_mito_parser.add_argument” to be below (e.g. with a text editor like vi command, or Sublime Text, or notepad++, but never use Windows’s text editor):

 
search_and_annot_mito_parser.add_argument("--clade", default="Arthropoda",
        choices=["Chordata", "Arthropoda", "Echinodermata", "Annelida-segmented-worms", "Bryozoa", "Mollusca", "Nematoda", "Nemertea-ribbon-worms", "Porifera-sponges"],
        help="which clade does your species belong to? [%(default)s]")

@OmonkeyGOD
Copy link
Author

Hi,

Sorry to bother you again. I feel frustrated when I try to annotate my sample with new annotation files and I really need your help.
I have just generated the .hmm profile with several isolates' sequences of my interested species and .fasta file of proteins sequences. Then I put the profiles in the profiles subdirectories where they should be and
• replace the Mitoz.py script with
search_and_annot_mito_parser.add_argument("--clade", default="Cryptococcus", choices=["Chordata", "Arthropoda","Porifera-sponges", "Cryptococcus"], # "Echinodermata", "Annelida-segmented-worms", "Bryozoa", "Mollusca", "Nematoda", "Nemertea-ribbon-worms", "Porifera-sponges"], help="which clade does your species belong to? [%(default)s]")
• change the product_dict of the script cds_ft_v2.py.
• genbank_gene_stat_v2.py
But I still get this error :
Traceback (most recent call last):
File "/home/yue/release_MitoZ_v2.4-alpha/bin/annotate/prepare_CDSposition.py", line 47, in
fh_in = open(in_f, 'r')
FileNotFoundError: [Errno 2] No such file or directory: 'ERR1810384_mitoscaf.fa.solar.genewise.gff.cds.position.cds'.
Please give me some advice.
Thank you very much!

@OmonkeyGOD
Copy link
Author

Hi,

I just solved the problem by resetting the environment.

@Prunoideae
Copy link

I am not quite sure what kind of clade that your interested species belong to, but I have actually built some HMM profiles for other clades in the latest version of MitoZ. However, since we still don’t have enough time and material to test their efficiency, I intendedly make them not available from the command lines currently.

Dear linzhi2013,
Thank you for your quick reply!
I'm using MitoZ to assemble and annotate the mitogenome of Platyhelminthes, which already has a .fa file in the directory, but it seems that the .hmm file is missing.

@linzhi2013
Copy link
Owner

I am not quite sure what kind of clade that your interested species belong to, but I have actually built some HMM profiles for other clades in the latest version of MitoZ. However, since we still don’t have enough time and material to test their efficiency, I intendedly make them not available from the command lines currently.

Dear linzhi2013,
Thank you for your quick reply!
I'm using MitoZ to assemble and annotate the mitogenome of Platyhelminthes, which already has a .fa file in the directory, but it seems that the .hmm file is missing.

Do not worry about the HMM. you may just copy the Arthropoda or chordate's hmm file then rename it to
the clade (--clade option) which your species belong to.

HMM files are just to search for PCGs, and I think HMM model has enough power to handle distant species.

@linzhi2013
Copy link
Owner

Hi,

I just solved the problem by resetting the environment.

That's great!

cgjosephlee added a commit to cgjosephlee/MitoZ_codebase that referenced this issue Dec 23, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants