Difference between revisions of "Applications/Cegma"
From HPC
(Created page with "__TOC__ ==Application Details== *Description: CEGMA (Core Eukaryotic Genes Mapping Approach) is a pipeline for building a set of high reliable set of gene annotations in virt...") |
m |
||
(5 intermediate revisions by the same user not shown) | |||
Line 2: | Line 2: | ||
==Application Details== | ==Application Details== | ||
− | *Description: CEGMA (Core Eukaryotic Genes Mapping Approach) is a pipeline for building a set of high reliable set of gene annotations in virtually | + | *Description: CEGMA (Core Eukaryotic Genes Mapping Approach) is a pipeline for building a set of high reliable set of gene annotations in virtually any eukaryotic genome. |
− | any eukaryotic genome. | ||
*Version: 2.5.0 | *Version: 2.5.0 | ||
*Module: cegma/2.5.0 | *Module: cegma/2.5.0 | ||
− | *Licence: | + | *Licence: Open source, free to academic research |
==Usage Examples== | ==Usage Examples== | ||
Line 25: | Line 24: | ||
===Module=== | ===Module=== | ||
+ | |||
+ | |||
<pre style="background-color: #000000; color: white; border: 2px solid black; font-family: monospace, sans-serif;"> | <pre style="background-color: #000000; color: white; border: 2px solid black; font-family: monospace, sans-serif;"> | ||
[username@login01 ~]$ module add cegma/2.5.0 | [username@login01 ~]$ module add cegma/2.5.0 | ||
Line 30: | Line 31: | ||
</pre> | </pre> | ||
− | |||
− | |||
− | |||
− | + | ==Further Information== | |
− | |||
− | |||
− | |||
− | |||
+ | * [http://korflab.ucdavis.edu/datasets/cegma/ http://korflab.ucdavis.edu/datasets/cegma/] | ||
+ | ==Navigation== | ||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
− | |||
* [[Main_Page|Home]] | * [[Main_Page|Home]] | ||
− | * [[Applications|Application support]] | + | * [[Applications|Application support]] * |
* [[General|General]] | * [[General|General]] | ||
− | |||
* [[Programming|Programming support]] | * [[Programming|Programming support]] | ||
− | |||
− |
Latest revision as of 11:16, 24 May 2019
Application Details
- Description: CEGMA (Core Eukaryotic Genes Mapping Approach) is a pipeline for building a set of high reliable set of gene annotations in virtually any eukaryotic genome.
- Version: 2.5.0
- Module: cegma/2.5.0
- Licence: Open source, free to academic research
Usage Examples
CEGMA (Core Eukaryotic Genes Mapping Approach) is a pipeline for building a set of high reliable set of gene annotations in virtually any eukaryotic genome. The strategy relies on a simple fact: some highly conserved proteins are encoded in essentially all eukaryotic genomes. We use the KOGs database to build a set of these highly conserved ubiquitous proteins. We define a set of 458 core proteins, and the protocol, CEGMA, to find orthologs of the core proteins in new genomes and to determine their exon-intron structures.
It has the following commands:
- cegma
- completeness
- geneid-train
- genome_map
- hmm_select
- local_map
- make_paramfile parsewise
and is invoke by loading the cegma module within Viper.
Module
[username@login01 ~]$ module add cegma/2.5.0 [username@login01 ~]$ cegma --genome sample.dna --protein sample.prot -o sample