Difference between revisions of "Applications/Cegma"

From HPC
Jump to: navigation , search
(Created page with "__TOC__ ==Application Details== *Description: CEGMA (Core Eukaryotic Genes Mapping Approach) is a pipeline for building a set of high reliable set of gene annotations in virt...")
 
m
 
(5 intermediate revisions by the same user not shown)
Line 2: Line 2:
  
 
==Application Details==
 
==Application Details==
*Description: CEGMA (Core Eukaryotic Genes Mapping Approach) is a pipeline for building a set of high reliable set of gene annotations in virtually
+
*Description: CEGMA (Core Eukaryotic Genes Mapping Approach) is a pipeline for building a set of high reliable set of gene annotations in virtually any eukaryotic genome.  
any eukaryotic genome.  
 
 
*Version: 2.5.0
 
*Version: 2.5.0
 
*Module: cegma/2.5.0
 
*Module: cegma/2.5.0
*Licence: GNU
+
*Licence: Open source, free to academic research
  
 
==Usage Examples==
 
==Usage Examples==
Line 25: Line 24:
  
 
===Module===
 
===Module===
 +
 +
 
<pre style="background-color: #000000; color: white; border: 2px solid black; font-family: monospace, sans-serif;">
 
<pre style="background-color: #000000; color: white; border: 2px solid black; font-family: monospace, sans-serif;">
 
[username@login01 ~]$ module add cegma/2.5.0
 
[username@login01 ~]$ module add cegma/2.5.0
Line 30: Line 31:
 
</pre>
 
</pre>
  
===Compilation===
 
  
''boost'' is a library for use either with a compiled binary that requires it, or it could be required at compilation time also.
 
  
A typical code snippet requiring is shown below:
 
  
<pre style="background-color: #f5f5dc; color: black; font-family: monospace, sans-serif;">
+
==Further Information==
 
 
 
 
 
 
</pre>
 
  
 +
* [http://korflab.ucdavis.edu/datasets/cegma/ http://korflab.ucdavis.edu/datasets/cegma/]
  
 +
==Navigation==
  
==Further Information==
 
 
* [https://support.hdfgroup.org/HDF5/ https://support.hdfgroup.org/HDF5/]
 
 
{|
 
|style="width:5%; border-width: 0" | [[File:icon_home.png]]
 
|style="width:95%; border-width: 0" |
 
 
* [[Main_Page|Home]]
 
* [[Main_Page|Home]]
* [[Applications|Application support]]
+
* [[Applications|Application support]] *
 
* [[General|General]]
 
* [[General|General]]
* [[Training|Training]]
 
 
* [[Programming|Programming support]]
 
* [[Programming|Programming support]]
|-
 
|}
 

Latest revision as of 11:16, 24 May 2019

Application Details

  • Description: CEGMA (Core Eukaryotic Genes Mapping Approach) is a pipeline for building a set of high reliable set of gene annotations in virtually any eukaryotic genome.
  • Version: 2.5.0
  • Module: cegma/2.5.0
  • Licence: Open source, free to academic research

Usage Examples

CEGMA (Core Eukaryotic Genes Mapping Approach) is a pipeline for building a set of high reliable set of gene annotations in virtually any eukaryotic genome. The strategy relies on a simple fact: some highly conserved proteins are encoded in essentially all eukaryotic genomes. We use the KOGs database to build a set of these highly conserved ubiquitous proteins. We define a set of 458 core proteins, and the protocol, CEGMA, to find orthologs of the core proteins in new genomes and to determine their exon-intron structures.

It has the following commands:

  • cegma
  • completeness
  • geneid-train
  • genome_map
  • hmm_select
  • local_map
  • make_paramfile parsewise

and is invoke by loading the cegma module within Viper.

Module

[username@login01 ~]$ module add cegma/2.5.0
[username@login01 ~]$ cegma --genome sample.dna --protein sample.prot -o sample



Further Information

Navigation