Applications/Cegma

Application Details

Description: CEGMA (Core Eukaryotic Genes Mapping Approach) is a pipeline for building a set of high reliable set of gene annotations in virtually

any eukaryotic genome.

Version: 2.5.0
Module: cegma/2.5.0
Licence: GNU

Usage Examples

CEGMA (Core Eukaryotic Genes Mapping Approach) is a pipeline for building a set of high reliable set of gene annotations in virtually any eukaryotic genome. The strategy relies on a simple fact: some highly conserved proteins are encoded in essentially all eukaryotic genomes. We use the KOGs database to build a set of these highly conserved ubiquitous proteins. We define a set of 458 core proteins, and the protocol, CEGMA, to find orthologs of the core proteins in new genomes and to determine their exon-intron structures.

It has the following commands:

cegma
completeness
geneid-train
genome_map
hmm_select
local_map
make_paramfile parsewise

and is invoke by loading the cegma module within Viper.

Module

[username@login01 ~]$ module add cegma/2.5.0
[username@login01 ~]$ cegma --genome sample.dna --protein sample.prot -o sample

Compilation

boost is a library for use either with a compiled binary that requires it, or it could be required at compilation time also.

A typical code snippet requiring is shown below:

Further Information

https://support.hdfgroup.org/HDF5/

HPC

Applications/Cegma

Contents

Application Details

Usage Examples

Module

Compilation

Further Information

Navigation

Support

Research

Tools