Commands and Auxiliary Programs¶
The core elements of the Hybran pipeline can be run separately, but they are being added incrementally:
: Unify names of gene duplicates.
onegene is used to process the reference annotations to avoid inconsistent matching to different instances of (near-)identical genes.
It solves the problem by assigning a single generic name to all instances, while storing the original name in each feature's
The unified generic name propagates to the input samples' final annotations, but if this is not desired, you can use
hybran standardize on the output annotations to restore the original names.
Hybran comes with associated functionality that can be run independently. These are either not part of the pipeline at all or not a component in their own right. These tools are:
: Remove generic gene names.
hybran leaves no gene without a name and only assigns names from genes in your provided references.
Genes that only had ab initio-predicted gene names have them relegated to the
gene_synonym field to avoid propagation during the hybran pipeline.
The result is that if no reference gene name could be assigned, or if a gene is duplicated in the reference genome, a generic name is generated and assigned.
Generic gene names allow easy matching of hybran-identified homologs within and between your samples.
For the purposes of uploading your annotation to a public database, however, these names should not be kept.
hybran standardize will remove all these generic names and replace them, wherever possible, with a reference name (for reference duplicates that had been grouped under a generic name) or the ab initio predicted name.