GALLO: Genomic Annotation in Livestock for positional candidate LOci

The Genomic Annotation in Livestock for positional candidate LOci (GALLO) is an R package designed to provide a straightforward environment for gene and QTL annotation, as well as data integration from multiple data sources. The QTL enrichment analyses can additionally be performed directly by GALLO using the output obtained from the QTL annotation step. In addition, GALLO also provide a set of functions for graphical visualization for the annotation, comparison, integration and QTL enrichment results. Consequently, GALLO is a useful package for the annotation, identification of hidden pattern across datasets, data mining of previous reported associations, as well as the efficient scrutinization of the genetic architecture of complex traits.

Requirements

Depends: R (>= 4.0.0)

Imports: circlize, DT, data.table, doParallel, dplyr, dynamicTreeCut, ggplot2, graphics, grDevices, foreach, lattice , parallel, RColorBrewer, rtracklayer, stats, stringr, unbalhaar, utils, igraph, visNetwork

License: GPL-3

Usage

.gtf files for gene annotation can de found, for example, in ensembl FTP website: https://www.ensembl.org/info/data/ftp/index.html

.gtf files for QTL annotation can be found in Animal QTLdb: https://www.animalgenome.org/QTLdb/

To install the package, the following command line can be use in R:

install.packages("GALLO")

Functions description

  1. import_gff_gtf(): Takes a .gft or .gff file and import into a dataframe

  2. find_genes_qtls_around_markers: Takes a dataframe with candidate markers and/or regions (haplotypes, windows, CNVs, etc) and search for genes or QTLs in a specified interval

  3. overlapping_among_groups: Takes a dataframe with a column for genes, QTLs (or any other data) and a grouping column and create matrices with the ovelapping information

  4. plot_overlapping: Takes the output from overlapping_amoung_groups function and creates a heatmap with the overlapping between groups

  5. plot_qtl_info: Takes the output from find_genes_qtls_around_markers and create plots for the frequency of each QTL type and trait

  6. qtl_enrich: Takes the output from find_genes_qtls_around_markers and perform a QTL enrichment analysis

  7. QTLenrich_plot: Takes the output from _find_genes_qtls_around_markers function and creates a heatmap with the overlapping between groups

  8. relationship_plot: Takes the output from find_genes_qtls_around_markers function and creates a chord plot with the relationship between groups

  9. gene_pval: Estimate a gene-level p-value using Weighted Z-score approach and Meta-analysis with LD correlation coefficients approach

  10. NetCen: Compute the centrality metrics for the nodes composing the network generated by the NetVis function

  11. NetVis: Create a dynamic network representing the relationship between two groups of variables

  12. Nmarkers_SimpleM: Estimate the number of effective markers in a chromosome based on an adapted version of the simpleM methodology

  13. Nseg_chr: Estimate the number of independent segments in a chromosome based on the effective population size

  14. PleioChiTest: Compute a multi-trait test statistic for pleiotropic effects using summary statistics from association tests

A tutorial for GALLO usage can be found at:

https://rpubs.com/pablo_bio/GALLO_vignette

Contact

For more information, suggestions, discussions, and bug reports, contact pfonseca@uoguelph.ca