Software | Genome Sciences Centre

The rapid evolution of DNA sequencing technologies over the past 20 years has made it possible to generate enormous amounts of data, and has subsequently spurred the development of computational tools needed to assemble complete genomes and to analyze genomic, transcriptomic and proteomic data. The GSC collaborates with and supports research by the wider research community. We have an extensive collection of software packages developed in-house available for download here and through GitHub.

https://github.com/bcgsc/

ABySS

Assembly By Short Sequences - a de novo, parallel, paired-end sequence assembler

Learn more about ABySS

ABySS-Explorer

A sequence assembly visualization tool

Learn more about ABySS-Explorer

Adapter Trimming for Small RNA Sequencing

Removes 3' adapter from Illumina sequencing of small RNAs where read length is greater than the size of RNAs

Learn more about Adapter Trimming for Small RNA Sequencing

ALEA

ALEA is a computational toolbox for allele-specific (AS) epigenomics analysis

Learn more about ALEA

AMPlify

Attentive deep learning model for antimicrobial peptide prediction

Learn more about AMPlify

Anchor

Post-processing tools for de novo assemblies

Learn more about Anchor

ARCS/ARKS

Genome assembly scaffolder with linked and long reads

Learn more about ARCS/ARKS

Barnacle

A pipeline for detecting and characterizing chimeric transcripts from long RNA sequences

Learn more about Barnacle

BioBloomTools

BioBloom Tools (BBT) is a general use fast sequence categorization tool utilizing Bloom filters

Learn more about BioBloomTools

BLISS

Batch anaLysIS Suite (BLISS)

Learn more about BLISS

btllib

A common code library with efficient code and wrappers for many common bioinformatics operations

Learn more about btllib

Chinook

Chinook is a peer-to-peer (P2P) bioinformatics service

Learn more about Chinook

ChopStitch

Exon annotation and splice graph reconstruction using transcriptome assembly and whole genome sequencing data

Learn more about ChopStitch

Circos

Visualize comparative genomic data such as alignments, conservation, homology, synteny and other positional n-tuples in an attractive and informative circular layout

Learn more about Circos

CORAL

Contig Ordering Algorithm

Learn more about CORAL

DIDA

DIDA is a novel framework that performs the large-scale alignment tasks by distributing the indexing and alignment stages into smaller subtasks over a cluster of compute nodes

Learn more about DIDA

DiscoverySpace

DiscoverySpace is a graphical software application that intends to free the biologist from the micro-level, syntactic detail of the underlying data structures to concentrate on the "big picture" and the meaning of experimental results

Learn more about DiscoverySpace

FASSI

Fingerprint and Assembly Incorporation

Learn more about FASSI

FindPeaks

Findpeaks was developed to perform analysis of ChIP-Seq experiments

Learn more about FindPeaks

GoldRush

A linear time de novo long read assembler

Learn more about GoldRush

GraphNER

GraphNER is a named entity recognizer that uses graph propagation and improves BANNER and BANNER-ChemDNER systems. Data is available for gene mention detection task

Learn more about GraphNER

HLAminer

Derivation of HLA class I and II predictions from shotgun sequence data sets

Learn more about HLAminer

Internet Contig Explorer (iCE)

iCE is used for viewing fingerprint maps and associated data

Learn more about Internet Contig Explorer (iCE)

JAGuaR

Junction Alignments to Genome for RNA-seq Reads

Learn more about JAGuaR

KLEAT

c(K)LEavage site Analysis of Transcriptomes (KLEAT) identifies 3' UTR ends of transcripts in de novo RNA-Seq assemblies

Learn more about KLEAT

Kollector

Targeted de novo assembler

Learn more about Kollector

Konnector

Connecting Paired-end Reads Using a Bloom Filter de Bruijn Graph

Learn more about Konnector

LaneRuler

LaneRuler will identify lanes in a gel image. The core module is a command line C program, whose result can be reviewed and corrected using a Java interface

Learn more about LaneRuler

LongStitch

LongStitch is a de novo genome assembly correction and scaffolding pipeline. LongStitch runs in up to three stages, which includes initial assembly correction (Tigmint-long), followed by two incremental scaffolding stages (ntLink and ARKS-long).

Learn more about LongStitch

MAVIS

A Python command-line tool for the post-processing of structural variant calls

Learn more about MAVIS

MiRNA Profiling

Profile the content of a miRNA sequencing run

Learn more about MiRNA Profiling

MSSS

Sampling with Minimum Sum of Squared Similarities for Nystrom-Based Large Scale Spectral Clustering Publication

Learn more about MSSS

NanoSim

Nanopore sequence read simulator based on statistical characterization

Learn more about NanoSim

ntCard

ntCard: a streaming algorithm for cardinality estimation in genomics data

Learn more about ntCard

ntEdit

Scalable genome sequence polishing

Learn more about ntEdit

ntHash

ntHash: recursive nucleotide hashing

Learn more about ntHash

ntJoin

Fast and lightweight assembly-guided scaffolding using minimizer graphs

Learn more about ntJoin

ntLink

ntLink is a lightweight de novo genome assembly scaffolder using long reads and minimizers.

Learn more about ntLink

ntRoot

ntRoot is an alignment-free, computationally lightweight method for inferring human super-population-level global and local ancestry from whole genome assemblies or raw sequencing data types.

Learn more about ntRoot

ntSynt

ntSynt detects multi-genome synteny blocks using minimizer graph mappings.

Learn more about ntSynt

ORegAnno

Open Regulatory Annotation

Learn more about ORegAnno

PASsiT

Post Alignment SNV Tools

Learn more about PASsiT

PAVFinder

Post-Assembly Variant Finder (PAVFinder) - Structural variant caller on sequence assembly

Learn more about PAVFinder

PAVFinder_transcriptome

Structural and splice variant detection from transcriptome assembly

Learn more about PAVFinder_transcriptome

Physlr

Constructing a physical map from linked reads. The physical map can then be used to scaffold draft genome assemblies

Learn more about Physlr

Raw Quant

A Python package for extracting scan meta data and quantification values from Thermo .raw files

Learn more about Raw Quant

RNA-Bloom

Reference-free transcriptome assembly for short and long reads

Learn more about RNA-Bloom

SAM - Sequence Assembly Manager

SAM is a Whole Genome Assembly (WGA) Management and Visualization Tool. It provides a generic platform for manipulating, analyzing and viewing WGA data, regardless of input type

Learn more about SAM - Sequence Assembly Manager

Satellog

A database for the identification and prioritization of satellite repeats in disease association studies

Learn more about Satellog

Sealer

Scalable gap-filler for finishing large genomes

Learn more about Sealer

Slider

Maximum use of probability information for alignment of short sequence reads and SNP detection

Learn more about Slider

SliderII

High quality SNP calling using Illumina data at minimal coverage

Learn more about SliderII

SNVMix

Detecting single nucleotide variants from next generation sequencing data

Learn more about SNVMix

Sockeye

Genome visualization application

Learn more about Sockeye

Spark

NOTE: This software is now being distributed via www.sparkinsight.org - please see that site for the latest release

Learn more about Spark

SSAKE

De novo genome assembly with short DNA sequence reads

Learn more about SSAKE

Straglr

Short-tandem repeat genotyping using long reads

Learn more about Straglr

TASR

Targeted Assembler of Short Sequence Reads

Learn more about TASR

THOR

THOR, the Targeted High-throughput Ortholog Reconstructor, is a Java application designed to assemble target genomic sequence orthologs in low-coverage genomes

Learn more about THOR

Tigmint

Correct misassemblies in genome assembly drafts using linked or long DNA sequencing reads

Learn more about Tigmint

Trans-ABySS

de novo assembly of RNA-Seq data using ABySS

Learn more about Trans-ABySS

TreeBuilder3D

TreeBuilder3D is an interactive viewer that allows the organization of SAGE and other types of gene expression data into hierarchical dendrograms, or phenetic networks