Welcome to PanBGC-DB

A comprehensive analysis pipeline for Gene cluster Families

PanBGC-DB is a user-friendly web tool to explore biosynthetic gene clusters (BGCs) diversity within Gene Cluster Families (GCFs). It allows researchers to investigate the diversity, conservation, and structure of BGCs in specific GCFs. Users can access and analyze GCFs containing BGCs from the antiSMASH and MIBiG database, generate visualizations of custom GCFs, and connect newly discovered BGCs to related families present in the database.

This platform adapts established analysis of pangenomes on a Biosynthetic gene cluster level.

Database

This database comprises Gene Cluster Families derived from biosynthetic gene clusters (BGCs) of the antiSMASH-DB. To build the GCFs the combnination of BiG-SLICE and BiG-SCAPE was used.

It features precomputed analyses of individual GCFs, including core and pan-BGC composition, GCF openness, phylogenetic relationships, and structural diversity of BGCs.

Explore Database

Query

Upload your own GenBank files and analyze them against our database. Our query tool uses cblaster to link your BGC to GCFs present in our database.

Simply upload your .gbk file, and our pipeline will process your data, showing a list of candidate GCFs and a presence / absence map to get a visual overview of which family is the closest.

Start Querying

Visualization

Generate a intuitive visual representations of your own defined gene cluster family. Our easy to use visualization pipeline allows you to create the visualisation of one or multiple GCFs by using a python script pipeline.

The scripts generate diagrams, heatmaps, phylogenetic trees as well as structural comparisons of the BGCs to better understand the natural occuring diversity of this family.

View Visualizations