Bacteria and archaea have developed a wide variety of CRISPR-Cas systems to protect themselves from harmful mobile genetic elements (MGEs), such as phages and plasmids. In response, MGEs evolved a series of potent inhibitors, known as anti-CRISPRs (Acrs), to counteract host CRISPR-Cas defence systems. Except for their universally short sequences, Acrs have little in common with each other. With very low sequence and structural similarity, at least 50 distinct Acr families have been identified across both bacterial and archaeal domains of life where they each use different molecular mechanisms to inhibit CRISPR-Cas systems. Outside the confined environment of a microbial cell, Acrs have inspired a number of downstream applications, from gene editing technologies and protein engineering to phage therapy, applications that are only limited by the relatively small number of known anti-CRISPR systems compared to the thousands hidden in sequenced genomes.
We therefore aim to design and implement an all-in-one, user-friendly solution to better assist biologists with extracting the data they need to help design experiments and formulate interesting hypotheses. We cataloged a comprehensive list of experimentally validated and predicted Acrs, each annotated with their relationships to known Acrs. We further developed a novel machine learning based anti-CRISPR predictor and additionally integrated existing state-of-the-art predictors to allow wet experimentalists to predict homologous and novel Acrs of their interests. To facilitate relationship analysis between known and potential Acrs, we developed three analytical modules: a BLAST-based similarity analysis, a multiple sequence alignment based phylogenetic analysis, and a homology-based network analysis. These tools can either work independently or within the platform pipeline to facilitate downstream relationship analysis of a newly predicted Acr and thereby shorten the gap between prediction, functional characterisation, and eventual experimental validation.