Pathogens
Select a pathogen to browse pre-computed CRISPR targets or load its FM-index for live search
| # | Sequence (23-mer) | Gene | Position | PAM | Coverage | Score | Actions |
|---|
Select a pathogen above to view its CRISPR targets
Select a pathogen, load its FM-index, then search any DNA sequence for exact matches across all genomes
PubMed Literature Scanner
Search NCBI PubMed for published research on any CRISPR target sequence. Identify which targets are novel (no prior publications) and which have existing literature — helping prioritize targets for new papers.
Enter a sequence or gene name to search published CRISPR literature, or use "Quick scan" to check all targets for a selected pathogen
Disease & Epidemiological Context
Biomedical ontology data linking each pathogen to standardized disease classifications, WHO surveillance context, gene annotations, and diagnostic landscape. All data is bundled — works fully offline.
Select a pathogen to view disease context and ontology data
NCBI RefSeq Viral
Complete reference sequences for all known viral genomes. The primary source for our viral pathogen indexes.
- Dataset
- RefSeq Viral Complete Genomes
- Accessed
- March 2026
- Sequences
- 703,000+
- License
- Public domain (US Government work)
- URL
- ncbi.nlm.nih.gov/datasets
NCBI RefSeq Bacterial
Reference genomes for bacterial pathogens including M. tuberculosis and V. cholerae.
- Dataset
- RefSeq Bacterial Genomes (selected pathogens)
- Accessed
- March 2026
- Pathogens
- Cholera, Tuberculosis
- License
- Public domain (US Government work)
- URL
- ncbi.nlm.nih.gov/datasets
Human Reference Genome
GRCh38.p14 (Genome Reference Consortium Human Build 38) used for off-target analysis — ensuring CRISPR guides don't match human sequences.
- Assembly
- GRCh38.p14 (GCF_000001405.40)
- Accessed
- March 2026
- Size
- ~3.1 Gbp
- License
- Public domain
- URL
- GRCh38.p14 at NCBI
LOOM FM-Index Engine
The search engine powering this tool. A Burrows-Wheeler Transform (BWT) based FM-index compiled to 195 KB of WebAssembly, enabling sub-millisecond exact-match search in the browser.
- Library
brenda(Rust crate)- Binary
- 195 KB WASM
- Method
- FM-index with suffix array sampling
- License
- Open source
PubMed / NCBI E-utilities
Literature searches use NCBI's public E-utilities API to query PubMed for published CRISPR research related to target sequences.
- API
- NCBI E-utilities (esearch + esummary)
- Rate limit
- 3 requests/sec (unauthenticated)
- Data
- PubMed article metadata
- License
- Public access (NLM Terms of Service)
- URL
- E-utilities documentation
NCBI Taxonomy
Standardized taxonomic classification for every pathogen — species name, lineage, genome type, and transmission mode. Powers the Disease Context taxonomy chips.
- Dataset
- NCBI Taxonomy Database
- Accessed
- March 2026
- Pathogens
- 12 (all indexed species)
- License
- Public domain (US Government work)
- URL
- ncbi.nlm.nih.gov/taxonomy
Disease Ontology
Standardized disease definitions, synonyms, and cross-references for each pathogen's primary disease. Provides the "What is it?" descriptions and alternative names in the Disease Context tab.
- Dataset
- Disease Ontology (DO)
- Accessed
- March 2026
- Terms
- 12 disease terms (DOID mapped)
- License
- CC0 1.0 (Public Domain)
- URL
- disease-ontology.org
MONDO Disease Ontology
Cross-ontology disease mappings linking Disease Ontology, OMIM, Orphanet, and other vocabularies. Provides additional cross-references for each pathogen's disease.
- Dataset
- Monarch Disease Ontology (MONDO)
- Accessed
- March 2026
- Terms
- 12 disease terms (MONDO mapped)
- License
- CC BY 4.0
- URL
- mondo.monarchinitiative.org
WHO Disease Surveillance
Epidemiological context from the World Health Organization — case fatality rates, annual case/death estimates, geographic spread, diagnostic landscape, and CRISPR diagnostic status.
- Dataset
- WHO Disease Outbreak News & fact sheets
- Accessed
- March 2026
- Data
- Epi stats for 12 pathogens
- License
- CC BY-NC-SA 3.0 IGO
- URL
- who.int/disease-outbreak-news
NCBI Gene / Datasets V2
Complete gene annotations for each pathogen's reference genome — gene symbols, names, positions, and types. Powers the Gene Map section with 8,283 annotated genes across all 12 pathogens.
- API
- NCBI Datasets V2 (annotation_report)
- Accessed
- March 2026
- Genes
- 8,283 across 12 pathogens
- License
- Public domain (US Government work)
- URL
- NCBI Datasets V2 API
Methods (Copy for Your Paper)
Copy this methods paragraph into your manuscript's Materials & Methods section:
CRISPR diagnostic target candidates were identified using LOOM CRISPR Search
(https://calm-mushroom-0185d800f.4.azurestaticapps.net), a BWT/FM-index based pangenomic scanning tool.
For each pathogen, all available genome assemblies were downloaded from NCBI
RefSeq (accessed March 2026) and concatenated into a single corpus. A 23-mer
sliding window (20 bp guide + 3 bp PAM context) was applied to extract all
candidate target sequences. PAM classification identified NGG (SpCas9) and
TTTN (Cas12a/Cpf1) compatible sites. Targets were ranked by genome conservation
(occurrence count across all assemblies). Guide quality scores were computed
based on GC content (optimal 40-70%), seed region GC (last 12 nt), poly-T
terminator absence, homopolymer run length, and self-complementarity.
Off-target specificity was assessed by searching each candidate against 7
host reference genomes (human GRCh38, pig, bat, chicken, cow, camel, mouse)
using exact-match FM-index queries. Literature coverage was assessed by
automated PubMed scanning with ontology-enhanced synonym expansion (NCBI
Taxonomy, Disease Ontology, MONDO). Drug-resistance region overlap was
annotated using coordinates from WHO mutation catalogs and Stanford HIVDB.
How To Cite This Tool
If you use LOOM CRISPR Search or data from this tool in your research, please cite:
LOOM CRISPR Search: Open-science CRISPR target discovery tool.
https://calm-mushroom-0185d800f.4.azurestaticapps.net (2026).
Genome data: NCBI RefSeq Viral & Bacterial, GRCh38.p14.
Disease context: Disease Ontology (CC0), MONDO (CC BY 4.0),
WHO Disease Surveillance, NCBI Taxonomy, NCBI Gene.
Search engine: brenda FM-index (195 KB WASM).
CRISPR Glossary
Key terms used throughout this tool
...ATCGATCGATCGATCGATCGAGGTTTGATCGATCGATCGATCGATCG...Multiplexed Diagnostic Panel Designer
Select pathogens to build a syndromic diagnostic panel. The algorithm finds the minimum set of non-cross-reactive NGG guides that uniquely identifies each pathogen.