InterProScan
InterPro is a database that brings together predictive information on protein function from multiple partner resources. It provides an integrated view of the families, domains and functional sites to which a given protein belongs.
InterProScan is the command‑line designed to scan protein or nucleotide sequences against the InterPro member‑database signatures. Researchers with novel sequences can use InterProScan to annotate their data with family classifications, domain architectures and site predictions.
InterProScan 6 is the new implementation of InterProScan, built using Nextflow, a workflow system for creating scalable, portable, and reproducible pipelines.

Get started
To get a first successful run:
- Install Nextflow
25.10.4or later. - Install a supported container runtime such as Docker, SingularityCE, or Apptainer.
- Run the following command
nextflow run ebi-pf-team/interproscan6 \
-r 6.0.1 \ # (1)!
-profile docker,test \ # (2)!
--datadir data \ # (3)!
--interpro latest # (4)!
- Specifies the version of InterProScan to run. We strongly recommend always specifying a version to ensure consistent and reproducible results.
- Configuration profiles:
dockerexecutes tasks in Docker containers, andtestuses a small test FASTA file included in the workflow. - Sets the
datadirectory as the location to store InterPro and member database files. The directory will be created automatically if it doesn't exist, and required files will be downloaded into it. - Uses the latest available InterPro data release.
After the test run, you should see five new files in your current working directory:
test.faa.gff3test.faa.jsontest.faa.jsonltest.faa.tsvtest.faa.xml
Once the test succeeds, move on to Running InterProScan and annotate your own sequences.
Support
For further assistance, please create an issue or contact us.
License
InterProScan 6 is released under the Apache 2.0 License.
Citations
If you use InterProScan in published work, please cite:
- Blum M, Andreeva A, Florentino LC, Chuguransky SR, Grego T, Hobbs E, Pinto BL, Orr A, Paysan-Lafosse T, Ponamareva I, Salazar GA, Bordin N, Bork P, Bridge A, Colwell L, Gough J, Haft DH, Letunic I, Llinares-López F, Marchler-Bauer A, Meng-Papaxanthos L, Mi H, Natale DA, Orengo CA, Pandurangan AP, Piovesan D, Rivoire C, Sigrist CJA, Thanki N, Thibaud-Nissen F, Thomas PD, Tosatto SCE, Wu CH, Bateman A. InterPro: the protein sequence classification resource in 2025. Nucleic Acids Res. 2025 Jan;53(D1):D444-D456. doi: 10.1093/nar/gkae1082.