Skip to content Skip to footer

UniProt

UniProt features

How to submit data

UniProt submission guidelines. Submit protein sequences.

You can submit full or partial protein sequences, sequences determined at protein level and associated functional annotation to UniProtKB.

  • Edman degradation or manual interpretation of MS/MS spectra data is submitted via SPIN.
  • Translated nucleotide sequences should be submitted to ENA.
  • MS/MS data should be submitted to PRIDE.

Non-Personal vs Personal data

Non-personal data (non-identifiable).

Access to data

Open access.

Embargo

Possible. There are three options for the release date:

  • Data may be published without further notice.
  • Data must be kept confidential until publication.
  • Data may be made public after the specified date.

Data licence

UniProt applies the Creative Commons Attribution (CC BY 4.0) License to all copyrightable parts of its databases. It cannot provide unrestricted permission regarding the use of the data, as some data may be covered by patents or other rights (UniProt License & disclaimer).

Data/Experiments types

  • Edman degradation.
  • Manual interpretation of MS/MS data.
  • Mascot or similar search algorithms.
  • Translation of a nucleotide sequence.

Metadata

Required:

  • Protein name.
  • Sequencing method.
  • Organism.
  • Sequence.
  • Citations.
  • Confidentiality (see Embargo, above).

Optional:

  • Properties of the protein: Mass spectrometry; Function; Tissue specificity; Similarity; EC number; Catalytic activity; Cofactor; Pathway; Enzyme regulation; Vmax; KM data; Quarternary structure ; Allergenicity; Subcellular location; Posttranslational modification; Induction; Developmental stage; Optimum temperature; Optimum pH; Redox potential; Absorption; 2D-PAGE results; Miscellaneous
  • Sequence features of the protein: Uncertainty in the sequence; Post-translationally modified residue; Disulfide bond; Active site; Glycosylation site; Metal ion-binding site; Nucleotide-binding region; DNA-binding region; Other binding site; Transmembrane region; Other domain of interest; Other site of interest; Sequence variation.

A complete description of the metadata requirements can be found at the SPIN submission guidelines.

Ontology

UniProt has its own rdf schema ontology (http://ontologies.berkeleybop.org/mi.owl).

Data documentation

UniProt doesn’t allow upload of README file. All relevant information about the data (metadata) need to be provided in the designated fields. See “Metadata” section above.

File format(s)

UniProt submission system (SPIN) takes single-letter amino acid code. A complete description of the UniProtKB/Swiss-Prot format is given here.

If you are familiar with the UniProtKB/Swiss-Prot format you are encouraged to submit your updates and/or corrections using that format, but this is not a prerequisite and it is preferred that you provide a longer textual description of what you are submitting rather than spend time trying to fit your data to the UniProt format.

Data volume and costs

No limit for data volume. No costs.

Data quality

UniProtKB/Swiss-Prot is a curated database and not an unedited archive or repository. It makes no promise as to whether it will accept to represent in a given entry all data sent to us.

Identifiers

An accession number is minted once the submission has been processed by a curator.

Guide and manual

User Manual. UniProt Knowledgebase https://www.uniprot.org/docs/userman.htm.