Canonical sequence

A canonical sequence is a sequence of DNA, RNA, or amino acids that reflects the most common choice of base or amino acid at each position. Many databases use or only give the canonical sequence. The UniProtKB/Swiss-Prot policy for example describes all the protein products encoded by one gene and uses the following criteria for the entry of a canonical sequence:

  • It is the most prevalent.
  • It is the most similar to orthologous sequences found in other species.
  • By virtue of its length or amino acid composition, it allows the clearest description of domains, isoforms, polymorphisms, post-translational modifications, etc.
  • In the absence of any information, we choose the longest sequence.
  • See also

  • Homology (biology)
  • Consensus sequence

  • Podcasts:

    PLAYLIST TIME:

    Latest News for: canonical sequence

    Edit

    Hide and seek

    Astronomy 14 Mar 2025
    ... taken with a Canon mirrorless camera and 600mm lens.
    • 1
    ×