IUB/IUPAC code
Standard IUB/IUPAC amino acid and nucleic acid codes
Standard Codes
For all the tools running in this BioPortale, sequences are expected to be represented in the standard IUB/IUPAC amino acid and nucleic acid codes
For DNA
A - Adenine
C - Cytosine
G - Guanine
T - Thymine
U - Uracil
M - A or C (amino)
R - A or G (purine)
W - A or T (weak)
S - C or G (strong)
Y - C or T (pyrimidine)
K - G or T (keto)
V - A or C or G
H - A or C or T
D - A or G or T
B - C or G or T
N - A or G or C or T (any)
- gap of indeterminate length
For Proteins
A - Alanine - Ala
B - Aspartic acid or Asparagine - Asx
C - Cysteine - Cys
D - Aspartic acid - Asp
E - Glutamic acid - Glu
F - Phenylalanine - Phe
G - Glycine - Gly
H - Histidine - His
I - Isoleucine - Ile
K - Lysine - Lys
L - Leucine - Leu
M - Methionine - Met
N - Asparagine - Asn
P - Proline - Pro
Q - Glutamine - Gln
R - Arginine - Arg
S - Serine - Ser
T - Threonine - Thr
U - Selenocysteine - Sec
V - Valine - Val
W - Tryptophan - Trp
X - Unknown - Xxx
Y - Tyrosine - Tyr
Z - Glutamic acid or Glutamine - Glx
* translation stop
- gap of inderteminate length