PROTEINOGENIC AMINO ACIDS are amino acids that are incorporated biosynthetically into proteins during translation . The word "proteinogenic" means "protein creating". Throughout known life , there are 22 genetically encoded (proteinogenic) amino acids, 20 in the standard genetic code and an additional 2 that can be incorporated by special translation mechanisms.
In contrast, non-proteinogenic amino acids are amino acids that are
either not incorporated into proteins (like GABA ,
Both eukaryotes and prokaryotes can incorporate selenocysteine into
their proteins via a nucleotide sequence known as a
In eukaryotes, there are only 21 proteinogenic amino acids, the 20 of the standard genetic code, plus selenocysteine. Humans can synthesize 12 of these from each other or from other molecules of intermediary metabolism. The other nine must be consumed (usually as their protein derivatives), and so they are called essential amino acids . The essential amino acids are histidine , isoleucine , leucine , lysine , methionine , phenylalanine , threonine , tryptophan , and valine (i.e. H, I, L, K, M, F, T, W, V).
The proteinogenic amino acids have been found to be related to the set of amino acids that can be recognized by ribozyme autoaminoacylation systems. Thus, non-proteinogenic amino acids would have been excluded by the contingent evolutionary success of nucleotide-based life forms. Other reasons have been offered to explain why certain specific non-proteinogenic amino acids are not generally incorporated into proteins; for example, ornithine and homoserine cyclize against the peptide backbone and fragment the protein with relatively short half-lives , while others are toxic because they can be mistakenly incorporated into proteins, such as the arginine analog canavanine .
* 1 Structures
* 2 Chemical properties
* 2.1 Side chain properties
* 2.2 Gene expression and biochemistry
The following illustrates the structures and abbreviations of the 21 amino acids that are directly encoded for protein synthesis by the genetic code of eukaryotes. The structures given below are standard chemical structures, not the typical zwitterion forms that exist in aqueous solutions. Grouped table of 21 amino acids' structures, nomenclature, and their side groups' pKa values
Following is a table listing the one-letter symbols, the three-letter symbols, and the chemical properties of the side chains of the standard amino acids. The masses listed are based on weighted averages of the elemental isotopes at their natural abundances . Forming a peptide bond results in elimination of a molecule of water , so the mass of an amino acid unit within a protein chain is reduced by 18.01524 Da.
General chemical properties
AMINO ACID SHORT ABBREV. AVG. MASS (DA ) PI pK 1 (α-COOH) pK2 (α-+NH3)
ALANINE A Ala 89.09404 6.01 2.35 9.87
CYSTEINE C Cys 121.15404 5.05 1.92 10.70
ASPARTIC ACID D Asp 133.10384 2.85 1.99 9.90
GLUTAMIC ACID E Glu 147.13074 3.15 2.10 9.47
PHENYLALANINE F Phe 165.19184 5.49 2.20 9.31
GLYCINE G Gly 75.06714 6.06 2.35 9.78
HISTIDINE H His 155.15634 7.60 1.80 9.33
ISOLEUCINE I Ile 131.17464 6.05 2.32 9.76
LYSINE K Lys 146.18934 9.60 2.16 9.06
LEUCINE L Leu 131.17464 6.01 2.33 9.74
METHIONINE M Met 149.20784 5.74 2.13 9.28
ASPARAGINE N Asn 132.11904 5.41 2.14 8.72
PYRROLYSINE O Pyl 255.31
PROLINE P Pro 115.13194 6.30 1.95 10.64
GLUTAMINE Q Gln 146.14594 5.65 2.17 9.13
ARGININE R Arg 174.20274 10.76 1.82 8.99
SERINE S Ser 105.09344 5.68 2.19 9.21
THREONINE T Thr 119.12034 5.60 2.09 9.10
SELENOCYSTEINE U Sec 168.053 5.47 1.91 10
VALINE V Val 117.14784 6.00 2.39 9.74
TRYPTOPHAN W Trp 204.22844 5.89 2.46 9.41
TYROSINE Y Tyr 181.19124 5.64 2.20 9.21
SIDE CHAIN PROPERTIES
phobic PKA §
ALANINE A Ala -CH3 X - - - X X Aliphatic 67
CYSTEINE C Cys -CH2SH X 8.55 - acidic X X - 86
ASPARTIC ACID D Asp -CH2COOH - 3.67 X acidic X - - 91
GLUTAMIC ACID E Glu -CH2CH2COOH - 4.25 X acidic - - - 109
PHENYLALANINE F Phe -CH2C6H5 X - - - - - Aromatic 135
GLYCINE G Gly -H X - - - X X - 48
HISTIDINE H His -CH2-C3H3N2 - 6.54 X weak basic - - Aromatic 118
ISOLEUCINE I Ile -CH(CH3)CH2CH3 X - - - - - Aliphatic 124
LYSINE K Lys -(CH2)4NH2 - 10.40 X basic - - - 135
LEUCINE L Leu -CH2CH(CH3)2 X - - - - - Aliphatic 124
METHIONINE M Met -CH2CH2S CH3 X - - - - - Aliphatic 124
ASPARAGINE N Asn -CH2CONH2 - - X - X - - 96
PYRROLYSINE O Pyl -(CH2)4NHCOC4H5N CH3 - N.D. X weak basic - - -
PROLINE P Pro -CH2CH2CH2- X - - - X - - 90
GLUTAMINE Q Gln -CH2CH2CONH2 - - X - - - - 114
ARGININE R Arg -(CH2)3NH-C(NH)NH2 - 12.3 X strongly basic - - - 148
SERINE S Ser -CH2OH - - X - X X - 73
THREONINE T Thr -CH(OH)CH3 - - X - X - - 93
SELENOCYSTEINE U Sec -CH2SeH - 5.43 - acidic X X -
VALINE V Val -CH(CH3)2 X - - - X - Aliphatic 105
TRYPTOPHAN W Trp -CH2C8H6N - - X - - - Aromatic 163
TYROSINE Y Tyr -CH2-C6H4OH - 9.84 X weak acidic - - Aromatic 141
§: Values for Asp, Cys, Glu, His, Lys & Tyr were determined using the amino acid residue placed centrally in an alanine pentapeptide. The value for Arg is from Pace et al. (2009). The value for Sec is from Byun & Kang (2011).
N.D.: The pKa value of
Note: The pKa value of an amino-acid residue in a small peptide is
typically slightly different when it is inside a protein.
GENE EXPRESSION AND BIOCHEMISTRY
AMINO ACID SHORT ABBREV. CODON (S) Occurrence
in Archaean proteins
in Bacteria proteins
(%)& Occurrence in human proteins (%)& ESSENTIAL‡ IN HUMANS
ALANINE A Ala GCU, GCC, GCA, GCG 8.2 10.06 7.63 7.01 No
CYSTEINE C Cys UGU, UGC 0.98 0.94 1.76 2.3 Conditionally
ASPARTIC ACID D Asp GAU, GAC 6.21 5.59 5.4 4.73 No
GLUTAMIC ACID E Glu GAA, GAG 7.69 6.15 6.42 7.09 Conditionally
PHENYLALANINE F Phe UUU, UUC 3.86 3.89 3.87 3.65 Yes
GLYCINE G Gly GGU, GGC, GGA, GGG 7.58 7.76 6.33 6.58 Conditionally
HISTIDINE H His CAU, CAC 1.77 2.06 2.44 2.63 Yes
ISOLEUCINE I Ile AUU, AUC, AUA 7.03 5.89 5.1 4.33 Yes
LYSINE K Lys AAA, AAG 5.27 4.68 5.64 5.72 Yes
LEUCINE L Leu UUA, UUG, CUU, CUC, CUA, CUG 9.31 10.09 9.29 9.97 Yes
METHIONINE M Met AUG 2.35 2.38 2.25 2.13 Yes
ASPARAGINE N Asn AAU, AAC 3.68 3.58 4.28 3.58 No
PYRROLYSINE O Pyl UAG* 0 0 0 0 No
PROLINE P Pro CCU, CCC, CCA, CCG 4.26 4.61 5.41 6.31 No
GLUTAMINE Q Gln CAA, CAG 2.38 3.58 4.21 4.77 No
ARGININE R Arg CGU, CGC, CGA, CGG, AGA, AGG 5.51 5.88 5.71 5.64 Conditionally
SERINE S Ser UCU, UCC, UCA, UCG, AGU, AGC 6.17 5.85 8.34 8.33 No
THREONINE T Thr ACU, ACC, ACA, ACG 5.44 5.52 5.56 5.36 Yes
SELENOCYSTEINE U Sec UGA** 0 0 0 >0 No
VALINE V Val GUU, GUC, GUA, GUG 7.8 7.27 6.2 5.96 Yes
TRYPTOPHAN W Trp UGG 1.03 1.27 1.24 1.22 Yes
TYROSINE Y Tyr UAU, UAC 3.35 2.94 2.87 2.66 Conditionally
STOP CODON† - Term UAA, UAG, UGA††
* UAG is normally the amber stop codon , but encodes pyrrolysine if a
PYLIS element is present.
** UGA is normally the opal (or umber) stop codon, but encodes
selenocysteine if a
AMINO ACID SHORT ABBREV. FORMULA MON. MASS§ (DA ) AVG. MASS (DA )
ALANINE A Ala C3H5NO 71.03711 71.0779
CYSTEINE C Cys C3H5NOS 103.00919 103.1429
ASPARTIC ACID D Asp C4H5NO3 115.02694 115.0874
GLUTAMIC ACID E Glu C5H7NO3 129.04259 129.1140
PHENYLALANINE F Phe C9H9NO 147.06841 147.1739
GLYCINE G Gly C2H3NO 57.02146 57.0513
HISTIDINE H His C6H7N3O 137.05891 137.1393
ISOLEUCINE I Ile C6H11NO 113.08406 113.1576
LYSINE K Lys C6H12N2O 128.09496 128.1723
LEUCINE L Leu C6H11NO 113.08406 113.1576
METHIONINE M Met C5H9NOS 131.04049 131.1961
ASPARAGINE N Asn C4H6N2O2 114.04293 114.1026
PYRROLYSINE O Pyl C12H19N3O2 237.14773 237.2982
PROLINE P Pro C5H7NO 97.05276 97.1152
GLUTAMINE Q Gln C5H8N2O2 128.05858 128.1292
ARGININE R Arg C6H12N4O 156.10111 156.1857
SERINE S Ser C3H5NO2 87.03203 87.0773
THREONINE T Thr C4H7NO2 101.04768 101.1039
SELENOCYSTEINE U Sec C3H5NOSe 150.95364 150.0489
VALINE V Val C5H9NO 99.06841 99.1311
TRYPTOPHAN W Trp C11H10N2O 186.07931 186.2099
TYROSINE Y Tyr C9H9NO2 163.06333 163.1733
STOICHIOMETRY AND METABOLIC COST IN CELL
The table below lists the abundance of amino acids in E.coli cells and the metabolic cost (ATP) for synthesis the amino acids. Negative numbers indicate the metabolic processes are energy favorable and do not cost net ATP of the cell. The abundance of amino acids includes amino acids in free form and in polymerization form (proteins).
AMINO ACID Abundance (# of molecules (×108) per E. coli cell) ATP cost in synthesis under aerobic condition ATP cost in synthesis under anaerobic condition
ALANINE 2.9 -1 1
CYSTEINE 0.52 11 15
ASPARTIC ACID 1.4 0 2
GLUTAMIC ACID 1.5 -7 -1
PHENYLALANINE 1.1 -6 2
GLYCINE 3.5 -2 2
HISTIDINE 0.54 1 7
ISOLEUCINE 1.7 7 11
LYSINE 2.0 5 9
LEUCINE 2.6 -9 1
METHIONINE 0.88 21 23
ASPARAGINE 1.4 3 5
PROLINE 1.3 -2 4
GLUTAMINE 1.5 -6 0
ARGININE 1.7 5 13
SERINE 1.2 -2 2
THREONINE 1.5 6 8
TRYPTOPHAN 0.33 -7 7
TYROSINE 0.79 -8 2
VALINE 2.4 -2 2
AMINO ACID ABBREV. REMARKS
ALANINE A Ala Very abundant and very versatile, it is more stiff than glycine, but small enough to pose only small steric limits for the protein conformation. It behaves fairly neutrally, and can be located in both hydrophilic regions on the protein outside and the hydrophobic areas inside.
ASPARAGINE OR ASPARTIC ACID B Asx A placeholder when either amino acid may occupy a position
CYSTEINE C Cys The sulfur atom bonds readily to heavy metal ions. Under oxidizing conditions, two cysteines can join together in a disulfide bond to form the amino acid cystine . When cystines are part of a protein, insulin for example, the tertiary structure is stabilized, which makes the protein more resistant to denaturation ; therefore, disulfide bonds are common in proteins that have to function in harsh environments including digestive enzymes (e.g., pepsin and chymotrypsin ) and structural proteins (e.g., keratin ). Disulfides are also found in peptides too small to hold a stable shape on their own (e.g. insulin ).
ASPARTIC ACID D Asp Asp behaves similarly to glutamic acid, and carries a hydrophilic acidic group with strong negative charge. Usually, it is located on the outer surface of the protein, making it water-soluble. It binds to positively charged molecules and ions, and is often used in enzymes to fix the metal ion. When located inside of the protein, aspartate and glutamate are usually paired with arginine and lysine.
GLUTAMIC ACID E Glu Glu behaves similarly to aspartic acid, and has a longer, slightly more flexible side chain.
Essential for humans, phenylalanine, tyrosine, and tryptophan
contain a large, rigid aromatic group on the side chain. These are the
biggest amino acids. Like isoleucine, leucine, and valine, these are
hydrophobic and tend to orient towards the interior of the folded
GLYCINE G Gly Because of the two hydrogen atoms at the α carbon, glycine is not optically active . It is the smallest amino acid, rotates easily, and adds flexibility to the protein chain. It is able to fit into the tightest spaces, e.g., the triple helix of collagen . As too much flexibility is usually not desired, as a structural component, it is less common than alanine.
HISTIDINE H His His is essential for humans. In even slightly acidic conditions, protonation of the nitrogen occurs, changing the properties of histidine and the polypeptide as a whole. It is used by many proteins as a regulatory mechanism, changing the conformation and behavior of the polypeptide in acidic regions such as the late endosome or lysosome , enforcing conformation change in enzymes. However, only a few histidines are needed for this, so it is comparatively scarce.
ISOLEUCINE I Ile Ile is essential for humans. Isoleucine, leucine, and valine have large aliphatic hydrophobic side chains. Their molecules are rigid, and their mutual hydrophobic interactions are important for the correct folding of proteins, as these chains tend to be located inside of the protein molecule.
LEUCINE OR ISOLEUCINE J Xle A placeholder when either amino acid may occupy a position
LYSINE K Lys Lys is essential for humans, and behaves similarly to arginine. It contains a long, flexible side chain with a positively charged end. The flexibility of the chain makes lysine and arginine suitable for binding to molecules with many negative charges on their surfaces. E.g., DNA -binding proteins have their active regions rich with arginine and lysine. The strong charge makes these two amino acids prone to be located on the outer hydrophilic surfaces of the proteins; when they are found inside, they are usually paired with a corresponding negatively charged amino acid, e.g., aspartate or glutamate.
LEUCINE L Leu Leu is essential for humans, and behaves similarly to isoleucine and valine.
METHIONINE M Met Met is essential for humans. Always the first amino acid to be incorporated into a protein, it is sometimes removed after translation. Like cysteine, it contains sulfur, but with a methyl group instead of hydrogen. This methyl group can be activated, and is used in many reactions where a new carbon atom is being added to another molecule.
ASPARAGINE N Asn Similar to aspartic acid, Asn contains an amide group where Asp has a carboxyl .
PYRROLYSINE O Pyl Similar to lysine , but it has a pyrroline ring attached.
PROLINE P Pro Pro contains an unusual ring to the N-end amine group, which forces the CO-NH amide sequence into a fixed conformation. It can disrupt protein folding structures like α helix or β sheet , forcing the desired kink in the protein chain. Common in collagen , it often undergoes a post-translational modification to hydroxyproline .
GLUTAMINE Q Gln Similar to glutamic acid, Gln contains an amide group where Glu has a carboxyl . Used in proteins and as a storage for ammonia , it is the most abundant amino acid in the body.
ARGININE R Arg Functionally similar to lysine.
THREONINE T Thr Essential for humans, Thr behaves similarly to serine.
SELENOCYSTEINE U Sec The selenium analog of cysteine, in which selenium replaces the sulfur atom.
VALINE V Val Essential for humans, Val behaves similarly to isoleucine and leucine.
TRYPTOPHAN W Trp Essential for humans, Trp behaves similarly to phenylalanine and tyrosine. It is a precursor of serotonin and is naturally fluorescent .
UNKNOWN X Xaa Placeholder when the amino acid is unknown or unimportant.
TYROSINE Y Tyr Tyr behaves similarly to phenylalanine (precursor to tyrosine) and tryptophan, and is a precursor of melanin , epinephrine , and thyroid hormones . Naturally fluorescent , its fluorescence is usually quenched by energy transfer to tryptophans.
GLUTAMIC ACID OR GLUTAMINE Z Glx A placeholder when either amino acid may occupy a position
Amino acids can be classified according to the properties of their main products as either of:
* Glucogenic, with the products having the ability to form glucose by gluconeogenesis * Ketogenic, with the products not having the ability to form glucose: These products may still be used for ketogenesis or lipid synthesis . * Amino acids catabolized into both glucogenic and ketogenic products.
LIFE BASED ON ALTERNATIVE PROTEINOGENIC SETS
The proteinogenic set used by known life on Earth appears to be arbitrarily selected by evolution, according to current knowledge, from many hundreds of possible alpha-type amino acids. Xenobiology studies hypothetical life forms that could be constructed using alternative sets using expanded genetic codes . Miller -type experiments on artificial abiogenesis show that alpha-type amino acids predominate in water-based 'primordial soups', but beta-type amino acids dominate when less water is present. Both alpha- and beta-based sets could form the basis for alternative protein constructions and life forms.
* ^ Ambrogelly A, Palioura S, Söll D (Jan 2007). "Natural
expansion of the genetic code". Nat Chem Biol. 3 (1): 29–35. PMID
17173027 . doi :10.1038/nchembio847 .
* ^ Lobanov, AV.; Turanov, AA.; Hatfield, DL.; Gladyshev, VN.
(2010). "Dual functions of codons in the genetic code." . Crit Rev
Biochem Mol Biol. 45 (4): 257–65. PMC 3311535 . PMID 20446809 .
doi :10.3109/10409231003786094 .
* ^ Young VR (1994). "Adult amino acid requirements: the case for a
major revision in current recommendations" (PDF). J. Nutr. 124 (8
Suppl): 1517S–1523S. PMID 8064412 .
* ^ Erives A (2011). "A Model of Proto-Anti-
* Nelson, David L.; Cox, Michael M. (2000). Lehninger Principles of Biochemistry (3rd ed.). Worth Publishers. ISBN 1-57259-153-6 . * Kyte, J.; Doolittle, R. F. (1982). "A simple method for displaying the hydropathic character of a protein". J. Mol. Biol. 157 (1): 105–132. PMID 7108955 . doi :10.1016/0022-2836(82)90515-0 . * Meierhenrich, Uwe J. (2008). Amino acids and the asymmetry of life (1st ed.). Springer. ISBN 978-3-540-76885-2 . * Biochemistry, Harpers (2015). Harpers Illustrated Biochemistry (30st ed.). Lange. ISBN 978-0-07-182534-4 .
Wikimedia Commons has media