Structure & Reactivity in Chemistry

Introduction to Biomolecules

IB2. Intermolecular Forces in Biology: Proteins (contributed by Henry Jakubowski)

Proteins are a large biological molecules that have molecular weights ranging from the thousand to the millions.  Humans have about 24,000 different proteins which catalyze chemical reactions, recognize foreign molecules and pathogens, allow cellular and organism movement, and regulate cell response, including cell division and death. 


Proteins are polymers consisting of monomers call amino acids.  There are twenty different naturally occurring amino acids that differ in one of the four groups connected to a central carbon atom.  In an amino acid, the central (alpha) carbon has an amine group (RNH2, RNH3+), a carboxylic acid group (RCOOH, RCOO-), an H, and one of twenty different R groups (also called side chains) attached to it. 

The R groups are classified as generally nonpolar, polar charged, or polar uncharged.  The smallest amino acid is glycine (Gly) which has a hydrogen atom as its R group.  All of the other 19 naturally-occurring amino acids have one stereocenter at the carbon containing the amine and carboxyl groups and can exist as two possible enantiomers of which only one form occurs in proteins.  All amino acids in proteins have the absolute configuration shown above.  With the exception of the amino acid cysteine (Cys) with a -CH2SH for an R group (which happens to have an R stereocenter), all of the remaining amino acids found in proteins have an S stereocenter.  

Amino acids form polymers when an amino group of an amino acid is covalently attached to the carbonyl carbon (C=O) of the carboxyl group of the next amino acid. The resulting link between the amino acids is an amide bond which biochemists call a peptide bond.  In this reaction, water is released. In a reverse reaction, the peptide bond can be cleaved by water (hydrolysis).  When two amino acids link together to form an amide link, the resulting structure is called a dipeptide.  Likewise, we can have tripeptides, tetrapeptides, and other polypeptides.  At some point, when the structure is long enough, it is called a protein.  There are many different ways to represent the structure of a polypeptide or protein.  Each shows differing amounts of information.  A heptapeptide, Aspartic Acid-Lysine-Glutamine-Histidine-Cysteine-Arginine-Phenylalanine is shown below.  Each amino acid is denoted by a three letter code (Asp-Lys-Gln-His-Cys-Arg-Phe).. 

Notice that the protein chain has a beginning (an N-terminus with a amino group) and an end (a C-terminus with a carboxyl group).  Also note that every atom in the backbone has a slight charge arising from the presence of the electronegative atoms O and N.  Hence the backbone is polar.  The R groups on each amino acid in the peptide are also called side chains.

The actual linear sequence of a protein is called its primary (1o) structure).  Both the sequence of a protein and it's total length differ from one protein to another. Just for an octapeptide, there are over 25 billion different possible arrangement of amino acids.  Hence the diversity of possible proteins is enormous.

Most proteins do not form an elongated structure as implied by the extended structures shown above.  Rather they collapse on themselves to form compact, mostly globular (roughly spherical) structures. They do so as groups local and distant on the chain attract each other through IMFs which are now exerted within the large protein and not between different proteins.  What kind of IMFs are involved?

To simplify the process, lets consider first just the polar backbone without the side chains.  The main chain can clearly form hydrogen bonds with itself and to water.  If the hydrogen bonds are between the amide H (a hydrogen bond "donor" and a carbonyl O (a hydrogen bond "acceptor") a fixed number of amino acids distant from the amide H, a regular, repetitive secondary (2o) structures called a helix can form.  One especially prevalent helix, the alpha helix,  forms when within a short stretch of amino acids, the amide H of an amino acid (given the number ith ) in the backbone forms a hydrogen bond to the carbonyl C four amino acids in the protein sequence (ith + 4).  There are 3.6 amino acids/turn of the alpha helix.

Alpha Helix (dotted yellow lines represent hydrogen bonds)

Note that all the side chains (R groups) are pointing away from the helix axis.  Also evident from the space-filling model is that there is no opening in the helix as you look down the axis.  The actual atoms are densely packed.  The trace of an alpha helix in a protein is usually represented by a red or purple curly ribbon.

JmolJmol:  An isolated helix from an Antifreeze Protein  Jmol14 (Java) |  JSMol  (HTML5)    

Beta strands/sheets, another type of secondary structure, also occur when H bonds form between adjacent short stretches of amino acids in which the backbone of the short stretches are running either in the same N-C direction (parallel beta strands) or in opposite directions (antiparallel beta strands).   The hydrogen bonds in secondary structures are all among main chain atoms in the backbone, not among side chains.  Beta structure is usually represented in "cartoon" form by yellow flat ribbons with an arrow showing the direction of the protein backbone from the N-terminus to C terminus direction.

Antiparallel Beta Strands (yellow lines represent hydrogen bonds)

Parallel Beta Strands (yellow lines represent hydrogen bonds)

The side chains in the beta sheet are perpendicular to the plane of the sheet, extending out from the plane on alternating sides.  Parallel sheets characteristically distribute nonpolar (or hydrophobic) side chains on both side of the sheet, while antiparallel sheets are usually arranged with all the hydrophobic residues on one side. This requires an alternation of hydrophilic and hydrophobic side chains in the primary sequence. Antiparallel sheets are found in silk with the sheets running parallel to the silk fibers. The following repeat is found in the primary sequence: (Ser-Gly-Ala-Gly)n), with Gly pointing out from one face, and Ser or Ala from the other.

JmolJmol: Twisted beta sheet from Arabinose Binding Protein Jmol14 (Java) |  JSMol  (HTML5) 

Protein folding  is determined by much more than the formation of hydrogen bonds between backbone donors and acceptors.  We must consider the effects of the 20 different R groups (side chains) which complicate the folding process. A protein ultimately folds in space to form a unique 3D shape, which usually contains some alpha helices and beta sheets. The overall 3 D structure is called the tertiary (3o) structure of the protein.  The 3D structure of a protein determines the function of the protein as its shape and surface charge characteristics determines which molecules, both small and large, bind to the protein.

Here are some models of proteins showing secondary and tertiary structures.

JmolJmol: Met-Myoglobin - predominantly alpha-helical protein  Jmol14 (Java) |  JSMol  (HTML5) 

JmolJmol:  Superoxide Dismutase - predominantly beta structure   Jmol14 (Java) |  JSMol  (HTML5) 

Jmol Jmol: Triose Phosphate Isomerase - has both alpha and beta secondary structure  Jmol14 (Java) |  JSMol  (HTML5) 

The structure of proteins is much more complicated than micelles and bilayers.  To a first approximation the protein consists of a polar main chain/backbone from which amino acid side chains of varying polarity and charge hang.  These side chains are polar uncharged, polar charged, and nonpolar.  In general the nonpolar side chains are more stable buried in the center of the protein, surrounded by other nonpolar side chains and away from polar water.  Compare this to the structure of a micelle.  Given the greater complexity of protein primary and tertiary structure, however, not all nonpolar side groups can be buried.  Some are on the surface exposed to solvent.  Likewise, polar and charged polar side chains like to be on the surface exposed to water, but some will find themselves buried.  If they are, they will be surrounded by polar side chains or interact with buried hydrogen bond donors and acceptors on the backbone that stabilize the buried polar group.  Here are some findings about proteins derived from the known 3D structure of thousands of different proteins:

The packing around a buried nonpolar side chain of the amino acid phenylalanine (Phe) is shown in the Jmol below.  It shows the structure of a small protein (protein tyrosine phosphatase) and the amino acids groups surrounding the buried Phe.

Jmol Jmol:  Buried Phe in low molecular weight protein tyrosine phosphatase (bovine)  Jmol14 (Java) |  JSMol  (HTML5) 

Problem IB2.1.

Secondary structure in proteins involves

a.  hydrogen bonds between side chain atoms and backbone atoms

b.  hydrogen bonds between side chain atoms and other side chain atoms

c.  London dispersion forces among side chain atoms

d.  hydrogen bond between backbone atoms and other backbone atoms.

Problem IB2.2.

The structure of the amino acid Ser is shown below



Which groups are involved in forming covalent bonds to other amino acids in a protein?

a.  -CH2OH and -CO2H

b.   -CH2OH and -NH3 +  

c.  -CO2H and -NH3 +  

d.  -H and -CH2OH

Problem IB2.3.

The structure of the amino acid Ser is shown below



If Ser was in an alpha helix, which group would most likely be projecting away from the helix axis?

a.  -CH2OH

b.   -NH3 +  

c.  -CO2H +  

d.  -H

Problem IB2.4.

If Ser were part of a protein, which of the following would most likely NOT describe the environment around the side chain?

a.  it could be buried in the protein surrounded by nonpolar amino acid side chains

b.  it could be one the surface surrounded by water

c.  it would buried adjacent to a glutamine side chain

d.  it is polar so it could not be buried in the protein

Problem IB2.5.

Molecules of a given protein have the following traits:

a. a defined amino acid sequence

b.  an invariant molecular weight

c.  a fixed amino acid composition

d.  all of the above

Problem IB2.6.

______ between tightly packed amino acid side chains in the interior of a protein are a major contribution to the stability of the native state

a.  Dipole-dipole interactions

b.  Ion - ion inteactions

c.  Covalent bonds

d.  London dispersion forces

Problem IB2.7.

If the following section of a polypeptide is folded into an alpha helix, to which amino acid is the carbonyl group of Ala H-bonded?

N term  Ala-Leu-Ser-Asp-Glu-Val  C term

a.  Leu

b.  Ser

c. Asp

d.  Glu

e. Val

Problem IB2.8.

Which form of a 10 amino acid peptide would most likely have the shortest length?

a.  the peptide is in solution and not part of a proteins structure

b.  the peptide is part of a protein and in a antiparallel beta sheet

c.  the peptide is part of a protein and in a parallel beta sheet

d.  the peptide is part of a protein and in an alpha helix

This site is written and maintained by Henry Jakubowski as part of a Structure & Reactivity in Organic, Biological and Inorganic Chemistry written by Chris P. Schaller, Ph.D., College of Saint Benedict / Saint John's University.  It is freely available for educational use.

Creative Commons License
Structure & Reactivity in Organic, Biological and Inorganic Chemistry by Chris Schaller is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported License

Send corrections for this chapter to



Back to Structure-Property Relationships

Back to Web Materials on Structure & Reactivity in Chemistry

Back to Chemistry

Back to CSB/SJU