From: Josh Bembenek (email@example.com)
Date: Tue Aug 20 2002 - 11:33:54 EDT
Thank you for the response, here are my comments:
>There is a key simple flaw with Orr's analysis. His simple idea
>that my sequence has more kids than yours is filled with hidden
>goals or targets.To have any sequence generate "more" kids you must have
>1. Stable, Biologically active sequence.
>2. Sequence able to replicate.
>3. Sequence capable of being improved.
However, these targets are enormous.
--According to what calculations? For example what proportion of all
possible protein sequences are even soluble in a water based environment?
What percent fold in solution in a repeatable and predictable fashion? On
what basis do you make this claim?
Dembski's error is to specify the goal much more than nature does. Only
pre-specified complexity is improbable.
--I believe that the specification for a water soluble protein is quite
stringent, not to mention one that folds properly, and has biological
activity that enhances the adaptive capability of an organism or cell or
protobiont. Here is a relevant section from "Biochemistry" by Lubert Stryer
(p. 418) that I posted previously in a massive post:
"The way out of this dilemma is to recognize the power of cumulative
selection. Richard Dawkins, in The Blind Watchmaker, asked how long it would
take a monkey poking randomly at a typewriter to reproduce Hamlet's remark
to Polonius, "Methinks it is like a weasel." An astronomically large number
of keystrokes, of the order of 10^40, would be required. However, suppose
that we preserved each correct character and allowed the monkey to retype
only the wrong ones. In this case, only a few thousand keystrokes on
average would be needed. The crucial difference between these cases is that
the first employs a completely random search [the hemoglobin number] whereas
in the second, partially correct intermediates are retained. The essence of
protein folding is the retention of partially correct intermediates.
However, the protein-folding problem is much more difficult that the one
presented to our simian Shakespeare. First, proteins are only marginally
stable. The free-energy difference between the folded and unfolded states of
a typical 100-residue protein is 10kcal/mol. The average stabilization per
residue is only 0.1kcal/mol, which is less than random thermal energy
(RT=0.6kcal/mol at room temperature). This means that correct
intermediates, especially those formed early in folding, can be lost. The
analogy is that the monkey would be quite free to undo its correct
keystrokes. Second, the criterion of correctness is not a residue-by-residue
scrutiny of conformation by an omniscient observer [as Dawkins analogy
provides with the computer program] but rather the total free energy of the
transient species. Intermediates can be scored only by their free energies.
Third, some intermediates, called kinetic traps, have a favorable free
energy but are not on the path to final folded protein form. No wonder then
that protein folding is such an intriguing problem for both theoriticians
Not only is this an intriguing problem for experimentalists, but it must be
incorporated into whatever theoretical issues we imagine when discussing
useful, productive, protein sequences capable of conferring natural
selective advantage to an organism. Your statement that the target is huge
for nature's search of useful sequence doesn't seem to match Stryer's
comments on the protein folding problem and thus protein structure/function
in general. In addition, we aren't discussing intermediates here in this
section, we are discussing the properties of fully functional, fully
adapted, highly tailored sequences perfectly matched to do their job. Even
these protein sequences are "marginally stable."
To assert that particular complexity is sufficiently specified to be
improbable requires proving that the number of possible ways to achieve a
similar goal is small.
--Conversely what proves that the number of ways to achieve a similar goal
is large? Consider all possible protein sequences available at an average
of 150 amino acids, an astronomical number. Just sorting through the
sequence space to arrive at molecules that can fold in solution in a
predictable three dimensional structure that can eventually do something is
probably quite rare incorporating -- try obtaining soluble proteins in the
lab from molecules that are supposed to fold. Regardless of whether you
find the numbers large or small, what data backs up your claim??
However, we have no idea how many different ways hypothetical living
organisms might be able to do things.
--So why make the statement that there are many ways? The data we have
about organisms we know about don't support this idea-- it tells us just how
difficult it is to do what currently gets done. An overall theme is that a
very limited class of proteins perform a very limited subset of goals. You
do not find histones in chimps and completely different molecules in fish
doing the same job. If it were so easy to perform the job of a histone,
every organism could have its own special one perhaps unrecognizable in
sequence relation to other organisms. The origin of the molecule within the
scheme of evolution doesn't help either, if one sequence is just as good as
the next, eventually the next should be found at an equal rate of
opportunity with the first. The fact that no other sequences have been
found to perform histone's job indicate that this is the best one at its
Furthermore, 1 and 3 are not required. A sequence needs to be moderately
--Even the best are only "marginally stable." What do you mean by moderate,
is your comparison with something other than random thermal energy?
....but it does not have to be biologically active to increase in frequency
in a population, nor must it be biologically active to be mutated into
--For evolution to act, it must start with a template that has some function
that confers advantage to natural selection. If something has no function
to do anything, it cannot be naturally-selected to do nothing better. What
you are talking about is the spontaneous generation of function, not the
evolution of function. If mutation acts in the absence of any function and
thus in the absence of any selective advantage of the organism, then this is
an uncoupling of the evolutionary paradigm and is equivalent to miracle
occurrences at each arrival of novel genetic sequence.
3 is not a requirement in two ways. First, neutral or even detrimental
variants may end up generating more kids than a better variant, either
through genetic drift or through association with other, beneficial
--This is still "improvement" in the sense that the gene is becoming more
abundant. Given enough time evolution will select out detrimental genes
regardless of their association with better genes-- they will either be
mutated back into more fit genes, or they will be breeded out due to their
disadvantages by evolutionary processes. Unless the detrimental gene itself
confers selective advantage like sickle-cell anemia, will that gene persist.
What is more important is that if you completely lose the genes' function
(easily done with stop codons and frame-shifts,) it cannot be selected to be
improved because it isn't doing anything. Therefore if you destroy a gene,
mutations will simply accumulate at random mostly destroying the gene
completely unless you mutate the gene back to a place where it can function.
Secondly, all sequences are capable of being improved upon, depending upon
the function that you chose to look at. Histone 3 is a terrible digestive
enzyme, for example. Even for the current function of a gene, usually
slight improvements are possible.
--Do you care to comment on the minimal number of mutations required to
allow histone 3 act in a natural-selectiion-advantageous manner as a
protease? It will look nothing like histone 3 when your done! I find this
concept to completely disregard the relationship between protein sequence,
protein structure and protein function. They are tightly related, and
suggesting that any given sequence can perform any given function seems to
ignore about this particular relationship.
No; these data suggest that a precursor would have had a different
function, at least in detail, if it were a functional sequence.
--Allen Orr remarks in his criticism of Behe's work, that (Boston Review @
http://bostonreview.mit.edu/bostonreview/br21.6/orr.html ): "First it will
do no good to suggest that all the required parts of some biochemical
pathway popped up simultaneously by mutation. Although this "solution"
yields a functioning system in one fell swoop, it's so hopelessly unlikely
that no Darwinian takes it seriously. As Behe rightly says, we gain nothing
by replacing a problem with a miracle. Second, we might think that some of
the parts of an irreducibly complex system evolved step by step for some
other purpose and were then recruited wholesale to a new function. But this
is also unlikely. You may as well hope that half your car's transmission
will suddenly help out in the airbag department. Such things might happen
very, very rarely, but they surely do not offer a general solution to
I would guess that histone 3 may have common ancestry with other histones
and thus presumably descend from a more generic DNA-binding protein.
However, Dawkin╠s data are outdated or incorrect, as a search on GenBank
for histone 3 yields not only several ¤histone 3-like genesË but also
several genes identified as histone 3.
--All the histone genes you posted have very a related sequence, very
conservative residue changes (like threonine substituted for serine, one
carbon atom plus hydrogen change difference....) I would bet that they all
have the same three-dimensional structure and that critical residues are
invariant throughout all sequences. This analysis does not change the point
I was trying to make nor does it widen the relationship between sequence,
structure, and function for proteins. It does show that histone 3 is
tolerates more substitutions than hemoglobin or cytochrome C, but it only
means my example wasn't perfect. The point is still there.
Not necessarily inactive.
Also, there is a problem here in the definition of novel. Is histone 3
versus histone 2 novel? How different do sequences have to be to count as
novel? Important structural proteins in advanced eyes (vertebrate and
cephalopod) show minimal or no sequence divergence from certain enzymes-the
exact same sequences was capable of a novel function (albeit one probably
not too closely constrained by protein structure).
--And as indicated by Orr, occurs very very rarely. There are such examples
which occur like antifreeze proteins and the like, but generally the trend
is as stated above. Definition of novel would be a class of sequences
sharing the same three-dimensional structure and serve similar functions.
MSN Photos is the easiest way to share and print your photos:
This archive was generated by hypermail 2.1.4 : Tue Aug 20 2002 - 12:10:34 EDT