@Rumraket
He is not trying to turn a protein’s function into another in the 2004 paper. And that’s not what I said, precisely.
He wanted to know how difficult it is to go from non-functional to functional for a specific functional fold. He sets the threshold low for a particular reason.
There are two approaches to asking about the rarity of functional folds. The first, the forward approach, starts with some sort of random sequence, then selects for improvement in some basic function, such as binding to a molecule like ATP or some other hapten. The second is to start with a functional fold, and establish the threshold at which that fold goes from functional to non-functional, and then estimate from that what number of sequences are capable of carrying out that function.
Let me upload an image from his paper. This has to do with why he chose to use a low threshold and weakened enzyme, and why his numbers differ from forward approach studies, where you go from random sequence to some binding function or weak catalysis.
In discussing this figure he says:
How might this picture be reconciled with the much higher prevalence of function often reported
in studies using the forward approach?_ Figure 9 illustrates two possible ways for functional
sequences to appear relatively common when a very low functional threshold is used. Figure 9 (a) represents a global-ascent model of the function landscape, meaning that incremental improvement
of an arbitrary starting sequence will lead to a globally optimal final sequence with reasonably high probability. In this case, sequences exhibiting function at any level are properly regarded as
suboptimal versions of the optimal archetype. Consequently, if we want to know how common
sequences of this functional type are (regardless of optimality), we should set the functional threshold
as low as possible. The higher of the two thresholds shown in Figure 9 (a) would therefore lead to a
considerable underestimate. However, if the real landscape is more like the local-ascent model depicted in Figure 9 (b), where incremental improvement leads to an archetypal sequence for only a relatively tiny set of local starting sequences, then the lower threshold would lead to a considerable overestimate. In essence, activity might be a reliable marker of archetype-like mechanism down to some minimum level, but not below.
If there are lots of ways to achieve local optima related to a particular function then they will appear to be numerous. But they won’t be able to be improved much. If there is a smooth global optimum, There will be fewer candidates at the beginning, but it should be relatively easy to optimize once the first level is achieved.
People keep getting the 2004 paper, which Art Hunt critiqued and which has to do with the rarity of functional folds in sequence space, with my paper with dDoug, where we looked at the conversion of one protein’s function into another protein’s with the same fold. Different scale questions.