Gpuccio: Functional Information Methodology

Fair enough, will be looking forward to that.

1 Like

2 posts were merged into an existing topic: Comments on Gpuccio: Functional Information Methodology

I will start from this statement to deal with what I call the central/core of ID theory.

You must understand that when I was requested to write a summary of my methodology to measure FI in proteins, I did exactly that. I did not include a complete description of ID theory.

Of course, being a very convinced supporter of ID theory, it was very natural for me to conflate the measurement of very high values of FI with an inference to design, because that’s exactly what ID theory warrants. But now, having discussed in some detail the rationale of my measurement of FI in proteins, the focus can shift to ID theory itself. In brief, what is the connection between complex FI and design? And how does that connection apply to biological objects?

An important premise is that my personal approach to ID theory is completely empirical. It requires no special philosophy or worldview, except some good epistemology and philosophy of science. It is, I believe, completely scientific. And it has no connections with any theology. It has always been my personal choice to avoid any reference to theological arguments in all my scientific discussions about ID theory. And I will stick to that choice here too.

More in next post.


You can feel free to refine your case, as long as you are clear about any shifts you are making.

Three questions:

  1. What empirical evidence do you have that demonstrates FI increases are unique to design?

  2. How have you accounted for the similarity between proteins due to common descent?

  3. How have you accounted for differences between proteins caused by neutral drift and neutral coevolution?



Please, let me go on with some linear explanation of ID theory and my approach to it. Then I will answer your three questions.

You may have noticed that I have proposed two different questions about what ID theory is about:

  1. What is the connection between complex FI and the design inference?

  2. How does that apply to biological objects?

Now, if we want to understand each other, we have to focus first on the first question. To do that, we must for the moment forget biological objects. After all, they are the object we are discussing about: are they designed or do they arise by other mechanisms? So, we will for the moment consider the origin of biological objects undecided, and try to understand ID theory without any reference to biology.

To do that, we need an explicit definition of design and of functional information. I have offered a lnk to my two OPs about those two definitions. So, I will just remind here that:

  1. Design is any process where some conscious intelligent and purposeful agent imprints some specific configuration to a material object deriving it from subjective representations in his consciousness. The key point here is that the subjective representation must precede its output to the materila oobject.

  2. FI is the number of bits required to implement some explicitly defined function. Any function can be used. FI is always defined in relation to the defined function, whatever it is. n object exhibits the level of FI linked to the function if it can be used to implement the explicitly defined function at the explicitly defined level.

  3. In general, an explicitly defined function generates a binary partition in a well defined system and set of possible objects: those that can implement it, and those that cannot. FI, in general, is computed as -log2 of the ratio of the target space (the number of objects that can implement the function) to the search space (the number of possible objects) in the defined system.

More in next post.

Finally, a definition of “design”.

It looks like an entirely reasonable definition.

With this definition, it should be trivially obvious that biological organisms are not the result of design.



I am happy that you appreciate the definition. I love definitions.

For the conclusions, we will see…

1 Like

Speaking only for myself, I don’t think there is any reason to discuss design, or “ID theory,” in this context until the very basic questions asked by @swamidass are addressed. And I would reiterate that asking this question outside of even a basic phylogenetic analysis/approach is futile.

It is not possible to talk meaningfully about “functional information” without these basic foundational tasks being done.


I agree. Rather than a primer on ID generalities, let’s focus on what specifically you @gpuccio are doing.

For example, with your definition of design, there must be a pre-existing design. You are empirically based. So what evidence can you produce for a pre-existing specification? As @nwrickert, it seems obvious that this does not exist, at least not in an human accessible form.


To all:

So, the central core of ID theory is the following:

Leaving aside biological objects (for the moment), there is not one single example in the whole known universe where FI higher than 500 bits arises without any intervention of design.

On the contrary, FI higher than 500 bits (often much higher than that) abunds in designed objects. I mean human artifacts here.

Therefore, if we observe FI in any object (leaving aside for the moment biological objects) we can safely infer a design origin for that object.

That procedure will generate no false positives. Of course, it will generate a lot of false negatives. The threshold of 500 bits has been chosen exactly to get that type of result.

If those points are not clear, we are not really discussing iD theory, but something else.

This strong connection between high FI levels and a design origin has, of course, a rationale. But its foundation is completey empirical. We can observe that connection practically everywhere.

The rationale could be expressed as follows: there is no known necessity law that generates those levels of FI without any design intervention. Therefore, FI in non design systems can arise only by chance. But a threshold of 500 bits is so much higher than the probabilistic resources of the known universe, that we can be sure that such an event is empirically impossible. The probabilistic barriers of getting 500 bits of FI are simply too high to be overcome.

Well, that’s ID theory in a nutshell. I will come to the application to biology later. But I am confident that this simple summary will be enough for the moment to generate some answers. :slight_smile:

1 Like

You present a bare assertion as a premise of your analysis.

Why should we agree with this? It seems obviously false. What evidence do can you present to support this assumption?

There are examples of non-designed processes processes we can directly observe producing FI. We can observe high amounts of FI in cancer evolution too, which you agree is not designed. We also see high amounts of FI in viruses, which you also agree are not designed. All these, and more, are all counter examples to your assumption.

As a technical point, without clarifying precisely how FI is defined, this is not at all clearly the case.


I was answering to the remarks made here asking how I got to the design inference from the simple observation of complex FI. If you are not interested in the theory you seem to discuss so often here, please clarify that.

But that seems not to be the case. I see that, as expected, my “primer on ID generalities” has already generated some fierce response. So, I think I will go on, and answer them.

1 Like

This is what you should be working to demonstrate. You have now asserted this, so you should proceed to demonstrate the truth of this assertion.


I’m sorry, I did not mean to communicate any lack of interest in design; for me, the topic is very interesting and I was not criticizing your inclusion of the topic in general. I added “in this context” in a failed attempt to point out that without some foundational and very basic additional information, the analysis that attempts to address “functional information” is meaningless at best and misleading at worst.


I have a 2048 bit gpg key. Okay, maybe the RSA cryptosystem is inefficient, and there are less than 500 bits of FI there. But I could generate a longer key in much the same way. So the 500 bits doesn’t seem much of a limit.

The RSA key itself was mostly produced by a random number generator, with some filtering. Yes, you could say that the random number generator was designed. And you could say that the RSA cryptosystem was designed. Still, the key itself is mostly generated randomly, so does not seem designed.

What it amounts to, is that if you want to consider the role of design, then you have to push the design further back than the FI (functional information).

There are plenty of people at this site who have no problem with saying that there’s design involved in biological organisms. But they see a need to push the design further back than the organisms themselves. For example, they may see the system of evolution as being designed, but they don’t see individual organisms as designed.

As an agnostic, I cannot rule out the possibility that the system of evolution is designed. Possibly there are some atheists, maybe even Richard Dawkins, who might admit that they cannot rule out design at that level. But you cannot have a science of design for design at that level. You can maybe have a philosophy or theology, but not a science.


A few comments from the peanut gallery.

I find it helpful to have this context given because I’m not already familiar with @gpuccio’s work. If it has wandered out-of-bounds for the relevant question, that should soon become apparent.

@gpuccio: Is this Behe’s definition? Your own?? I am just curious to the source due to another recent discussion here.

Again, a source would be interesting. I won’t argue these definitions here (others have already), but I can work them into other ongoing discussions.

I think you could find naturally occurring non-biological examples too.

@colewd I suggest your question is better for a side thread. I’ll start one … no I won’t, it already exists! :slight_smile:


Hi everyone,
In order to keep the discussion clear, substantive, and on-topic, which will help @gpuccio in responding to questions and comments, we are going to start more tightly moderating this thread. Besides @gpuccio, we are going to mostly restrict the comments on the main thread to a limited number of non-ID participants: @swamidass, @Art, @glipsnort, and @sfmatheson. If you have a substantial, on-topic technical contribution to make and wish to join the main thread, please contact the @moderators via PM. For all other reactions, contributions, and comments, please post them on Comments on Gpuccio: Functional Information Methodology.


Consider a cartoon piece of sequence that is 100 bp in length. The sequence has a function, every basepair contributes to the function, and loss of the function is lethal to the organism. 0.1% of all randomly chosen sequences could serve the same function equally well. This means the sequence has 7 bits of FI, correct? 300 different sequences can be reached from the functional sequence by one single-base substitution – which is the only kind of mutation in this cartoon world. The most likely case, then, is that all 100 bases will be conserved through evolution. Will blast return a bitscore of 7 bits for a 100 bp exact match?

ETA: Sorry, that’s 10 bits – I was taking the natural log.

Actually doing a BLAST search in the human genome for a 100 bp perfect match, I get a bit score of 194 bits (calculating from the E score and assuming a target genome size of 3e9). For this scenario, the bit score is clearly not a good estimator of the FI as you have defined it. It’s not because the two are measuring fundamentally different things. The bit score assesses the probability of a specific sequence by chance, with no reference to the other sequences that could perform the same function, while FI represents the probability of finding a sequence with equivalent function.

1 Like

Since we’re dealing with BLAST searches here, I’m assuming that the search space is all possible sequences in a genome of the same size. (I’ve been assuming we’re using nucleotide sequence, but the argument is the same if instead we use amino acid sequence for proteins.)