Information is Additive but Evolutionary Wait Time is Not

swamidass · August 31, 2019, 5:05pm

Continuing the discussion from Gpuccio: Functional Information Methodology:

Gpuccio: Functional Information Methodology

Case 1.Two independent events, each with probability p, and success requires both at same time, and there is no benefit to one alone.

Case 2. Two independent events, each with probability p, and each event is independently usefule, so it can be retained by negative selection when found.

Case 3. One event with probability p^2.

All else being equal, perhaps with some caveats to be clarified:

The FI is the same for all three cases (success at all events).

Single trial success is identical in all cases: p^2, with FI 2 log p.

Evolutionary wait time in Case 1 and 3 is the same: p^2, with FI 2 log p.

Evolutionary wait time in Case 2 is much less, approx, p * 2, with FI 2 log p. Note, the wait time is actually LESS than this, and it scales very well as we increase decomposability.

Case 1 is equivalent to the strictest (and known to be false) version of irreducible complexity (IC1). Even Behe acknowledges that this is not how biology works.

For very good reason, modern evolutionary theory works like Case 2, which had far lower wait times than Case 1 and 3.

FI does not correlated with wait time! The decomposability of the system breaks this relationship.

This result does not depend on fitness landscapes at all, just random sampling (tornado in a junkyard) plus NEGATIVE selection, not Darwinistic positive selection.

I bulleted out the points here so points can be disputed or affirmed more clearly. Everything I wrote here is directly verifiable with simulations, and experiments. We are not even including positive selection here, just negative selection.

This is a simplified model, which a good starting point for refinement. To do: 1. extend to show math for more decomposability, 2. show my work, and 3. show how dependencies change the computation.

swamidass · August 31, 2019, 5:51pm

To be clear,

This is not true. If these objects are independent, than they are exactly 100 bits of FI.

I’d like to know in what context you think it is correct. Can you tell me @glipsnort?

Look at the example offered:

Gpuccio: Functional Information Methodology

A thief enters a building, where he finds the following objects:

a) One set of 100 small safes.

b) One big safe.

The 100 small safes contain, each, 1/100 of the sum in the big safe.

Each small safe is protected by one electronic key of one bit: it opens either with 0 or with 1.

The big safe is protected by a 100 bit long electronic key.

The thief does not know the keys, any of them.

He can do two different things:

a) Try to open the 100 small safes.

b) Try to open the big safe.

What would you do, if you were him?

Rumracket, maybe, would say that there is no difference: the total sum is the same, and according to his reasoning (or your reasoning, maybe) we have 100 bits of FI in both cases.

My compliments to your reasoning! If the thief reasoned that way, he could choose to go for the big safe, and maybe spend his whole life without succeeding. He has to find one functional combination out of 2^100 (about 10^30). Not a good perspective.

On the other hand, if he goes for the small safes, he can open one in, what? one minute? Probably less. Even giving one more minute to take the cash, he would probably be out and rich after a few hours of honest work!

What is going on here is that @gpuccio has (wrongly) equated wait time with exp(FI). Wait time and FI are not the same thing though, and are not related this way. Wait time is much lower than this equation indicates if the system is decomposable. This is not precise (and I can derive the exact if you are curious),

In fully decomposible system (100 safes) the wait time is about log 100. Note that you can try all the safes at once. So 100 theifs on 100 1-bit safes.
In a non-decomposible system (1 safe, with a 100-bit combination), the wait time is 2^100.

The computation above is not precise (the log 100), and it has a distribution. I can work it out exactly if people want. Though, perhaps @Dan_Eastwood or @nwrickert wants to jump in.

What is the FI? In both cases the FI is 100 to achieve full function. It both cases there is one configuration in a space of 2^100, so the FI is 100 bits. The point is that FI is not wait time. You cannot equivocate the two.

As the OP states, for an independent multi-component system, FI is additive but wait time is not.

swamidass · August 31, 2019, 5:53pm

@gpuccio, do you see your error in reasoning? You thought wait time (evolutionary difficulty) is about 2^FI. This is not true though, not if partial payouts are available, as in the case with 100 safes.

nwrickert · August 31, 2019, 7:26pm

I can look at this differently.

@gpuccio appears to be looking at FI as something to be built up sequentially – one bit at a time.

When you describe the system as decomposable, you are thinking of something more like parallel processing – several bit at the same time.

What’s important here, is that evolution is massively parallel.

swamidass · August 31, 2019, 7:35pm

Actually the opposite. he thinks it must happen all at once. He needs that, because it MUST happen at all at once for his wait time = 2^FI claim to be correct.

Decomposable and parallel are distinct concepts. A decomposable, but non-parallel, problem would change this:

To this:

In fully decomposible system (100 safes) the wait time is about 100 * 2 . Here one thief is working, solving each safe one at a time.

Exactly, so this last case is just not relevant. There is at least two other simplifications in this case study, which if we fixed, it would even further decrease wait time to be sublinear.

Dan_Eastwood · September 1, 2019, 12:07am

I already played with the safes example and broke it. There I assumed a certain dependence among the safes, where the small safes each holds part of the function, and opening 100 small safes is equivalent to opening the single large safe.
I also noted the waiting time will be considerably shorter. Opening one safe at a time, guessing one more bit with each safe, is a Negative Binomial distribution with an expected value of 200 (at p=0.5).
The thief can do a lot better, approaching O[log(100)], by trying all the safes at once and guessing the bits he has not already learned. As I interpret the problem, the thief has a reasonable chance of guessing the next several bits; 50% chance to guess 1 bit and open 1 safe, 25% chance to open 2 safes, 12.5% to open 3, etc. The number of safes opened on each cycle this way follows a Geometric Distribution until the thief starts running out of safes.

swamidass · September 1, 2019, 12:09am

Do you see the set up where he can get all the combinations in exactly 2 guesses, no matter how many safes there are?

This would match biology better than most versions.

Dan_Eastwood · September 1, 2019, 12:30am

I’m assuming the thief is flipping a coin to choose the next bit(s) and applying it to all the remaining unopened safes. If the thief is systematic he can do even better, toggling the next bit past the last one he knows to be correct, guaranteeing at least one success on each cycle after the first.

But no. I don’t see the 2 guesses set up, nor where being systematic matches biology.

I may have a double post here - delete if necessary.

glipsnort · September 1, 2019, 12:42am

What I meant is that he is correct that the probability of success is higher if the problem is decomposable. This was in the context of generating successive 60-bit antibodies – the probability of finding 10 such antibodies by random search is much higher than the probability of finding one 600-bit protein. (That’s because this is equivalent to whichever case you offered, in which selection fixes successful draws in one partition. My statement about the probabilities is also equivalent to your statement about the waiting times being shorter for the decomposed case, since waiting time and probability of success with a fixed number of trials are inversely related.) My point is that even if you look at probability of success via random search rather than FI, selection operating in the immune system is capable of producing results that @gpuccio says can’t occur; that is, finding a fairly small series of antibodies by chance is as improbable as finding a single 500-bit protein by chance.

As far as FI is concerned, you’re right. It doesn’t matter whether the problem can be partitioned or not: the FI is the same. This can be seen clearly from @gpuccio’s own definition of FI, the negative log of (target space / search space). In the case of the safes, the target space and search spaces are the same whether you’re looking at one 100-bit safe or one hundred 1-bit safes. So the claim that FI for the 100 safes is lower is simply wrong. I really don’t see the point of introducing FI to begin with.

Dan_Eastwood · September 1, 2019, 12:43am

Adding: Gpuccio was clearly NOT making this assumption of dependence, where the 100 small safes unlock each have a piece of the same function in the one large safe.

Gpuccio seem to think the function cannot be unlocked at all until the large safe is unlocked - maybe not quite that extreme, but I think he would say some minimum number of safes need to be opened before any function ($$$) is gained. I do not completely disagree.

We could play the game of Open N-safes to get K-function Dollars ($). That makes it more complicated, but the thief can still use his same tricks. It will take the thief a bit longer to make initial progress, and progress may be sporadic, but it it still much faster than guessing the 100-bit combination at random. It changes the setup of the problem, but not the form of the answer.

For example, every 5th small safe contains $5, with the others having $0. This is the same problem with the thief needing to guess 5 bits instead of one.

swamidass · September 1, 2019, 1:03am

If every guess is applied to every safe (like a game of bingo), he just needs to guess 1 then 0.

If he is randomized, we expect him to open all safes with on average three guesses, no matter how many safes there are.

The reason this fits biology: if each safe is some functional unit, there usually is not a strong genome position dependence. The function could arise from any part of the Genome. Functions do not have to appear on the Genome in some predetermined order.

Of course, it takes more than 1 bit to open a safe. But the parallelism and position Independence completely changes the game.

It is as if there are:

a large (unobservable large) number of safes, each one a potential functions that could increase fitness.
Safe codes are larger, many bits.
There is a whole genome full of thieves checking combinations.
If any theif any time generates to the code for any function, the safe opens and there is an increase in FI.

The parallelism, and the position Independence of the theives with respect to safes allows us to open many large code safes in a relatively short amount of time, but certainly not all of them.

The high FI of proteins (large keys of the safe) help us immensely here, explaining why we are not seeing several new functions pop up every generation.

swamidass · September 1, 2019, 1:08am

So @Dan_Eastwood, for this last variant, if you know the number of safes, theives, and bits per code, can you compute how long before 1 safe is opened?

How long before 10 are opened?

How long before x are opened if x is much less than the total number of safes and the key size is non trivially long?

How does it scale with the number of theives (genome size)?

Dan_Eastwood · September 3, 2019, 10:47pm

Clearly I’m not thinking big enough!

If I understand you correctly, parts of the function could arise on different parts of the genome? Also at different times, or the same time, would seem to follow. I think we might even see multiple functions from some units, but I’ll leave that alone.

I didn’t know there was going to be homework!

how long before 1 safe is opened?

For one specific safe this looks like a Geometric distribution, clearly longer codes will be less likely (unless there is repetition? ignoring that).
Wild guess: probability \propto1- {bits \over thieves }

No … that can’t be right …

For multiple safes, much faster, as I describe in Multiplying Probabilities.

How long before 10 are opened?

Given probability p of finding some function, this is a Negative Binomial probability distribution.

How long before x are opened if x is much less than the total number of safes and the key size is non trivially long?

Also negative binomial, or approximately so. With each discovered function the probability of the next will be slightly less, assuming a finite number of functions.

How does it scale with the number of thieves (genome size)?

Depends on which “IT” you mean. But bigger genomes means many more opportunities to discover function, similar to my Multiplying Probabilities situation.

I think I’m failing this homework, but I haven’t Googled.

Topic		Replies	Views
Gpuccio: Functional Information Methodology Conversation Science , Design	183	12557	September 1, 2019
Help Appreciated- Library of ID arguments Conversation Science	11	735	May 31, 2021
Explaining the Cancer Information Calculation Conversation	85	6654	September 28, 2020
Gpuccio on Common Descent Conversation Science	1	751	August 26, 2019
From Panda's Thumb: The Evolution of T-URF13: Does Irreducible Complexity Count or Not? Conversation Science , Design	89	2357	August 1, 2022

Information is Additive but Evolutionary Wait Time is Not

Related topics