Holloway: Fallacy of the Phylogenetic Signal: Nucleotide Level

I wondered if this blog post by Eric Holloway (@EricMH) would be of interest. Dr Holloway is a member here though he hasn’t commented since February, 2019. In his post

http://theskepticalzone.com/wp/fallacy-of-the-phylogenetic-signal-nucleotide-level/

Dr Holloway claims to have falsified a claim about common descent on the Talk Origins website saying:

Talk Origin’s claim that the nested hierarchy of species is well attested by the data is highly questionable if not outright false, and should be retracted as evidence for evolution until such time as a much more rigorous analysis with DAG eliminating controls is established.

As TSZ is not a particularly busy site these days, I thought I would bring Dr Holloway’s potentially earth-shattering revelation to a wider audience. Also, I hope to hear from anyone with expertise in phylogenetics as to whether we need to start rewriting the text books.

Very odd. It’s not completely clear just what he did, what fake sequences he actually analyzed, or how he aligned or analyzed them. He doesn’t say. But my first point to make is that consistency index is not a measure of phylogenetic signal. I can’t say much more until I see how he actually produced his data matrices. His little figure is no help, though oddly enough his model appears to be a form of phylogeny that has some hierarchical structure to it.

1 Like

Put this up on TSZ just now:

Corneel,

Based on his brief description, he didn’t actually do a phylogenetic analysis. He just generated a tree (probably many trees, though he doesn’t say) and looked at its parsimony score. There is no suggestion that this tree is the most parsimonious, and no indication that he did any sort of test of support, which would require an actual analysis. CI, RI, etc. are not indices of hierarchical structure.

I can’t figure out, based on the description, what his taxa were or how he aligned the data set. It may be that his scheme does produce data with hierarchical structure. But that could be the result of the inheritance and branching in the model, even though the inheritance is of an odd sort, and the branching involves some reticulation.

1 Like

That figure is exceptionally confusing. It looks like he’s generating characters de novo, passing them on to descendants, and then combines them with characters from other lineages while also introducing some degree of shuffling(it sorta kinda looks like what you get from sexual recombination). It would be false to say there is no genealogy going on in this process.

As DNA_Jock points out, there’s plenty of common descent going on in the scenario depicted in the figure.

What, in that figure, are the terminal taxa? How would he align the sequences, which it would seem are composed of short fragments that are either identical among taxa or unshared? How would you even get a CI from alignments of these sequences, which would seem to consist entirely of a combination of identical sites and gaps? Puzzling.

That’s reassuring. I thought it was my lack of reading comprehension causing me not being able to follow Eric’s argument.

I really have no idea.

Summary? The best I can gather thus far is that he is able to manipulate the data for common descent or against common descent.

The other members in this discussion, most of whom have professional expertise in the relevant fields, are having difficulty seeing how he actually did that. Why don’t you help them out?

I don’t want to get in the way of professionals who can’t figure it out.

I see. You can answer their questions, but you just won’t.

I’m sure everyone here believes that…

5 Likes

I already like Eric Holloway’s conclusions and I don’t want to work hard, so I am good. Thank you.

…he wrote with, apparently, not the slightest hint of irony.

3 Likes

I’m a little confused by this as I’ve gained the impression that, as a YEC, you are absolutely convinced that Biblical truth trumps empirical experience. Which is why I wonder why Eric’s attempt at an empirical test that purports to undermine common descent matters.

2 Likes

I can clear up the confusion. Neither his analysis nor yours qualifies as empirical evidence. I have told you many times your computer simulations and sequencing and what not are not empirically based. All I care about is that he is able to manipulate the data against your paradigm. That makes my case that neither is empirically based.

Who is this directed at?

Did he do that? How? If it’s not too much hard work for you to answer, of course…

More details can be found in a discussion over at the Biologos forum: Nested Clades, The Consistency Index, and Affirming the Consequent - Scientific Evidence - The BioLogos Forum

3 Likes

I seem to have reached an impasse, as the data set he provided on TSZ is not a valid NEXUS format. For one thing, it has different numbers of characters for the various taxa.

This is how a phylogenetic tree comes out after obtaining an alignment from one of these DAGs(containing 30 “taxa”): tree

I just obtained a new DAG from the link he gives to his data generator tool.

I first sent the “taxa”(which comes in fasta format) from the DAG to a multiple sequence alignment tool (MAFFT) using default settings.
Btw the alignment consists mostly of gaps as the sequences from different taxa have wildly different lengths.

For example this is one of his taxa:

>3
GTTGAATTACCACGGGTTATCGCAT

Here’s another:

>4
GCCTTTACCAGACTTTACTTGCTCGAAATTCTATTGGTCGTATTACTTAGCCATCACCTTAAGGTACGAGCTGGTCCGAGTAAAGATTGTTGTTGTTACTGATTCTTCTACTAACATGCTCCTTCTGTCTACTTGGGGAACAACACAGACCCGCCGTACATTAAGTAGTAAGACTTGGGTGTCTGGAAACCTAGCTACCAACGAAGCTGGGTTTGTCGAAAATCGCACAGCTACCAGACGGGGGCTCGAGCGCATCTGAGGGCATAAGGGTAGATTCTTCCAGTTGGGCATGCATGACACGTTGCTCCAAGGAGTGCGGCCTAGTTCCTGCTCAAGAACGGCTCCTCAAAGGACAGCATTTGTTCGGATTTGGCGATAAACGACAATCTCATAAGCTCTCCTAAAC

Then sent the alignment to a ML program again using default settings, but with bootstrap 100 turned on.

Had to force it into dendrogram mode to see the bootstrap values:


Most of them are pretty crap.

Regardless, none of this makes much sense because I’m still not sure how Holloway generates his data. It appears like his terminal taxa basically consists of anything output by his DAG generator, be they ancestors or descendants. I’m not sure.

I suspect that, to the extend there is any tree-like structure in the data, it’s because his DAG actually happened to implement a tree-like process, so if and when Holloway claims to measure any noteworthy CI, it’s because his DAG really has coincidentally implemented something like common descent.

2 Likes