Chimpanzee Contigs and the Human Genome: 84% Similar?

As long as the differences are accurately described I don’t have a problem with using a straight base to base comparison. The chimp genome paper has the figures we need for comparisons to Tomkins’ work.

67 million bases of unique DNA in a 3 billion base genome is an additional ~2% difference stacked on top of the 1-2% difference due to substitutions. This puts the total at around 96-97% for a base to base comparison.

As mentioned by others, he used an ungapped analysis which means that a 300 bp comparison that differs by a single indel in the middle of the sequence would be counted as ~75% similar even though it differs by a single base.

At least this time around Tomkins appears to be including gaps, but time will tell if they were correctly accounted for.

2 Likes