For those who share my enthusiasm for protein evolution and diversity, and for the protein universe, two recent papers to read, both open access.
Evolving concepts of the protein universe is a Perspective piece that provides a very useful historical account (which I didn’t know) of how views of protein structure/function evolved (heh).
These observations led Anfinsen to postulate the “thermodynamic hypothesis”, which states that the protein’s native conformation is comprised of the totality of interatomic interactions which is determined by the amino acid sequence in a given environment.
However, classic work by Kauzmann (1959) explicitly refers to an alternative view of the protein folding problem: “According to all that we know now about protein structure, we have good reason to believe that disorder might be introduced into a protein in small increments.”
Then:
The idea that excited states in the funnel landscape can have functional relevance became a major research topic, and in further major developments started in the 1990s, it became clear that proteins need not always be highly structured to be functional as presciently suspected by Kauzmann back in 1959. Indeed, it is now well recognized that a large fraction of the proteomes of organisms across all three domains of life is comprised of intrinsically disordered proteins/peptides (IDPs) and many more proteins with ordered domains contain intrinsically disordered regions (IDRs) that, by definition, lack rigid 3D structure yet are functional. Furthermore, it is now clear that some ordered proteins can switch folds and gain new function and that regions in certain folded proteins (and even entire proteins) can “unfold”, in a transition from order to disorder, in response to physical or chemical stimuli.
These concepts are known to those who read the literature, but of course many people don’t, and I suspect that obsolete views of protein structure/function are common, and not just at the decrepit creationist/ID potlucks.
The whole paper is a fun read and approachable for non-specialists.
This second one (EvoWeaver: large-scale prediction of gene functional associations from coevolutionary signals) is a lot more technical but worth a look. One thing to be enthused about is the fact that they used (co)evolutionary knowledge to identify protein relationships that are not already known–a great example of how different domains of evolutionary biology feed back/forward onto each other. The other (to me) interesting facet is the ability to find functions (and modules) in poorly-explored (and often newly-discovered) regions of the protein universe. Here’s how they put it in their nice abstract:
The known universe of uncharacterized proteins is expanding far faster than our ability to annotate their functions through laboratory study. Computational annotation approaches rely on similarity to previously studied proteins, thereby ignoring unstudied proteins. Coevolutionary approaches hold promise for injecting new information into our knowledge of the protein universe by linking proteins through ‘guilt-by-association’. However, existing coevolutionary algorithms have insufficient accuracy and scalability to connect the entire universe of proteins. We present EvoWeaver, a method that weaves together 12 signals of coevolution to quantify the degree of shared evolution between genes. EvoWeaver accurately identifies proteins involved in protein complexes or separate steps of a biochemical pathway. We show the merits of EvoWeaver by partly reconstructing known biochemical pathways without any prior knowledge other than that available from genomic sequences. Applying EvoWeaver to 1545 gene groups from 8564 genomes reveals missing connections in popular databases and potentially undiscovered links between proteins.