New paper introducing a new database for reconstructed ancestral proteins

Rumraket · June 29, 2020, 9:01pm

I’ve been deeply fascinated with the ability of scientists to reconstruct past evolutionary histories using phylogenetic methods, and the possibility of reconstructing ancestral protein sequences to understand molecular evolutionary transitions from the deep past is no exception.

The database can be found here: http://revenant.inf.pucp.edu.pe/

The paper is:
Matias Sebastian Carletti, Alexander Miguel Monzon, Emilio Garcia-Rios, Guillermo Benitez, Layla Hirsh, Maria Silvina Fornasari, Gustavo Parisi, Revenant: a database of resurrected proteins, Database , Volume 2020, 2020, baaa031, Revenant: a database of resurrected proteins | Database | Oxford Academic

Abstract

Revenant is a database of resurrected proteins coming from extinct organisms. Currently, it contains a manually curated collection of 84 resurrected proteins derived from bibliographic data. Each protein is extensively annotated, including structural, biochemical and biophysical information. Revenant contains a browse capability designed as a timeline from where the different proteins can be accessed. The oldest Revenant entries are between 4200 and 3500 million years ago, while the younger entries are between 8.8 and 6.3 million years ago. These proteins have been resurrected using computational tools called ancestral sequence reconstruction techniques combined with wet-laboratory synthesis and expression. Resurrected proteins are commonly used, with a noticeable increase during the past years, to explore and test different evolutionary hypotheses such as protein stability, to explore the origin of new functions, to get biochemical insights into past metabolisms and to explore specificity and promiscuous behaviour of ancient proteins.

It is rather mindblowing to me that it is possible to reconstruct, with high probability, the amino acid sequence of a protein that has not existed on Earth for three quarters the total age of the planet.

Simplified overview of the process of ancestor reconstruction in this:

Figure 1

Open in new tab Download slide

Schematic representation of the different steps to obtain resurrected proteins. The first step involves sequence similarity searches of a given protein to obtain a set of homologous sequences, involving the ancestral nodes to be studied. For example, one could be interested in studying biochemical properties of the studied protein in the last common ancestor for all vertebrates. Using these sequences, it is possible to estimate a phylogenetic tree to define the ancestral node to be reconstructed. In the second step, ancestral sequence reconstruction techniques are applied to estimate most probable sequences in the studied node. The third step involves the ancestral sequence synthesis. This sequence is then inserted into a vector, cloned, expressed and purified (fourth step). The fifth and final step involves a series of biochemical and biophysical characterization.

Rumraket · June 29, 2020, 10:24pm

The timeline view on the browse function of the database is pretty cool atm, but I suspect that will have to change to a different format as the list of resurrected proteins is sure to grow quickly.

Topic		Replies	Views
Seven amino acid types suffice to reconstruct the core fold of RNA polymerase Conversation Science	4	711	August 5, 2021
The origin of genetic code: Study finds textbook version needs revision Conversation Science	2	60	February 17, 2025
Simulating 500 million years of evolution with a language model Conversation Science , Artificial-Intelligence	9	178	February 2, 2025
Order of amino acid recruitment into the genetic code resolved by LUCA's protein domains Conversation Science , Article	1	62	December 19, 2024
Gauger and Mercer: Bifunctional Proteins and Protein Sequence Space Office Hours Design	188	7403	November 15, 2018

New paper introducing a new database for reconstructed ancestral proteins

Abstract

Related topics