Abstract

In this work, we investigate whether the construction of signaling pathways during evolution follows a deterministic law through a study of the eventual link between age of appearance in the tree of life and position in the signaling pathway of genes involved in these pathways. We use the 47 human signaling pathways described in the Kyoto Encyclopedia of Genes and Genomes and investigate the orthologs of these genes in 315 animal species plus a yeast taxon, representing 15 large clades. Many genes appear on two key branches: those between the last common ancestor of Opisthokonta and Metazoa and between Deuterostomia and Chordata. We look for a link between the age of appearance of an upstream A gene and that of its downstream B partner. We observe that for all the interactions of two partners, only 20.6% of the corresponding genes arose simultaneously in the tree of life, 40.7% being called “backward” (i.e. B appearing before A) and 38.7% “forward” (A appearing before B). For 16 of the 47 pathways, there is a positive correlation between the age rank difference between interacting partner genes and the position of the corresponding proteins in the pathway: the more upstream a protein is involved in the pathway, the greater the rank difference is (the correlation, positive or negative, is not significant for 30 pathways). For the sole insulin signaling pathway, this correlation is negative. Moreover, by permutation test, we find that 14 of the 47 observed pathway contained larger modules (subset respecting a homogeneous appearance pattern) than expected by chance alone. Finally, for 20 of the 47 pathways, the construction scenario appears to be random, as these pathways do not validate any of our statistical tests (permutation tests on interaction direction and module sizes as well as correlation test on pathway position and age rank). Given that only 14.9% of the tests are significant and that significant effects are different among pathways, we conclude that there is no deterministic rule in the establishment of the pathways herein studied or that the patterns have been obscured by subsequent transformations.

Significance

This work shows that the majority of genes coding for intracellular signaling proteins appeared early and are associated with the origin of Metazoa and Chordata. It also shows that for 20 of the 47 signaling pathways, there is no significant correlation between the age rank difference between interacting partner genes and the position of the corresponding protein in the signaling pathway. Surprisingly, the age of appearance of the genes that code proteins in these pathways appears to have occurred randomly, rather than in an order reflecting position in the proteins in the pathway.

Introduction

The co-evolution of genes encoding interacting molecules has interested many scientists (Fraser et al. 2004; Rand et al. 2004; Andreani and Guerois 2014; Lynch and Hagner 2015) because of the intriguing question of the modes of mutation and selection that act on two molecules simultaneously. In particular, co-evolution of the binding motif has been well investigated (Lewis et al. 2010), focusing on the fitness (Williams et al. 2001; Bloom et al. 2004), conservation of interaction (Wuchty et al. 2003; Lovell and Robertson 2010; Kachroo et al. 2015), or evolution of the residues at the interface of the molecules (Mintseris and Weng 2005; Jack et al. 2016; Echave and Wilke 2017). These studies on the co-evolution of binding partners often require the integration of different disciplines (chemistry, evolutionary and molecular biology, etc.). Paradoxically, establishment of the interaction in a phylogenetic context, which involves fewer fields, is less studied. Little is known, for example, about the age of appearance and evolution of the different partners prior to their first interaction. Does the emergence of one partner favor the emergence of the second partner, and if so, in which order?

In the case of interacting molecules, the appearance of genes coding for molecules included in a complex is intricate (Kauffman 1993). For two molecules that will eventually interact, the appearance of one may depend on the appearance and conservation of the other. This may be the case, for example, when the presence of the first molecule is not advantageous if its partner has not yet appeared.

The existence of interacting proteins without partners (“orphan protein”) has been described frequently (Howard et al. 2001), even though it is sometimes difficult to assess whether an interacting protein is a true orphan or whether its ligand is simply unknown (Benoit et al. 2006). The relative order of appearance of genes encoding protein partners is thus an open question. Furthermore, several types of interactions can be observed in living organisms, with different numbers of interacting partners (Koshland 1958; Albelda and Buck 1990; Maslov and Sneppen 2002; Sullivan and Holyoak 2008), varying affinities (Kent et al. 1980), or different durations for the interactions (Nooren and Thornton 2003) through evolution, making the problem even more complex.

Concerning the pairs of ligands and their receptor(s), Thornton (2001) has shown that the first steroid receptor, present in lamprey and believed to be present in the last common ancestor of vertebrates, was an estrogen receptor and that several duplications led to other steroid receptors specialized in other functions with other ligands. However, more recent investigations suggested that the ancestral ligand for the ancestral steroid receptor was a molecule with a structure distinct from modern estrogen (an aromatized steroid with a side chain) called paraestrol A (Markov et al. 2017). In a previous work, we found that 41% of the membrane receptors and their respective first ligands appeared on the same branch of the evolutionary tree, which is 2.5-fold more often than expected by chance, thus suggesting an evolutionary dynamic of interdependence and conservation between these partners (Grandchamp and Monget 2018). In contrast, 21% of the receptors appeared after their ligand (i.e. 3-fold less often than expected by chance) and 38% appeared before their first ligand (i.e. as much as expected by chance). These results suggest that selective pressure is exerted on ligands and receptors once they appear, which would eliminate molecules whose partner does not appear quickly.

Using both descriptive and hypothesis testing approaches, we investigate the age rank of appearance of genes encoding proteins involved in signaling pathways downstream of membrane receptors in the animal tree of life (Fig. 1). We also study the relationship between the position of the protein in the signaling pathway relative to its partners, and their relative age rank of appearance in the tree of life. In other words, does the order of appearance of genes in the tree of life reflect that of their coded protein in the signaling pathway (upstream or downstream of the pathway), with a direct, reverse or random order (Fig. 2)? We also test whether the size of modules, defined as subsets respecting a homogeneous appearance pattern, departs from randomness.

Simplified animal tree of life and sampled clades. The rectangles represent the branches on which proteins appeared, with estimated times (ages of the nodes to the right of the rectangles) and can include one to several lower-ranking taxa included in this study. The circles represent the terminal taxa (A–Y) included in this study; these are sorted from the most distantly related to the most closely related to humans. The nodes available on Ensembl Metazoa (or Genomicus Metazoa) appear in italics, and the nodes available on Ensembl Vertebrata (or Genomicus Vertebrata) appear in bold.
Fig. 1.

Simplified animal tree of life and sampled clades. The rectangles represent the branches on which proteins appeared, with estimated times (ages of the nodes to the right of the rectangles) and can include one to several lower-ranking taxa included in this study. The circles represent the terminal taxa (A–Y) included in this study; these are sorted from the most distantly related to the most closely related to humans. The nodes available on Ensembl Metazoa (or Genomicus Metazoa) appear in italics, and the nodes available on Ensembl Vertebrata (or Genomicus Vertebrata) appear in bold.

Three hypotheses about the possible scenarios (A–C) linking the age rank of appearance of a gene to the pathway position of the corresponding protein. The colors represent the age of genes (dated by their appearances on branches below internal nodes 1–15). We arbitrarily chose 10 proteins/positions per pathway for the illustration. In scenario A (“backward”), the pattern of protein activation in the pathway represents the reverse of the pattern of appearance of genes in the tree. Scenario B (“forward”) is characterized by the opposite relationship (the pattern between protein position in the pathway matches the appearance of the genes coding these proteins). In scenario C (“random”), there is no link between the position of the protein in the pathway and the pattern of appearance of the corresponding gene in evolution.
Fig. 2.

Three hypotheses about the possible scenarios (A–C) linking the age rank of appearance of a gene to the pathway position of the corresponding protein. The colors represent the age of genes (dated by their appearances on branches below internal nodes 1–15). We arbitrarily chose 10 proteins/positions per pathway for the illustration. In scenario A (“backward”), the pattern of protein activation in the pathway represents the reverse of the pattern of appearance of genes in the tree. Scenario B (“forward”) is characterized by the opposite relationship (the pattern between protein position in the pathway matches the appearance of the genes coding these proteins). In scenario C (“random”), there is no link between the position of the protein in the pathway and the pattern of appearance of the corresponding gene in evolution.

Three mutually exclusive hypotheses can be formulated about the possible relationships between gene position in a pathway and its relative age of origin (which we call below “scenarios”), based on principles of molecular biology, evolution, and paleontology. The first is that the genes expressed downstream in the sequence appeared first because this is where the interactions with the genome occur, in the nucleus (Fig. 2A). The second hypothesis, which is diametrically opposed, is that the genes expressed upstream appeared first because the first life forms most likely had an outer cell membrane, but they almost certainly lacked a cell nucleus. Indeed, recent paleontological research suggests that crown eukaryotes (hence, potentially, the nucleus) may have appeared later than usually thought (Porter, 2020), and this may add support for this hypothesis because a greater proportion of the evolution of life may have occurred prior to the appearance of the nucleus (Figure 2B). Also, the outer cell membrane is where interactions with the environment occur. The third hypothesis, to an extent complementary with the first two, is that some genes are recruited in a pathway (an example of exaptation) randomly, and their age of appearance bears no logical relationship with the age of other genes in the cascade (Figure 2C). With time, this phenomenon, a kind of evolutionary tinkering (Jacob, 1977), could erase the order originally created by successive gene appearance following a backward or forward rule. The signal loss generated by this process is vaguely reminiscent of the well-known saturation in phylogenetic signal in gene sequences that makes fast-evolving genes unsuitable to infer ancient divergence events, which was evidenced long ago (Hillis and Huelsenbeck, 1992).

Results

Age of Appearance of Genes Encoding Proteins of Signaling Pathways

Our results, based on 2,298 unique genes involved in 47 human intracellular signaling pathways available in KEGG database V104.0 (see Materials and Methods), reveal interesting patterns on the distribution of ages of appearance for the genes encoding proteins involved in animal signaling pathways. Two parts of the tree of life are overrepresented in gene origins (Fig. 3): at the base of the tree, on the branch leading to Metazoa (before 765 mya), and at the base of Chordata, which includes Cephalochordata, Tunicata, and Vertebrata, around 635 to 558 mya (Kumar et al. 2022). Overall, in cumulative data, 29% of genes present in humans from our dataset had appeared before the origin of Metazoa, 45% before diversification of Deuterostomia, 72% before Chordata, and 92% before the origin of Vertebrata.

Cumulative distribution of genes encoding proteins involved in the 47 signaling pathways. The colored bars represent the cumulative percentage of genes involved in a signaling pathway progressively appearing during evolution. The light gray bars represent the raw data.
Fig. 3.

Cumulative distribution of genes encoding proteins involved in the 47 signaling pathways. The colored bars represent the cumulative percentage of genes involved in a signaling pathway progressively appearing during evolution. The light gray bars represent the raw data.

The temporal relationship of gene appearances among interacting partners in a pathway also reveals interesting patterns. Of the 2,666 interactions studied, 20.6% of interactions are “simultaneous” (i.e. both partners in the interaction appeared on the same branch), ranging from 9% for the FoxO and chemokine pathway to 35% for the VEGF pathway. Therefore, 79.4% are asynchronous, of which 40.7% are “backward”, with partner B (downstream in the pathway) appearing first in the tree of life (ranging from 18% for toll-like receptor pathways to 79% for the ovarian steroidogenesis pathway) and 38.7% are “forward,” with partner A (upstream in the pathway) appearing first (ranging from 11% for ovarian steroidogenesis to 65% for IL-17; Fig. 4). Some pathways have a greater proportion of forward relationships, such as the toll-like receptor, IL-17, and T-cell receptor pathways; it is the opposite for other pathways, such as the PPAR, cAMP, and ovarian steroidogenesis pathways (Fig. 4).

Forward, backward, or simultaneous interaction for each signaling pathway. Each horizontal column represents a signaling pathway (described on the left). On each column, for an A × B interaction, the red color corresponds to a backward interaction (A was born after B), the orange color corresponds to a simultaneous interaction (A was born at the same age as B), and the green color corresponds to a forward interaction (A was born before B).
Fig. 4.

Forward, backward, or simultaneous interaction for each signaling pathway. Each horizontal column represents a signaling pathway (described on the left). On each column, for an A × B interaction, the red color corresponds to a backward interaction (A was born after B), the orange color corresponds to a simultaneous interaction (A was born at the same age as B), and the green color corresponds to a forward interaction (A was born before B).

The age rank difference or delta (i.e. the age rank of appearance of the first interactant and that of the second interactant) is highly variable among the 47 pathways (supplementary Data S1, Supplementary Material online). For eight pathways (at the top of supplementary Data S1, Supplementary Material online), the delta median is > 0 (between 1 and 4; more backward than forward); for nine pathways, the median is < 0 (between −2.5 and −0.5; more forward than backward); and for 30 pathways, the median is 0. Moreover, some pathways have specific profiles. More precisely, some pathways exhibit a particularly eccentric median and biased distribution compared with 0, such as the prolactin pathway (min = −13, Q1 = −7, median = −2.5, Q3 = 2, and max = 8) and the ovarian steroidogenesis pathway (min = −2, Q1 = 0, median = 4, Q3 = 7, and max = 12), where Q1 and Q3 are the first and third quantiles of a dataset (supplementary Data S1, Supplementary Material online, columns Q–U in supplementary Data S4, Supplementary Material online).

For each pathway, the delta distribution in relation to the age rank of appearance of the first interactant displays surprising patterns (supplementary Data S2 and S3, Supplementary Material online). Within the Hippo pathway, for example, several proteins appearing in either Opisthokonta or Metazoa (in blue) have “waited” a long time for their partner (delta = −13), whereas proteins appearing with Vertebrata (in orange) appeared after their partner. For this Hippo pathway, more than half of partners (that appeared on a different branch) present a delta of −6 to 13 (21.1% of delta = 0; supplementary Data S2, Supplementary Material online). For the ovarian steroidogenesis and cAMP pathways, most of the deltas are > 0 (median = 4 and 1, respectively) and the first partner of the interaction appeared after their direct downstream partner, the interaction being A (position n in the pathway) → B (position n+ 1) (supplementary Data S2, Supplementary Material online). All the pathways are represented in supplementary Data S2, Supplementary Material online and supplementary Data S3, Supplementary Material online. These examples suggest that these pathways settled with genes that have been replaced since.

Relationship Between the Delta of Age Rank of Appearance of the Gene Encoding a Partner Involved in Each Pathway and the Upstream or Downstream Position in the Pathway

For 16 of the 47 pathways studied here (from the AMPK to mTOR pathway; Table 1), there is a positive correlation (P ranges from 2.63 × 10−11 to 0.04) between the delta of a gene pair and the position of the corresponding protein in the pathway, which means that the more upstream a protein is involved in the pathway, the older is the upstream gene compared with downstream gene of the pair (hypothesis B of Fig. 2, Table 1, and Fig. 5). These include the NOD-like pathway, among others. For a single pathway (the insulin signaling pathway), there is a negative correlation (r = −0.18, P = 0.03), which means that the more upstream a protein is involved in the pathway, the younger is the activator compared with the downstream gene of the pair (hypothesis A of Fig. 2 and Table 1). For 30 pathways, such as the Hippo and Notch pathways, we do not find any significant correlation between the age of appearance and the position in the pathway (Table 1).

Relative age (rank) of appearance of genes encoding proteins involved in each pathway according to the pathway position of the protein. Each row corresponds to a pathway. The upstream proteins of the pathway (close to the cell membrane) are on the left, and the downstream proteins of the pathway (close to the nucleus) are on the right. Depending on the signaling pathway, there are 3–16 positions on the abscissa. For a given pathway position, each rectangle represents the distribution of age rank of appearance for each of the proteins occupying the pathway position. For example, for the GnRH pathway at position 1, all the proteins appeared at the base of Osteichthyes. The pathways are sorted by correlation between pathway position and delta age rank (Table 1).
Fig. 5.

Relative age (rank) of appearance of genes encoding proteins involved in each pathway according to the pathway position of the protein. Each row corresponds to a pathway. The upstream proteins of the pathway (close to the cell membrane) are on the left, and the downstream proteins of the pathway (close to the nucleus) are on the right. Depending on the signaling pathway, there are 3–16 positions on the abscissa. For a given pathway position, each rectangle represents the distribution of age rank of appearance for each of the proteins occupying the pathway position. For example, for the GnRH pathway at position 1, all the proteins appeared at the base of Osteichthyes. The pathways are sorted by correlation between pathway position and delta age rank (Table 1).

Table 1

Correlation between rank age differences of appearance and pathway position

pathcategoriesnb interactsnb genesnb subpathmax subpathr pvalue
AMPKSignal transduction109120184100.4199.49E−06*
ErbBSignal transduction11490309100.3812.88E−05*
JAK-STATSignal transduction38118312,958120.3332.63E−11*
AdipocytokineEndocrine system72685480.3045.54E−03*
cAMPSignal transduction2032061966110.2551.47E−04*
WntSignal transduction129126402110.2464.91E−03*
HIF-1Signal transduction72775470.2454.78E−02*
OxytocinEndocrine system79905190.2432.59E−02*
AGE-RAGEEndocrine and metabolic disease22014635860.2022.11E−03*
NOD-like receptorImmune system21020842090.2003.71E−03*
IL-17Immune system17616870990.1583.63E−02*
MAPKSignal transduction4103542083110.1541.82E−03*
T cell receptorImmune system21616037690.1532.46E−02*
Toll-like receptorImmune system200174371100.1513.33E−02*
NeurotrophinNervous system231193952160.1373.81E−02*
mTORSignal transduction293282375140.1362.12E−02*
InsulinEndocrine system13313025511−0.1842.95E−02*
RelaxinEndocrine system115131140130.1865.02E−02
ProlactinEndocrine system1231316314−0.1656.63E−02
FoxOSignal transduction74787090.2166.64E−02
Ovarian steroidogenesisEndocrine system1929156−0.3232.27E−01
cGMP-PKGSignal transduction607920112−0.1382.46E−01
HedgehogSignal transduction5164415−0.1612.65E−01
NotchSignal transduction35346440.1743.17E−01
CalciumSignal transduction4059778−0.1563.40E−01
Phospholipase DSignal transduction5564235110.1193.49E−01
ApelinSignal transduction798626110.1043.88E−01
PPAREndocrine system58506120.1104.30E−01
NF-kappa BSignal transduction1221241106−0.0634.91E−01
GnRHEndocrine system565716160.0885.19E−01
ChemokineImmune system918418311−0.0585.76E−01
B cell receptorImmune system1091048813−0.0545.90E−01
PI3K–AktSignal transduction26426766813−0.0335.97E−01
RasSignal transduction1151122329−0.0456.43E−01
Thyroid hormoneEndocrine system1131211027−0.0446.53E−01
RIG-I-like receptorImmune system988837480.0397.03E−01
TNFSignal transduction64702780.0487.04E−01
Fc epsilon RIImmune system49494210−0.0377.81E−01
Rap1Signal transduction89901085−0.0258.12E−01
GlucagonEndocrine system65634090.0298.27E−01
HippoSignal transduction901067360.0228.39E−01
VEGFSignal transduction404119100.0338.42E−01
p53Cell growth and death72712547−0.0228.52E−01
C-type lectin receptorImmune system1591741078−0.0148.59E−01
TGF-betaSignal transduction8593617−0.0119.21E−01
EstrogenEndocrine system58692916−0.0109.47E−01
SphingolipidSignal transduction56625160.0029.86E-01
pathcategoriesnb interactsnb genesnb subpathmax subpathr pvalue
AMPKSignal transduction109120184100.4199.49E−06*
ErbBSignal transduction11490309100.3812.88E−05*
JAK-STATSignal transduction38118312,958120.3332.63E−11*
AdipocytokineEndocrine system72685480.3045.54E−03*
cAMPSignal transduction2032061966110.2551.47E−04*
WntSignal transduction129126402110.2464.91E−03*
HIF-1Signal transduction72775470.2454.78E−02*
OxytocinEndocrine system79905190.2432.59E−02*
AGE-RAGEEndocrine and metabolic disease22014635860.2022.11E−03*
NOD-like receptorImmune system21020842090.2003.71E−03*
IL-17Immune system17616870990.1583.63E−02*
MAPKSignal transduction4103542083110.1541.82E−03*
T cell receptorImmune system21616037690.1532.46E−02*
Toll-like receptorImmune system200174371100.1513.33E−02*
NeurotrophinNervous system231193952160.1373.81E−02*
mTORSignal transduction293282375140.1362.12E−02*
InsulinEndocrine system13313025511−0.1842.95E−02*
RelaxinEndocrine system115131140130.1865.02E−02
ProlactinEndocrine system1231316314−0.1656.63E−02
FoxOSignal transduction74787090.2166.64E−02
Ovarian steroidogenesisEndocrine system1929156−0.3232.27E−01
cGMP-PKGSignal transduction607920112−0.1382.46E−01
HedgehogSignal transduction5164415−0.1612.65E−01
NotchSignal transduction35346440.1743.17E−01
CalciumSignal transduction4059778−0.1563.40E−01
Phospholipase DSignal transduction5564235110.1193.49E−01
ApelinSignal transduction798626110.1043.88E−01
PPAREndocrine system58506120.1104.30E−01
NF-kappa BSignal transduction1221241106−0.0634.91E−01
GnRHEndocrine system565716160.0885.19E−01
ChemokineImmune system918418311−0.0585.76E−01
B cell receptorImmune system1091048813−0.0545.90E−01
PI3K–AktSignal transduction26426766813−0.0335.97E−01
RasSignal transduction1151122329−0.0456.43E−01
Thyroid hormoneEndocrine system1131211027−0.0446.53E−01
RIG-I-like receptorImmune system988837480.0397.03E−01
TNFSignal transduction64702780.0487.04E−01
Fc epsilon RIImmune system49494210−0.0377.81E−01
Rap1Signal transduction89901085−0.0258.12E−01
GlucagonEndocrine system65634090.0298.27E−01
HippoSignal transduction901067360.0228.39E−01
VEGFSignal transduction404119100.0338.42E−01
p53Cell growth and death72712547−0.0228.52E−01
C-type lectin receptorImmune system1591741078−0.0148.59E−01
TGF-betaSignal transduction8593617−0.0119.21E−01
EstrogenEndocrine system58692916−0.0109.47E−01
SphingolipidSignal transduction56625160.0029.86E-01

The correlation index (r) quantifies the strength of the relationship: strong between [−1, −0.5] and [0.5, 1], low between [−0.5, 0] and [0, 0.5], and no correlation at r = 0. Significant P-values (*P < 0.05) are presented in bold italics.

Table 1

Correlation between rank age differences of appearance and pathway position

pathcategoriesnb interactsnb genesnb subpathmax subpathr pvalue
AMPKSignal transduction109120184100.4199.49E−06*
ErbBSignal transduction11490309100.3812.88E−05*
JAK-STATSignal transduction38118312,958120.3332.63E−11*
AdipocytokineEndocrine system72685480.3045.54E−03*
cAMPSignal transduction2032061966110.2551.47E−04*
WntSignal transduction129126402110.2464.91E−03*
HIF-1Signal transduction72775470.2454.78E−02*
OxytocinEndocrine system79905190.2432.59E−02*
AGE-RAGEEndocrine and metabolic disease22014635860.2022.11E−03*
NOD-like receptorImmune system21020842090.2003.71E−03*
IL-17Immune system17616870990.1583.63E−02*
MAPKSignal transduction4103542083110.1541.82E−03*
T cell receptorImmune system21616037690.1532.46E−02*
Toll-like receptorImmune system200174371100.1513.33E−02*
NeurotrophinNervous system231193952160.1373.81E−02*
mTORSignal transduction293282375140.1362.12E−02*
InsulinEndocrine system13313025511−0.1842.95E−02*
RelaxinEndocrine system115131140130.1865.02E−02
ProlactinEndocrine system1231316314−0.1656.63E−02
FoxOSignal transduction74787090.2166.64E−02
Ovarian steroidogenesisEndocrine system1929156−0.3232.27E−01
cGMP-PKGSignal transduction607920112−0.1382.46E−01
HedgehogSignal transduction5164415−0.1612.65E−01
NotchSignal transduction35346440.1743.17E−01
CalciumSignal transduction4059778−0.1563.40E−01
Phospholipase DSignal transduction5564235110.1193.49E−01
ApelinSignal transduction798626110.1043.88E−01
PPAREndocrine system58506120.1104.30E−01
NF-kappa BSignal transduction1221241106−0.0634.91E−01
GnRHEndocrine system565716160.0885.19E−01
ChemokineImmune system918418311−0.0585.76E−01
B cell receptorImmune system1091048813−0.0545.90E−01
PI3K–AktSignal transduction26426766813−0.0335.97E−01
RasSignal transduction1151122329−0.0456.43E−01
Thyroid hormoneEndocrine system1131211027−0.0446.53E−01
RIG-I-like receptorImmune system988837480.0397.03E−01
TNFSignal transduction64702780.0487.04E−01
Fc epsilon RIImmune system49494210−0.0377.81E−01
Rap1Signal transduction89901085−0.0258.12E−01
GlucagonEndocrine system65634090.0298.27E−01
HippoSignal transduction901067360.0228.39E−01
VEGFSignal transduction404119100.0338.42E−01
p53Cell growth and death72712547−0.0228.52E−01
C-type lectin receptorImmune system1591741078−0.0148.59E−01
TGF-betaSignal transduction8593617−0.0119.21E−01
EstrogenEndocrine system58692916−0.0109.47E−01
SphingolipidSignal transduction56625160.0029.86E-01
pathcategoriesnb interactsnb genesnb subpathmax subpathr pvalue
AMPKSignal transduction109120184100.4199.49E−06*
ErbBSignal transduction11490309100.3812.88E−05*
JAK-STATSignal transduction38118312,958120.3332.63E−11*
AdipocytokineEndocrine system72685480.3045.54E−03*
cAMPSignal transduction2032061966110.2551.47E−04*
WntSignal transduction129126402110.2464.91E−03*
HIF-1Signal transduction72775470.2454.78E−02*
OxytocinEndocrine system79905190.2432.59E−02*
AGE-RAGEEndocrine and metabolic disease22014635860.2022.11E−03*
NOD-like receptorImmune system21020842090.2003.71E−03*
IL-17Immune system17616870990.1583.63E−02*
MAPKSignal transduction4103542083110.1541.82E−03*
T cell receptorImmune system21616037690.1532.46E−02*
Toll-like receptorImmune system200174371100.1513.33E−02*
NeurotrophinNervous system231193952160.1373.81E−02*
mTORSignal transduction293282375140.1362.12E−02*
InsulinEndocrine system13313025511−0.1842.95E−02*
RelaxinEndocrine system115131140130.1865.02E−02
ProlactinEndocrine system1231316314−0.1656.63E−02
FoxOSignal transduction74787090.2166.64E−02
Ovarian steroidogenesisEndocrine system1929156−0.3232.27E−01
cGMP-PKGSignal transduction607920112−0.1382.46E−01
HedgehogSignal transduction5164415−0.1612.65E−01
NotchSignal transduction35346440.1743.17E−01
CalciumSignal transduction4059778−0.1563.40E−01
Phospholipase DSignal transduction5564235110.1193.49E−01
ApelinSignal transduction798626110.1043.88E−01
PPAREndocrine system58506120.1104.30E−01
NF-kappa BSignal transduction1221241106−0.0634.91E−01
GnRHEndocrine system565716160.0885.19E−01
ChemokineImmune system918418311−0.0585.76E−01
B cell receptorImmune system1091048813−0.0545.90E−01
PI3K–AktSignal transduction26426766813−0.0335.97E−01
RasSignal transduction1151122329−0.0456.43E−01
Thyroid hormoneEndocrine system1131211027−0.0446.53E−01
RIG-I-like receptorImmune system988837480.0397.03E−01
TNFSignal transduction64702780.0487.04E−01
Fc epsilon RIImmune system49494210−0.0377.81E−01
Rap1Signal transduction89901085−0.0258.12E−01
GlucagonEndocrine system65634090.0298.27E−01
HippoSignal transduction901067360.0228.39E−01
VEGFSignal transduction404119100.0338.42E−01
p53Cell growth and death72712547−0.0228.52E−01
C-type lectin receptorImmune system1591741078−0.0148.59E−01
TGF-betaSignal transduction8593617−0.0119.21E−01
EstrogenEndocrine system58692916−0.0109.47E−01
SphingolipidSignal transduction56625160.0029.86E-01

The correlation index (r) quantifies the strength of the relationship: strong between [−1, −0.5] and [0.5, 1], low between [−0.5, 0] and [0, 0.5], and no correlation at r = 0. Significant P-values (*P < 0.05) are presented in bold italics.

Number of Appearance Pattern Occurrences

Among the 47 pathways, we observe that six pathways have significantly more backward pairs than expected by chance alone, while only three are more forward and nine have more simultaneous relationships (Table 2).

Table 2

Summary of the statistical tests about appearance patterns

pathforwardbackwardsimultsizeeffct_fpvalue_fsizeeffct_bpvalue_bsizeeffct_spvalue_s
Ovarian steroidogenesis2152−5.6816.00E−03*7.4261.00E−03*−1.7451.27E−01
JAK-STAT218847973.3526.00E−03*−59.6072.50E−02*−13.7454.02E−01
IL-17114382437.5121.90E−02*−36.8423.60E−02*−0.6708.11E−01
Toll-like receptor124373934.9722.80E−02*−39.4412.70E−02*4.4695.34E−01
VEGF101614−6.8004.70E−02*−0.6368.10E−017.4366.00E−03*
RIG-I-like receptor4821295.2924.20E−01−19.7463.00E−03*14.4543.00E−03*
cAMP4813322−43.1537.40E−0256.8904.00E−03*−11.9643.39E−01
ErbB336318−11.1588.80E−0218.9425.00E−03*−7.7841.42E−01
NF-kappa B426515−9.6882.01E−0113.7814.30E−02*−3.5584.00E−01
NOD-like receptor109633812.0623.16E−01−22.1914.70E−02*10.1291.02E−01
Wnt374151−11.7518.90E−02−7.7553.03E−0119.5062.00E−03*
TNF222121−6.4668.70E−02−2.2935.20E−018.7591.10E−02*
MAPK151129130−13.5376.53E−01−35.0852.35E−0148.6221.20E−02*
B cell receptor383338−4.1376.03E−01−8.3953.24E−0120.8341.60E−02*
Sphingolipid162119−6.0238.20E−02−1.2856.84E−017.3082.00E−02*
C-type lectin receptor536838−13.2847.60E−021.8247.49E−0111.4602.10E−02*
p533616205.4445.33E−01−14.2612.12E−018.8173.60E−02*
Hedgehog181716−2.5694.00E−01−3.7382.67E−016.3073.60E−02*
AMPK5530249.0784.08E−01−18.3551.14E−019.2778.30E−02
Ras5038272.4206.90E−01−10.0481.33E−017.6288.70E−02
TGF-beta3531195.8641.73E−011.8186.23E−01−7.6821.07E−01
FoxO30377−1.3708.90E−015.7215.37E−01−4.3511.84E−01
Phospholipase D2124100.1609.22E−015.1441.93E−01−4.3582.03E−01
Oxytocin413175.9394.21E−01−0.9688.87E−01−4.9712.09E−01
Estrogen232411−1.8955.61E−01−0.8677.35E−012.7622.19E−01
HIF-1174015−13.2771.52E−019.3804.01E−013.8972.35E−01
cGMP-PKG222612−5.1593.83E−012.5187.05E−012.6413.17E−01
Apelin3827146.3932.18E−01−2.5624.99E−01−3.8313.22E−01
Relaxin465514−6.4453.12E−0110.3151.34E−01−3.8703.62E−01
PI3K–Akt1201143011.0325.88E−014.0548.56E−01−10.6844.19E−01
Thyroid hormone464522−1.2598.82E−01−1.8568.13E−013.1154.29E−01
Insulin486322−4.9455.56E−0110.1042.95E−01−5.1594.41E−01
Calcium191471.6515.38E−01−2.4855.43E−011.3235.57E−01
Fc epsilon RI1722100.8177.12E−011.7205.37E−010.1695.93E−01
GnRH262372.1545.71E−01−0.9697.28E−01−1.1856.59E−01
AGE-RAGE10187327.6307.80E−01−3.2939.32E−01−4.3377.47E−01
PPAR83812−15.8341.46E−0114.4541.29E−011.3807.56E−01
Chemokine393517−2.6444.68E−011.4186.20E−011.2267.60E−01
Adipocytokine3026160.7378.02E−01−1.5986.81E−010.8617.70E−01
Prolactin65382014.0132.73E−01−12.9163.83E−01−1.0978.42E−01
Notch161093.1603.33E−01−3.0823.19E−01−0.0788.63E−01
T cell receptor126583232.8279.80E−02−30.7131.29E−01−2.1148.75E−01
Rap1264617−10.1208.90E−029.9789.20E−020.1428.88E−01
Hippo353619−0.3129.40E−010.9207.81E−010.1198.88E−01
Glucagon183710−9.5341.24E−0110.0621.65E−01−0.5289.02E−01
Neurotrophin889845−5.8206.09E−015.2386.66E−010.5829.05E−01
mTOR9013964−25.6934.57E−0125.7424.71E−01−0.0499.52E−01
pathforwardbackwardsimultsizeeffct_fpvalue_fsizeeffct_bpvalue_bsizeeffct_spvalue_s
Ovarian steroidogenesis2152−5.6816.00E−03*7.4261.00E−03*−1.7451.27E−01
JAK-STAT218847973.3526.00E−03*−59.6072.50E−02*−13.7454.02E−01
IL-17114382437.5121.90E−02*−36.8423.60E−02*−0.6708.11E−01
Toll-like receptor124373934.9722.80E−02*−39.4412.70E−02*4.4695.34E−01
VEGF101614−6.8004.70E−02*−0.6368.10E−017.4366.00E−03*
RIG-I-like receptor4821295.2924.20E−01−19.7463.00E−03*14.4543.00E−03*
cAMP4813322−43.1537.40E−0256.8904.00E−03*−11.9643.39E−01
ErbB336318−11.1588.80E−0218.9425.00E−03*−7.7841.42E−01
NF-kappa B426515−9.6882.01E−0113.7814.30E−02*−3.5584.00E−01
NOD-like receptor109633812.0623.16E−01−22.1914.70E−02*10.1291.02E−01
Wnt374151−11.7518.90E−02−7.7553.03E−0119.5062.00E−03*
TNF222121−6.4668.70E−02−2.2935.20E−018.7591.10E−02*
MAPK151129130−13.5376.53E−01−35.0852.35E−0148.6221.20E−02*
B cell receptor383338−4.1376.03E−01−8.3953.24E−0120.8341.60E−02*
Sphingolipid162119−6.0238.20E−02−1.2856.84E−017.3082.00E−02*
C-type lectin receptor536838−13.2847.60E−021.8247.49E−0111.4602.10E−02*
p533616205.4445.33E−01−14.2612.12E−018.8173.60E−02*
Hedgehog181716−2.5694.00E−01−3.7382.67E−016.3073.60E−02*
AMPK5530249.0784.08E−01−18.3551.14E−019.2778.30E−02
Ras5038272.4206.90E−01−10.0481.33E−017.6288.70E−02
TGF-beta3531195.8641.73E−011.8186.23E−01−7.6821.07E−01
FoxO30377−1.3708.90E−015.7215.37E−01−4.3511.84E−01
Phospholipase D2124100.1609.22E−015.1441.93E−01−4.3582.03E−01
Oxytocin413175.9394.21E−01−0.9688.87E−01−4.9712.09E−01
Estrogen232411−1.8955.61E−01−0.8677.35E−012.7622.19E−01
HIF-1174015−13.2771.52E−019.3804.01E−013.8972.35E−01
cGMP-PKG222612−5.1593.83E−012.5187.05E−012.6413.17E−01
Apelin3827146.3932.18E−01−2.5624.99E−01−3.8313.22E−01
Relaxin465514−6.4453.12E−0110.3151.34E−01−3.8703.62E−01
PI3K–Akt1201143011.0325.88E−014.0548.56E−01−10.6844.19E−01
Thyroid hormone464522−1.2598.82E−01−1.8568.13E−013.1154.29E−01
Insulin486322−4.9455.56E−0110.1042.95E−01−5.1594.41E−01
Calcium191471.6515.38E−01−2.4855.43E−011.3235.57E−01
Fc epsilon RI1722100.8177.12E−011.7205.37E−010.1695.93E−01
GnRH262372.1545.71E−01−0.9697.28E−01−1.1856.59E−01
AGE-RAGE10187327.6307.80E−01−3.2939.32E−01−4.3377.47E−01
PPAR83812−15.8341.46E−0114.4541.29E−011.3807.56E−01
Chemokine393517−2.6444.68E−011.4186.20E−011.2267.60E−01
Adipocytokine3026160.7378.02E−01−1.5986.81E−010.8617.70E−01
Prolactin65382014.0132.73E−01−12.9163.83E−01−1.0978.42E−01
Notch161093.1603.33E−01−3.0823.19E−01−0.0788.63E−01
T cell receptor126583232.8279.80E−02−30.7131.29E−01−2.1148.75E−01
Rap1264617−10.1208.90E−029.9789.20E−020.1428.88E−01
Hippo353619−0.3129.40E−010.9207.81E−010.1198.88E−01
Glucagon183710−9.5341.24E−0110.0621.65E−01−0.5289.02E−01
Neurotrophin889845−5.8206.09E−015.2386.66E−010.5829.05E−01
mTOR9013964−25.6934.57E−0125.7424.71E−01−0.0499.52E−01

The columns forward/backward and simultaneous represent the number of interaction of this direction; Size effect_f/b/s: average number of interaction that meets the “forward”/“backward”/“simultaneous” condition; pvalue_f/b/s: P-value of “forward”/“backward”/“simultaneous” condition. Significant P-values (*P < 0.05) are presented in bold italics.

Table 2

Summary of the statistical tests about appearance patterns

pathforwardbackwardsimultsizeeffct_fpvalue_fsizeeffct_bpvalue_bsizeeffct_spvalue_s
Ovarian steroidogenesis2152−5.6816.00E−03*7.4261.00E−03*−1.7451.27E−01
JAK-STAT218847973.3526.00E−03*−59.6072.50E−02*−13.7454.02E−01
IL-17114382437.5121.90E−02*−36.8423.60E−02*−0.6708.11E−01
Toll-like receptor124373934.9722.80E−02*−39.4412.70E−02*4.4695.34E−01
VEGF101614−6.8004.70E−02*−0.6368.10E−017.4366.00E−03*
RIG-I-like receptor4821295.2924.20E−01−19.7463.00E−03*14.4543.00E−03*
cAMP4813322−43.1537.40E−0256.8904.00E−03*−11.9643.39E−01
ErbB336318−11.1588.80E−0218.9425.00E−03*−7.7841.42E−01
NF-kappa B426515−9.6882.01E−0113.7814.30E−02*−3.5584.00E−01
NOD-like receptor109633812.0623.16E−01−22.1914.70E−02*10.1291.02E−01
Wnt374151−11.7518.90E−02−7.7553.03E−0119.5062.00E−03*
TNF222121−6.4668.70E−02−2.2935.20E−018.7591.10E−02*
MAPK151129130−13.5376.53E−01−35.0852.35E−0148.6221.20E−02*
B cell receptor383338−4.1376.03E−01−8.3953.24E−0120.8341.60E−02*
Sphingolipid162119−6.0238.20E−02−1.2856.84E−017.3082.00E−02*
C-type lectin receptor536838−13.2847.60E−021.8247.49E−0111.4602.10E−02*
p533616205.4445.33E−01−14.2612.12E−018.8173.60E−02*
Hedgehog181716−2.5694.00E−01−3.7382.67E−016.3073.60E−02*
AMPK5530249.0784.08E−01−18.3551.14E−019.2778.30E−02
Ras5038272.4206.90E−01−10.0481.33E−017.6288.70E−02
TGF-beta3531195.8641.73E−011.8186.23E−01−7.6821.07E−01
FoxO30377−1.3708.90E−015.7215.37E−01−4.3511.84E−01
Phospholipase D2124100.1609.22E−015.1441.93E−01−4.3582.03E−01
Oxytocin413175.9394.21E−01−0.9688.87E−01−4.9712.09E−01
Estrogen232411−1.8955.61E−01−0.8677.35E−012.7622.19E−01
HIF-1174015−13.2771.52E−019.3804.01E−013.8972.35E−01
cGMP-PKG222612−5.1593.83E−012.5187.05E−012.6413.17E−01
Apelin3827146.3932.18E−01−2.5624.99E−01−3.8313.22E−01
Relaxin465514−6.4453.12E−0110.3151.34E−01−3.8703.62E−01
PI3K–Akt1201143011.0325.88E−014.0548.56E−01−10.6844.19E−01
Thyroid hormone464522−1.2598.82E−01−1.8568.13E−013.1154.29E−01
Insulin486322−4.9455.56E−0110.1042.95E−01−5.1594.41E−01
Calcium191471.6515.38E−01−2.4855.43E−011.3235.57E−01
Fc epsilon RI1722100.8177.12E−011.7205.37E−010.1695.93E−01
GnRH262372.1545.71E−01−0.9697.28E−01−1.1856.59E−01
AGE-RAGE10187327.6307.80E−01−3.2939.32E−01−4.3377.47E−01
PPAR83812−15.8341.46E−0114.4541.29E−011.3807.56E−01
Chemokine393517−2.6444.68E−011.4186.20E−011.2267.60E−01
Adipocytokine3026160.7378.02E−01−1.5986.81E−010.8617.70E−01
Prolactin65382014.0132.73E−01−12.9163.83E−01−1.0978.42E−01
Notch161093.1603.33E−01−3.0823.19E−01−0.0788.63E−01
T cell receptor126583232.8279.80E−02−30.7131.29E−01−2.1148.75E−01
Rap1264617−10.1208.90E−029.9789.20E−020.1428.88E−01
Hippo353619−0.3129.40E−010.9207.81E−010.1198.88E−01
Glucagon183710−9.5341.24E−0110.0621.65E−01−0.5289.02E−01
Neurotrophin889845−5.8206.09E−015.2386.66E−010.5829.05E−01
mTOR9013964−25.6934.57E−0125.7424.71E−01−0.0499.52E−01
pathforwardbackwardsimultsizeeffct_fpvalue_fsizeeffct_bpvalue_bsizeeffct_spvalue_s
Ovarian steroidogenesis2152−5.6816.00E−03*7.4261.00E−03*−1.7451.27E−01
JAK-STAT218847973.3526.00E−03*−59.6072.50E−02*−13.7454.02E−01
IL-17114382437.5121.90E−02*−36.8423.60E−02*−0.6708.11E−01
Toll-like receptor124373934.9722.80E−02*−39.4412.70E−02*4.4695.34E−01
VEGF101614−6.8004.70E−02*−0.6368.10E−017.4366.00E−03*
RIG-I-like receptor4821295.2924.20E−01−19.7463.00E−03*14.4543.00E−03*
cAMP4813322−43.1537.40E−0256.8904.00E−03*−11.9643.39E−01
ErbB336318−11.1588.80E−0218.9425.00E−03*−7.7841.42E−01
NF-kappa B426515−9.6882.01E−0113.7814.30E−02*−3.5584.00E−01
NOD-like receptor109633812.0623.16E−01−22.1914.70E−02*10.1291.02E−01
Wnt374151−11.7518.90E−02−7.7553.03E−0119.5062.00E−03*
TNF222121−6.4668.70E−02−2.2935.20E−018.7591.10E−02*
MAPK151129130−13.5376.53E−01−35.0852.35E−0148.6221.20E−02*
B cell receptor383338−4.1376.03E−01−8.3953.24E−0120.8341.60E−02*
Sphingolipid162119−6.0238.20E−02−1.2856.84E−017.3082.00E−02*
C-type lectin receptor536838−13.2847.60E−021.8247.49E−0111.4602.10E−02*
p533616205.4445.33E−01−14.2612.12E−018.8173.60E−02*
Hedgehog181716−2.5694.00E−01−3.7382.67E−016.3073.60E−02*
AMPK5530249.0784.08E−01−18.3551.14E−019.2778.30E−02
Ras5038272.4206.90E−01−10.0481.33E−017.6288.70E−02
TGF-beta3531195.8641.73E−011.8186.23E−01−7.6821.07E−01
FoxO30377−1.3708.90E−015.7215.37E−01−4.3511.84E−01
Phospholipase D2124100.1609.22E−015.1441.93E−01−4.3582.03E−01
Oxytocin413175.9394.21E−01−0.9688.87E−01−4.9712.09E−01
Estrogen232411−1.8955.61E−01−0.8677.35E−012.7622.19E−01
HIF-1174015−13.2771.52E−019.3804.01E−013.8972.35E−01
cGMP-PKG222612−5.1593.83E−012.5187.05E−012.6413.17E−01
Apelin3827146.3932.18E−01−2.5624.99E−01−3.8313.22E−01
Relaxin465514−6.4453.12E−0110.3151.34E−01−3.8703.62E−01
PI3K–Akt1201143011.0325.88E−014.0548.56E−01−10.6844.19E−01
Thyroid hormone464522−1.2598.82E−01−1.8568.13E−013.1154.29E−01
Insulin486322−4.9455.56E−0110.1042.95E−01−5.1594.41E−01
Calcium191471.6515.38E−01−2.4855.43E−011.3235.57E−01
Fc epsilon RI1722100.8177.12E−011.7205.37E−010.1695.93E−01
GnRH262372.1545.71E−01−0.9697.28E−01−1.1856.59E−01
AGE-RAGE10187327.6307.80E−01−3.2939.32E−01−4.3377.47E−01
PPAR83812−15.8341.46E−0114.4541.29E−011.3807.56E−01
Chemokine393517−2.6444.68E−011.4186.20E−011.2267.60E−01
Adipocytokine3026160.7378.02E−01−1.5986.81E−010.8617.70E−01
Prolactin65382014.0132.73E−01−12.9163.83E−01−1.0978.42E−01
Notch161093.1603.33E−01−3.0823.19E−01−0.0788.63E−01
T cell receptor126583232.8279.80E−02−30.7131.29E−01−2.1148.75E−01
Rap1264617−10.1208.90E−029.9789.20E−020.1428.88E−01
Hippo353619−0.3129.40E−010.9207.81E−010.1198.88E−01
Glucagon183710−9.5341.24E−0110.0621.65E−01−0.5289.02E−01
Neurotrophin889845−5.8206.09E−015.2386.66E−010.5829.05E−01
mTOR9013964−25.6934.57E−0125.7424.71E−01−0.0499.52E−01

The columns forward/backward and simultaneous represent the number of interaction of this direction; Size effect_f/b/s: average number of interaction that meets the “forward”/“backward”/“simultaneous” condition; pvalue_f/b/s: P-value of “forward”/“backward”/“simultaneous” condition. Significant P-values (*P < 0.05) are presented in bold italics.

We find that 18 of the observed pathways (ovarian steroidogenesis, JAK-STAT, IL-17, toll-like receptor, VEGF, RIG-I-like receptor, cAMP, ErbB, NF-kappa B, NOD-like receptor, Wnt, TNF, MAPK, B-cell receptor, Sphingolipid, C-type lectin receptor, p53, and Hedgehog) display a distribution of appearance pattern that has significantly more or less backward/simultaneous/forward pairs than expected by chance alone based on 1,000 of random permutations. For 29 other pathways, the statistical test suggests no deviation from a random gene appearance pattern (Table 2 and supplementary Data S4, Supplementary Material online). Given the large number of tests, it cannot be ruled out that the significant results are random. This further supports the third scenario (Fig. 2c).

Size of Modules

Fourteen of the observed pathways (adipocytokine, C-type lectin receptor, cAMP, ErbB, Hedgehog, IL-17, JAK-STAT, mTOR, ovarian steroidogenesis, RIG-I-like receptor, sphingolipid, T-cell receptor, toll-like receptor, and VEGF) display modules significantly larger than expected by chance alone (Table 3 and supplementary Data S4, Supplementary Material online). Thus, 33 other pathways do not have any deviation from distributions under randomness. In a similar way to the number of appearance patterns, the size of modules is mainly random in the majority of the pathways.

Table 3

Summary of module size statistical tests

 backward or simultbackwardsimultaneousforwardforward or simult
pathsizeeffctpvaluesizeeffctpvaluesizeeffctpvaluesizeeffctpvaluesizeeffctpvalue
IL-17−1.2031.10E−02*−1.1209.00E−03*0.1266.29E−010.4213.34E−01−0.0069.76E−01
Ovarian steroidogenesis0.7211.40E−02*0.9931.10E−02*−0.4681.63E−01−1.0995.00E−03*−1.1701.00E−03*
mTOR1.2673.50E−02*0.1625.20E−010.3681.70E−01−0.6269.60E−02−0.7282.37E−01
Adipocytokine3.0204.30E−02*0.7381.96E−010.0826.91E−01−0.4345.13E−01−1.3684.06E−01
cAMP5.5204.30E−02*6.3783.00E−03*−1.3071.16E−01−1.9201.65E−01−3.7571.51E−01
RIG-I-like receptor−0.2347.88E−01−1.0883.20E−02*1.1633.00E−03*0.3095.19E−011.1705.00E−02
Sphingolipid0.6109.20E−02−0.3142.84E−011.1351.00E−03*−0.3252.80E−010.1925.21E−01
JAK-STAT0.5571.70E−01−0.5632.41E−010.7585.00E−03*−0.5641.53E−01−0.5962.05E−01
Hedgehog0.2712.74E−01−0.2024.43E−010.6028.00E−03*−0.2513.02E−010.1824.89E−01
C-type lectin receptor0.2977.90E−02−0.0278.59E−010.3092.40E−02*−0.0965.12E−010.1085.10E−01
VEGF2.8389.10E−02−0.8995.58E−011.1694.40E−02*−1.3433.04E−010.1076.66E−01
Toll-like receptor−1.8945.70E−02−1.0228.90E−02−0.1565.73E−011.5531.20E−02*1.5606.80E−02
T-cell receptor−2.4593.57E−01−1.7271.66E−01−0.0919.57E−012.5524.60E−02*2.9741.07E−01
ErbB−0.3967.45E010.7022.33E−01−0.3765.26E−01−1.5165.10E−02−2.4442.70E−02*
TGF-beta0.6371.85E010.4971.01E01−0.4131.76E01−0.1107.61E01−0.9187.40E02
Apelin−1.0972.40E010.0058.10E01−0.4273.03E010.8425.70E021.2668.20E02
NF-kappa B0.3971.38E010.3421.41E01−0.1843.63E01−0.3411.69E01−0.4788.90E02
AMPK−0.1609.44E01−0.6092.47E010.5485.40E02−0.0319.48E010.9198.90E02
Oxytocin−0.0168.26E01−0.1599.13E01−0.3762.29E01−0.3965.18E01−1.6311.30E01
Chemokine0.3056.20E01−0.1958.61E01−0.5521.97E010.1686.33E01−1.9081.30E01
p53−0.3259.46E01−2.0893.46E010.9372.30E011.0933.33E013.0591.34E01
Relaxin0.0288.69E010.4889.30E02−0.3351.87E01−0.0119.81E01−0.8211.47E01
PPAR2.9341.99E013.4871.24E010.6994.04E01−3.4581.10E01−2.8251.50E01
Insulin−0.7586.02E010.0448.11E01−0.2046.42E01−0.3716.18E01−1.4432.69E01
GnRH−0.9093.19E01−0.1988.76E01−0.4432.20E01−0.3337.29E01−0.8703.32E01
NOD-like receptor−0.0059.81E01−0.4241.09E010.2552.11E01−0.1924.82E010.2923.54E01
AGE-RAGE2.2261.16E010.2606.58E01−0.1048.98E01−0.7345.68E01−1.7253.99E01
MAPK0.2497.35E010.5664.55E010.2495.25E01−1.5521.67E01−2.1284.16E01
Rap10.9811.48E010.9075.80E02−0.1017.04E01−0.4294.08E01−0.8294.17E01
B cell receptor0.6193.85E01−0.6603.17E010.6656.70E02−0.1378.92E010.5204.48E01
Neurotrophin0.4946.56E010.5334.47E010.3243.83E01−1.5271.20E01−1.6285.26E01
HIF-10.9574.44E010.0429.15E010.4813.02E01−1.4732.25E01−1.0275.49E01
Wnt4.3317.60E021.8538.10E020.8141.25E01−0.8176.28E010.6575.52E01
Fc epsilon RI−0.2966.70E01−0.1269.05E010.3322.19E01−0.3725.52E01−0.4495.55E01
Prolactin0.0968.39E01−0.6702.57E010.3044.08E010.1047.12E010.4155.67E01
Hippo0.1056.66E01−0.0578.05E010.0667.13E01−0.0826.81E010.0837.13E01
PI3K–Akt−0.4765.37E01−0.5132.68E010.2053.96E010.3403.00E010.0867.75E01
FoxO−1.0665.56E01−0.6056.66E01−0.5933.39E01−0.1759.82E01−0.6047.93E01
Phospholipase D−1.2272.58E01−0.5323.25E01−0.7131.82E010.4312.14E010.0038.26E01
Thyroid hormone−0.2424.84E01−0.1606.13E01−0.0747.76E010.1765.64E010.0608.44E01
Estrogen0.2984.10E010.1725.37E010.1167.80E−01−0.0029.65E−01−0.0049.06E−01
Notch−0.3396.71E−01−0.3496.80E−01−0.1916.90E−010.4933.43E−01−0.0579.39E−01
Ras−0.2669.62E−01−0.6145.44E−01−0.0078.69E−010.0667.96E−01−0.3459.42E−01
Glucagon0.7841.07E−010.5851.46E−01−0.1017.31E−01−0.5532.15E−01−0.1689.44E−01
TNF0.4612.30E−01−0.0937.60E−010.2873.47E−01−0.1646.12E−01−0.0789.59E−01
Calcium−0.5531.71E−01−0.1317.01E−01−0.2014.36E−010.1256.84E−01−0.0209.62E−01
cGMP-PKG−0.0529.86E−01−0.0369.32E−010.1586.05E−01−0.1819.01E−01−0.1059.98E−01
 backward or simultbackwardsimultaneousforwardforward or simult
pathsizeeffctpvaluesizeeffctpvaluesizeeffctpvaluesizeeffctpvaluesizeeffctpvalue
IL-17−1.2031.10E−02*−1.1209.00E−03*0.1266.29E−010.4213.34E−01−0.0069.76E−01
Ovarian steroidogenesis0.7211.40E−02*0.9931.10E−02*−0.4681.63E−01−1.0995.00E−03*−1.1701.00E−03*
mTOR1.2673.50E−02*0.1625.20E−010.3681.70E−01−0.6269.60E−02−0.7282.37E−01
Adipocytokine3.0204.30E−02*0.7381.96E−010.0826.91E−01−0.4345.13E−01−1.3684.06E−01
cAMP5.5204.30E−02*6.3783.00E−03*−1.3071.16E−01−1.9201.65E−01−3.7571.51E−01
RIG-I-like receptor−0.2347.88E−01−1.0883.20E−02*1.1633.00E−03*0.3095.19E−011.1705.00E−02
Sphingolipid0.6109.20E−02−0.3142.84E−011.1351.00E−03*−0.3252.80E−010.1925.21E−01
JAK-STAT0.5571.70E−01−0.5632.41E−010.7585.00E−03*−0.5641.53E−01−0.5962.05E−01
Hedgehog0.2712.74E−01−0.2024.43E−010.6028.00E−03*−0.2513.02E−010.1824.89E−01
C-type lectin receptor0.2977.90E−02−0.0278.59E−010.3092.40E−02*−0.0965.12E−010.1085.10E−01
VEGF2.8389.10E−02−0.8995.58E−011.1694.40E−02*−1.3433.04E−010.1076.66E−01
Toll-like receptor−1.8945.70E−02−1.0228.90E−02−0.1565.73E−011.5531.20E−02*1.5606.80E−02
T-cell receptor−2.4593.57E−01−1.7271.66E−01−0.0919.57E−012.5524.60E−02*2.9741.07E−01
ErbB−0.3967.45E010.7022.33E−01−0.3765.26E−01−1.5165.10E−02−2.4442.70E−02*
TGF-beta0.6371.85E010.4971.01E01−0.4131.76E01−0.1107.61E01−0.9187.40E02
Apelin−1.0972.40E010.0058.10E01−0.4273.03E010.8425.70E021.2668.20E02
NF-kappa B0.3971.38E010.3421.41E01−0.1843.63E01−0.3411.69E01−0.4788.90E02
AMPK−0.1609.44E01−0.6092.47E010.5485.40E02−0.0319.48E010.9198.90E02
Oxytocin−0.0168.26E01−0.1599.13E01−0.3762.29E01−0.3965.18E01−1.6311.30E01
Chemokine0.3056.20E01−0.1958.61E01−0.5521.97E010.1686.33E01−1.9081.30E01
p53−0.3259.46E01−2.0893.46E010.9372.30E011.0933.33E013.0591.34E01
Relaxin0.0288.69E010.4889.30E02−0.3351.87E01−0.0119.81E01−0.8211.47E01
PPAR2.9341.99E013.4871.24E010.6994.04E01−3.4581.10E01−2.8251.50E01
Insulin−0.7586.02E010.0448.11E01−0.2046.42E01−0.3716.18E01−1.4432.69E01
GnRH−0.9093.19E01−0.1988.76E01−0.4432.20E01−0.3337.29E01−0.8703.32E01
NOD-like receptor−0.0059.81E01−0.4241.09E010.2552.11E01−0.1924.82E010.2923.54E01
AGE-RAGE2.2261.16E010.2606.58E01−0.1048.98E01−0.7345.68E01−1.7253.99E01
MAPK0.2497.35E010.5664.55E010.2495.25E01−1.5521.67E01−2.1284.16E01
Rap10.9811.48E010.9075.80E02−0.1017.04E01−0.4294.08E01−0.8294.17E01
B cell receptor0.6193.85E01−0.6603.17E010.6656.70E02−0.1378.92E010.5204.48E01
Neurotrophin0.4946.56E010.5334.47E010.3243.83E01−1.5271.20E01−1.6285.26E01
HIF-10.9574.44E010.0429.15E010.4813.02E01−1.4732.25E01−1.0275.49E01
Wnt4.3317.60E021.8538.10E020.8141.25E01−0.8176.28E010.6575.52E01
Fc epsilon RI−0.2966.70E01−0.1269.05E010.3322.19E01−0.3725.52E01−0.4495.55E01
Prolactin0.0968.39E01−0.6702.57E010.3044.08E010.1047.12E010.4155.67E01
Hippo0.1056.66E01−0.0578.05E010.0667.13E01−0.0826.81E010.0837.13E01
PI3K–Akt−0.4765.37E01−0.5132.68E010.2053.96E010.3403.00E010.0867.75E01
FoxO−1.0665.56E01−0.6056.66E01−0.5933.39E01−0.1759.82E01−0.6047.93E01
Phospholipase D−1.2272.58E01−0.5323.25E01−0.7131.82E010.4312.14E010.0038.26E01
Thyroid hormone−0.2424.84E01−0.1606.13E01−0.0747.76E010.1765.64E010.0608.44E01
Estrogen0.2984.10E010.1725.37E010.1167.80E−01−0.0029.65E−01−0.0049.06E−01
Notch−0.3396.71E−01−0.3496.80E−01−0.1916.90E−010.4933.43E−01−0.0579.39E−01
Ras−0.2669.62E−01−0.6145.44E−01−0.0078.69E−010.0667.96E−01−0.3459.42E−01
Glucagon0.7841.07E−010.5851.46E−01−0.1017.31E−01−0.5532.15E−01−0.1689.44E−01
TNF0.4612.30E−01−0.0937.60E−010.2873.47E−01−0.1646.12E−01−0.0789.59E−01
Calcium−0.5531.71E−01−0.1317.01E−01−0.2014.36E−010.1256.84E−01−0.0209.62E−01
cGMP-PKG−0.0529.86E−01−0.0369.32E−010.1586.05E−01−0.1819.01E−01−0.1059.98E−01

Size effect: average size of the module that meets each condition, calculated by averaging the size of the module across 1,000 simulated pathways; pvalue: P-value of each condition. Significant P-values (*P < 0.05) are presented in bold italics.

Table 3

Summary of module size statistical tests

 backward or simultbackwardsimultaneousforwardforward or simult
pathsizeeffctpvaluesizeeffctpvaluesizeeffctpvaluesizeeffctpvaluesizeeffctpvalue
IL-17−1.2031.10E−02*−1.1209.00E−03*0.1266.29E−010.4213.34E−01−0.0069.76E−01
Ovarian steroidogenesis0.7211.40E−02*0.9931.10E−02*−0.4681.63E−01−1.0995.00E−03*−1.1701.00E−03*
mTOR1.2673.50E−02*0.1625.20E−010.3681.70E−01−0.6269.60E−02−0.7282.37E−01
Adipocytokine3.0204.30E−02*0.7381.96E−010.0826.91E−01−0.4345.13E−01−1.3684.06E−01
cAMP5.5204.30E−02*6.3783.00E−03*−1.3071.16E−01−1.9201.65E−01−3.7571.51E−01
RIG-I-like receptor−0.2347.88E−01−1.0883.20E−02*1.1633.00E−03*0.3095.19E−011.1705.00E−02
Sphingolipid0.6109.20E−02−0.3142.84E−011.1351.00E−03*−0.3252.80E−010.1925.21E−01
JAK-STAT0.5571.70E−01−0.5632.41E−010.7585.00E−03*−0.5641.53E−01−0.5962.05E−01
Hedgehog0.2712.74E−01−0.2024.43E−010.6028.00E−03*−0.2513.02E−010.1824.89E−01
C-type lectin receptor0.2977.90E−02−0.0278.59E−010.3092.40E−02*−0.0965.12E−010.1085.10E−01
VEGF2.8389.10E−02−0.8995.58E−011.1694.40E−02*−1.3433.04E−010.1076.66E−01
Toll-like receptor−1.8945.70E−02−1.0228.90E−02−0.1565.73E−011.5531.20E−02*1.5606.80E−02
T-cell receptor−2.4593.57E−01−1.7271.66E−01−0.0919.57E−012.5524.60E−02*2.9741.07E−01
ErbB−0.3967.45E010.7022.33E−01−0.3765.26E−01−1.5165.10E−02−2.4442.70E−02*
TGF-beta0.6371.85E010.4971.01E01−0.4131.76E01−0.1107.61E01−0.9187.40E02
Apelin−1.0972.40E010.0058.10E01−0.4273.03E010.8425.70E021.2668.20E02
NF-kappa B0.3971.38E010.3421.41E01−0.1843.63E01−0.3411.69E01−0.4788.90E02
AMPK−0.1609.44E01−0.6092.47E010.5485.40E02−0.0319.48E010.9198.90E02
Oxytocin−0.0168.26E01−0.1599.13E01−0.3762.29E01−0.3965.18E01−1.6311.30E01
Chemokine0.3056.20E01−0.1958.61E01−0.5521.97E010.1686.33E01−1.9081.30E01
p53−0.3259.46E01−2.0893.46E010.9372.30E011.0933.33E013.0591.34E01
Relaxin0.0288.69E010.4889.30E02−0.3351.87E01−0.0119.81E01−0.8211.47E01
PPAR2.9341.99E013.4871.24E010.6994.04E01−3.4581.10E01−2.8251.50E01
Insulin−0.7586.02E010.0448.11E01−0.2046.42E01−0.3716.18E01−1.4432.69E01
GnRH−0.9093.19E01−0.1988.76E01−0.4432.20E01−0.3337.29E01−0.8703.32E01
NOD-like receptor−0.0059.81E01−0.4241.09E010.2552.11E01−0.1924.82E010.2923.54E01
AGE-RAGE2.2261.16E010.2606.58E01−0.1048.98E01−0.7345.68E01−1.7253.99E01
MAPK0.2497.35E010.5664.55E010.2495.25E01−1.5521.67E01−2.1284.16E01
Rap10.9811.48E010.9075.80E02−0.1017.04E01−0.4294.08E01−0.8294.17E01
B cell receptor0.6193.85E01−0.6603.17E010.6656.70E02−0.1378.92E010.5204.48E01
Neurotrophin0.4946.56E010.5334.47E010.3243.83E01−1.5271.20E01−1.6285.26E01
HIF-10.9574.44E010.0429.15E010.4813.02E01−1.4732.25E01−1.0275.49E01
Wnt4.3317.60E021.8538.10E020.8141.25E01−0.8176.28E010.6575.52E01
Fc epsilon RI−0.2966.70E01−0.1269.05E010.3322.19E01−0.3725.52E01−0.4495.55E01
Prolactin0.0968.39E01−0.6702.57E010.3044.08E010.1047.12E010.4155.67E01
Hippo0.1056.66E01−0.0578.05E010.0667.13E01−0.0826.81E010.0837.13E01
PI3K–Akt−0.4765.37E01−0.5132.68E010.2053.96E010.3403.00E010.0867.75E01
FoxO−1.0665.56E01−0.6056.66E01−0.5933.39E01−0.1759.82E01−0.6047.93E01
Phospholipase D−1.2272.58E01−0.5323.25E01−0.7131.82E010.4312.14E010.0038.26E01
Thyroid hormone−0.2424.84E01−0.1606.13E01−0.0747.76E010.1765.64E010.0608.44E01
Estrogen0.2984.10E010.1725.37E010.1167.80E−01−0.0029.65E−01−0.0049.06E−01
Notch−0.3396.71E−01−0.3496.80E−01−0.1916.90E−010.4933.43E−01−0.0579.39E−01
Ras−0.2669.62E−01−0.6145.44E−01−0.0078.69E−010.0667.96E−01−0.3459.42E−01
Glucagon0.7841.07E−010.5851.46E−01−0.1017.31E−01−0.5532.15E−01−0.1689.44E−01
TNF0.4612.30E−01−0.0937.60E−010.2873.47E−01−0.1646.12E−01−0.0789.59E−01
Calcium−0.5531.71E−01−0.1317.01E−01−0.2014.36E−010.1256.84E−01−0.0209.62E−01
cGMP-PKG−0.0529.86E−01−0.0369.32E−010.1586.05E−01−0.1819.01E−01−0.1059.98E−01
 backward or simultbackwardsimultaneousforwardforward or simult
pathsizeeffctpvaluesizeeffctpvaluesizeeffctpvaluesizeeffctpvaluesizeeffctpvalue
IL-17−1.2031.10E−02*−1.1209.00E−03*0.1266.29E−010.4213.34E−01−0.0069.76E−01
Ovarian steroidogenesis0.7211.40E−02*0.9931.10E−02*−0.4681.63E−01−1.0995.00E−03*−1.1701.00E−03*
mTOR1.2673.50E−02*0.1625.20E−010.3681.70E−01−0.6269.60E−02−0.7282.37E−01
Adipocytokine3.0204.30E−02*0.7381.96E−010.0826.91E−01−0.4345.13E−01−1.3684.06E−01
cAMP5.5204.30E−02*6.3783.00E−03*−1.3071.16E−01−1.9201.65E−01−3.7571.51E−01
RIG-I-like receptor−0.2347.88E−01−1.0883.20E−02*1.1633.00E−03*0.3095.19E−011.1705.00E−02
Sphingolipid0.6109.20E−02−0.3142.84E−011.1351.00E−03*−0.3252.80E−010.1925.21E−01
JAK-STAT0.5571.70E−01−0.5632.41E−010.7585.00E−03*−0.5641.53E−01−0.5962.05E−01
Hedgehog0.2712.74E−01−0.2024.43E−010.6028.00E−03*−0.2513.02E−010.1824.89E−01
C-type lectin receptor0.2977.90E−02−0.0278.59E−010.3092.40E−02*−0.0965.12E−010.1085.10E−01
VEGF2.8389.10E−02−0.8995.58E−011.1694.40E−02*−1.3433.04E−010.1076.66E−01
Toll-like receptor−1.8945.70E−02−1.0228.90E−02−0.1565.73E−011.5531.20E−02*1.5606.80E−02
T-cell receptor−2.4593.57E−01−1.7271.66E−01−0.0919.57E−012.5524.60E−02*2.9741.07E−01
ErbB−0.3967.45E010.7022.33E−01−0.3765.26E−01−1.5165.10E−02−2.4442.70E−02*
TGF-beta0.6371.85E010.4971.01E01−0.4131.76E01−0.1107.61E01−0.9187.40E02
Apelin−1.0972.40E010.0058.10E01−0.4273.03E010.8425.70E021.2668.20E02
NF-kappa B0.3971.38E010.3421.41E01−0.1843.63E01−0.3411.69E01−0.4788.90E02
AMPK−0.1609.44E01−0.6092.47E010.5485.40E02−0.0319.48E010.9198.90E02
Oxytocin−0.0168.26E01−0.1599.13E01−0.3762.29E01−0.3965.18E01−1.6311.30E01
Chemokine0.3056.20E01−0.1958.61E01−0.5521.97E010.1686.33E01−1.9081.30E01
p53−0.3259.46E01−2.0893.46E010.9372.30E011.0933.33E013.0591.34E01
Relaxin0.0288.69E010.4889.30E02−0.3351.87E01−0.0119.81E01−0.8211.47E01
PPAR2.9341.99E013.4871.24E010.6994.04E01−3.4581.10E01−2.8251.50E01
Insulin−0.7586.02E010.0448.11E01−0.2046.42E01−0.3716.18E01−1.4432.69E01
GnRH−0.9093.19E01−0.1988.76E01−0.4432.20E01−0.3337.29E01−0.8703.32E01
NOD-like receptor−0.0059.81E01−0.4241.09E010.2552.11E01−0.1924.82E010.2923.54E01
AGE-RAGE2.2261.16E010.2606.58E01−0.1048.98E01−0.7345.68E01−1.7253.99E01
MAPK0.2497.35E010.5664.55E010.2495.25E01−1.5521.67E01−2.1284.16E01
Rap10.9811.48E010.9075.80E02−0.1017.04E01−0.4294.08E01−0.8294.17E01
B cell receptor0.6193.85E01−0.6603.17E010.6656.70E02−0.1378.92E010.5204.48E01
Neurotrophin0.4946.56E010.5334.47E010.3243.83E01−1.5271.20E01−1.6285.26E01
HIF-10.9574.44E010.0429.15E010.4813.02E01−1.4732.25E01−1.0275.49E01
Wnt4.3317.60E021.8538.10E020.8141.25E01−0.8176.28E010.6575.52E01
Fc epsilon RI−0.2966.70E01−0.1269.05E010.3322.19E01−0.3725.52E01−0.4495.55E01
Prolactin0.0968.39E01−0.6702.57E010.3044.08E010.1047.12E010.4155.67E01
Hippo0.1056.66E01−0.0578.05E010.0667.13E01−0.0826.81E010.0837.13E01
PI3K–Akt−0.4765.37E01−0.5132.68E010.2053.96E010.3403.00E010.0867.75E01
FoxO−1.0665.56E01−0.6056.66E01−0.5933.39E01−0.1759.82E01−0.6047.93E01
Phospholipase D−1.2272.58E01−0.5323.25E01−0.7131.82E010.4312.14E010.0038.26E01
Thyroid hormone−0.2424.84E01−0.1606.13E01−0.0747.76E010.1765.64E010.0608.44E01
Estrogen0.2984.10E010.1725.37E010.1167.80E−01−0.0029.65E−01−0.0049.06E−01
Notch−0.3396.71E−01−0.3496.80E−01−0.1916.90E−010.4933.43E−01−0.0579.39E−01
Ras−0.2669.62E−01−0.6145.44E−01−0.0078.69E−010.0667.96E−01−0.3459.42E−01
Glucagon0.7841.07E−010.5851.46E−01−0.1017.31E−01−0.5532.15E−01−0.1689.44E−01
TNF0.4612.30E−01−0.0937.60E−010.2873.47E−01−0.1646.12E−01−0.0789.59E−01
Calcium−0.5531.71E−01−0.1317.01E−01−0.2014.36E−010.1256.84E−01−0.0209.62E−01
cGMP-PKG−0.0529.86E−01−0.0369.32E−010.1586.05E−01−0.1819.01E−01−0.1059.98E−01

Size effect: average size of the module that meets each condition, calculated by averaging the size of the module across 1,000 simulated pathways; pvalue: P-value of each condition. Significant P-values (*P < 0.05) are presented in bold italics.

Summary of Hypotheses Testing

Among the 47 pathways, 30 have no correlation between the age rank differences (delta) with respect to the position in the pathway (Table 1), 29 do not have more backward/simultaneous/forward pairs than randomness (Table 2), and 33 do not have modules larger than obtained by chance alone (Table 3). Interestingly, 20 pathways constitute the intersection of these tests: they can be interpreted as randomly organized for all the variables considered in our study. These 20 pathways are: apelin, B-cell receptor, calcium, cGMP-PKG, chemokine, estrogen, Fc epsilon RI, FoxO, glucagon, GnRH, Hippo, NF-kappa B, Notch, p53, phospholipase D, PI3K–Akt, PPAR, prolactin, Rap1, Ras, relaxin, TGF-beta, thyroid hormone, and TNF. A total of 581 genes are involved only in the pathways for which a correlation and/or permutation statistical test yielded significant results, and 334 genes are exclusively associated with the pathways that provide support for one of these architectures. Finally, 14.9% of the tests are significant and their effects are different among pathways. This low ratio can rule out that scenarios A and B but is compatible with scenario C.

Discussion

Ages of Appearance and Their Patterns in Evolution

For most of the pathways with significant correlations between position of proteins in the pathway and the delta of appearance of their corresponding gene, the correlation is surprisingly positive: the more upstream a protein is involved in the pathway, the older is the upstream gene (Fig. 2b). The sole exception is the insulin pathway that has a negative correlation (Fig. 2a).

Finally, for 24 of the 47 pathways studied here, there is no significant correlation between the position of the proteins in the pathway and the delta of appearance of the corresponding gene in the tree of life, which reflects evolutionary tinkering (Fig. 2c); furthermore, the architecture of proteins and their interactions is not significantly different from 1,000 simulation taken at random (a simulation is a possible configuration randomly generated from a signaling pathway, where the interactions and their directions are maintained, but the ages of the involved proteins are randomly swapped). This result suggests that for these 24 pathways, the organization of proteins and interactions during evolution was not guided by a molecular determinism. One may formulate two hypotheses: that the first interactions may have arisen under determinism/pressure (e.g. 100% backward; Fig. 2a) and may subsequently have been modified by new interactions during evolution (a form of evolutionary tinkering that may improve fitness) that erased the signal in the initial pathway (Fig. 2c) and that several modules arose partly by chance and then further new interactions made the entire pathway necessary. This “chance and necessity” hypothesis was proposed by Monod (1970) in his famous book “Le hasard et la nécessité”. This evolutionary concept has been widely discussed and remains accepted by many (Merlin 2015; Davankov 2021; Xie et al. 2021; Alexander 2022).

Moreover, even if the correlation between the position of proteins in the pathway and the delta of appearance of their corresponding genes is not significant, we observe that almost all signaling pathways have target factors (e.g. factors entering the nucleus) that appeared very early during evolution (dark blue factors to the right in the colored KEGG diagrams; supplementary Data S3, Supplementary Material online). For various pathways, some factors are close to the targets and appeared very early (Opisthokonta or Metazoa), whereas those further upstream in the pathway appeared later (PPAR and cAMP). For other pathways (IL-17 and T-cell receptor), some factors close to the targets appeared later than the factors acting upstream of the pathway. This also supports that pathways or parts thereof were not formed/refined according to a unique deterministic law like scenarios A or B but rather conforms to scenario C (Fig. 2c) and suggests that some evolutionary tinkering took place.

In a previous work (Grandchamp and Monget 2018), we have shown that the genes encoding the pairs of ligand/membrane receptors mainly appeared at the root of Metazoa, Vertebrata, and Osteichthyes. In the present work, we show that the genes encoding proteins of signaling pathways mainly appeared between the last common ancestor of Opisthokonta and Metazoa and between Deuterostomia and Chordata. Moreover, as expected, among all the pathways, at least 119 proteins (from 1 to 23 by pathway) appeared in Opisthokonta, showing, as expected, that numerous genes encoding proteins of signaling pathways appeared long before the membrane receptors and their ligands. Our previous study on membrane receptors and their ligands was done on a tree with only 10 nodes, but we refine it by using 15 nodes in the present paper.

Some signaling pathways are relatively old (i.e. contain more members that appeared in Opisthokonta or Metazoa than in more recent clades), such as the MAPK pathway, and others are more recent, such as the pathways involved in immunity (Cooper and Alder 2006).

Some Interesting Cases

In the signaling pathways studied, numerous proteins interact with their partner(s) in humans, but their appearance was desynchronized in the tree of life (Fig. 2c). This raises the question of the functioning of these pathways without the full set of their members. For example, p53, which appeared in Metazoa, is inhibited by several factors whose genes appeared later, such as MDM2 and MDM4 (Olfactora). Without these inhibitors, mammalian embryos would die in utero, as shown in the mouse (Jones et al. 1995; Montes de Oca Luna et al. 1995). This suggests that another p53 inhibitor existed before Olfactora appeared. In addition, human p53 targets also emerged later in evolution, such as IGFBP3 (Vertebrata) and pro-apoptotic factor BAX (Olfactora). It is these and other factors that explain the tumor suppressor role of p53 in humans (Lehmann-Che et al. 2007) and in Drosophila (Zhou 2019). We can therefore hypothesize a progressive refinement of the mechanisms of action and inhibition of p53 during evolution; for example, a possible first inhibitor of p53 in an ancestor may have been replaced by another during evolution.

Another example is the interaction between PIN1 and IRF3 in the RIG-I-like receptor pathway, characterized by a delta of 8, the PIN1 gene having appeared at the root of Opisthokonta and IRF3 in stem-gnathostomes. PIN1 also activates p53, which is an apomorphy of Metazoa (Berger et al. 2005). Another example of desynchronization concerns the sphingolipid pathway, in which CTSD interacts only with BID, whereas BID interacts with CTSD and BAX in humans. The CTSD gene appeared in Metazoa, BAX in Olfactora, and BID in Mammalia. Moreover, although the yeast genome does not contain genes encoding BCL2 proteins, the heterologous expression of mammalian BAX in yeast induces a suppressible lethal phenotype that is associated with characteristics of metazoan apoptosis, strongly suggesting that its targets are present in yeast and thus hark back to Opisthokonta (Zha et al. 1996; Khoury and Greenwood 2008).

Limits and Future Improvements

This study could have been completed by an analysis based on known pathways in yeast to verify certain interactions. However, only one pathway from our list, MAPK, is described in KEGG for Saccharomyces cerevisiae. Unsurprisingly, this pathway is widely documented (Zou et al. 2008; Saito 2010; Chavel et al. 2014), including in plants (Meng and Zhang 2013). In the literature, several other pathways (not studied here) are documented in yeast, such as cAMP (Tamaki 2007; Portela and Rossi 2020), mTOR (Powers et al. 2004), Ras (Tisi et al. 2014), or sphingolipid (Montefusco et al. 2014). Thus, using the KEGG database, it is not possible to establish an exhaustive evolutionary bridge of signaling pathways between yeast and humans. Moreover, certain taxa lack specific pathways; for example, the yeast S. cerevisiae does not have the NF-kappa B pathway (Ho et al. 2017; Saleski et al. 2017), and yet, some elements of the pathway are present in S. cerevisiae, such as casein kinase 2 (CSNK2A1/2/3 and CSNK2B). In yeast, these caseins are known to be essential for mitophagy (Kanki et al. 2013).

Another problem concerns the limitations of our method of dating the appearance of a gene. In this, we are limited by the various versions of the Genomicus and Genomicus Metazoa databases. In particular, the species common to these two databases are Drosophila melanogaster and Caenorhabditis elegans. Indeed, if for a given gene the oldest ortholog found is not from these two species, or if this gene is lost in both species but present in taxa more distantly related to humans, our method does not allow us to adequately use the Genomicus Metazoa trees. Genomicus trees are modified trees of the Ensembl database, and therefore, the limit comes from its origin, Ensembl. Furthermore, depending on the versions of Ensembl, there may be mega-trees with all the paralogs of a gene in the same tree or in sub-trees with one paralog per sub-tree. This change from mega-trees into several sub-trees in Ensembl V94 and further (Emily 2018) makes the attribution of appearance times more complex, to an extent that depends on the size of the paralog family.

Not all of the 15 clades that we have sampled have been studied to the same extent. As shown in supplementary Data S5, Supplementary Material online, we can see that yeast are represented by a single species (S. cerevisiae) out of about 64,000 known species, whereas for Aves, we were able to collect data for 13 species, out of a little more than 11,000 currently recognized. However, our data show that appearance of the genes is concentrated on two nodes (Opisthokonta and Vertebrata), and the genes that appeared at the base of these taxa represent 75.7% of all the genes involved in the KEGG pathways.

It must also be considered that the signaling pathways noted on KEGG (or databases more broadly) are human representations and that simplifications have been made to facilitate reading and understanding. However, as given in supplementary Data S3, Supplementary Material online, some proteins, such as Shc, Grb2, SOS, Ras, Raf1, MEK, and ERK, are involved in multiple pathways. In fact, interactions within these proteins are involved (partially or entirely) in 22 pathways, including ErbB, estrogen, GnRH, insulin, prolactin, relaxin, B-cell receptor, chemokine, F epsilon RI, T-cell receptor, neurotrophin, cAMP, FoxO, JAK-STAT, MAPK, mTOR, phospholipase D, PI3K–Akt, Rap1, Ras, sphingolipid, and VEGF. The RTK/RAS/ERK component is known to be common to Drosophila, Nematoda, and humans (Ashton-Beaucage and Therrien 2010).

In a previous study, we described cases where the genes coding for membrane receptors appeared before their ligand (Grandchamp and Monget 2020). We had studied more precisely the 30 cases for which 3D structures were known in the Protein Data Bank, to formulate hypotheses on the plausible scenarios of evolution of the amino acids involved in the binding pocket of the receptor until the ligand appeared. In future, a similar study of the partners that appeared before the proteins with which they interact would be worthwhile. Such a study could be performed for Ras/SOS complexes, for example, on Grb2/SOS, Akt/Gsk3, or Akt/mTor, for which the 3D structure is available.

Materials and Methods

Implementation of the Database

Signaling Pathways by KEGG

We follow the method described in our previous paper (Picolo et al. 2023). We use the KEGG pathways because they are annotated in humans and therefore its genes are also annotated in humans, and we use parsimony and the trees to locate the branch on which each gene originated. We retrieved a list of 2,298 genes encoding proteins involved in the 47 human intracellular signaling pathways available on the KEGG V104.0 database (https://www.genome.jp/kegg) (Bader et al. 2006) using the keywords “signaling pathway” and “human” (columns C and D in supplementary Data S4, Supplementary Material online). KEGG is one of the most referenced and used databases listing signaling pathways.

Each signaling pathway was retrieved from the KEGG pathway analysis tool, and the information was compiled into an Extensible Markup Language (XML) file. These files were read using the R XML (Lang and Kalibera 2023) library, and pathways were represented graphically using the R library igraph (Csárdi and Nepusz 2006; Csárdi et al. 2023).

The gene products of KEGG pathways are inscribed in rectangular blocks that we call “labels”; for the XML file, this is the label “name,” and to keep it simple for our readers, these labels can cover several paralogs (supplementary Data S3, Supplementary Material online). Some paralogous genes appear under different labels, as in the PPAR pathway for which the proteins PPARα, PPARγ, and PPARβ/δ have similar interactions (with FABP3, thiolase B, AP2, and UCP-1) and more specific interactions (e.g. PPARα → HMGCS2 and PPARγ → GyK). Each pathway is constructed in such a way that there can be one or more inputs (e.g. a ligand) and one or more outputs (e.g. transcription factor). Each subpathway is analyzed, which means that the same gene can be involved in several subpathways (e.g. A → B → C and A → D → E).

Tree of Life From Genomicus

To determine the time of origin of genes, two phylogenetic trees from Genomicus (Nguyen et al. 2018) were used: the vertebrate tree and the metazoan tree. The vertebrate tree is the V109 tree (https://www.genomicus.bio.ens.psl.eu/genomicus-109.01), comprising 199 animal species and covering a range of vertebrate species, as well as D. melanogaster and C. elegans, and to this is added the yeast species S. cerevisiae. The metazoan tree is that of V51 (https://www.genomicus.bio.ens.psl.eu/genomicus-metazoa-51.01), comprising 116 animal species and covering all main metazoan clades except for Vertebrata. Drosophila melanogaster and C. elegans are the two species present in both of these databases (supplementary Data S5, Supplementary Material online).

From these Genomicus trees, we determined 15 clades on a similar and “enriched” model of a previous work (Grandchamp and Monget 2018): (i) Opisthokonta (∼1,300 my; taxon age here and below refers crown groups and to the divergence between the various taxa (S. cerevisiae, in this case) and Homo sapiens, unless stated otherwise); (ii) Metazoa (∼765 my); (iii) Eumetazoa (∼743 my); (iv) Bilateria (∼708 my); (v) Deuterostomia (∼635 my); (vi) Chordata (∼588 my); (vii) Olfactora (∼570 my); (viii) Vertebrata (∼563 my); (ix) Gnathostomata (∼462 my); (x) Osteichthyes (462 my); (xi) Sarcopterygii (∼415 my); (xii) Tetrapoda (∼350 my); (xiii) Amniota (∼330 my); (xiv) Mammalia (∼166 my); and (xv) Theria (∼98 my). To provide consistent ages and given that several of the oldest nodes are poorly constrained by the fossil record, we have used molecular time estimates throughout (de Vienne 2016; Kumar et al. 2022; Nguyen et al. 2022). These clades are our references for the present study. The gene is considered to appear on the branch that leads to the clade for which parsimony unambiguously indicates its presence ancestrally.

Dating the Appearance of Genes

In order to identify the origin of a gene, we trace its orthologs along the evolutionary tree, starting with humans (our reference species because our knowledge of these pathways is human-centered). We identify the deepest evolutionary node that includes H. sapiens where orthologs are already present as the age of appearance. For example, if we identify a teleost orthologue of a human gene, but we do not find one for more distantly related taxon, we infer that the node of emergence of this gene is probably Osteichthyes. (https://github.com/florianepicolo/origin-gene).

Some cases require a more complex analysis. If, under a label in the KEGG pathways, several paralogs are listed, then the latest possible date of origin of this gene is the oldest node at which paralogs occur. If several paralogs are present in a pathway but under different labels, we consider them independently from other paralogs. In the case of a protein complex, we consider the complex to be a group, so we consider the node of origin of the complex to be the deepest node for which all the genes of the complex are present, because the complex cannot be functional unless all the proteins are present.

All elements in a pathway are assigned a position in the pathway, and multiple positions are possible for the elements that are present in several pathways. We have not considered the “components” of the pathways present on KEGG in the allocation of the positions if they are not proteins and therefore do not involve genes, such as Ca2+ or lactate (e.g. A1 → B2 → Ca2+ → C3, with pathway position in index).

Furthermore, in an interaction A → B (A is closer to the receptor than B), the interaction is said to be “backward” if gene A was born after B, “forward” if gene A was born before B, and “simultaneous” if A and B were born on the same branch (this condition may reflect extinction of the relevant taxa or insufficient taxonomic resolution of our study, which prevents us from untangling the pattern of appearance).

Analyses and Statistics

To explore the relative age of genes in the signaling pathways, we used several variables with both descriptive and hypothesis testing approaches (supplementary Data S6, Supplementary Material online). Indeed, for each pathway, we considered the pathway position of genes (supplementary Data S6-b, Supplementary Material online), rank age of appearance (supplementary Data S6-c, Supplementary Material online), appearance pattern in an interaction (supplementary Data S6-d, Supplementary Material online), rank age difference between genes in an interaction (supplementary Data S6-e, Supplementary Material online), and size of module sharing the same appearance pattern. A module is a subset of a pathway that achieves a specific condition, in our case a pattern of appearance for a given interaction (e.g. A -> B): forward (A before B), backward (B before A), or simultaneous (A at the same time as B) (supplementary Data S6-f, Supplementary Material online). As the distribution of variable values in networks is usually not normal and implies complex structures of dependences, we used a permutation algorithm to test the hypotheses (supplementary Data S6-g, Supplementary Material online). The algorithm works as follows for each pathway: (i) compute the statistics of interest; (ii) randomly permute the rank ages of origin of all genes from the pathway; (iii) compute again the statistics on the pathway with random rank ages; (iv) redo steps 2 and 3 a large number of times (here, 1,000 times); (v) if the statistics have more than one value per pathway, compute the medians; and (vi) use the values of the statistics (or the medians) as the distribution, under the null hypothesis that there is no relationship between the rank ages of origin and the statistics under scrutiny. Comparing the values from the raw data (step 1) to the distribution of values under the null hypothesis (step 6) allows P-value (here, two-sided) to be computed. In other words, we evaluate how probable the values of the statistics on the raw data are if we accept the null hypothesis. For clarity, the nomenclature of the two types of pathway subsets (subpaths and modules) is distinguished in supplementary Data S6-f and S6-i, Supplementary Material online.

Finally, we tested the correlation between the differences of rank ages (delta age; supplementary Data S6-e, Supplementary Material online) and the position of the upstream gene in the pathway (supplementary Data S6-b, Supplementary Material online). Because pathways are networks, cycles imply several pathway position attributions for some genes (supplementary Data S6-h, Supplementary Material online). We were not able to choose one pathway position hypothesis over another in networks with cycles. Therefore, all hypotheses were included, and each was down-weighted proportionally to the number of alternative hypotheses.

Supplementary Material

Supplementary material is available at Genome Biology and Evolution online.

Acknowledgments

The authors thank Éric Reiter and Pascale Crépieux for helpful and encouraging discussion.

Data Availability

All data are available free of charge online or on request.

Literature Cited

Albelda
 
SM
,
Buck
 
CA
.
Integrins and other cell adhesion molecules
.
FASEB J
.
1990
:
4
(
11
):
2868
2880
. .

Alexander
 
DR
.
Evolution, chance, necessity, and design
.
Zygon®
.
2022
:
57
(
4
):
1069
1082
. .

Andreani
 
J
,
Guerois
 
R
.
Evolution of protein interactions: from interactomes to interfaces
.
Arch Biochem Biophys
.
2014
:
554
:
65
75
. .

Ashton-Beaucage
 
D
,
Therrien
 
M
.
La signalisation RTK/RAS/ERK élargie—contributions de la génétique à l’assemblage d’un réseau de signalisation
.
Med Sci (Paris)
.
2010
:
26
(
12
):
1067
1073
. .

Bader
 
GD
,
Cary
 
MP
,
Sander
 
C
.
Pathguide: a pathway resource list
.
Nucleic Acids Res
.
2006
:
34
(
Database issue
):
D504
D506
. .

Benoit
 
G
,
Cooney
 
A
,
Giguere
 
V
,
Ingraham
 
H
,
Lazar
 
M
,
Muscat
 
G
,
Perlmann
 
T
,
Renaud
 
J-P
,
Schwabe
 
J
,
Sladek
 
F
, et al.  
International Union of Pharmacology. LXVI. Orphan nuclear receptors
.
Pharmacol Rev
.
2006
:
58
:
798
836
. .

Berger
 
M
,
Stahl
 
N
,
Del Sal
 
G
,
Haupt
 
Y
.
Mutations in proline 82 of p53 impair its activation by Pin1 and Chk2 in response to DNA damage
.
Mol Cell Biol
.
2005
:
25
(
13
):
5380
5388
. .

Bloom
 
JD
,
Wilke
 
CO
,
Arnold
 
FH
,
Adami
 
C
.
Stability and the evolvability of function in a model protein
.
Biophys J
.
2004
:
86
(
5
):
2758
2764
. ://doi.org/10.1016/S0006-3495(04)74329-5.

Chavel
 
CA
,
Caccamise
 
LM
,
Li
 
B
,
Cullen
 
PJ
.
Global regulation of a differentiation MAPK pathway in yeast
.
Genetics
.
2014
:
198
(
3
):
1309
1328
. .

Cooper
 
MD
,
Alder
 
MN
.
The evolution of adaptive immune systems
.
Cell
.
2006
:
124
(
4
):
815
822
. .

Csárdi
 
G
,
Nepusz
 
T
.
The igraph software package for complex network research
.
InterJ, Complex Syst.
 
2006
:
1695
.

Csárdi
 
G
,
Nepusz
 
T
,
Müller
 
K
,
Horvát
 
S
,
Traag
 
V
,
Zanini
 
F
,
Noom
 
D
.
2023
. igraph for R: R interface of the igraph library for graph theory and network analysis.
Zenodo
. .

Davankov
 
VA
.
Chance and necessity in the evolution of matter to life: a comprehensive hypothesis
.
Symmetry (Basel).
 
2021
:
13
(
10
):
1918
. .

de Vienne
 
DM
.
Lifemap: exploring the entire tree of life
.
PLoS Biol
.
2016
:
14
(
12
):
e2001624
. .

Echave
 
J
,
Wilke
 
CO
.
Biophysical models of protein evolution: understanding the patterns of evolutionary sequence divergence
.
Annu Rev Biophys
.
2017
:
46
:
85
103
. .

Emily
. Changes to paralogy in release 94. Ensembl Blog; 2018 [accessed 2021 Feb 11]. https://www.ensembl.info/2018/10/24/changes-to-paralogy-in-release-94/.

Fraser
 
HB
,
Hirsh
 
AE
,
Wall
 
DP
,
Eisen
 
MB
.
Coevolution of gene expression among interacting proteins
.
Proc Natl Acad Sci U S A
.
2004
:
101
(
24
):
9033
9038
. .

Grandchamp
 
A
,
Monget
 
P
.
The membrane receptors that appeared before their ligand: the different proposed scenarios
.
PLoS One
.
2020
:
15
(
5
):
e0231813
. .

Grandchamp
 
A
,
Monget
 
P
.
Synchronous birth is a dominant pattern in receptor-ligand evolution
.
BMC Genomics
.
2018
:
19
(
1
):
611
. .

Hillis
 
DM
,
Huelsenbeck
 
JP
.
Signal, noise, and reliability in molecular phylogenetic analyses
.
J Hered
.
1992
:
83
(
3
):
189
195
. .

Ho
 
JML
,
Mattia
 
JR
,
Bennett
 
MR
.
Tunable NF-κB oscillations in yeast
.
Cell Syst
.
2017
:
5
(
5
):
440
442
. .

Howard
 
AD
,
McAllister
 
G
,
Feighner
 
SD
,
Liu
 
Q
,
Nargund
 
RP
,
Van der Ploeg
 
LH
,
Patchett
 
AA
.
Orphan G-protein-coupled receptors and natural ligand discovery
.
Trends Pharmacol Sci
.
2001
:
22
(
3
):
132
140
. .

Jack
 
BR
,
Meyer
 
AG
,
Echave
 
J
,
Wilke
 
CO
.
Functional sites induce long-range evolutionary constraints in enzymes
.
PLoS Biol
.
2016
:
14
(
5
):
e1002452
. .

Jacob
 
F
.
Evolution and tinkering
.
Science
.
1977
:
196
(
4295
):
1161
1166
. .

Jones
 
SN
,
Roe
 
AE
,
Donehower
 
LA
,
Bradley
 
A
.
Rescue of embryonic lethality in mdm2-deficient mice by absence of p53
.
Nature
.
1995
:
378
(
6553
):
206
208
. .

Kachroo
 
AH
,
Laurent
 
JM
,
Yellman
 
CM
,
Meyer
 
AG
,
Wilke
 
CO
,
Marcotte
 
EM
.
Evolution. Systematic humanization of yeast genes reveals conserved functions and genetic modularity
.
Science
.
2015
:
348
(
6237
):
921
925
. .

Kanki
 
T
,
Kurihara
 
Y
,
Jin
 
X
,
Goda
 
T
,
Ono
 
Y
,
Aihara
 
M
,
Hirota
 
Y
,
Saigusa
 
T
,
Aoki
 
Y
,
Uchiumi
 
T
, et al.  
Casein kinase 2 is essential for mitophagy
.
EMBO Rep
.
2013
:
14
(
9
):
788
794
. .

Kauffman
 
SA
.
The origins of order: self-organization and selection in evolution
.
Oxford University Press
;
1993
.

Kent
 
RS
,
De Lean
 
A
,
Lefkowitz
 
RJ
.
A quantitative analysis of beta-adrenergic receptor interactions: resolution of high and low affinity states of the receptor by computer modeling of ligand binding data
.
Mol Pharmacol
.
1980
:
17
(
1
):
14
23
. .

Khoury
 
CM
,
Greenwood
 
MT
.
The pleiotropic effects of heterologous Bax expression in yeast
.
Biochim Biophys Acta
.
2008
:
1783
(
7
):
1449
1465
. .

Koshland
 
DE
.
Application of a theory of enzyme specificity to protein synthesis
.
Proc Natl Acad Sci U S A
.
1958
:
44
(
2
):
98
104
. .

Kumar
 
S
,
Suleski
 
M
,
Craig
 
JM
,
Kasprowicz
 
AE
,
Sanderford
 
M
,
Li
 
M
,
Stecher
 
G
,
Hedges
 
SB
.
TimeTree 5: an expanded resource for species divergence times
.
Mol Biol Evol
.
2022
:
39
(
8
):
msac174
. .

Lang
 
DT
,
Kalibera
 
T
.
XML: Tools for Parsing and Generating XML Within R and S-Plus. 2023
. https://rdrr.io/cran/XML/.

Lehmann-Che
 
J
,
Turpin
 
E
,
Bertheau
 
P
,
Espié
 
M
,
de Thé
 
H
.
La fonction de TP53—Facteur prédictif majeur de la réponse à une chimiothérapie dose-intense dans le cancer du sein
.
Med Sci (Paris)
.
2007
:
23
(
11
):
1021
1023
. .

Lewis
 
ACF
,
Saeed
 
R
,
Deane
 
CM
.
Predicting protein-protein interactions in the context of protein evolution
.
Mol Biosyst
.
2010
:
6
(
1
):
55
64
. .

Lovell
 
SC
,
Robertson
 
DL
.
An integrated view of molecular coevolution in protein-protein interactions
.
Mol Biol Evol
.
2010
:
27
(
11
):
2567
2575
. .

Lynch
 
M
,
Hagner
 
K
.
Evolutionary meandering of intermolecular interactions along the drift barrier
.
Proc Natl Acad Sci U S A
.
2015
:
112
(
1
):
E30
E38
. .

Markov
 
GV
,
Gutierrez-Mazariegos
 
J
,
Pitrat
 
D
,
Billas
 
IML
,
Bonneton
 
F
,
Moras
 
D
,
Hasserodt
 
J
,
Lecointre
 
G
,
Laudet
 
V
.
Origin of an ancient hormone/receptor couple revealed by resurrection of an ancestral estrogen
.
Sci Adv
.
2017
:
3
(
3
):
e1601778
. .

Maslov
 
S
,
Sneppen
 
K
.
Specificity and stability in topology of protein networks
.
Science
.
2002
:
296
(
5569
):
910
913
. .

Meng
 
X
,
Zhang
 
S
.
MAPK cascades in plant disease resistance signaling
.
Annu Rev Phytopathol
.
2013
:
51
(
1
):
245
266
. .

Merlin
 
F
.
Monod's conception of chance: its diversity and relevance today
.
C R Biol
.
2015
:
338
(
6
):
406
412
. .

Mintseris
 
J
,
Weng
 
Z
.
Structure, function, and evolution of transient and obligate protein-protein interactions
.
Proc Natl Acad Sci U S A
.
2005
:
102
(
31
):
10930
10935
. .

Monod
 
J
.
Le hasard et la nécessité : essai sur la philosophie naturelle de la biologie moderne, editions du seuils. Ed, points essais
.
Paris
:
Institut Catholique De Toulouse édition
;
1970
.

Montefusco
 
DJ
,
Matmati
 
N
,
Hannun
 
YA
.
The yeast sphingolipid signaling landscape
.
Chem Phys Lipids.
 
2014
:
177
:
26
40
. .

Montes de Oca Luna
 
R
,
Wagner
 
DS
,
Lozano
 
G
.
Rescue of early embryonic lethality in mdm2-deficient mice by deletion of p53
.
Nature
.
1995
:
378
(
6553
):
203
206
. .

Nguyen
 
NTT
,
Vincens
 
P
,
Dufayard
 
JF
,
Roest Crollius
 
H
,
Louis
 
A
.
Genomicus in 2022: comparative tools for thousands of genomes and reconstructed ancestors
.
Nucleic Acids Res
.
2022
:
50
(
D1
):
D1025
D1031
. .

Nguyen
 
NTT
,
Vincens
 
P
,
Roest Crollius
 
H
,
Louis
 
A
.
Genomicus 2018: karyotype evolutionary trees and on-the-fly synteny computing
.
Nucleic Acids Res
.
2018
:
46
(
D1
):
D816
D822
. .

Nooren
 
IMA
,
Thornton
 
JM
.
Diversity of protein-protein interactions
.
EMBO J
.
2003
:
22
(
14
):
3486
3492
. .

Picolo
 
F
,
Piégu
 
B
,
Monget
 
P
.
Genes encoding teleost orthologues of human signal transduction proteins remain duplicated or triplicated more frequently than the whole genome
.
Heliyon
.
2023
:
9
(
9
):
e20217
. .

Portela
 
P
,
Rossi
 
S
.
cAMP-PKA signal transduction specificity in Saccharomyces cerevisiae
.
Curr Genet
.
2020
:
66
(
6
):
1093
1099
. .

Porter
 
SM
.
Insights into eukaryogenesis from the fossil record
.
Interface Focus
.
2020
:
10
(
4
):
20190105
. .

Powers
 
T
,
Dilova
 
I
,
Chen
 
CY
,
Wedaman
 
K
.
Yeast TOR signaling: a mechanism for metabolic regulation
.
Curr Top Microbiol Immunol
.
2004
:
279
:
39
51
. .

Rand
 
DM
,
Haney
 
RA
,
Fry
 
AJ
.
Cytonuclear coevolution: the genomics of cooperation
.
Trends Ecol Evol
.
2004
:
19
(
12
):
645
653
. .

Saito
 
H
.
Regulation of cross-talk in yeast MAPK signaling pathways
.
Curr Opin Microbiol.
 
2010
:
13
(
6
):
677
683
. .

Saleski
 
T
,
Tan
 
JY
,
Lin
 
XN
.
Dissecting the ecology of microbes using a systems toolbox
.
Cell systms
.
2017
:
5
:
442
444
. .

Sullivan
 
SM
,
Holyoak
 
T
.
Enzymes with lid-gated active sites must operate by an induced fit mechanism instead of conformational selection
.
Proc Natl Acad Sci U S A
.
2008
:
105
(
37
):
13829
13834
. .

Tamaki
 
H
.
Glucose-stimulated cAMP-protein kinase A pathway in yeast Saccharomyces cerevisiae
.
J Biosci Bioeng
.
2007
:
104
(
4
):
245
250
. .

Thornton
 
JW
.
Evolution of vertebrate steroid receptors from an ancestral estrogen receptor by ligand exploitation and serial genome expansions
.
Proc Natl Acad Sci U S A
.
2001
:
98
(
10
):
5671
5676
. .

Tisi
 
R
,
Belotti
 
F
,
Martegani
 
E
.
Yeast as a model for Ras signalling
.
Methods Mol Biol
.
2014
:
1120
:
359
390
. .

Williams
 
PD
,
Pollock
 
DD
,
Goldstein
 
RA
.
Evolution of functionality in lattice proteins
.
J Mol Graph Model
.
2001
:
19
(
1
):
150
156
. .

Wuchty
 
S
,
Oltvai
 
ZN
,
Barabási
 
A-L
.
Evolutionary conservation of motif constituents in the yeast protein interaction network
.
Nat Genet
.
2003
:
35
(
2
):
176
179
. .

Xie
 
VC
,
Pu
 
J
,
Metzger
 
BP
,
Thornton
 
JW
,
Dickinson
 
BC
.
Contingency and chance erase necessity in the experimental evolution of ancestral proteins
.
eLife
.
2021
:
10
:
e67336
. .

Zha
 
H
,
Fisk
 
HA
,
Yaffe
 
MP
,
Mahajan
 
N
,
Herman
 
B
,
Reed
 
JC
.
Structure-function comparisons of the proapoptotic protein Bax in yeast and mammalian cells
.
Mol Cell Biol
.
1996
:
16
(
11
):
6494
6508
. .

Zhou
 
L
.
P53 and apoptosis in the Drosophila model
.
Adv Exp Med Biol
.
2019
:
1167
:
105
112
. .

Zou
 
X
,
Peng
 
T
,
Pan
 
Z
.
Modeling specificity in the yeast MAPK signaling networks
.
J Theor Biol
.
2008
:
250
(
1
):
139
155
. .

Author notes

Floriane Picolo and Jérémie Bardin equally contributed to the present work.

This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact [email protected] for reprints and translation rights for reprints. All other permissions can be obtained through our RightsLink service via the Permissions link on the article page on our site—for further information please contact [email protected].
Associate Editor: Luis Delaye
Luis Delaye
Associate Editor
Search for other works by this author on:

Supplementary data