Abstract

Background: Network motifs within biological networks show non-random abundances in systems at different scales. Large directed protein networks at the cellular level are now well defined in several diverse species. We aimed to compare the nature of significantly observed two- and three-node network motifs across three different kingdoms (Arabidopsis thaliana for multicellular plants, Saccharomyces cerevisiae for unicellular fungi and Homo sapiens for animals).

Results: ‘Two-node feedback’ is the most significant motif in all three species. By considering the sign of each two-node feedback interaction, we examined the enrichment of the three types of two-node feedbacks [positive–positive (PP), negative–negative (NN) and positive–negative (PN)]. We found that PN is enriched in the network of A.thaliana, NN in the network of S.cerevisiae and PP and NN in the network of H.sapiens. Each feedback type has characteristic features of robustness, multistability and homeostasis.

Conclusions: We suggest that amplification of particular network motifs emerges from contrasting dynamical and topological properties of the motifs, reflects the evolutionary design principles selected by the characteristic behavior of each species and provides a signature pointing to their behavior and function.

Contact:  [email protected]

Supplementary information:  Supplementary data are available at Bioinformatics online.

1 INTRODUCTION

Systems biology involves the study of complex interactions between components of a biological system. The study of network motifs (Alon, 2007) within larger networks can help to unravel the functions of complex biological networks (Milo et al., 2002, 2004). In biology, network motifs—subgraphs that recur within a network much more often than expected in random networks—include negative autoregulation, positive autoregulation, feedforward loops, single-input modules, dense overlapping regulons and feedback loops (Alon, 2007). These network motifs are known to have different dynamical functions, and distinct subsets have been found to predominate in various biological networks such as transcription, signal transduction, protein interaction and neuronal networks, or even larger scale ecosystem-wide networks. Most of the previous studies on network motifs (Conant and Wagner, 2003; Prill et al., 2005; Wuchty et al., 2003; Yeger-Lotem et al., 2004) were, however, limited to the networks with no interaction sign (activation or inhibition) although such interaction sign is indispensable for characterizing various dynamical properties. Now that networks involving a substantial proportion of all the signed directed protein networks (SDPNs) within cells are available for organisms from the animal, plant and fungal kingdoms, we can compare the network motifs and ask whether there are characteristics which may be kingdom specific, and allow us to unravel evolutionary design principles. In this article, we examine the enrichment of particular network motifs in the Arabidopsis thaliana (multicellular plant), Saccharomyces cerevisiae (unicellular fungus) and Homo sapiens (multicellular animal). We suggest biological implications for the sets of network motifs in each species that arise from their characteristic dynamical and topological features.

2 METHODS

2.1 Construction of directed protein networks

The network data were retrieved from all available manually curated databases using Pathway Studio 6 (Nikitin et al., 2003). This software provides five biological network databases (ResNet Mammalian Database 6.0, ResNet Plant Database 2.0, Yeast-3, Drosophila-3 and C-elegans-3). Each database contains eight types of nodes (Small Molecule, Functional Class, Complex, Protein, Cell Process, Cell Object, Treatment and Disease) and eight types of links (Expression, Regulation, MolTransport, ProtModification, Binding, PromoterBinding, MolSynthesis and DirectRegulation). In order to obtain reliable directed protein interaction data, we retrieved only the referenced links comprised of ‘Protein’ nodes and five types of interaction (Regulation, MolTransport, ProtModification, MolSynthesis and DirectRegulation).

2.2 Identification of network motifs

In order to identify network motifs using mfinder (Kashtan et al., 2004; Milo et al., 2002), 1000 random networks were generated by the switching method that switches between edges while preserving the degree distribution of the nodes in the original network—in-degree, out-degree and mutual degree are preserved for each node. The number of switches was a random number in the range of 100–200 times the total number of edges in the network. Network motifs with P < 0.01 was considered as statistically significant.

2.3 Enrichment analysis for two-node feedbacks and their couplings

In each directed protein network, the enrichment of positive–positive (PP), negative–negative (NN) and positive–negative (PN) was examined by performing one-sample one-sided z-tests for empirical binomial distributions of PP, NN and PN. The empirical binomial distributions were constructed by randomizing 10 000 networks while preserving the numbers of positive links, negative links and two-node feedbacks. The enrichment of the six types of couplings (PP–PP, NN–NN, PN–PN, PP–NN, NN–PN and PN–PP) were also examined in a similar way where the empirical binomial distributions were constructed by randomizing 10 000 networks while preserving the numbers of PP, NN and PN.

2.4 Tissue-specific and tissue-general subnetworks for H.sapiens

Human tissue-specific and tissue-general subnetworks were extracted by reflecting the mRNA expression data in 79 human tissues (Su et al., 2004). We first identified upper outliers for each tissue, whose mRNA expressions were higher than the sum of the upper quartile and 1.5 times the interquartile range (Rosner, 2000). Here, we assumed that protein abundances are proportional to their corresponding mRNA expressions. Then, a tissue-specific and a tissue-general protein sets were defined as the upper outliers that were only expressed in ‘less than k (0 to k − 1)’ and ‘at least k (k to 79)’ tissues, respectively. Here, k was varied from 1 to 10, and hence 10 tissue-specific and 10 tissue-general subnetworks were generated in total. Finally, a tissue-specific subnetwork (a tissue-general subnetwork) was constructed by extracting the protein nodes in a tissue-specific protein set (a tissue-general protein set) and their interactions from the SDPN of H.sapiens. For the comparison, 10 random subnetworks were also constructed by randomly extracting the same number of nodes with each tissue-specific/general subnetworks; hence, 200 random subnetworks were generated in total.

2.5 Average distance from a node to transcription factors or receptors

In the SDPNs of S.cerevisiae and H.sapiens, transcription factors (TFs) and receptors were selected as the nodes whose corresponding genes are classified into the Gene Ontology (GO) term ‘regulation of transcription (GO:0045449)’ and ‘receptor activity (GO:0004872)’, respectively. According to this criterion, there are 503 TFs and 13 receptors for S.cerevisiae and 1894 TFs and 1354 receptors for H.sapiens. Then, the average distance from a node to TFs (ADTF) was defined as the average of the shortest paths from the node to all reachable downstream TFs. Similarly, the average distance from a node to receptors (ADR) was defined as the average of the shortest paths from the node to all reachable upstream receptors.

2.6 Degree-free sampling

The degree of a node is a confounding factor in comparison of the topological features of PP and NN–ADTF and ADR are inversely proportional to the degree of a node in a network. In order to standardize the topological features, we considered four degree intervals ([1, 5], [6, 10], [11, 20] and >20) for S.cerevisiae and seven degree intervals ([1, 5], [6, 10], [11, 20], [21, 30], [31, 50], [51, 100] and >100) for H.sapiens. In each mutually exclusive set of PP and NN, 100 nodes with degrees in each degree interval were randomly sampled with replacement; hence, 400 and 700 nodes were sampled for S.cerevisiae and H.sapiens, respectively. Then, we compared the average of each topological feature of PP with that of NN, and this procedure was repeated for 100 times.

2.7 GO analysis for two-node positive feedbacks

GO analysis enables us to determine which GO categories are statistically overrepresented in a set of genes or a subgraph of a biological network. In order to differentiate biological functions between the two-node positive feedbacks (PP and NN), we performed GO analysis by utilizing BiNGO 2.31 (Maere et al., 2005), which is implemented as a plugin for Cytoscape (Shannon et al., 2003). Since we were interested in general functional profiling, a slimmed-down version of the whole GO (i.e. GOSlim_GOA) was used for the analysis. Each mutually exclusive set of PP and NN was tested against its corresponding SDPN, and hypergeometric tests were performed for statistical analysis. P-values were corrected with Benjamini and Hochberg false discovery rate (FDR) and the significance level was set to 0.05.

3 RESULTS

3.1 Two-node feedback is the most significant network motif in the three different species

In constructing the biological networks of the three kingdoms (represented by A.thaliana, S.cerevisiae and H.sapiens), we retrieved directed protein interactions from all available databases (see Section 2.1; Fig. 1A). Directed protein interaction data for other model organisms such as Drosophila melanogaster and Caenorhabditis elegans also exist, but the network sizes are smaller, with many unknown interactions or more intensive study of particular pathways, and therefore are not sufficient for comparison with the three selected organisms (Supplementary Tables 1 and 2). Based on the networks, we performed network motif analysis (see Section 2.2). Significantly observed (P < 0.01) two- and three-node motifs are depicted as bold and colored structures in Figure 1B. The two-node feedback was the most significant network motif in all of the examined species (Fig. 1C and Supplementary Table 2). Notably, the number of significantly observed network motifs decreased for H.sapiens.

Network motifs in the three different species (A.thaliana, S.cerevisiae and H.sapiens). Interactions in each network were retrieved from Pathway Studio with nodes involved in two-node feedback motifs colored yellow. (A) Directed protein networks of A.thaliana (blue: 1750 nodes and 3705 links), S.cerevisiae (red: 1653 nodes and 3860 links) and H.sapiens (green: 8053 nodes and 72588 links). (B) Network motifs of the three different species. Significantly observed (P < 0.01) two- and three-node motifs are represented as bold and colored structures below each network topology. (C) Z-scores of significantly observed two- and three-node network motifs for the three different species. Note that the two-node feedback is the most significant network motif in each species.
Fig. 1.

Network motifs in the three different species (A.thaliana, S.cerevisiae and H.sapiens). Interactions in each network were retrieved from Pathway Studio with nodes involved in two-node feedback motifs colored yellow. (A) Directed protein networks of A.thaliana (blue: 1750 nodes and 3705 links), S.cerevisiae (red: 1653 nodes and 3860 links) and H.sapiens (green: 8053 nodes and 72588 links). (B) Network motifs of the three different species. Significantly observed (P < 0.01) two- and three-node motifs are represented as bold and colored structures below each network topology. (C) Z-scores of significantly observed two- and three-node network motifs for the three different species. Note that the two-node feedback is the most significant network motif in each species.

3.2 Two-node feedback motifs show enrichment characteristic of species: PN in A.thaliana, NN in S.cerevisiae and PP + NN in H.sapiens

To unravel the dynamical functions of two-node feedbacks, we examined the enrichment of three types of two-node feedbacks (PP, NN and PN) in each SDPN (Fig. 2A) where only signed interactions are extracted from the corresponding directed protein network (Fig. 1A). The Z-scores in Figure 2B (see Section 2.3; Supplementary Table 3) show that PN was enriched (Z-score > 0) only in A.thaliana, while in S.cerevisiae NN was significantly (P < 0.01) enriched and in H.sapiens both PP and NN were significantly enriched. To examine whether this enrichment characteristic depends on the particular network reconstruction method or the (possibly) incomplete network information we employed, randomly subsampled networks for each species were further investigated. As a result, we confirmed that the enrichment results are not biased by such a network reconstruction method or incomplete network information (see ‘Enrichment of two-node feedbacks in the sub-sampled networks’ in Supplementary Material and Supplementary Fig. 1).

Enrichment of two-node feedbacks and their couplings in the three different species (A.thaliana, S.cerevisiae and H.sapiens). (A) SDPNs of A.thaliana (1286 nodes and 1959 links), S.cerevisiae (1140 nodes and 1848 links) and H.sapiens (6958 nodes and 47469 links). (B) Enrichment of the three types of two-node feedbacks in each SDPN. Filled bars represent the significantly enriched (P < 0.01) network motifs. (C) Representation of the three types of two-node feedbacks (PP, NN and PN) and the six types of two-node feedback couplings (PP–PP, NN–NN, PN–PN, PP–NN, NN–PN and PN–PP). Blue arrows and red arrows with blunted ends represent positive and negative regulations, respectively. (D) Two-node feedback coupling networks of A.thaliana (102 nodes and 185 links), S.cerevisiae (76 nodes and 116 links) and H.sapiens (1631 nodes and 19213 links). (E) Enrichment of the six types of two-node feedback couplings in each two-node feedback coupling network. Filled bars represent the significantly enriched (P < 0.01) network motifs.
Fig. 2.

Enrichment of two-node feedbacks and their couplings in the three different species (A.thaliana, S.cerevisiae and H.sapiens). (A) SDPNs of A.thaliana (1286 nodes and 1959 links), S.cerevisiae (1140 nodes and 1848 links) and H.sapiens (6958 nodes and 47469 links). (B) Enrichment of the three types of two-node feedbacks in each SDPN. Filled bars represent the significantly enriched (P < 0.01) network motifs. (C) Representation of the three types of two-node feedbacks (PP, NN and PN) and the six types of two-node feedback couplings (PP–PP, NN–NN, PN–PN, PP–NN, NN–PN and PN–PP). Blue arrows and red arrows with blunted ends represent positive and negative regulations, respectively. (D) Two-node feedback coupling networks of A.thaliana (102 nodes and 185 links), S.cerevisiae (76 nodes and 116 links) and H.sapiens (1631 nodes and 19213 links). (E) Enrichment of the six types of two-node feedback couplings in each two-node feedback coupling network. Filled bars represent the significantly enriched (P < 0.01) network motifs.

Why is PN enriched only in A.thaliana? Negative feedback is important in homeostasis and robustness to perturbation (Brandman and Meyer, 2008; Brandman et al., 2007). Consistently, as we removed the protein nodes classified into the GO (The Gene Ontology Consortium, 2006) term ‘homeostatic process (GO:0042592)’, PN was no longer enriched (Z-score < 0; Supplementary Table 4). PN enrichment was also found in the SDPN of rice, Oryza sativa (Supplementary Table 3). Moreover, some types of three- and four-node negative feedbacks (PPN, NNN and PPPN) were also enriched in A.thaliana although only PPPN was enriched in case of O.sativa (Supplementary Table 5). The contrast with fungi and animals suggests that negative feedback leading to robust homeostasis is selected in plants.

Like H.sapiens, two other multicellular animals (Mus musculus and Rattus norvegicus) also exhibited the significant enrichment of PP and NN (Supplementary Table 3). Species-specific differential enrichment of PP and NN suggests that the characteristics of each species might have emerged from different dynamical functions of PP and NN. Positive feedbacks are widespread in developmental processes as a result of their dynamical functions of hysteresis and multistationarity (Kim et al., 2008a): PP drives the developmental system towards a monostable regime (Chickarmane et al., 2006; Legewie et al., 2006), while NN leads to a bi- or multistable regime in development (Hwang et al., 2005; McClean et al., 2007), outcomes that have been notated as progression and decision switches (Guantes and Poyatos, 2008). Might alternative controls be favored in a single-cell organism such as S.cerevisiae where rapid changes in cell environment are encountered and most cells remain totipotent? Decision switches might be utilized in multipotent regimes in developmental processes. In contrast, a multicellular animal such as H.sapiens has many types of terminally differentiated cells in progressive differentiation processes, each protected from the environment. Hence, it might undergo developmental processes not only through decision switches but also through progression switches. Arguably, NN might act as a decision switch for bi- or multistable differentiation (e.g. from a stem cell to a pluripotent progenitor cell, from a pluripotent progenitor cell to a unipotent progenitor cell), whereas PP could act as a progression switch for monostable differentiation (e.g. from a unipotent progenitor cell to a terminally differentiated cell).

3.3 Coupling tendency of two-node feedbacks is different for each species

Biological networks involve many interlinked feedbacks (Brandman et al., 2005; Kim et al., 2007a, b, 2008b; Tsai et al., 2008), and we aimed to examine how the three types of two-node feedbacks are coupled in the species. Each SDPN (Fig. 2A) was converted into a two-node feedback coupling network (Fig. 2C and D) where each node denotes one of the three types of two-node feedbacks (PP: blue, NN: red, PN: green) and each link denotes a coupling, or a node sharing between two two-node feedbacks (right box of Fig. 2). Statistical analysis in Figure 2E (see Section 2.3; Supplementary Table 3) shows that PN–PP was relatively enriched in the network of A.thaliana. Only homogeneous couplings (PP–PP, NN–NN and PN–PN) were significantly enriched in the network of S.cerevisiae. In the case of H.sapiens, PP–PP, PN–PN and PN–PP were significantly enriched in the network. These findings and the results from other species (Supplementary Table 3) suggest the following three points: (i) homogeneous couplings are more enriched in the networks of single-cell organisms (S.cerevisiae) and multicellular animals (H.sapiens, M.musculus and R.norvegicus) than heterogeneous couplings (PP–NN, NN–PN and PN–PP), (ii) PP–PP is enriched in all the species, whereas PP–NN is not enriched in any of the species and (iii) for heterogeneous couplings, PN tends to be coupled with PP in the networks of multicellular organisms, whereas PN tends to be coupled with NN in the networks of single-cell organisms.

3.4 Dynamical features of two-node feedbacks

We noted above that species-specific differential enrichment of two-node feedbacks could originate from their different functions. To validate this, we investigated the dynamical features of PP, NN and PN, first by evolving artificial networks with a preference for ‘multistationarity’ (Fig. 3A), ‘robustness of point attractor’ (Fig. 3B) or ‘robustness of cyclic attractor’ (Fig. 3C) using Boolean network simulations (Kim et al., 2008a) (see ‘Network evolution’ in Supplementary Material). The artificial evolution toward ‘multistationarity’ increased the numbers of two-node positive feedbacks while suppressing negative feedbacks (Fig. 3A and D). We note that NN becomes more preferential than PP as the number of nodes in artificial networks increases (Supplementary Fig. 2 and Supplementary Table 6). The preference of NN over PP, although both are two-node positive feedbacks, might be because PP-only couplings can incorporate only two ‘point attractors’ whereas NN-only couplings can give rise to multiple ‘point attractors’ (Kim et al., 2008a). When artificial networks were evolved to enhance ‘robustness of point attractor’, PN and PP interactions were increased whereas NN was decreased (Fig. 3B and E). In the case of ‘robustness of cyclic attractor’, only PN was increased while suppressing positive feedbacks (Fig. 3C and F).

Dynamical features of the two-node feedbacks. (A–C) The average numbers of two-node feedbacks during the evolution of four-node artificial networks with a preference for multistationarity (A), robustness of point attractor (B), or robustness of cyclic attractor (C). (D–F) Attractor landscape of a network (inset figure) having the best fitness score for multistationarity (D), robustness of point attractor (E) or robustness of cyclic attractor (F). Attraction fields for every possible state of given networks are mapped to the corresponding attractor landscapes. The z-axis in the attractor landscape denotes the energy potential measured by a distance to the corresponding attractor.
Fig. 3.

Dynamical features of the two-node feedbacks. (A–C) The average numbers of two-node feedbacks during the evolution of four-node artificial networks with a preference for multistationarity (A), robustness of point attractor (B), or robustness of cyclic attractor (C). (D–F) Attractor landscape of a network (inset figure) having the best fitness score for multistationarity (D), robustness of point attractor (E) or robustness of cyclic attractor (F). Attraction fields for every possible state of given networks are mapped to the corresponding attractor landscapes. The z-axis in the attractor landscape denotes the energy potential measured by a distance to the corresponding attractor.

In summary, we found that NN is exploited only for multistationarity, PP increases both multistationarity and robustness of point attractor, and PN enhances robustness of attractors. These results support our previous findings that PP can be employed for a unipotent regime as a discreet decision maker, NN for a multipotent regime for multistability—multistationarity is essential for multistable differentiation (Laslo et al., 2006)—and PN for robust homeostatic behavior.

3.5 Topological features of two-node positive feedbacks

From the different dynamical roles of two-node positive feedbacks in developmental processes, we speculated that if PPs (NNs) are exploited for unipotent (multipotent) regimes, genes comprising PP (NN) structure would be expressed in specific (diverse) conditions for single-cell organisms and in specific (diverse) tissues for multicellular organisms. To test this idea, we compared the responsiveness, defined as the sum of squares of the log2-ratios measured from curated expression datasets under more than 1500 conditions (Choi and Kim, 2009; Tirosh et al., 2006), of PP versus NN and PP–PP versus NN–NN in S.cerevisiae. We found that NN and NN–NN show higher responsiveness to various conditions than PP and PP–PP, respectively (Fig. 4A). This result implies that in adaptation to rapidly changing environment, NNs are exploited for totipotent regimes leading to different cell types in S.cerevisiae. For H.sapiens, we examined the enrichment of PP and NN in 10 tissue-specific (filled green circles in Fig. 4B) and 10 tissue-general subnetworks (filled blue triangles in Fig. 4B), comparing with 10 random subnetworks for each (empty sky blue circles for tissue-specific and empty orange triangles for tissue-general subnetworks in Fig. 4B). As a result, the tissue-specific (tissue-general) subnetworks showed higher enrichment for PP (NN) than the random subnetworks, implying that PPs (NNs) are utilized for unipotent (multipotent) regimes leading to fully differentiated cells (various cell types) in H.sapiens.

Topological features of the two-node positive feedbacks (see Section 2). (A) Responsiveness of the mutually exclusive sets of PP versus NN and PP–PP versus NN–NN in S.cerevisiae. (B) Enrichment of PP and NN in tissue-specific and tissue-general subnetworks for H.sapiens. (C, D) Scatter plots for the average ADTFs and ADRs for the mutually exclusive sets of PP and NN in S.cerevisiae and H.sapiens. In (B–D), red data points indicate mean values for each data category.
Fig. 4.

Topological features of the two-node positive feedbacks (see Section 2). (A) Responsiveness of the mutually exclusive sets of PP versus NN and PP–PP versus NN–NN in S.cerevisiae. (B) Enrichment of PP and NN in tissue-specific and tissue-general subnetworks for H.sapiens. (C, D) Scatter plots for the average ADTFs and ADRs for the mutually exclusive sets of PP and NN in S.cerevisiae and H.sapiens. In (B–D), red data points indicate mean values for each data category.

In order to further understand the different biological aspects of two-node positive feedbacks, we performed GO analysis (see Section 2.7). According to the result shown in Table 1, PPs (NNs) are involved in exogenous (endogenous) processes. Exogenous processes are a means to cope with the cell environment through receptor-mediated signal transduction, whereas endogenous processes are employed for various cellular decisions by utilizing transcriptional programs in the nucleus. In support of this, as shown in Figure 4C, PPs were closer to the receptors, while NNs were closer to the TFs in the SDPN of S.cerevisiae (see Section 2.5). Moreover, PPs were relatively closer to the receptors than NNs were, although both PPs and NNs were closer to the receptors than TFs in H.sapiens (Fig. 4D). In multicellular animals, cells need to communicate in order to act in concert with other cells, thereby increasing the importance of exogenous processes such as cell-to-cell communications as indicated by the increased proportion of PP interactions in H.sapiens (Fig. 2B).

Table 1.

GO terms related to two-node positive feedbacks in S.cerevisiae and H.sapiens

SpeciesGO-terms for PPGO-terms for NN
Saccharomyces cerevisiaeCell communicationaRegulation of biological process
Signal transducer activityaNucleusb
MembraneEnzyme regulator activity
Homo sapiensCell communicationaEnzyme regulator activity
Regulation of biological processBinding
Signal transducer activitya
Response to stimulusa
Protein binding
Kinase activity
Cellular process
Behavior
Binding
Extracellular spacea
Receptor activitya
Biological process
Extracellular regiona
Transferase activity
Cell motility
Macromolecule metabolic process
Cell death
Molecular function
Metabolic process
SpeciesGO-terms for PPGO-terms for NN
Saccharomyces cerevisiaeCell communicationaRegulation of biological process
Signal transducer activityaNucleusb
MembraneEnzyme regulator activity
Homo sapiensCell communicationaEnzyme regulator activity
Regulation of biological processBinding
Signal transducer activitya
Response to stimulusa
Protein binding
Kinase activity
Cellular process
Behavior
Binding
Extracellular spacea
Receptor activitya
Biological process
Extracellular regiona
Transferase activity
Cell motility
Macromolecule metabolic process
Cell death
Molecular function
Metabolic process

Only the GO terms with corrected P < 0.05 are listed (see Supplementary Tables 7–10 for more details).

aGO terms related to exogenous processes.

bGO terms related to endogenous processes.

Table 1.

GO terms related to two-node positive feedbacks in S.cerevisiae and H.sapiens

SpeciesGO-terms for PPGO-terms for NN
Saccharomyces cerevisiaeCell communicationaRegulation of biological process
Signal transducer activityaNucleusb
MembraneEnzyme regulator activity
Homo sapiensCell communicationaEnzyme regulator activity
Regulation of biological processBinding
Signal transducer activitya
Response to stimulusa
Protein binding
Kinase activity
Cellular process
Behavior
Binding
Extracellular spacea
Receptor activitya
Biological process
Extracellular regiona
Transferase activity
Cell motility
Macromolecule metabolic process
Cell death
Molecular function
Metabolic process
SpeciesGO-terms for PPGO-terms for NN
Saccharomyces cerevisiaeCell communicationaRegulation of biological process
Signal transducer activityaNucleusb
MembraneEnzyme regulator activity
Homo sapiensCell communicationaEnzyme regulator activity
Regulation of biological processBinding
Signal transducer activitya
Response to stimulusa
Protein binding
Kinase activity
Cellular process
Behavior
Binding
Extracellular spacea
Receptor activitya
Biological process
Extracellular regiona
Transferase activity
Cell motility
Macromolecule metabolic process
Cell death
Molecular function
Metabolic process

Only the GO terms with corrected P < 0.05 are listed (see Supplementary Tables 7–10 for more details).

aGO terms related to exogenous processes.

bGO terms related to endogenous processes.

4 CONCLUSIONS

Network motifs have particular functional importance in various biological networks, and our results show that different classes of motif are enriched in different species from the three kingdoms. The data are still limited because network topology is only available for an adequate number of nodes in three species, and even so there may be ascertainment biases because the networks do not account for all cellular protein networks. However, for the first time we reveal species-specific network motif signatures that implicate contrasting evolutionary driving forces. In summary, (i) PN enrichment only in A.thaliana represents the importance of homeostasis in plants, (ii) NN enrichment in S.cerevisiae indicates the prevalence of multipotency in the developmental processes of this unicellular fungus and (iii) PP and NN enrichment in H.sapiens represents the progressive and often largely irreversible nature of the developmental processes. As further data are collected, it will be critical to contrast different abundance of network motifs between kingdoms with unicellular and multicellular forms, and to see whether the most abundant motifs are more similar in network regions concerned with similar functions but without conserved pathways, such as resistance to biotic or abiotic stresses. Network motif analysis may provide a valuable numerical tool to represent and compare large cellular protein networks between species, giving a signature that is a pointer to their function, behavior and evolution.

ACKNOWLEDGEMENTS

The authors would like to thank Jung Kyoon Choi and Philip Murray for their helpful discussions.

Funding: National Research Foundation of Korea (NRF) grants funded by the Korea Government, the Ministry of Education, Science & Technology (MEST) (2009-0086964 and 2010-0017662).

Conflict of Interest: none declared.

REFERENCES

Alon
U
,
Network motifs: theory and experimental approaches
Nat. Rev. Genet.
,
2007
, vol.
8
(pg.
450
-
461
)
Brandman
O
Meyer
T
,
Feedback loops shape cellular signals in space and time
Science
,
2008
, vol.
322
(pg.
390
-
395
)
Brandman
O
et al.
,
Interlinked fast and slow positive feedback loops drive reliable cell decisions
Science
,
2005
, vol.
310
(pg.
496
-
498
)
Brandman
O
et al.
,
STIM2 is a feedback regulator that stabilizes basal cytosolic and endoplasmic reticulum Ca2+ levels
Cell
,
2007
, vol.
131
(pg.
1327
-
1339
)
Chickarmane
V
et al.
,
Transcriptional dynamics of the embryonic stem cell switch
PLoS Comput. Biol.
,
2006
, vol.
2
pg.
e123
Choi
JK
Kim
YJ
,
Intrinsic variability of gene expression encoded in nucleosome positioning sequences
Nat. Genet.
,
2009
, vol.
41
(pg.
498
-
503
)
Conant
GC
Wagner
A
,
Convergent evolution of gene circuits
Nat. Genet.
,
2003
, vol.
34
(pg.
264
-
266
)
The Gene Ontology Consortium
,
The Gene Ontology (GO) project in 2006
Nucleic Acids Res.
,
2006
, vol.
34
(pg.
D322
-
D326
)
Guantes
R
Poyatos
JF
,
Multistable decision switches for flexible control of epigenetic differentiation
PLoS Comput. Biol.
,
2008
, vol.
4
pg.
e1000235
Hwang
ES
et al.
,
T helper cell fate specified by kinase-mediated interaction of T-bet with GATA-3
Science
,
2005
, vol.
307
(pg.
430
-
433
)
Kashtan
N
et al.
,
Efficient sampling algorithm for estimating subgraph concentrations and detecting network motifs
Bioinformatics
,
2004
, vol.
20
(pg.
1746
-
1758
)
Kim
D
et al.
,
Coupled positive and negative feedback circuits form an essential building block of cellular signaling pathways
Bioessays
,
2007
, vol.
29
(pg.
85
-
90
)
Kim
TH
et al.
,
Interlinked mutual inhibitory positive feedbacks induce robust cellular memory effects
FEBS Lett.
,
2007
, vol.
581
(pg.
4899
-
4904
)
Kim
J
et al.
,
Evolutionary design principles of modules that control cellular differentiation: consequences for hysteresis and multistationarity
Bioinformatics
,
2008
, vol.
24
(pg.
1516
-
1522
)
Kim
JR
et al.
,
Coupled feedback loops form dynamic motifs of cellular networks
Biophys J.
,
2008
, vol.
94
(pg.
359
-
365
)
Laslo
P
et al.
,
Multilineage transcriptional priming and determination of alternate hematopoietic cell fates
Cell
,
2006
, vol.
126
(pg.
755
-
766
)
Legewie
S
et al.
,
Mathematical modeling identifies inhibitors of apoptosis as mediators of positive feedback and bistability
PLoS Comput. Biol.
,
2006
, vol.
2
pg.
e120
Maere
S
et al.
,
BiNGO: a Cytoscape plugin to assess overrepresentation of gene ontology categories in biological networks
Bioinformatics
,
2005
, vol.
21
(pg.
3448
-
3449
)
McClean
MN
et al.
,
Cross-talk and decision making in MAP kinase pathways
Nat. Genet.
,
2007
, vol.
39
(pg.
409
-
414
)
Milo
R
et al.
,
Network motifs: simple building blocks of complex networks
Science
,
2002
, vol.
298
(pg.
824
-
827
)
Milo
R
et al.
,
Superfamilies of evolved and designed networks
Science
,
2004
, vol.
303
(pg.
1538
-
1542
)
Nikitin
A
et al.
,
Pathway studio–the analysis and navigation of molecular networks
Bioinformatics
,
2003
, vol.
19
(pg.
2155
-
2157
)
Prill
RJ
et al.
,
Dynamic properties of network motifs contribute to biological network organization
PLoS Biol.
,
2005
, vol.
3
pg.
e343
Rosner
B
Fundamentals of Biostatistics.
,
2000
Pacific Grove, CA
Duxbury
Shannon
P
et al.
,
Cytoscape: a software environment for integrated models of biomolecular interaction networks
Genome Res.
,
2003
, vol.
13
(pg.
2498
-
2504
)
Su
AI
et al.
,
A gene atlas of the mouse and human protein-encoding transcriptomes
Proc. Natl Acad. Sci. USA
,
2004
, vol.
101
(pg.
6062
-
6067
)
Tirosh
I
et al.
,
A genetic signature of interspecies variations in gene expression
Nat. Genet.
,
2006
, vol.
38
(pg.
830
-
834
)
Tsai
TY
et al.
,
Robust, tunable biological oscillations from interlinked positive and negative feedback loops
Science
,
2008
, vol.
321
(pg.
126
-
129
)
Wuchty
S
et al.
,
Evolutionary conservation of motif constituents in the yeast protein interaction network
Nat. Genet.
,
2003
, vol.
35
(pg.
176
-
179
)
Yeger-Lotem
E
et al.
,
Network motifs in integrated cellular networks of transcription-regulation and protein-protein interaction
Proc. Natl Acad. Sci. USA
,
2004
, vol.
101
(pg.
5934
-
5939
)

Author notes

The authors wish it to be known that, in their opinion, the first two authors should be regarded as joint First Authors.

Associate Editor: Trey Ideker

Supplementary data