Abstract

The completion of the rice genome sequence paved the way for rice functional genomics research. Additionally, the functional characterization of transcription factors is currently a popular and crucial objective among researchers. Transcription factors are one of the groups of proteins that bind to either enhancer or promoter regions of genes to regulate expression. On the basis of several typical examples of transcription factor analyses, we herein summarize selected research strategies and methods and introduce their advantages and disadvantages. This review may provide some theoretical and technical guidelines for future investigations of transcription factors, which may be helpful to develop new rice varieties with ideal traits.

Introduction

Rice is one of the most important crops worldwide, with more than half of the global population relying on it as a food source. To promote rice development, a high-quality fully sequenced rice genome is imperative. Therefore, the International Rice Genome Sequencing Project was launched in 1998 [1, 2]. The sequenced rice genome and the development of sequencing technologies ushered in a new and exciting chapter regarding rice functional genomics research [3, 4], which aims to clarify the functions and structures of large numbers of genes to eventually functionally characterize the whole rice genome [5]. The proteins encoded by genes, as well as the associated regulatory networks, collectively determine specific agronomic traits. Additionally, genes may be assembled and integrated to breed rice varieties with ideal traits, including high grain yield and quality and resistance to multiple abiotic and biotic stresses. Rice, which is an excellent model monocotyledonous plant species, shares a close syntenic and collinear relationship with other crops, such as maize and wheat. Therefore, rice functional genomics research may also provide lots of support for genetic analyses of other crops.

The identification of transcription factors (TFs) is a crucial aspect of functional genomics research [6]. TFs are vital for controlling gene expression [7]. Moreover, TFs are involved in the regulation of various biological processes in rice, such as development, hormone signal transduction, metabolism and responses to various stresses [8–13]. Interactions between proteins and DNA influence almost all aspects of cellular function [14, 15]. In this regard, several computational predictors have been proposed to predict the DNA-binding proteins only based on the sequence information [16–19]. Furthermore, some predictors have been proposed to detect the homology relationship among proteins sharing similar structures and functions [20–22]. The Database of Rice Transcription Factors contains 2025 putative TFs in Oryza sativa L. ssp. indica and 2384 putative TFs in O. sativa L. ssp. japonica [23], all of which are distributed in 63 families. TFs are frequently classified based on their DNA-binding domains. For example, the WRKY family is named according to the first four amino acids of the most highly conserved DNA-binding domain [24]. Additionally, MYB proteins consist of a highly conserved MYB domain [25], whereas the basic leucine zipper (bZIP) TFs harbor a conserved bZIP domain [26]. In rice, most of the studied TFs belong to the bZIP, MYB/MYC, WRKY, AP2/ERF and NAC families [27–35]. Examples include OsFD1, which is a bZIP TF that is closely related to flowering [28], and OsMYB2P-1, which affects abiotic stress responses and root architecture [29]. The SHALLOT-LIKE1 (SLL1) gene encodes an MYB TF that regulates leaf abaxial cell development [30]. Salicylic acid (SA) plays an important role in rice basic defense [36]. Rice WRKY45 is a key factor of the branched SA pathway and regulates rice resistance to rice blast disease and bacterial leaf blight disease, which are major rice diseases [31, 32]. The SHAT1 gene is reportedly expressed as part of the regulatory pathways related to seed shattering [33]. Several members of the AP2/ERF family play important regulatory roles that enable rice plants to adapt to abiotic stresses [34]. Moreover, OsNAC10 improves rice tolerance to environmental stresses and increases grain yield under drought conditions [35].

Table 1

TFBS database

DatabaseSpiecesredundancyFreeSitesReferences
TRANSFACEukaryoticYNhttp://gene-regulation.com/[51]
JASPAREukaryoticNYhttp://jaspar.binf.ku.dk/[56]
PlantTFDBPlantNYhttp://planttfdb.cbi.pku.edu.cn/[41]
PlantPromPlantNYhttp://softberry.com[58]
PlantCAREPlantUYhttp://bioinformatics.psb.ugent.be/webtools/plantcare/html/[49]
PlacePlantUYhttp://www.dna.affrc.go.jp/htdocs/PLACE/[59]
AGRISArabidopsis thalianaUYhttps://agris-knowledgebase.org/[60]
DatabaseSpiecesredundancyFreeSitesReferences
TRANSFACEukaryoticYNhttp://gene-regulation.com/[51]
JASPAREukaryoticNYhttp://jaspar.binf.ku.dk/[56]
PlantTFDBPlantNYhttp://planttfdb.cbi.pku.edu.cn/[41]
PlantPromPlantNYhttp://softberry.com[58]
PlantCAREPlantUYhttp://bioinformatics.psb.ugent.be/webtools/plantcare/html/[49]
PlacePlantUYhttp://www.dna.affrc.go.jp/htdocs/PLACE/[59]
AGRISArabidopsis thalianaUYhttps://agris-knowledgebase.org/[60]

Y: Yes, N: No , U: Unkown.

Table 1

TFBS database

DatabaseSpiecesredundancyFreeSitesReferences
TRANSFACEukaryoticYNhttp://gene-regulation.com/[51]
JASPAREukaryoticNYhttp://jaspar.binf.ku.dk/[56]
PlantTFDBPlantNYhttp://planttfdb.cbi.pku.edu.cn/[41]
PlantPromPlantNYhttp://softberry.com[58]
PlantCAREPlantUYhttp://bioinformatics.psb.ugent.be/webtools/plantcare/html/[49]
PlacePlantUYhttp://www.dna.affrc.go.jp/htdocs/PLACE/[59]
AGRISArabidopsis thalianaUYhttps://agris-knowledgebase.org/[60]
DatabaseSpiecesredundancyFreeSitesReferences
TRANSFACEukaryoticYNhttp://gene-regulation.com/[51]
JASPAREukaryoticNYhttp://jaspar.binf.ku.dk/[56]
PlantTFDBPlantNYhttp://planttfdb.cbi.pku.edu.cn/[41]
PlantPromPlantNYhttp://softberry.com[58]
PlantCAREPlantUYhttp://bioinformatics.psb.ugent.be/webtools/plantcare/html/[49]
PlacePlantUYhttp://www.dna.affrc.go.jp/htdocs/PLACE/[59]
AGRISArabidopsis thalianaUYhttps://agris-knowledgebase.org/[60]

Y: Yes, N: No , U: Unkown.

There is currently an abundance of bioinformatics databases and tools useful for functionally annotating TFs and predicting TF binding sites (TFBSs) in rice [23, 37–41]. The public databases provide large amounts of important information relevant for studying rice TFs. However, the TFBSs predicted by the available tools may be incorrect. Consequently, researchers must either validate the accuracy of the predicted binding sites or identify the real TFBSs with molecular biology techniques. Additionally, proving how TFs regulate the downstream target genes can be difficult, with researchers needing to select suitable strategies and methods to generate the necessary evidence.

The classical view of interactions between TFs and methylated DNA is that DNA methylation inhibited TFs from binding target elements [42]. However, emerging evidence showed that some TFs bind to methylated DNA motifs and possess sequence-dependent mCpG-binding activity [43–45]. To the best of our knowledge, there are no reports of methylated DNA–TFs interactions in rice.

In this review article, we summarize some of the typical and effective techniques for testing protein–DNA interactions, including methylated DNA–TFs interactions, which may be useful for future studies involving rice TFs.

Research technology

The increasing number of experimentally characterized TFBSs prompted some databases to start collecting TFBS-related information (Table 1) [46–49]. Examples include TRANSFAC and JASPAR, which are two major TF databases [50]. In TRANSFAC, the experimentally derived TFBSs are annotated with the associated experimental techniques and conditions, and the credibility of the detected sites has been evaluated [51]. Moreover, TRANSFAC contains the most data as well as the most comprehensive related information. However, there are some drawbacks to this database, including information redundancy and the considerable variability in model quality. Furthermore, TRANSFAC only provides some data to nonprofit organizations. In contrast, JASPAR is the most comprehensive and publicly available database of matrix-based TF-binding profiles. These profiles were exclusively derived from published experimentally defined sites and are stored as position frequency matrices [52–57].

Both major TF-related databases enable researchers to predict the DNA-binding motifs of TFs. The TFBSs are conserved, but are not identical, instead exhibiting a specific pattern (i.e. motif). With the development of algorithms, the weight matrix is considered to be a general and accurate approach for describing motif characteristics [50, 61]. A position weight matrix (PWM), which is also known as a position-specific weight matrix or position-specific scoring matrix, is an essential and commonly used modern algorithm for detecting motifs. Match™ is a powerful web-based and weight matrix-based tool that is closely interconnected with the PWM library in the TRANSFAC database. The matrix similarity score and the core similarity score are used by Match™ to search for putative TFBSs in the promoter sequence of target genes [37]. The JASPAR database recommends the use of its data sets and the ConSite tool instead of its own tools [38]. ConSite is a web-based and user-friendly tool, which uses high-quality TF and phylogenetic footprinting to predict TFBSs [38]. Phylogenetic footprinting refers to the fact that selective pressure makes functional regulatory regions more conserved than regions with no sequence-specific function in non-coding regions [62, 63]. Accordingly, TFBSs are predicted by searching for conserved motifs of homologous genes in multiple species. Although profile-based model frameworks are able to accurately describe TFBSs, the predicted TFBSs are not necessarily correct. For example, some functional regulatory sites are less conserved than background sequences.

Assay for Transposase Accessible Chromatin using sequencing (ATAC-seq) is a rapid and sensitive method to predict protein-binding motifs occupancy genome-widely [64]. What is known to all is that DNA is inaccessible in eukaryotic cell, because DNA and nucleosomes are packaged into chromatin [65]. When the chromatin changes from condensed state to accessible state, TFs or other DNA-binding proteins are able to access DNA and subsequently regulate gene expression [65]. Transposons are considered inserted preferentially into regions of accessible chromatin. ATAC-seq, based on the feature of hyperactive Tn5 transposase, is able to capture open chromatin sites [64]. Compared with DNase-seq and FAIRE-seq, ATAC-seq is simple and more efficient. The existence of cell walls and the pollution of subcellular organelles affect the application of ATAC-seq in plant. Roger B. Deal isolated nuclei by INTACT (isolation of nuclei tagged in specific cell types) to optimize ATAC-seq [66]. Fluorescence-activated nuclei sorting-assay was also used to optimize ATAC-seq [67]. Chromatin immunoprecipitation sequencing technique (ChIP-seq) is a comprehensive experimental combining biology method with silico analyses. The combination of ATAC-seq and ChIP-seq can be used to predict the binding of the TFs of interest to DNA [64, 68]. However, these new technologies inevitably have certain deviations [69]. Therefore, researchers will need to test the predicted sites with another experiment.

Yeast one-hybrid assay

The yeast one-hybrid (Y1H) assay is an important technique for assessing physical interactions between TFs and their target promoters. The assay is based on the yeast two-hybrid protein–protein interaction assay. The Y1H assay consists of the following two components (Figure 1A): a reporter construct with the DNA element of interest and an expression construct comprising the TF of interest and a yeast transcription activation domain. The reporter construct with the ‘bait’ DNA element is commonly called the ‘bait’, while the expression construct is often referred to as the ‘prey’. A typical TF generally consists of a DNA-binding domain, a trans-activating domain and a signal-sensing domain [7]. In a Y1H system, the DNA-binding domain of a yeast TF (e.g. GAL4) is replaced by a TF lacking the transcription activation function. Therefore, if there are physical interactions between the TF and DNA, the downstream reporter will be activated regardless of whether the TF is an activator or repressor [70].

Cartoon illustration of the mechanism underlying the Y1H assay.
(A) Structures of a reporter construct and a prey construct.
(B) Yeast cells are transformed with both ‘bait’ and ‘prey’ constructs.
Figure 1

Cartoon illustration of the mechanism underlying the Y1H assay. (A) Structures of a reporter construct and a prey construct. (B) Yeast cells are transformed with both ‘bait’ and ‘prey’ constructs.

There are several types of Y1H reporter constructs, each of which was designed for distinct experiments. The yeast strains used for Y1H commonly carry mutations in multiple auxotrophic genes. However, trp1 (tryptophan), leu2 (leucine), his3 (histidine) and ura3 (uracil) are generally used. The auxotrophic mutants with these mutated genes are unable to grow in the absence of various essential substances, and the reporter gene is activated via an interaction with a specific TF that rescues the auxotrophic phenotype [71]. The Y1H system reporter gene is HIS3, which enables yeast cells to grow in the absence of exogenously supplied histidine. Additionally, the Y1H reporter construct is often designed to express the gene encoding beta-galactosidase. All of the colonies that are able to grow are then tested in a colorimetric assay. Specifically, the production of beta-galactosidase results in the cleavage of X-gal to produce blue colonies.

As indicated in Figure 1, the reporter construct with the ‘bait’ cis-acting element and the ‘prey’ construct with a TF-lacking activating domain are transferred into yeast cells. The interaction between the DNA-binding domain of the ‘prey’ and the ‘bait’ will activate the expression of the reporter gene. A lack of interaction between the ‘prey’ and ‘bait’ results in no expression.

The Y1H assay can confirm the interactions between proteins and the DNA of interest without requiring purified protein. Additionally, the TF-drived protein retains its natural structure in yeast cells. Moreover, Y1H assays are relatively simple to complete. These advantages have made the Y1H assay a widely used technique to prove protein–DNA interactions. From its initial application to the present, the Y1H assay has been important for verifying the binding of TFs to DNA [72–76].

Dual-luciferase reporter assay system

The luciferase genes have been one of the most generally and widely used reporter genes since they were first used as a reporter of gene expression in rice. The luciferase reporter assay can detect bioluminescence generated by the reaction between luciferase and its substrates. This system has been used to verify interactions between TFs and their target DNA sequences. The expression of the reporter gene is measured to quantify the activity of the TF. The luciferase reporter assay consists of the following two components: a reporter construct with the target promoter and an effector construct with TFs. These components are somewhat similar to those of the Y1H system. Both plasmids are transferred into plant cells, and if the TF activates the target promoter, the expression of the luciferase gene will yield luciferase. The intensity of the fluorescence produced by the reaction between luciferase and its substrate may be used to determine the luciferase activity level, which indicates whether the TF is capable of interacting with the target promoter fragment. However, differences in the transfection efficiency of each sample may result in diverse fluorescence intensities between the experimental group and the control group. Therefore, it may be difficult to judge whether any differences between the experimental and control groups are caused by the activity of the promoter itself or operational errors affecting variables such as transfection efficiency.

Promega introduced the Dual-Luciferase® Reporter (DLR™) assay system, thereby providing researchers with a highly efficient method for applying dual reporters to improve the experimental accuracy. Dual reporters transfer two reporter constructs into a single system. One reporter construct with an experimental reporter gene (e.g. pGL3 family vector) is coupled to a regulated promoter, while the other reporter functions as an internal control, which contains a distinct reporter gene (e.g. pRL family vector), and is coupled to a constitutive promoter [77]. The expression of the pGL3 vector sequence is related to the transcriptional activity of its regulated promoter. The transcriptional activity of the pRL vector sequence is unaffected by the external conditions and is used to normalize and measure the transcriptional activity of the pGL3 vector. It is possible to minimize the effects of several inherent factors that can influence the accuracy of the experiment, including differences among cell plates, cell numbers, cell viability, transfection efficiency and lysis efficiency [77]. Compared with the luciferase reporter assay, the dual-luciferase reporter assay likely decreases the effects of extraneous factors to optimize the experimental accuracy.

Both pGreenII62-SK and pGreenII0800-LUC are commonly coupled to TFs and the sequence of interest, respectively, in rice dual-luciferase reporter assays [78, 79]. The pGreenII0800-LUC vector carries the firefly luciferase (LUC) gene under the control of the promoter of interest as well as the Renilla luciferase (REN) gene under the control of the CaMV 35S promoter [5]. To generate the construct for a dual-luciferase reporter assay, a coding sequence is inserted into the pGreenII62-SK vector as an effector and a promoter sequence is cloned into pGreenII0800-LUC as a reporter. Rice cells are then co-infiltrated with these vectors. If the TF is able to specifically bind to the promoter, the LUC reporter gene will be expressed to produce the encoded firefly luciferase enzyme. The REN reporter gene is simultaneously activated under the control of CaMV 35S promoter. The rice cells are subsequently lysed in passive lysis buffer to release the firefly and Renilla luciferase enzymes. Next, Luciferase Assay Reagent II is added to the cell lysates to activate the firefly luminescence signal. Upon completion of the firefly luciferase reaction, the Renilla luciferase enzyme catalyzes the production of a luminescence signal, while the firefly luminescence signal is rapidly quenched following the addition of the Stop & Glo Reagent to the solution [77]. This is the process in a classical dual-luciferase reporter assay in rice (Figure 2). Additionally, the negative control cells are co-infiltrated with the pGreenII62-SK empty vector and the reporter construct.

Dual-luciferase reporter assay.
Figure 2

Dual-luciferase reporter assay.

Electrophoretic mobility shift assay

The electrophoretic mobility shift assay (EMSA) is a rapid and simple technique that is generally used to detect and study sequence-specific DNA-binding proteins such as TFs in vitro [80, 81]. It is based on the principle that proteins binding to DNA probes form macromolecular complexes, which shift more slowly than free nucleic acids in non-denaturing polyacrylamide gel. The DNA probes can be labeled by covalent or non-covalent fluorophores, radioisotopes or biotin [82–84]. Although diverse DNA sequences may be used in an EMSA, from short oligonucleotides to sequences comprising several thousand nucleotides/base pairs, short nucleic acids may be preferable when the target DNA is well defined [80, 81, 85]. Compared with long nucleic acid sequences, shorter sequences are cheaper and more easily synthesized. Additionally, the associated decrease in the number of non-specific protein-binding sites helps to prevent interference from other proteins. The proteins may be derived from crude nuclear or whole cell extracts or from prokaryotic expression systems. However, the concentration of the protein of interest must be greater than 1 μg/μl.

The EMSA requires at least four reaction mixtures, which are used to prove that a protein can directly and specifically bind to the promoter of a target gene. These mixtures are as follows (Figure 3): a negative control with only a biotin-labeled probe; a conventional control group with a biotin-labeled probe and the protein; a cold-competition control with a biotin-labeled probe, a cold-probe (unlabeled) and the protein; and a mutant-probe cold-competition control with a biotin-labeled probe, a mutant-probe (unlabeled) and the protein. Both the cold-probe and mutant-probe need to be used at considerably higher concentrations than the biotin-probe. If the band corresponding to the biotin-probe is not at the lowest position, there are likely some impurities in the control. If the band position in the lane with the mixture of biotin-labeled probe and protein is higher than the biotin-probe band, this suggests that the protein interacts with the DNA probe. The biotin tag may bind to proteins, thereby resulting in a false-positive mobility shift. Thus, the cold-competition control is used to rule out this possibility. The band position of this control group should be at the same level as the band of the negative control group if the mobility shift is due to the protein specifically binding to the probe. Conversely, cold-probes do not affect the interaction between the biotin-labeled probe and the protein. Consequently, this control is used to indicate that the mobility shift of the conventional control group band is due to biotin. The mutant-probe is unable to affect the interaction between the biotin-labeled probe and the protein. Thus, this control is used to indicate the specific binding between the protein and the probe. All of the results collectively indicate whether TFs can specifically bind to promoters. Additionally, the positive control in commercially available EMSA kits is used to ensure that the assay is conducted correctly under optimal experimental conditions.

Electrophoretic mobility shift assay.
Figure 3

Electrophoretic mobility shift assay.

Chromatin immunoprecipitation

Chromatin immunoprecipitation (ChIP) is an excellent method for analyzing the interaction between proteins and DNA in vivo [86]. This technique involves immobilizing protein–DNA complexes in living cells with a chemical cross-linking agent (e.g. formaldehyde) and then randomly breaking them into small fragments of chromatin of a certain length with a physicochemical-based method, including sonication (hydrodynamic shearing), enzyme digestion or a micrococcal nuclease treatment [87]. When formaldehyde is used as the cross-linking agent, sonication is usually the preferred option. Finally, the specificity of antibodies for antigens is applied to acquire the protein of interest with a DNA fragment [88]. A quantitative real-time polymerase chain reaction (PCR) is the preferred method for analyzing the ChIP DNA fragment [87]. In addition with the maturity and low cost of next-generation sequencing technology, ChIP-seq has gradually become the most common technique to study these protein–DNA relationships. However, ChIP-seq is unable to provide precise set of binding sites because of non-specific genomic DNA [89, 90]. Lambda exonuclease only degrades naked double-strand DNA in the 5′ to 3′ orientation. Compared with ChIP-seq, ChIP-exo reduces background noise by lambda exonuclease [90]. Whereas the digestion step reduces the amount of recovered DNA from ChIP, which influences the subsequent sequencing. For constructing library, it is required to complete two inefficient ligation steps to add adaptors on DNA paired ends. During the PCR process, low amounts of starting DNA often produce noise data because of over-amplification. A robust ChIP-exo protocol named ChIP experiments with nucleotide resolution through exonuclease, unique barcode and single ligation (ChIP-nexus) overcomes the problem caused by low amounts of DNA [91]. The new technique employs an efficient DNA self-circulation step to add an adaptor on DNA ends. Therefore, ChIP-nexus without more starting DNA is available for subsequent sequencing.

In rice, ChIP experiments have been widely used to investigate the biological interactions between proteins of interest and the promoter of target genes [78, 79, 92–95]. A ChIP experiment generally requires the development of transgenic rice seedlings for detecting TFBSs rather than an analysis of epigenetics. A GFP coding sequence is usually fused in frame to the 3′ end of the TF gene sequence. However, the Flag-tag or other tags may alternatively be used to label the TFs [92].

Regarding rice, the cross-linking of protein and DNA is difficult in old tissue. Therefore, ChIP experiments are usually completed with 2–4-week-old transgenic rice seedlings. Formaldehyde is an excellent cross-linking agent that is able to form stable protein–protein, protein–DNA and protein–RNA complexes in cells. Moreover, the cross-linked complexes generated by formaldehyde are completely reversible, which facilitates the subsequent analysis of the protein and DNA [96, 97]. The duration of the cross-linking step is one of the crucial factors determining whether the ChIP experiment will be successfully completed. If this step is too short, there will be an insufficient amount of the target segment, ultimately leading to a false-negative result. If this step is allowed to continue for too long, it will not only cause the excessive background noise but also interfere with the sonication treatment [86]. As a general guideline, the cross-linking should be stopped when the material becomes transparent [98]. Additionally, the cross-linked material will need to be thoroughly washed with ddH2O to remove any residual formaldehyde, which may adversely influence the subsequent experimental steps. Furthermore, liquid nitrogen may be used to grind cross-linked rice tissue into a fine powder, from which pure chromatin will be extracted [98]. After resuspending the chromatin in nuclei lysis buffer, the solution is sonicated to shear the chromatin into certain sizes, which will depend on the study objective. The size of the resulting fragments is measured by 1% agarose gel electrophoresis. The protein of interest is immunoprecipitated from the fragmented chromatin solution with an antibody specific for the tag fused to the target protein [92, 95]. Note that the quality of the antibody is a critical factor for the recovery of DNA fragments. Diverse antibodies have their own affinities for epitopes, which can affect the ChIP results [86, 87, 99]. The DNA is recovered from the precipitate of the protein–DNA fragment complex by reverse the cross-links, after which the DNA is purified prior to any analyses. The process involved in a ChIP experiment is presented in Figure 4.

Chromatin immunoprecipitation assay process.
Figure 4

Chromatin immunoprecipitation assay process.

Conclusion

There is currently considerable interest in TFs, which are important for rice functional genomics research. Analyses of the interactions between TFs and DNA sequences can be divided into the following three steps (Figure 5):

  • (i) Characterize the TFs by confirming their effects on rice (e.g. phenotype and physiological processes).

  • (ii) Search for target genes. Apply the publicly available information, experimental data and/or bioinformatics-based data to identify potential downstream target genes that may be directly regulated by TFs. For example, predict the TFBSs using bioinformatics tools, like those described herein, or genome-wide gene expression profiles to detect TF-related genes.

  • (iii) Confirm interactions between TFs and target genes. Experimentally verify that the TFs bind directly to the target genes.

The effective use of bioinformatics tools may increase the efficiency and accuracy of TFBS predictions. However, the above-mentioned methods all have their own drawbacks. In Y1H, it is well known that auto-activity is a contributing factor for false-positive results. Yeast endogenous TFs can bind to the DNA of interest, leading to an activated reporter. This phenomenon, in which target elements activate reporter genes without TFs, is called auto-activity. Additionally, the ‘prey’ proteins are toxic to yeast cells or the proteins cannot be stably expressed in host cells, both of which will cause false-negative results. Regarding the EMSA, the cost of the biotin label is high, and biotin may affect protein–DNA interactions. Although 32P does not influence the interactions, the associated radioactivity is hazardous. The disadvantages of the ChIP assay include the fact it requires highly specific and quality antibodies, and the cross-linking by formaldehyde may cause non-specific binding, resulting in a false-positive signal [86, 100]. Moreover, the transgenic rice seedlings required for the ChIP assay may be relatively difficult to produce [101]. Furthermore, the chromatin is randomly sheared into fragments, which may make it difficult to repeat. Thus, two or more techniques may be needed to confirm the proteins of interest directly bind to target DNA sequences [78, 79, 93, 94, 102, 103]. Several yield-related TFs have been molecularly identified and characterized by these experiments (Table 2). These results and related information is collected in rice-related database [23, 40, 41, 104].

Proposed process for analyzing protein–DNA interactions.
Figure 5

Proposed process for analyzing protein–DNA interactions.

Table 2

Identified yield-related genes

GenesRegulated traitsMethodReferences
OsNAC2Plant height, flowering,ChIP-PCR, Dual-LUC, Y1H[79]
OsSPL16 /GW8Grain widthChIP-PCR, EMSA, LUC, Y1H[109]
OsTB1TilleringChIP-PCR, EMSA, LUC, Y1H[110]
OsMADS1/ LGY3Grain number and sizeChIP-PCR, EMSA[111]
OsMADS57TilleringChIP-PCR, EMSA, LUC, Y1H[110]
OsLIC1Tiller angleChIP-PCR, EMSA[112]
IPA1Tillering ,panicle developmentChIP-PCR, EMSA[113]
DSTGrain numberChIP-PCR, EMSA[114]
SERF1Grain weightChIP-PCR, EMSA, LUC[115]
GenesRegulated traitsMethodReferences
OsNAC2Plant height, flowering,ChIP-PCR, Dual-LUC, Y1H[79]
OsSPL16 /GW8Grain widthChIP-PCR, EMSA, LUC, Y1H[109]
OsTB1TilleringChIP-PCR, EMSA, LUC, Y1H[110]
OsMADS1/ LGY3Grain number and sizeChIP-PCR, EMSA[111]
OsMADS57TilleringChIP-PCR, EMSA, LUC, Y1H[110]
OsLIC1Tiller angleChIP-PCR, EMSA[112]
IPA1Tillering ,panicle developmentChIP-PCR, EMSA[113]
DSTGrain numberChIP-PCR, EMSA[114]
SERF1Grain weightChIP-PCR, EMSA, LUC[115]
Table 2

Identified yield-related genes

GenesRegulated traitsMethodReferences
OsNAC2Plant height, flowering,ChIP-PCR, Dual-LUC, Y1H[79]
OsSPL16 /GW8Grain widthChIP-PCR, EMSA, LUC, Y1H[109]
OsTB1TilleringChIP-PCR, EMSA, LUC, Y1H[110]
OsMADS1/ LGY3Grain number and sizeChIP-PCR, EMSA[111]
OsMADS57TilleringChIP-PCR, EMSA, LUC, Y1H[110]
OsLIC1Tiller angleChIP-PCR, EMSA[112]
IPA1Tillering ,panicle developmentChIP-PCR, EMSA[113]
DSTGrain numberChIP-PCR, EMSA[114]
SERF1Grain weightChIP-PCR, EMSA, LUC[115]
GenesRegulated traitsMethodReferences
OsNAC2Plant height, flowering,ChIP-PCR, Dual-LUC, Y1H[79]
OsSPL16 /GW8Grain widthChIP-PCR, EMSA, LUC, Y1H[109]
OsTB1TilleringChIP-PCR, EMSA, LUC, Y1H[110]
OsMADS1/ LGY3Grain number and sizeChIP-PCR, EMSA[111]
OsMADS57TilleringChIP-PCR, EMSA, LUC, Y1H[110]
OsLIC1Tiller angleChIP-PCR, EMSA[112]
IPA1Tillering ,panicle developmentChIP-PCR, EMSA[113]
DSTGrain numberChIP-PCR, EMSA[114]
SERF1Grain weightChIP-PCR, EMSA, LUC[115]

In previous studies on DNA methylation in rice, researchers tended to ignore the interactions between DNA and proteins without methyl-CpG-binding domain. LUC, EMSA and ChIP were applied in identifying methylated DNA–TF interactions in mammal [105]. These methods applied in mammals provide excellent instances for studying methylated DNA–TF interactions in rice. Methylated DNA–TF interactions may play important role in rice growth and development, which should not be ignored.

The available experimental techniques for characterizing TFs can generate new information and data that can be used to improve algorithms, ultimately leading to more accurate predictions for future studies. More accurate predictors for DNA-binding protein identification considering the features of proteins will promote the development of this important field [106]. Particularly, feature representation learning recently has proven to be capable of capturing the high-latent sequential characteristics of proteins and DNAs [107, 108]. These methods are useful for elucidating the regulatory mechanisms of TFs, which may be relevant for the application of molecular techniques to develop new rice varieties with ideal traits.

Key Points

  • On the basis of several typical examples of transcription factor analyses, we summarize selected research strategies and methods and introduce their advantages and disadvantages.

  • We divided research process of the interactions between TFs–DNA sequences into the following three steps.

  • Agriculture researchers should pay more attention to methylated DNA–TF interactions in rice.

  • Bioinformatics algorithms and biological research can promote each other, then paved the way for developing new rice varieties with ideal traits.

Acknowledgements

We thank Liwen Bianji, Edanz Editing China (www.liwenbianji.cn/ac) for editing the English text of a draft of this manuscript.

Funding

National Key R&D Program of China (2018YFC0910405) and the Natural Science Foundation of China (No. 61771331).

Zijie Shen is a research assistant in the Institute of Fundamental and Frontier Sciences at the University of Electronic Science and Technology of China, Chengdu

Yuan Lin is a system architect in Sparebanken Vest, Norway. She received her PhD degree from Norwegian University of Science and Technology in 2009.

Quan Zou is a professor in the Institute of Fundamental and Frontier Sciences at the University of Electronic Science and Technology of China, Chengdu. He received his PhD from Harbin Institute of Technology in 2009.

References

1.

Sasaki
T
.
The rice genome project in Japan
.
Proc Natl Acad Sci U S A
1998
;
95
:
2027
8
.

2.

Yu
J
,
Hu
S
,
Wang
J
, et al.
A draft sequence of the rice genome (Oryza sativa L. ssp. indica)
.
Science
2002
;
296
:
79
92
.

3.

International Rice Genome Sequencing Project
.
The map-based sequence of the rice genome
.
Nature
2005
;
436
:
793
800
.

4.

Han
B
,
Xue
Y
,
Li
J
, et al.
Rice functional genomics research in China
.
Philos Trans R Soc Lond B Biol Sci
2007
;
362
:
1009
21
.

5.

Hieter
P
,
Boguski
M
.
Functional genomics: it's all how you read it
.
Science
1997
;
278
:
601
2
.

6.

Liu
B
,
Fang
Y
,
Huang
D-S
,
Chou
KC
.
iPromoter-2L: a two-layer predictor for identifying promoters and their types by multi-window-based PseKNC
.
Bioinformatics
2018
;
34
:
33
40
.

7.

Latchman
DS
.
Transcription factors: an overview
.
Int J Biochem Cell Biol
1997
;
29
:
1305
12
.

8.

Liu
B
,
Weng
F
,
Huang
D-S
,
Chou
KC
.
iRO-3wPseKNC: identify DNA replication origins by three-window-based PseKNC
.
Bioinformatics
2018
;
34
:
3086
93
.

9.

Chen
X
,
Ronald
PC
.
Innate immunity in rice
.
Trends Plant Sci
2011
;
16
:
451
9
.

10.

E
ZG
,
Zhang
YP
,
Zhou
JH
,
Wang
L
.
Mini review roles of the bZIP gene family in rice
.
Genet Mol Res
2014
;
13
:
3025
36
.

11.

Rushton
PJ
,
Somssich
IE
,
Ringler
P
,
Shen
QJ
.
WRKY transcription factors
.
Trends Plant Sci
2010
;
15
:
247
58
.

12.

Wang
L
,
Zhang
Q
.
Boosting rice yield by fine-tuning SPL gene expression
.
Trends Plant Sci
2017
;
22
:
643
6
.

13.

De Vleesschauwer
D
,
Gheysen
G
,
Hofte
M
.
Hormone defense networking in rice: tales from a different world
.
Trends Plant Sci
2013
;
18
:
555
65
.

14.

Dey
B
,
Thukral
S
,
Krishnan
S
, et al.
DNA-protein interactions: methods for detection and analysis
.
Mol Cell Biochem
2012
;
365
:
279
99
.

15.

Liu
B
,
Li
K
,
Huang
D-S
,
Chou
KC
.
iEnhancer-EL: identifying enhancers and their strength with ensemble learning approach
.
Bioinformatics
2018
;
34
:
3835
42
.

16.

Zhang
J
,
Liu
B
.
PSFM-DBT: identifying DNA-binding proteins by combing position specific frequency matrix and distance-bigram transformation
.
Int J Mol Sci
2017
;
18
:
1856
.

17.

Liu
B
,
Wang
S
,
Dong
Q
, et al.
Identification of DNA-binding proteins by combining auto-cross covariance transformation and ensemble learning
.
IEEE Trans Nanobioscience
2016
;
15
:
328
34
.

18.

Liu
B
,
Xu
J
,
Lan
X
, et al.
iDNA-Prot|dis: identifying DNA-binding proteins by incorporating amino acid distance-pairs and reduced alphabet profile into the general pseudo amino acid composition
.
PLoS One
2014
;
9
:
e106691
.

19.

Wei
L
,
Tang
J
,
Zou
Q
.
Local-DPP: an improved DNA-binding protein prediction method by exploring local evolutionary information
.
Inform Sci
2017
;
384
:
135
44
.

20.

Liu
B
,
Jiang
S
,
Zou
Q
.
HITS-PR-HHblits: protein remote homology detection by combining PageRank and hyperlink-induced topic search
.
Brief Bioinform
2018
. DOI: .

21.

Li
S
,
Chen
J
,
Liu
B
.
Protein remote homology detection based on bidirectional long short-term memory
.
BMC Bioinformatics
2017
;
18
:
443
.

22.

Chen
J
,
Guo
M
,
Wang
X
,
Liu
B
.
A comprehensive review and comparison of different computational methods for protein remote homology detection
.
Brief Bioinform
2018
;
9
:
231
44
.

23.

Gao
G
,
Zhong
Y
,
Guo
A
, et al.
DRTF: a database of rice transcription factors
.
Bioinformatics
2006
;
22
:
1286
7
.

24.

Rushton
PJ
,
Torres
JT
,
Parniske
M
, et al.
Interaction of elicitor-induced DNA-binding proteins with elicitor response elements in the promoters of parsley PR1 genes
.
EMBO J
1996
;
15
:
5690
700
.

25.

Dubos
C
,
Stracke
R
,
Grotewold
E
, et al.
MYB transcription factors in Arabidopsis
.
Trends Plant Sci
2010
;
15
:
573
81
.

26.

Landschulz
WH
,
Johnson
PF
,
McKnight
SL
.
The leucine zipper: a hypothetical structure common to a new class of DNA-binding proteins
.
Science
1988
;
240
:
1759
64
.

27.

Zhu
Q
,
Ordiz
MI
,
Dabi
T
, et al.
Rice TATA binding protein interacts functionally with transcription factor IIB and the RF2a bZIP transcriptional activator in an enhanced plant in vitro transcription system
.
Plant Cell
2002
;
14
:
795
803
.

28.

Brambilla
V
,
Martignago
D
,
Goretti
D
, et al.
Antagonistic transcription factor complexes modulate the floral transition in rice
.
Plant Cell
2017
;
29
:
2801
16
.

29.

Dai
X
,
Wang
Y
,
Yang
A
,
Zhang
WH
.
OsMYB2P-1, an R2R3 MYB transcription factor, is involved in the regulation of phosphate-starvation responses and root architecture in rice
.
Plant Physiol
2012
;
159
:
169
83
.

30.

Zhang
GH
,
Xu
Q
,
Zhu
XD
, et al.
SHALLOT-LIKE1 is a KANADI transcription factor that modulates rice leaf rolling by regulating leaf abaxial cell development
.
Plant Cell
2009
;
21
:
719
35
.

31.

Shimono
M
,
Sugano
S
,
Nakayama
A
, et al.
Rice WRKY45 plays a crucial role in benzothiadiazole-inducible blast resistance
.
Plant Cell
2007
;
19
:
2064
76
.

32.

Takatsuji
H
.
Development of disease-resistant rice using regulatory components of induced disease resistance
.
Front Plant Sci
2014
;
5
:
630
.

33.

Zhou
Y
,
Lu
D
,
Li
C
, et al.
Genetic control of seed shattering in rice by the APETALA2 transcription factor shattering abortion1
.
Plant Cell
2012
;
24
:
1034
48
.

34.

Kazan
K
.
Diverse roles of jasmonates and ethylene in abiotic stress tolerance
.
Trends Plant Sci
2015
;
20
:
219
29
.

35.

Jeong
JS
,
Kim
YS
,
Baek
KH
, et al.
Root-specific expression of OsNAC10 improves drought tolerance and grain yield in rice under field drought conditions
.
Plant Physiol
2010
;
153
:
185
97
.

36.

Yang
DL
,
Yang
Y
,
He
Z
.
Roles of plant hormones and their interplay in rice immunity
.
Mol Plant
2013
;
6
:
675
85
.

37.

Kel
AE
,
Gossling
E
,
Reuter
I
, et al.
MATCH: a tool for searching transcription factor binding sites in DNA sequences
.
Nucleic Acids Res
2003
;
31
:
3576
9
.

38.

Sandelin
A
,
Wasserman
WW
,
Lenhard
B
.
ConSite: web-based prediction of regulatory elements using cross-species comparison
.
Nucleic Acids Res
2004
;
32
:
W249
52
.

39.

Dubchak
I
,
Munoz
M
,
Poliakov
A
, et al.
Whole-genome rVISTA: a tool to determine enrichment of transcription factor binding sites in gene promoters from transcriptomic data
.
Bioinformatics
2013
;
29
:
2059
61
.

40.

Priya
P
,
Jain
M
.
RiceSRTFDB: a database of rice transcription factors containing comprehensive expression, cis-regulatory element and mutant information to facilitate gene function analysis
.
Database (Oxford)
2013
;
2013
:
bat027
.

41.

Jin
J
,
Tian
F
,
Yang
DC
, et al.
PlantTFDB 4.0: toward a central hub for transcription factors and regulatory interactions in plants
.
Nucleic Acids Res
2017
;
45
:
D1040
5
.

42.

Robertson
KD
,
Jones
PA
.
DNA methylation: past, present and future directions
.
Carcinogenesis
2000
;
21
:
461
7
.

43.

Yin
Y
,
Morgunova
E
,
Jolma
A
, et al.
Impact of cytosine methylation on DNA binding specificities of human transcription factors
.
Science
2017
;
356
:eaaj2239; see also 489
.

44.

Hu
S
,
Wan
J
,
Su
Y
, et al.
DNA methylation presents distinct binding sites for human transcription factors
.
Elife
2013
;
2
:
e00726
.

45.

Zhu
H
,
Wang
G
,
Qian
J
.
Transcription factors as readers and effectors of DNA methylation
.
Nat Rev Genet
2016
;
17
:
551
65
.

46.

Wingender
E
,
Dietze
P
,
Karas
H
,
Knuppel
R
.
TRANSFAC: a database on transcription factors and their DNA binding sites
.
Nucleic Acids Res
1996
;
24
:
238
41
.

47.

Sandelin
A
,
Alkema
W
,
Engstrom
P
, et al.
JASPAR: an open-access database for eukaryotic transcription factor binding profiles
.
Nucleic Acids Res
2004
;
32
:
D91
4
.

48.

Guo
AY
,
Chen
X
,
Gao
G
, et al.
PlantTFDB: a comprehensive plant transcription factor database
.
Nucleic Acids Res
2008
;
36
:
D966
9
.

49.

Lescot
M
,
Dehais
P
,
Thijs
G
, et al.
PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences
.
Nucleic Acids Res
2002
;
30
:
325
7
.

50.

Wasserman
WW
,
Sandelin
A
.
Applied bioinformatics for the identification of regulatory elements
.
Nat Rev Genet
2004
;
5
:
276
87
.

51.

Matys
V
,
Kel-Margoulis
OV
,
Fricke
E
, et al.
TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes
.
Nucleic Acids Res
2006
;
34
:
D108
10
.

52.

Vlieghe
D
,
Sandelin
A
,
De Bleser
PJ
, et al.
A new generation of JASPAR, the open-access repository for transcription factor binding site profiles
.
Nucleic Acids Res
2006
;
34
:
D95
7
.

53.

Portales-Casamar
E
,
Thongjuea
S
,
Kwon
AT
, et al.
JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles
.
Nucleic Acids Res
2010
;
38
:
D105
10
.

54.

Mathelier
A
,
Zhao
X
,
Zhang
AW
, et al.
JASPAR 2014: an extensively expanded and updated open-access database of transcription factor binding profiles
.
Nucleic Acids Res
2014
;
42
:
D142
7
.

55.

Mathelier
A
,
Fornes
O
,
Arenillas
DJ
, et al.
JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles
.
Nucleic Acids Res
2016
;
44
:
D110
5
.

56.

Khan
A
,
Fornes
O
,
Stigliani
A
, et al.
JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework
.
Nucleic Acids Res
2018
;
46
:
D1284
.

57.

Bryne
JC
,
Valen
E
,
Tang
MH
, et al.
JASPAR, the open access database of transcription factor-binding profiles: new content and tools in the 2008 update
.
Nucleic Acids Res
2008
;
36
:
D102
6
.

58.

Shahmuradov
IA
,
Gammerman
AJ
,
Hancock
JM
, et al.
PlantProm: a database of plant promoter sequences
.
Nucleic Acids Res
2003
;
31
:
114
7
.

59.

Higo
K
,
Ugawa
Y
,
Iwamoto
M
,
Korenaga
T
.
Plant cis-acting regulatory DNA elements (PLACE) database: 1999
.
Nucleic Acids Res
1999
;
27
:
297
300
.

60.

Yilmaz
A
,
Mejia-Guerra
MK
,
Kurz
K
, et al.
AGRIS: the Arabidopsis Gene Regulatory Information Server, an update
.
Nucleic Acids Res
2011
;
39
:
D1118
22
.

61.

Stormo
GD
,
Schneider
TD
,
Gold
L
,
Korenaga
T
.
Use of the ‘Perceptron’ algorithm to distinguish translational initiation sites in E. coli
.
Nucleic Acids Res
1982
;
10
:
2997
3011
.

62.

Tagle
DA
,
Koop
BF
,
Goodman
M
, et al.
Embryonic epsilon and gamma globin genes of a prosimian primate (Galago crassicaudatus). Nucleotide and amino acid sequences, developmental regulation and phylogenetic footprints
,
J Mol Biol
.
1988
;
203
:
439
55
.

63.

Wasserman
WW
,
Palumbo
M
,
Thompson
W
, et al.
Human–mouse genome comparisons to locate regulatory sites
.
Nat Genet
2000
;
26
:
225
8
.

64.

Buenrostro
JD
,
Giresi
PG
,
Zaba
LC
, et al.
Transposition of native chromatin for fast and sensitive epigenomic profiling of open chromatin, DNA-binding proteins and nucleosome position
.
Nat Methods
2013
;
10
:
1213
8
.

65.

Mellor
J
.
The dynamics of chromatin remodeling at promoters
.
Mol Cell
2005
;
19
:
147
57
.

66.

Maher
KA
,
Bajic
M
,
Kajala
K
, et al.
Profiling of accessible chromatin regions across multiple plant species and cell types reveals common gene regulatory principles and new control modules
.
Plant Cell
2018
;
30
:
15
36
.

67.

Lu
Z
,
Hofmeister
BT
,
Vollmers
C
, et al.
Combining ATAC-seq with nuclei sorting for discovery of cis-regulatory regions in plant genomes
.
Nucleic Acids Res
2017
;
45
:
e41
.

68.

Mahony
S
,
Pugh
BF
.
Protein-DNA binding in high-resolution
.
Crit Rev Biochem Mol Biol
2015
;
50
:
269
83
.

69.

Meyer
CA
,
Liu
XS
.
Identifying and mitigating bias in next-generation sequencing methods for chromatin biology
.
Nat Rev Genet
2014
;
15
:
709
21
.

70.

Brent
R
,
Ptashne
M
.
A eukaryotic transcriptional activator bearing the DNA specificity of a prokaryotic repressor
.
Cell
1985
;
43
:
729
36
.

71.

Reece-Hoyes
JS
,
Marian Walhout
AJ
.
Yeast one-hybrid assays: a historical and technical perspective
.
Methods
2012
;
57
:
441
7
.

72.

Wilson
TE
,
Fahrner
TJ
,
Johnston
M
,
Milbrandt
J
.
Identification of the DNA binding site for NGFI-B by genetic selection in yeast
.
Science
1991
;
252
:
1296
300
.

73.

Inouye
C
,
Remondelli
P
,
Karin
M
,
Elledge
S
.
Isolation of a cDNA encoding a metal response element binding protein using a novel expression cloning procedure: the one hybrid system
.
DNA Cell Biol
1994
;
13
:
731
42
.

74.

Li
JJ
,
Herskowitz
I
.
Isolation of ORC6, a component of the yeast origin recognition complex by a one-hybrid system
.
Science
1993
;
262
:
1870
4
.

75.

Dowell
SJ
,
Romanowski
P
,
Diffley
JF
.
Interaction of Dbf4, the Cdc7 protein kinase regulatory subunit, with yeast replication origins in vivo
.
Science
1994
;
265
:
1243
6
.

76.

Li
W
,
Zhu
Z
,
Chern
M
, et al.
A natural allele of a transcription factor in rice confers broad-spectrum blast resistance
.
Cell
2017
;
170
:
114
26
.

77.

Sherf
BA
,
Navarro
SL
,
Hannah
RR
, et al.
Dual-luciferase reporter assay: an advanced co-reporter technology integrating firefly and Renilla luciferase assays
.
Promega Notes
1996
;
57
:
2
.

78.

Mao
C
,
Ding
J
,
Zhang
B
, et al.
OsNAC2 positively affects salt-induced cell death and binds to the OsAP37 and OsCOX11 promoters
.
Plant J
2018
;
94
:
454
68
.

79.

Chen
X
,
Lu
S
,
Wang
Y
, et al.
OsNAC2 encoding a NAC transcription factor that affects plant height through mediating the gibberellic acid pathway in rice
.
Plant J
2015
;
82
:
302
14
.

80.

Hellman
LM
,
Fried
MG
.
Electrophoretic mobility shift assay (EMSA) for detecting protein-nucleic acid interactions
.
Nat Protoc
2007
;
2
:
1849
61
.

81.

Fried
MG
,
Daugherty
MA
.
Electrophoretic analysis of multiple protein-DNA interactions
.
Electrophoresis
1998
;
19
:
1247
53
.

82.

Rye
HS
,
Drees
BL
,
Nelson
HC
,
Glazer
AN
.
Stable fluorescent dye-DNA complexes in high sensitivity detection of protein–DNA interactions. Application to heat shock transcription factor
.
J Biol Chem
1993
;
268
:
25229
38
.

83.

Maxam
AM
,
Gilbert
W
.
A new method for sequencing DNA
.
Proc Natl Acad Sci U S A
1977
;
74
:
560
4
.

84.

Kang
J
,
Lee
MS
,
Gorenstein
DG
.
Quantitative analysis of chemiluminescence signals using a cooled charge-coupled device camera
.
Anal Biochem
2005
;
345
:
66
71
.

85.

Rasimas
JJ
,
Kar
SR
,
Pegg
AE
,
Fried
MG
.
Interactions of human O6-alkylguanine-DNA alkyltransferase (AGT) with short single-stranded DNAs
.
J Biol Chem
2007
;
282
:
3357
66
.

86.

Das
PM
,
Ramachandran
K
,
vanWert
J
, et al.
Chromatin immunoprecipitation assay
.
Biotechniques
2004
;
37
:
961
9
.

87.

Haring
M
,
Offermann
S
,
Danker
T
, et al.
Chromatin immunoprecipitation: optimization, quantitative analysis and data normalization
.
Plant Methods
2007
;
3
:
11
.

88.

Orlando
V
,
Strutt
H
,
Paro
R
.
Analysis of chromatin structure by in vivo formaldehyde cross-linking
.
Methods
1997
;
11
:
205
14
.

89.

Fock-Chow-Tho
D
,
Topp
E
,
Ibeagha-Awemu
EA
, et al.
Comparison of commercial DNA extraction kits and quantitative PCR systems for better sensitivity in detecting the causative agent of paratuberculosis in dairy cow fecal samples
.
J Dairy Sci
2017
;
100
:
572
81
.

90.

Rhee
HS
,
Pugh
BF
.
Comprehensive genome-wide protein-DNA interactions detected at single-nucleotide resolution
.
Cell
2011
;
147
:
1408
19
.

91.

He
Q
,
Johnston
J
,
Zeitlinger
J
.
ChIP-nexus enables improved detection of in vivo transcription factor binding footprints
.
Nat Biotechnol
2015
;
33
:
395
401
.

92.

Qiao
S
,
Sun
S
,
Wang
L
, et al.
The RLA1/SMOS1 transcription factor functions with OsBZR1 to regulate brassinosteroid signaling and rice architecture
.
Plant Cell
2017
;
29
:
292
309
.

93.

Nemoto
Y
,
Nonoue
Y
,
Yano
M
, et al.
Hd1,a CONSTANS ortholog in rice, functions as an Ehd1 repressor through interaction with monocot-specific CCT-domain protein Ghd7
.
Plant J
2016
;
86
:
221
33
.

94.

Mao
C
,
Lu
S
,
Lv
B
, et al.
A rice NAC transcription factor promotes leaf senescence via ABA biosynthesis
.
Plant Physiol
2017
;
174
:
1747
63
.

95.

Hwang
Y
,
Choi
HS
,
Cho
HM
, et al.
Tracheophytes contain conserved orthologs of a basic helix-loop-helix transcription factor that modulate ROOT HAIR SPECIFIC genes
.
Plant Cell
2017
;
29
:
39
53
.

96.

Solomon
MJ
,
Varshavsky
A
.
Formaldehyde-mediated DNA-protein crosslinking: a probe for in vivo chromatin structures
.
Proc Natl Acad Sci U S A
1985
;
82
:
6470
4
.

97.

Solomon
MJ
,
Larsen
PL
,
Varshavsky
A
.
Mapping protein-DNA interactions in vivo with formaldehyde: evidence that histone H4 is retained on a highly transcribed gene
.
Cell
1988
;
53
:
937
47
.

98.

Gendrel
AV
,
Lippman
Z
,
Martienssen
R
, et al.
Profiling histone modification patterns in plants using genomic tiling microarrays
.
Nat Methods
2005
;
2
:
213
8
.

99.

Hanlon
SE
,
Lieb
JD
.
Progress and challenges in profiling the dynamics of chromatin and transcription factor binding with DNA microarrays
.
Curr Opin Genet Dev
2004
;
14
:
697
705
.

100.

Nishikori
S
,
Hattori
T
,
Fuchs
SM
, et al.
Broad ranges of affinity and specificity of anti-histone antibodies revealed by a quantitative peptide immunoprecipitation assay
.
J Mol Biol
2012
;
424
:
391
9
.

101.

Park
PJ
.
ChIP-seq: advantages and challenges of a maturing technology
.
Nat Rev Genet
2009
;
10
:
669
80
.

102.

Zhu
F
,
Luo
T
,
Liu
C
, et al.
An R2R3-MYB transcription factor represses the transformation of alpha- and beta-branch carotenoids by negatively regulating expression of CrBCH2 and CrNCED5 in flavedo of Citrus reticulate
.
New Phytol
2017
;
216
:
178
92
.

103.

Wang
H
,
Guan
S
,
Zhu
Z
, et al.
A valid strategy for precise identifications of transcription factor binding sites in combinatorial regulation using bioinformatic and experimental approaches
.
Plant Methods
2013
;
9
:
34
.

104.

Jiang
J
,
Xing
F
,
Zeng
X
, et al.
RicyerDB: a database for collecting rice yield-related genes with biological analysis
.
Int J Biol Sci
2018
;
14
:
965
70
.

105.

Rishi
V
,
Bhattacharya
P
,
Chatterjee
R
, et al.
CpG methylation of half-CRE sequences creates C/EBPalpha binding sites that activate some tissue-specific genes
.
Proc Natl Acad Sci U S A
2010
;
107
:
20311
6
.

106.

Liu
B
.
BioSeq-analysis: a platform for DNA, RNA, and protein sequence analysis based on machine learning approaches
.
Brief Bioinform
2017
. DOI: .

107.

Wei
L
,
Su
R
,
Wang
B
, et al.
Integration of deep feature representations and handcrafted features to improve the prediction of N6-methyladenosine sites
.
Neurocomputing
2019
;
324
:
3
9
.

108.

Wei
L
,
Zhou
C
,
Chen
H
, et al.
ACPred-FL: a sequence-based predictor based on effective feature representation to improve the prediction of anti-cancer peptides
.
Bioinformatics
2018
;
34
:
4007
16
.

109.

Wang
S
,
Li
S
,
Liu
Q
, et al.
The OsSPL16-GW7 regulatory module determines grain shape and simultaneously improves rice yield and grain quality
.
Nat Genet
2015
;
47
:
949
54
.

110.

Guo
S
,
Xu
Y
,
Liu
H
, et al.
The interaction between OsMADS57 and OsTB1 modulates rice tillering via DWARF14
.
Nat Commun
2013
;
4
:
1566
.

111.

Liu
Q
,
Han
R
,
Wu
K
, et al.
G-protein betagamma subunits determine grain size through interaction with MADS-domain transcription factors in rice
.
Nat Commun
2018
;
9
:
852
.

112.

Zhang
C
,
Xu
Y
,
Guo
S
, et al.
Dynamics of brassinosteroid response modulated by negative regulator LIC in rice
.
PLoS Genet
2012
;
8
:
e1002686
.

113.

Lu
Z
,
Yu
H
,
Xiong
G
, et al.
Genome-wide binding analysis of the transcription activator ideal plant architecture1 reveals a complex network regulating rice plant architecture
.
Plant Cell
2013
;
25
:
3743
59
.

114.

Li
S
,
Zhao
B
,
Yuan
D
, et al.
Rice zinc finger protein DST enhances grain production through controlling Gn1a/OsCKX2 expression
.
Proc Natl Acad Sci U S A
2013
;
110
:
3167
72
.

115.

Schmidt
R
,
Schippers
JH
,
Mieulet
D
, et al.
SALT-RESPONSIVE ERF1 is a negative regulator of grain filling and gibberellin-mediated seedling establishment in rice
.
Mol Plant
2014
;
7
:
404
21
.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/journals/pages/open_access/funder_policies/chorus/standard_publication_model)