Transcriptional Regulation: a Genomic Overview

José Luis Riechmann

doi:10.1199/tab.0085

How to translate text using browser tools

4 April 2002 Transcriptional Regulation: a Genomic Overview

José Luis Riechmann

Author Affiliations +

The Arabidopsis Book, 2002(1): (2002). https://doi.org/10.1199/tab.0085

Abstract

The availability of the Arabidopsis thaliana genome sequence allows a comprehensive analysis of transcriptional regulation in plants using novel genomic approaches and methodologies. Such a genomic view of transcription first necessitates the compilation of lists of elements. Transcription factors are the most numerous of the different types of proteins involved in transcription in eukaryotes, and the Arabidopsis genome codes for more than 1,500 of them, or approximately 6% of its total number of genes. A genome-wide comparison of transcription factors across the three eukaryotic kingdoms reveals the evolutionary generation of diversity in the components of the regulatory machinery of transcription. However, as illustrated by Arabidopsis, transcription in plants follows similar basic principles and logic to those in animals and fungi. A global view and understanding of transcription at a cellular and organismal level requires the characterization of the Arabidopsis transcriptome and promoterome, as well as of the interactome, the localizome, and the phenome of the proteins involved in transcription.

Introduction.

Many of the biological processes in a plant are regulated at the level of transcription. Changes in gene expression have been shown to underlie the response to environmental cues and stresses (such as light, temperature, and nutrient availability), the defense response against pathogens, the regulation of metabolic pathways, the regulation of photosynthesis, or the establishment of symbiotic relationships, to name a few. In plants, as well as in animals, development is based on the cellular capacity for differential gene expression (reviewed in: Scott, 2000; Benfey and Weigel, 2001). Accordingly, many of the genes identified in screens for Arabidopsis mutants with altered, for example, flower or root development have been found to encode transcription factors. Alterations in gene expression are also emerging as a major source of the diversity and change that underlie the morphological evolution of eukaryotic organisms (Doebley and Lukens, 1998; Cubas et al., 1999b; Carroll, 2000; Tautz, 2000). In particular, morphological changes that occurred during plant domestication and crop improvement in agriculture have been associated with mutations in transcription factors (Peng et al., 1999), alterations in their expression (Doebley et al., 1997; Wang et al., 1999b), or changes in the expression of other types of regulatory proteins (Frary et al., 2000). Related transcription factors, such as the Arabidopsis MYB proteins WEREWOLF (WER) and GLABROUS1 (GL1), have been shown to be functionally equivalent, and owe their particular roles in plant development to differences in their expression patterns (Lee and Schiefelbein, 2001).

The availability of the Arabidopsis genome sequence (Lin et al., 1999; Mayer et al., 1999; Arabidopsis Genome Initiative, 2000; Salanoubat et al., 2000; Tabata et al., 2000; Theologis et al., 2000) allows a global, or genomic, analysis of transcriptional regulation in plants. Whereas the mechanisms of transcription are largely common across eukaryotes, their components vary among kingdoms. The complement of genes coding for transcriptional regulators in Arabidopsis has been described (Arabidopsis Genome Initiative, 2000; Riechmann et al., 2000). Their systematic functional characterization can be pursued with a variety of reverse genetic methods (Riechmann and Ratcliffe, 2000). In addition, gene expression profiling technologies, such as DNA microarrays, allow monitoring transcription factor activity at a genome-wide level. These studies should eventually lead to an understanding of the interplay of the transcription factors with the genome whose expression they control.

This chapter intends to provide a genomic perspective on transcriptional regulation in Arabidopsis. The first section briefly reviews the different types of proteins directly involved in transcription in eukaryotes, and our current understanding on how they function. The following sections consist of a description of the Arabidopsis complement of genes and proteins involved in transcriptional control, in particular sequence-specific DNA-binding transcription factors and chromatin-related proteins. Transcriptional regulators often act in a combinatorial fashion, and this mode of action is reviewed in the context of Arabidopsis promoters and cis-regulatory sequences, and of protein-protein interactions. Finally, genome-wide functional analyses of transcription factors, the characterization of the Arabidopsis promoterome, and of the transcriptome by gene expression profiling experiments, are considered. The availability of the genome sequence of different prokaryotic and eukaryotic organisms has provided for new ways of searching for unity and diversity among biological systems, and given birth to the field of comparative genomics. Although the subject of this book is Arabidopsis, reference is made in this chapter to other eukaryotic organisms, in order to situate the Arabidopsis genome information in a broader biological context.

2. Transcription machinery: concepts, components, and mechanisms

In eukaryotic organisms, regulation of gene expression proceeds through mechanisms that are fundamentally different from those in prokaryotes, which explains both the large number and diversity of proteins that are involved in the process, as well as how it can be tightly regulated to facilitate the diversification in expression patterns that is required for biological complexity (Struhl, 1999). In a prokaryote such as E. coli, the ground state for transcription is non-restrictive, that is, the RNA polymerase complex is not limited in its ability to gain access to the DNA and initiate RNA synthesis (Struhl, 1999). Negative regulation is rare, and exerted by sequence-specific repressors. Furthermore, it has been estimated that the global structure of the E. coli gene regulatory network possesses low complexity. On average, a transcription factor would regulate three genes, and an E. coli gene would be under the direct control of two transcription factors (Thieffry et al., 1998). There is a prominence of promoters controlled by a single regulator, and whereas many of the regulators regulate themselves (usually through auto-inhibitions), very rarely do they regulate other transcription factors (Thieffry et al., 1998).

In contrast, the ground state for transcription in eukaryotes is restrictive, as a result of the packing of the DNA into chromatin, which blocks the recognition of the core promoters by the basic transcription machinery (Kornberg, 1999; Struhl, 1999). The effects of chromatin structure on promoter accessibility makes chromatin modifying activities necessary for eukaryotic transcription, and has important implications for the way transcription factors act. In addition to the components of the basic transcription machinery and to scores of sequence-specific DNA-binding transcription factors, eukaryotic genomes contain a variety of genes that code for chromatin-related proteins. Furthermore, transcriptional regulators in eukaryotes operate following a combinatorial logic (an efficient way of increasing the number and diversity of gene regulatory activities), and the complexity of the regulatory networks can be great.

Prokaryotic sequence-specific DNA-binding transcription factors often recognize binding sites longer than 12 base-pairs (bp) (see RegulonDB, http://www.cifn.unam.mx/regulondb/ and DPInteract, http://arep.med.harvard.edu/dpinteract) (Robison et al., 1998; Salgado et al., 2001), whereas binding sites for eukaryotic transcription factors are usually shorter, 5 to 10 bp long. A combinatorial mechanism composed of factors that recognize short sequences is probably a more economical way (requires a reduced number of factors) of selectively regulating the expression of tens of thousands of genes, than a mechanism based upon factors that are each dedicated to control a small number of genes and operate through longer target sites. Thus, the DNA binding characteristics of eukaryotic transcription factors, and the mechanisms of transcription themselves, might be operationally and evolutionarily related to features of eukaryotic genomes such as the vast increase in genome size and in the number of genes to be regulated.

Briefly, the proteins involved in transcription in eukaryotes can be classified into four different functional groups: (1) the basic transcription apparatus and intrinsic associated factors (also known as general transcription factors, or GTFs); (2) large multi-subunit coactivators and other cofactors; (3) sequence-specific DNA-binding transcription factors; and (4) chromatin-related proteins. In contrast to the components of the basal transcription machinery, which in general are highly conserved, coregulators and transcription factors have diverged largely among eukaryotes. The roles that the proteins in these four classes play can be summarized as follows (after Lee and Young, 2000; Lemon and Tjian, 2000, and http://web.wi.mit.edu/young/pub/regulation.html) (for an extensive coverage of the mechanisms of eukaryotic transcription, see: Latchman, 1998; Elgin and Workman, 2000; White, 2001).

(1) The basic transcription apparatus, and intrinsic associated factors. In eukaryotic organisms, there are three different RNA polymerases, which are responsible for the synthesis of rRNA (Pol I), mRNA (Pol II), and tRNA, 5S rRNA, and other small RNA molecules (Pol III). The focus of this chapter is the transcription of protein-encoding genes, which is carried out by Pol II exclusively. Pol II is a multi-subunit enzyme (Cramer et al., 2001) that requires accessory factors to recognize promoter sequences and accurately initiate transcription. These general transcription factors (GTFs) include TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIIH. GTFs carry out a variety of different functions, from positioning the polymerase on the promoter (TFIIB) to unwinding its DNA (TFIIH). TFIID is a multi-subunit complex that is generally responsible for promoter recognition. It contains the TATA-box binding protein (TBP) and several TBP-associated factors (TAFs) (reviewed in Green, 2000). The TAF subunits of TFIID are critical for the responsiveness of the basic apparatus to transcriptional activators. However, individual TAFs are not essential for transcription of all genes in a genome. TAFs contribute to the specificity and variety of transcriptional responses: distinct TAFs can be targeted by different classes of activators, and individual TAFs can function as promoter selectivity factors. Furthermore, some TAFs can form part of other multi-subunit regulatory complexes, in addition to TFIID, such as the histone acetylation SAGA complex; and whereas most of the TAFs are ubiquitously expressed, some are expressed in a tissue or cell-type specific manner, which can lead to the formation of different TAF-containing complexes (for a review on gene-selective roles of GTFs and TAFs, see Veenstra and Wolffe, 2001).

(2) Large multi-subunit coactivators, and cofactors that bind sequence specific transcription factors. This heterogeneous class of regulatory proteins includes cofactors that interact with sequence-specific transcription factors and modulate their DNA binding or interaction with the core machinery, as well as large multi-subunit coactivators such as the Mediator complex, initially identified in yeast. Multi-subunit coactivators interact with Pol II and/or with multiple types of activators, serving as a modular adapter to regulate transcription initiation (Hampsey and Reinberg, 1999). The Mediator (or Mediator-like) complex is found in organisms from yeast to humans, but its number of subunits vary, and the complex from one organism might contain subunits that have no orthologs in another (Malik and Roeder, 2000; Rachez and Freedman, 2001).

(3) Sequence-specific DNA-binding transcription factors (activators and repressors). These are transcription factors of the classic type: usually defined as proteins that show sequence-specific DNA binding and are capable of activating and/or repressing transcription. They are responsible for the selectivity in gene regulation, and are often themselves expressed in a tissue, cell-type, temporal, or stimulus-dependent specific manner. Transcription factors are modular proteins, with distinct and functionally separable domains, such as DNA-binding and activation domains. Most known transcription factors can be grouped into families according to their DNA binding domain (Luscombe et al., 2000). Transcription factors can interact directly with different components of the general machinery and with coactivators, affecting complex formation. They can also interact with chromatin remodeling complexes.

(4) Chromatin-related proteins. This group includes factors that covalently modify histones (such as histone acetylases and deacetylases), and remodeling complexes that hydrolize ATP for reorganizing chromatin structure (such as the SWI/SNF and ISWI complexes). Histone acetylation is generally a characteristic of transcribed chromatin, whereas deacetylation is associated with repression. Accordingly, histone acetyltransferase activities are found in coactivators, and deacetylase activities in corepressors. Chromatin proteins usually form part of multi-subunit complexes.

Using the regulation of the HO endonuclease gene in yeast (Saccharomyces cerevisiae) as a paradigm, the steps leading to transcriptional activation can be summarized as follows (Cosma et al., 1999; Cosma et al., 2001). Upstream sequences are recognized by a transcription (enhancer-binding) factor, with accessibility to its targets sites despite the packing of the DNA into chromatin fibers. This transcription factor recruits the SWI/SNF complex, which then recruits SAGA, and results in the remodeling of chromatin and localized histone acetylation, which facilitates the access of additional transcription factors to cis-regulatory sequences. The secondary activators direct gene transcription through multiple interactions with cofactors and the core machinery, recruiting the RNA polymerase complex to the transcription initiation site. The specific order in which the different chromatin-modifying complexes are recruited can vary among promoters and organisms, but the dual role of activators, first enlisting chromatin modifying activities and then inducing localization of the basal transcription apparatus, appears to be widespread in eukaryotes, including plants (see below, and: Agalioti et al., 2000; Brown et al., 2001; Merika and Thanos, 2001).

In many instances, the correct functioning of a gene requires the termination of the activation of its transcription to be as rapid or precise as its initial triggering. Termination of activation can be accomplished by several mechanisms, among them the targeted destruction of transcription factors after their interaction with the basal transcription machinery. Phosphorylation of a transcription factor molecule by kinases that form part of the Pol II holoenzyme (such as Srb10 or TFIIH) would mark it for ubiquitin-mediated destruction, effectively preventing it from engaging into another Pol II initiation event, and freeing the promoter sequence to interact with another transcription factor molecule (reviewed in Tansey, 2001).

In addition to the mechanisms of transcriptional control that the classes of proteins described in this section mediate, there are at least two other possible levels of regulation of gene expression in eukaryotes: DNA methylation and nuclear organization. DNA methylation is associated with suppressed gene expression, and is reviewed in other chapters of this book (see also: Finnegan et al., 2000; Habu et al., 2001). Nuclear organization could provide for a higher level of regulation of gene expression, where different transcriptional functions might be segregated into distinct compartments (for models and reviews, see: Francastel et al., 2000; Lemon and Tjian, 2000; Cremer and Cremer, 2001; Misteli, 2001).

Of all the groups of proteins involved in transcription, the most numerous one is that of sequence-specific DNA-binding transcription factors. They are the principal factors upon which the mechanisms for selectivity of gene activation are built, and the basic (although not the only) protein components of the combinatorial logic of transcription.

3. Transcription factor gene content of the Arabidopsis genome

The analysis of the Arabidopsis genome sequence indicates that it codes for at least 1,572 transcription factors, which account for ∼6% of its estimated ∼26,000 genes (Arabidopsis Genome Initiative, 2000; Riechmann et al., 2000) (Table 1). This observation, however, represents an underestimate of the total number of transcription factors, given that, at present, approximately 40% of the proteins predicted from the genome sequence cannot be assigned to functional categories on the basis of sequence similarity to proteins of known biochemical function (Lin et al., 1999; Mayer et al., 1999; Arabidopsis Genome Initiative, 2000; Salanoubat et al., 2000; Tabata et al., 2000; Theologis et al., 2000). Some of those uncharacterized proteins are expected to be transcriptional regulators and, in fact, novel classes of transcription factors are still being discovered (for example: Boggon et al., 1999; Schauser et al., 1999; Kawaoka et al., 2000; Nagano et al., 2001; Windhövel et al., 2001). Therefore, the total number of transcription factor genes present in Arabidopsis (as well as, for the same reasons, in any other of the sequenced eukaryotic genomes) will be uncertain for some time.

A question pertaining to genome-wide surveys is whether all the proteins identified by sequence similarity searches do indeed belong to the functional groups into which they are being catalogued. In the case of transcription factors, the answer depends on the particular gene family that is considered. If the conserved DNA-binding domain that defines a gene family is poor in sequence information (for example, some zinc-coordinating motifs), the ratio of false positives in the searches can be relatively high (although it can often be reduced by additional sequence comparison strategies that are beyond the scope of this chapter, see: Riechmann et al., 2000). On the other hand, many families are defined by long DNA-binding domains (50 to 70 amino acids) with multiple residues being highly conserved (that is, the domains are rich in sequence information). The three-dimensional structure of these domains might have been solved, and revealed the contacts between some of the conserved residues and the DNA. In cases like these, such as for example the homeobox and the AP2/ERF (APETALA2/ethylene response factor) families, it is reasonable to expect all the members of the gene family to be transcription factors (activators or repressors) (the AP2/ERF family was initially referred to as AP2/EREBP, for AP2/ethylene responsive element binding protein). However, there are cases in which a family of bona fide transcription factors might also contain members that have additional functions (for reviews on multifunctional transcription factors: Ladomery, 1997; Wilkinson and Shyu, 2001). For example, the Drosophila homeodomain protein Bicoid directs anterior embryo development both by regulating transcription and by interacting with Caudal mRNA and inhibiting its translation, thus restricting Caudal (which is another homeodomain protein) accumulation to the posterior part of the embryo through posttranscriptional control. Both DNA- and RNA-binding are specified by the Bicoid homeodomain, but by distinct subregions or residues in it (Niessing et al., 2000). The Arabidopsis MYB-related protein AtCDC5 is known to be homologous to the S. cerevisiae CEF1 and S. pombe Cdc5 proteins (Hirayama and Shinozaki, 1996; Ohi et al., 1998). Cdc5 proteins are essential for the G2/M progression, but their molecular functions are not completely understood, as they are required for pre-mRNA splicing and associate with core components of the splicing machinery, but also show sequence-specific binding to double stranded DNA and transactivation potential (Burns et al., 1999; Lei et al., 2000). It is thus possible that Cdc5 proteins are another example of factors with several distinct molecular functions. Two other members of the Arabidopsis MYB-related family, AtTRP1 and AtTBP1, have been identified as telomere-binding proteins (Chen et al., 2001a; Hwa Ng et al., 2001), although a possible role in transcription cannot be ruled out because the 5′ regions of some Arabidopsis genes contain two or more non-contiguous telomeric repeats (Regad et al., 1994). These examples illustrate the limitations of using sequence similarity to assign potential roles to proteins that are otherwise uncharacterized, and also how the determination of their molecular functions can be elusive. Similar cases might occur within some of the zinc-coordinating protein families, since the same or related motifs can be involved in DNA- and RNA-binding, and may be present in proteins withfunctions involving nucleic acid binding but not necessarily transcriptional regulation. For example, vertebrate Y-box proteins contain a zinc-coordinating cold-shock domain, and are often dual DNA- and RNA-binding proteins that can regulate transcription and/or translation (reviewed in: Matsumoto and Wolffe, 1998; Sommerville, 1999).

With these caveats in mind, the Arabidopsis complement of transcription factors has been the subject of an extensive genome-wide descriptive analysis, which also included a comparison with those of Drosophila melanogaster, Caenorhabditis elegans, and Saccharomyces cerevisiae (Riechmann et al., 2000). The main conclusions of that study are summarized here.

The 1,572 transcriptional regulator genes identified in the Arabidopsis genome are classified into more than 45 different gene families (Table 1; Figure 1), all of which are scattered throughout the genome. In addition, there are a few single-copy or “orphan” genes, such as LEAFY (LFY). Transcriptional regulators represent approximately 4.6, 3.5, and 3.5% of the genes in Drosophila, C. elegans, and yeast, respectively (Riechmann et al., 2000). Thus, the Arabidopsis content of transcription factors is 1.3 times that of Drosophila, and 1.7 times that of C. elegans and yeast (Riechmann et al., 2000). The large number and diversity of transcription factors in Drosophila were proposed to be related to its substantial regulatory complexity (Adams et al., 2000). Applying the same logic to Arabidopsis suggests that the regulation of transcription in plants is as complex as that in Drosophila. Furthermore, if the estimated total number of genes in humans, 30,000–40,000, and of transcription factor genes, 1,850–2,000, are correct (International Human Genome Sequencing Consortium, 2001; Tupler et al., 2001; Venter et al., 2001), then the transcription factor gene content of Arabidopsis and of H. sapiens are similar (∼6% in Arabidopsis, versus 4.6–6.6% in humans). It should be noted, however, that there is a substantial degree of uncertainty about these estimates of gene numbers in humans (see, for example: Hogenesch et al., 2001; Wright et al., 2001).

Transcription factors, the networks that they form, and the genes that they regulate, have been proposed as a possible objective measurement (connectivity of gene-regulation networks) of the biological complexity of an organism (Szathmáry et al., 2001). From that point of view, the large number of transcription factors in Arabidopsis was interpreted in the context of the complexity of secondary metabolism in plants (Szathmáry et al., 2001), but it might also be related to the complex interactions between plants and the environment (both biotic and abiotic) as well as to the degree of duplications in the genome (see below, and Arabidopsis Genome Initiative, 2000).

The extent to which the Arabidopsis complement of transcription factors represents that of other plants is still an open question. Since the evolutionary divergence between the monocot and dicot lineages is a relatively recent event, which perhaps occurred ∼200 million years ago (Savard et al., 1994), it could be expected that the overall composition of monocot and dicot transcription factor complements would be similar. In fact, the largest transcription factor families in Arabidopsis also appear to be the most prevalent ones in monocotyledonous plants. For example, the phylogenetic comparison of a subset of maize and Arabidopsis MYB-(R1)R2R3 sequences shows that the amplification of the gene family occurred prior to the separation of monocots and dicots (Rabinowicz et al., 1999). In addition, within phylogenetically well-studied families of transcription factors, such as the MADS-box family, many examples of orthology can be identified between Arabidopsis genes and those from rice or maize, and even from gymnosperms (reviewed in: Theissen et al., 2000; Ng and Yanofsky, 2001) (see also http://www.mpiz-koeln.mpg.de/mads). Putative orthologous MADS-box genes have regularly maintained conserved functions, even after substantial sequence divergence (Theissen et al., 2000). However, it is also apparent that diversity in transcriptional regulators will be found within the plant kingdom, and between monocots and dicots. Many MADS-box gene duplication and diversification events occurred after separation of the moss and fern lineages from the lineage that originated the flowering plants (Münster et al., 1997; Hasebe et al., 1998; Krogan and Ashton, 2000; Svensson et al., 2000), and at least two clades of MADS-box genes appear to have been amplified in the phylogenetic lineage that led to grasses with respect to Arabidopsis (Theissen et al., 2000). Similarly, whereas most of the amplification of the MYB-(R1)R2R3 gene family occurred prior to the separation between monocots and dicots, several subgroups in maize appear to have originated recently or undergone duplication (some of these duplications are likely to be associated with the allotetraploid origin of the maize genome, but others do not reflect it: Rabinowicz et al., 1999). These recent expansions could have allowed a functional diversification that might not be present in Arabidopsis.

An issue that impinges on the question of the similarity of the Arabidopsis complement of transcription factors with that of other plants is the degree of completeness of the current characterization (i. e., sequence determination and analysis) of the Arabidopsis genome, in particular if that question is to be addressed on a gene-by-gene basis. TRM1 is a maize C2H2 zinc finger transcription factor involved in the repression of rbcS gene expression in mesophyll cells that is related to the mammalian transcription activator-repressor YY1 (Xu et al., 2001). A BLAST search of the higher plant DNA sequences available in GenBank (July 2001) identifies homologous genes in other monocots (Triticum aestivum) as well as in dicotyledonous plants (Nicotiana tabacum, Solanum tuberosum), but not in Arabidopsis. It is possible that an Arabidopsis TRM1 homolog resides in one of the still unsequenced segments of the genome (see http://www.arabidopsis.org). Similarly, there are a few Arabidopsis transcription factor genes represented by ESTs or BAC-end sequences that still cannot be identified in the genome sequence. The limitations of the current sequencing technologies make it impractical or impossible to determine the sequence of eukaryotic genomes to absolute completeness. Thus, a failure to identify a particular gene in the genome sequence of an organism should not be taken as a definitive proof of the absence of that gene. In addition, gene sequences might diverge more than expected, which might result in the identification of homologous genes requiring more sophisticated sequence analysis than a standard BLAST search. For example, a homolog of the mammalian tumor suppressor gene p53 can be identified in the sequence of the C. elegans genome, despite initial reports that no p53-like gene was present in that organism (Derry et al., 2001; Schumacher et al., 2001).

The genome-wide comparison of transcription factors among eukaryotic organisms (Arabidopsis, Drosophila, C. elegans, and S. cerevisiae, encompassing the plant, animal, and fungal kingdoms) reveals the evolutionary generation of diversity in the regulation of transcription (Riechmann et al., 2000). Each of these eukaryotic kingdoms has its own set of particular transcription factor families and genes. Members of kingdom-specific families represent 45% of the Arabidopsis complement of transcriptional regulators, whereas those of families that are present in all four organisms account for 53% (Figure 2). In each organism, a minority (2–5%) of its transcription factors belong to families that are present in two of the three kingdoms: in animals and yeast (SOX/TCF, Fork head, and RFX1-like transcription factors) or in plants and animals (TULP, CPP, and E2F/DP families) (Figure 2) (Riechmann et al., 2000). This distribution of genes and gene families reflects the evolutionary history of eukaryotes. According to molecular phylogenetic analyses, plants, animals and fungi all diverged from a common ancestor during a short period of time, ∼1.5 billion years ago (Wang et al., 1999a; Philippe et al., 2000; Nei et al., 2001). Thus, most of the transcription factor families are either shared by the three kingdoms (those that were present in the common ancestor), or specific to each one (those families that arose independently following divergence).

Many of the Arabidopsis transcription factor gene families are large (Table 1). However, none has been so disproportionately amplified as the nuclear hormone receptors in C. elegans (∼38% of its transcription factors), the C2H2 zinc finger proteins in Drosophila (∼46%), or the C6 and C2H2 families in yeast (∼25% each one) (Figure 3) (Riechmann et al., 2000). The three largest families of transcription factors in Arabidopsis, AP2/ERF, bHLH (basic-region helix-loop-helix), and MYB-(R1)R2R3, each represent only ∼9% of the total, and there are several other families with comparable numbers of genes (Figure 3) (Riechmann et al., 2000). The two transcription factor families that have been more substantially amplified in plants, as compared to animals and yeast, are the MYB and the MADS families. Another difference between the Arabidopsis complement of transcription factors and those of the other eukaryotes is that less than 25% of it consists of zinc coordinating proteins, whereas zinc coordinating transcription factors represent ∼51% of the total in Drosophila, ∼64% in C. elegans, and ∼56% in yeast (Riechmann et al., 2000).

The Arabidopsis transcription factors that belong to families that are common to all eukaryotes do not share significant similarity with those from the other kingdoms, except in the conserved DNA binding domains that define the respective families (Riechmann et al., 2000). Furthermore, diversity in protein sequence and structure is increased by domain shuffling (Figure 1). Shuffling of some of the DNA-binding domains that are present in all eukaryotes has generated novel transcription factors with plant-specific combinations of modules, as for example in the homeodomain, MADS, and ARID protein families (Figures 1 and 4) (Riechmann et al., 2000).

The Arabidopsis genome contains many tandem gene duplications and large-scale duplications on different chromosomes (Arabidopsis Genome Initiative, 2000; Blanc et al., 2000; Vision et al., 2000). Whereas some of these duplications have been followed by rearrangements and divergent evolution, up to 40 to 60% of the Arabidopsis genes might comprise pairs of highly related sequences (the percentage depending on the parameters used in the analyses) (Arabidopsis Genome Initiative, 2000; Blanc et al., 2000). Transcription factor genes follow these general observations. A comparison of the transcription factor complement to itself (all-against-all) revealed that, on average, closely related genes account for ∼45% of the total number in the major families (a pair of proteins was considered highly similar if they showed >60% amino acid sequence identity along at least two-thirds of the length of one of them) (Riechmann et al., 2000). The pairs or groups of closely related genes most often correspond to duplications in different chromosomes (∼65% on average), or to duplications in the same chromosome but at very large distances (∼22%), than to tandem repeats (∼13%) (Riechmann et al., 2000). In addition, clusters of three or more homologous transcription factor genes are very rare in the genome (Riechmann et al., 2000). This distribution indicates that it will be feasible to generate double or triple mutants for the majority of the pairs or groups of highly related genes that, because of their sequence similarity, might have overlapping or partially redundant functions (which might not be revealed by single mutant analyses; see below).

The analysis of ∼120,000 Arabidopsis expressed sequence tags (ESTs) (sequences available in GenBank in January 2001) suggests that, in terms of overall expression and considered as a whole, transcription factor genes are not substantially different from the rest of the genes in the genome. Approximately half of the ∼26,000 predicted genes are matched by an EST (Arabidopsis Genome Initiative, 2000; Theologis et al., 2000). Similarly, when the major Arabidopsis transcription families are considered, ∼47% of the genes are represented by an EST (Table 2). This observation is in contrast to the sometimes common assumption that, because of their regulatory nature, genes of this class are generally expressed at low levels.

4. Chromatin remodeling proteins.

Chromatin structure is an important element of the mechanisms that determine gene expression patterns in eukaryotes, because nucleosome assembly eliminates the accessibility of promoter sequences for the basal transcription machinery. The unfolding of packed chromatin is necessary for gene expression and, conversely, repression requires the formation and maintenance of condensed chromatin structures. Gene silencing and epigenetic phenomena, in which chromatin structure and histone modifications play a role, are by themselves the subject of other chapters in this book.

As summarized above, one of the mechanisms of transcription factor action is the recruitment of chromatin remodeling complexes to target promoters. This mechanism has been deduced from research on transcription in yeast and mammalian cells, but studies on the regulation of the β-phaseolin (phas) gene in bean (Phaseolus vulgaris) suggest that it also operates in plants (reviewed in Li et al., 2001a). The phas gene is silenced in vegetative tissues as a consequence of the positioning of a nucleosome over the TATA boxes of the promoter, making them inaccessible to TBP, whereas nucleosome displacement allows the gene to be highly expressed during seed development (Li et al., 1998). Such modification in chromatin structure results from the presence of the seed-specific transcription factor PvALF, a member of the ABI3/VP1 family (Li et al., 1999). However, PvALF is not sufficient for phas transcriptional activation, which does not occur in the absence of abscisic acid (ABA) (Li et al., 1998). Thus, a plausible model is that PvALF mediates chromatin reconfiguration, then allowing the binding of ABA-responsive transcription factors and the recruitment and assembly of the basal transcription machinery on the phas promoter (Li et al., 2001a).

The remodeling or reconfiguration of chromatin involves different types of enzymes, such as members of the SWI2/SNF2 subfamily of the DEAD/H box superfamily of nucleic-acid stimulated ATPases, and proteins that covalently modify histones, such as acetyltranferases (HATs) and deacetylases (HDACs), kinases, and methyltransferases (for reviews: Kadonaga, 1998; Elgin and Workman, 2000; Fry and Peterson, 2001; Jenuwein, 2001; Urnov and Wolffe, 2001). All eukaryotes appear to contain several proteins belonging to each one of these types, and each type can be further divided into different structural subclasses. Such structural diversity allows different proteins with the same biochemical activity to be involved in specialized cellular functions. The chromatin proteins with enzymatic activity usually form part of multi-subunit complexes, which might be necessary for their specificity and functionality.

In general, histone acetylation is a characteristic of transcribed chromatin, whereas deacetylation is associated with repression. HATs acetylate the ϵ-amino groups of specific lysine residues in the amino-terminal tails of the histone proteins that form the octamer around which the DNA wraps in the nucleosomes. Histone deacetylase-containing complexes reverse this covalent modification (reviewed in Khochbin et al., 2001). The molecular mechanisms by which histone acetylation affects chromatin structure and influences transcription could involve the destabilization of interactions between the DNA and the histone octamer (by neutralizing positive charges), interference with the high-order packing of chromatin, or the modification of interactions between histones and other proteins (reviewed in: Marmorstein, 2001a; Marmorstein and Roth, 2001; Roth et al., 2001). Other types of post-translational modifications, such as phosphorylation and methylation, also occur on histones, and can regulate chromatin structure and transcriptional activation and repression (reviewed in Marmorstein, 2001a). Methylation of specific lysine residues in histone tails is a relatively stable modification, thus providing a stable epigenetic mark for transcriptional regulation and gene silencing via heterochromatin assembly (reviewed in: Jenuwein, 2001; Rice and Allis, 2001). Lysine methyltransferase activity has been demonstrated for several eukaryotic SET domain proteins (Table 3). In addition, histone tails can also be methylated at arginine residues by a different class of enzymes, that act as coactivators of transcription (Chen et al., 1999).

A “histone code” hypothesis has been proposed, suggesting that distinct covalent histone modifications might be used by the cell, sequentially or in combination, to generate a “code” that could be read by other proteins to produce different transcriptional outputs (Strahl and Allis, 2000). Reading the “histone code” would necessitate protein domains that recognize, in a receptor-ligand type of interaction, the different covalent modifications that can occur on histones (Strahl and Allis, 2000). Binding activities have been identified in several of the protein domains that are frequently found in chromatin-related proteins, such as the bromodomain and the chromodomain, which can recognize acetylated- and methylated-lysine residues of the histone tails, repectively (Table 3). A further level of complexity in regulatory mechanisms is inferred from the observation that the same covalent modifications that can be found on histones also occur on other proteins involved in transcriptional control. For example, histones are not the only targets for HATs, as HAT-catalyzed acetylation can also regulate the activity of transcription factors and co-factors (reviewed in: Sterner and Berger, 2000; Chen et al., 2001b). Lastly, another group of enzymes involved in chromatin remodeling is that of the DNA-dependent ATPases of the SWI2/SNF2 type. Yeast SWI2/SNF2 is the catalytic subunit of the multiprotein SWI/SNF remodeling complex, which can mediate the repositioning of nucleosomes by sliding histone octamers to other sites on the same DNA molecule, as well as by transferring them to other DNA molecules (reviewed in: Vignali et al., 2000; Flaus and Owen-Hughes, 2001).

A catalogue of known and putative chromatin proteins in Arabidopsis and maize has been compiled in ChromDB, a database that aims to present information on the entire complement of chromatin proteins in plants ( http://chromdb.biosci.arizona.edu/). ChromDB lists over 220 different Arabidopsis chromatin genes, including SWI2/SNF2 homologs (22 genes), HATs (12 genes; 10 are listed as HATs and 2 as TAF_II250 homologs; Table 4), HDACs (17 genes; Table 4), and SET-domain-protein genes (29 genes), and also includes histones (50 genes) and homologs of subunits of global transcription factors.

The definition and identification of the complement of chromatin proteins in Arabidopsis, or in any other eukaryotic organism, and in particular of the subset of those proteins that might be involved in transcriptional control, is complicated by several factors. First, chromatin remodeling is mediated by multiprotein complexes, some of which have already been purified and characterized from yeast and animal (mammalian, Drosophila) cells, but none from plants. Some of these complexes (for example, the yeast SAGA and human PCAF complexes) show a remarkable conservation in subunit composition, but there are also cases of proteins and complexes that are specific to one kingdom (Sterner and Berger, 2000, and see below). Thus, biochemical studies will be needed to obtain a complete description of the Arabidopsis complement of chromatin proteins. Another complication for the identification of bona fide chromatin proteins arises from their multi-domain architecture. Chromatin proteins frequently combine different domains or motifs of distinct molecular functions (Table 3). However, those domains are not necessarily unique to chromatin proteins, as they (or related sequences) can be present in other types of proteins. For example, an Arabidopsis protein that contains sequences related to the chromodomain is localized to the chloroplast and forms part of the chloroplast signal recognition particle pathway (Klimyuk et al., 1999). Lastly, the structure of chromatin influences not only transcription, but also other nuclear processes that are physically associated with the genome, such as replication, recombination, and DNA repair. Thus, that a protein is chromatin-related does not necessarily imply that it is involved in transcriptional control. For these different reasons, the identification and description by sequence similarity searches of the complement of chromatin proteins involved in transcriptional regulation, and of their biochemical and molecular functions, is more complicated than that of the sequence-specific DNA-binding transcription factors discussed above.

It is apparent from the content in known chromatin genes of the Arabidopsis genome that chromatin remodeling is important in plants for the control of gene expression. That some of the molecular mechanisms for chromatin reconfiguration and transcriptional control are conserved among plants and the other eukaryotic kingdoms can be deduced from the presence of orthologous genes. Furthermore, similarities or functional equivalence at the molecular or physiological level has been demonstrated in some cases, as illustrated with the following examples.

An RPD3-type maize histone deacetylase has been shown to complement a S. cerevisiae null mutant in the homologous RPD3 gene (Rossi et al., 1998).

The Arabidopsis gene BUSHY (BSH), which codes for a protein with high sequence similarity to S. cerevisiae SNF5 (a component of the SWI/SNF remodeling complex), can partially complement a snf5 mutation in yeast (Brzeski et al., 1999).

Arabidopsis homologs of human CBP/p300 proteins recapitulate the binding specificity of p300 for the adenoviral oncoprotein E1A, in addition to being capable of activating transcription in mammalian cells (Bordoli et al., 2001).

The Arabidopsis protein that is orthologous to yeast GCN5 possesses HAT activity, and can interact with Arabidopsis ADA2 proteins, suggesting that a complex analogous to yeast SAGA (of which GCN5 and ADA2 form part) and human PCAF also exists in plants (Stockinger et al., 2001).

PICKLE (PKL; also initially referred to as GYMNOS) is an Arabidopsis protein of the SWI2/SNF2-type that appears to be involved in the repression of several important developmental regulators, such as LEC1 and meristematic genes (Eshed et al., 1999; Ogas et al., 1999). PKL is homologous to human Mi-2, a component of the NuRD complex. By virtue of its different subunits, the NuRD complex combines both ATP-dependent chromatin remodeling and HDAC activity. The homology between PKL and Mi-2 suggests that a NuRD-like complex might exist in plants; thus, a plausible mechanism for gene repression by PKL is via histone deacetylation mediated by NuRD (reviewed in Ahringer, 2000).

The Polycomb group (PcG) and the trithorax group (trxG) of proteins in Drosophila and mammals control the cellular inheritance of mitotically stable states of gene expression, homeotic genes in particular. PcG and trxG proteins (repressors and activators, respectively) are thought to regulate transcription by modulating the structure of chromatin (reviewed in: Brock and van Lohuizen, 2001; Francis and Kingston, 2001; Mahmoudi and Verrijzer, 2001). The proteins within each group (Pc or trx) can be unrelated in sequence; rather, their relationship to each other comes from the fact that they operate together in the form of multi-subunit complexes of a genetically defined function (Gould, 1997). Homologous or related proteins for some PcG and trxG factors have been identified in Arabidopsis, and in some cases functionally characterized. Three Arabidopsis proteins show homology to the Drosophila SET-domain PcG protein Enhancer of zeste (E(z)), CURLY LEAF (CLF), CURLY LEAF LIKE (CLK), and MEDEA (MEA) (Goodrich et al., 1997; Grossniklaus et al., 1998) (Table 5). CLF is a repressor of floral organ identity (i.e., homeotic) gene expression in vegetative tissues (Goodrich et al., 1997), whereas MEA is involved in the maternal control of embryogenesis (Grossniklaus et al., 1998). Another Arabidopsis PcG protein involved in seed development is FERTILIZATION-INDEPENDENT ENDOSPERM (FIE), which shows homology to PcG proteins with WD repeats, such as Drosophila extra sex combs (esc) (Ohad et al., 1999). Animal E(z) and esc proteins have been shown to interact and to co-localize in unique complexes. Similarly, Arabidopsis FIE and MEA also interact, which provides a molecular explanation for the similarities between the fie and mea mutant phenotypes (Spillane et al., 2000; Yadegari et al., 2000). Other Arabidopsis proteins, such as EMBRYONIC FLOWER2 (EMF2), FERTILIZATION-INDEPENDENT SEED 2 (FIS2), and VERNALIZATION 2 (VRN2) are related to a different Drosophila PcG protein, Suppressor of zeste 12 (Su(z)12) (Luo et al., 1999; Birve et al., 2001; Gendall et al., 2001; Yoshida et al., 2001).

Despite these similarities, however, novel features in the chromatin-mediated regulation of gene expression have also evolved in plants. Plants contain what appears to be a kingdom-specific family of histone deacetylases, the HD2 class (Lusser et al., 1997; Aravind and Koonin, 1998; Dangl et al., 2001) (Table 4). Orthologs for some Arabidopsis chromatin proteins are not found in yeast or animals. This is the case, for example, of MOM1, a SWI2/SNF2-related protein that is involved in the maintenance of transcriptional gene silencing (Amedeo et al., 2000; Arabidopsis Genome Initiative, 2000). In addition, homologous chromatin proteins can show structural variation among the different eukaryotic kingdoms, and some of those variations appear to be specific to plants. In fact, eukaryotic chromatin proteins are a prominent example of evolutionary innovation by domain shuffling, deletion, and accretion (International Human Genome Sequencing Consortium, 2001). For example, Arabidopsis CBP/p300-like proteins lack the bromodomain and the CREB-binding region that are highly conserved in animal CBP/p300 proteins (Bordoli et al., 2001) (Table 4; CBP/p300 proteins are not found in yeast). Instead, one of these Arabidopsis proteins (PCAT1) contains a repeated motif of unknown function that does not show sequence similarity to any other known amino acid motif (Bordoli et al., 2001).

Other Arabidopsis chromatin genes that have already been genetically or functionally characterized further show the importance of chromatin-mediated regulation of gene expression in multiple aspects of the plant life cycle (Table 5). Reduction of AtHD1 (an HDAC-coding gene, also referred to as AtRPD3A) transcript levels by using antisense RNA caused pleiotropic developmental alterations, suggesting a global role for AtHD1 in regulating gene expression during development (Wu et al., 2000a; Tian and Chen, 2001). Similarly, reduction of AtHD2A activity (which codes for an HDAC of the plant-specific HD2 class) resulted in aborted seed development (Wu et al., 2000b). In another study, mutants in the HDAC gene AtHDA6 were isolated, which were morphologically wild-type but showed deregulated expression of transgenes, suggesting that AtHDA6 might be specifically involved in (transgene) silencing processes (Murfett et al., 2001). In addition to MOM1 and PKL, mentioned above, another Arabidopsis gene coding for a SWI2/SNF2-type protein that has been functionally characterized is DECREASE IN DNA METHYLATION1 (DDM1). DDM1 is required to maintain normal cytosine methylation patterns and to stabilize transposon behavior (Jeddeloh et al., 1999; Miura et al., 2001; Singer et al., 2001).

In summary, genetic studies on a variety of biological processes in Arabidopsis, the determination of its genome sequence, and biochemical studies performed in maize, have all started to illuminate the different physiological functions that chromatin remodeling might play in plants. However, our understanding of chromatin remodeling at the molecular level, and on how it influences plant nuclear processes, is extremely limited, and mostly derived from comparisons with the better-studied systems of yeast, Drosophila, and mammalian cells. If chromatin research in these model organisms is to be viewed as an example, it is clear that biochemical studies will be essential to understand chromatin in plants.

5. The combinatorial nature of transcriptional regulation: promoters, cis-elements and trans-acting factors.

Whereas plants and animals (or, to be more precise, Arabidopsis and Drosophila, C. elegans, and humans) might have comparable contents of transcription factors (3.5–6.6% of the total number of genes; see above), the organization of the regulatory sequences on which these transcription factors act can be different in the two kingdoms. In animals, the regulatory sequences that determine the correct temporal and spatial expression of a gene can extend over tens of kilobases (kbs) of DNA (for a review, see Bonifer, 2000). In contrast, regulatory sequences of plant genes usually span much shorter DNA intervals, often less than 1 or 2 kbs. This is reflected in the compact organization of the Arabidopsis genome, in which gene density is high. Out of the sequenced 115.4 megabases (Mb) of the 125 Mb genome, 51.2 Mb (or 44% of the sequenced regions) correspond to predicted exons and introns (Arabidopsis Genome Initiative, 2000). On average, there is one gene per 4.5 kb of DNA: the gene length (exons plus introns) is approximately 2 kb, and ∼2.5 kb correspond to intergenic regions. Considering the whole genome, transposons account for ∼20% of the intergenic DNA, resulting in an average of 2 kb of DNA for the 5′ and 3′ regions of a particular gene (Arabidopsis Genome Initiative, 2000). Other plants, maize for example, have genomes that are much larger than that of Arabidopsis, but with a similar organization of promoter sequences: in the maize genome, active genes are usually distributed in compact gene-rich islands, with much of the genomic DNA corresponding to repetitive sequences made up of retrotransposons (SanMiguel et al., 1996; Fu et al., 2001). As a result, regulatory sequences in Arabidopsis, and in plants in general, are easier to identify and delimit experimentally than in, for example, humans (for an introduction to the problem in mammals, see: Gumucio et al., 1993; Hardison et al., 1997; Bonifer, 2000; Fickett and Wasserman, 2000; Scherf et al., 2001). Compact Arabidopsis 5′ promoter sequences often recapitulate faithfully the expression of the native gene when assayed in transgenic plants by reporter gene fusions, that is, in a chromatin context. However, this is not always the case, because regulatory elements can also be localized downstream of the transcription start site: in introns, in the 5′ untranslated region, or in 3′ sequences (Larkin et al., 1993; Sieburth and Meyerowitz, 1997; Deyholos and Sieburth, 2000; Yu et al., 2001). For example, the large second intron of the MADS-box floral organ-identity gene AGAMOUS (AG) is essential for the correct expression of the gene, and contains binding sites for at least two AG regulators, LFY and WUSCHEL (WUS) (Sieburth and Meyerowitz, 1997; Bomblies et al., 1999; Busch et al., 1999; Deyholos and Sieburth, 2000; Lohmann et al., 2001).

In spite of the structural differences between animal and plant cis-regulatory and promoter regions, regulation of gene expression is often in both cases the result of multiple inputs, reflecting, or taking advantage of, the combinatorial nature of the mechanisms of eukaryotic transcription. Multiple stimuli can converge through different cis-acting elements on a promoter to coordinately regulate the expression of the corresponding gene (Arnone and Davidson, 1997; Yuh et al., 1998). The cis-acting elements are usually organized in a modular fashion: both in animals and plants, the regulatory region of a gene can be partitioned into discrete subelements, each one containing one or several binding sites for transcription factors and performing a certain regulatory function (Benfey and Chua, 1990; Arnone and Davidson, 1997). The modular nature of cis-regulatory systems is exemplified by the 2.3 kb promoter region of the sea urchin developmentally regulated Endo16 gene, one of the best characterized eukaryotic promoters (Yuh et al., 1998, 2001). It consists of six different regulatory modules, which provide different regulatory functions that are integrated through interrelations between the modules, and result in the spatial expression, and repression, of the gene, as well as on its variable rates of transcription (Yuh et al., 1998, 2001). This cis-regulatory system therefore acts like an information processing unit, and computational models for the modes of action of some of its modules have been established (Arnone and Davidson, 1997; Yuh et al., 1998, 2001). The view of cis-regulatory regions as information processing systems in which the output of developmental (or other) inputs is hardwired, is probably applicable to the eukaryotic genome as a whole (Arnone and Davidson, 1997; Davidson, 2001).

In plants, regulation of gene expression by systems of cis-acting modules, and the fact that these modules can interact synergistically (i. e., that combinations of modules direct gene expression in a manner not observed with the modules in isolation), was first described for the cauliflower mosaic virus (CaMV) 35S promoter (Benfey and Chua, 1990). The CaMV 35S promoter directs high levels of expression in most tissues and developmental stages when introduced as a transgene in plants, but can be dissected into subdomains that confer tissue-specific expression (Benfey and Chua, 1990; Benfey et al., 1990a, b).

The combinatorial interaction of cis-elements has also been demonstrated, for example, for Arabidopsis light-regulated promoters. Several consensus cis-sequences that are necessary for high activity in the light have been identified in the promoters of photosynthesis-associated nuclear genes (such as the rbcS and cab genes). These consensus sequences are referred to as ‘light responsive elements (LREs)’. Minimal promoters, sufficient to confer light-dependent expression, contain several LREs, but no single LRE is found in all light-regulated promoters (in fact, some LREs are also present in promoters that are not regulated by light) (Argüello-Astorga and Herrera-Estrella, 1998). LREs function combinatorially: whereas they cannot confer proper light responsiveness in isolation, paired combinations of them are able (1) to respond to a wide spectrum of light through the phytochrome signal transduction pathways, (2) to respond to the chloroplast developmental state, and (3) to confer a photosynthetic-cell specific expression pattern, therefore satisfying the strict definition of light-inducible (Puente et al., 1996; Chattopadhyay et al., 1998b). Thus, it is the combination of LREs in the promoter what serves as the integration system for the coordination of different light and developmental inputs to regulate the expression of the photosynthesis-related genes (Puente et al., 1996; Chattopadhyay et al., 1998b). Similarly, the promoter of the meristem identity gene LEAFY serves as the convergence point for different signals that control flowering time in Arabidopsis, including both environmental cues (daylength pathway) and endogenous signals (gibberellins) (Blázquez and Weigel, 2000), in accordance with the concept of promoters acting as information processing systems.

The combinatorial and synergistic function of cis-elements in eukaryotic promoters is logically accompanied by the combinatorial mode of action of the trans-acting factors that bind to those sites, and allows for the generation of regulatory diversity by a limited number of factors and binding sites. The requirement of several, often adjacent, cis-elements for the regulation of gene expression can be related to direct interactions between the proteins that bind to those elements. Direct interactions among transcription factors, however, is not the only molecular mechanism by which they can function combinatorially to regulate gene expression, since they can also interact with other components of the transcription machinery and with other classes of regulatory proteins. For example, LFY and WUS cooperatively participate in the regulation AG expression, yet they bind independently to AG cis-regulatory sequences and a direct interaction between the two proteins has not yet been detected (Lohmann et al., 2001).

Several examples of direct interactions between different Arabidopsis transcription factors have been reported, although the number is still small. In addition to increasing the regulatory repertoire, direct interactions between transcription factors are one of the mechanisms by which proteins with very similar DNA binding domains might achieve regulatory specificity (see, for example, Grotewold et al., 2000). Direct interactions can occur between members of the same protein family, to form dimeric complexes that bind to palindromic DNA sequences (such as in the case of the MADS domain proteins: Huang et al., 1996; Riechmann et al., 1996a; Riechmann et al., 1996b), or between transcription factors of different families. Examples of the latter include Arabidopsis, maize, and petunia proteins of the MYB and bHLH families (Table 6), interactions between bZIP and ABI3/VP1 proteins in rice and Arabidopsis (Hobo et al., 1999; Nakamura et al., 2001), between soybean C2H2 zinc finger and bZIP proteins (Kim et al., 2001), and between Dof and bZIP transcription factors in Arabidopsis and maize (Chen et al., 1996; Vicente-Carbajosa et al., 1997). The interaction between bZIP and ABI3/VP1 proteins (TRAB1 and VP1, respectively, in the case of rice, and ABI3 and ABI5 in Arabidopsis) provides a mechanism for VP1-mediated, ABA-inducible gene expression (Hobo et al., 1999; Nakamura et al., 2001). The interaction between bZIP and Dof proteins might mediate the endosperm-specific expression of seed-storage proteins (Vicente-Carbajosa et al., 1997).

Within the MADS domain family, interactions are not limited to the formation of protein dimers, but also include the formation of ternary complexes. APETALA1 (AP1), APETALA3 (AP3), PISTILLATA (PI), and AGAMOUS (AG) are MADS domain proteins that, together with AP2, control the development of floral organs in Arabidopsis (Bowman et al., 1991; Coen and Meyerowitz, 1991; Goto et al., 2001; Jack, 2001; Theiben, 2001). AP3 and PI bind to DNA forming a heterodimer, whereas AP1 and AG can both bind to DNA as homodimers or as heterodimers with other MADS domain proteins (Huang et al., 1996; Riechmann et al., 1996a; Riechmann et al., 1996b). The activity of AP1, AP3, PI, and AG, however, requires of floral cofactors, also MADS domain proteins, that are encoded by the SEPALLATA genes, SEP1, SEP2, and SEP3 (Pelaz et al., 2000; Honma and Goto, 2001; Pelaz et al., 2001a; Pelaz et al., 2001b). In yeast two-hybrid experiments, AP3 and PI together, but neither one of the two proteins individually, can physically interact with AP1 and with SEP3 (Honma and Goto, 2001). The ectopic expression of AP3, PI, and AP1, or of AP3, PI, and SEP3 converts vegetative leaves into petaloid organs (Honma and Goto, 2001), whereas AP3 and PI alone are not sufficient for such organ conversion (Krizek and Meyerowitz, 1996). These results indicate that the formation of ternary complexes might be necessary for the function of AP3 and PI. The role that ternary complex formation might play in AP3 and PI function could be several fold: from providing an activation domain that AP3 and PI appear to lack, but that AP1 and SEP3 have (Honma and Goto, 2001), to increasing the DNA-binding specificity/affinity of the complex versus that of the protein dimers, given that the organ identity activity of AP1, AP3, PI, and AG is independent of their individual DNA-recognition properties (Riechmann and Meyerowitz, 1997a). These results with the Arabidopsis floral organ identity proteins parallel and expand previously obtained data for the Antirrhinum majus MADS-domain proteins SQUAMOSA, DEFICIENS, and GLOBOSA (SQUA, DEF, and GLO, which are AP1, AP3, and PI orthologs, respectively). DEF and GLO were also found to form ternary complexes with SQUA, and the three proteins together to bind DNA with increased affinity versus SQUA or DEF/GLO alone (Egea-Cortines et al., 1999).

Whereas these isolated examples illustrate the importance of interactions between transcription factors for the regulation of transcription, and how the combinatorial logic can operate, they do not convey the scope of the regulatory interactions in which transcription factors could be involved. For this, the whole complement of proteins should be considered (see below).

6. Genome-wide analyses of transcriptional regulation.

The future of biological sciences in the “post-genome era” has been anticipated as an endeavor to generate a collection of comprehensive “functional maps” (corresponding to the “transcriptome”, the “phenome”, the “interactome”, the “localizome”, and so on), that would be compiled into a “biological atlas” which would represent the modular nature of biological processes in a holistic manner and allow the formulation of new hypothesis (Greenbaum et al., 2001; Kim, 2001; Vidal, 2001). These maps could be visualized as two-dimensional matrices in which one axis represents all the genes or proteins that can be tested in an organism, and the other a comprehensive series of mutant backgrounds, conditions to which the organism can be exposed, etc. (Vidal, 2001). For instance, a yeast transcriptome map of this type is already being developed (Hughes et al., 2000b). The interactome would represent the map of physical interactions among all the proteins of a proteome (reviewed in Walhout and Vidal, 2001) (for attempts to construct the yeast interactome, see: Uetz et al., 2000; Ito et al., 2001). The localizome map would describe in what cells and cellular compartments, and when, all the proteins of an organism's proteome can be found; and to produce the phenome, a collection of mutants encompassing all the genes in a genome would be screened in a large series of phenotypic assays (for C. elegans and yeast, see: Ross-Macdonald et al., 1999; Fraser et al., 2000; Gönczy et al., 2000; Maeda et al., 2001).

Such view is also appropriate when considering Arabidopsis transcriptional regulation at a global level in a cellular and organismal context, for whose understanding several of those functional maps would be required: the genome-wide transcriptome map, the interactome and the phenome of the transcriptional regulators, as well as other “-ome” maps not previously considered, such as the promoterome. Intrinsic to this view is the realization that none of these different “-ome” maps would lead, in isolation from the others, to a comprehensive or even logical understanding of transcriptional regulation and of its role as a major determinant of cellular and organismal functions and phenotypes.

To generate these functional maps, the systematic investigation of transcription factor function and transcriptional regulation in Arabidopsis can be pursued with a variety of tools for functional genomic analyses, including reverse genetics methods, gene expression profiling experiments, and protein-protein interaction screens (reviewed in Riechmann and Ratcliffe, 2000). Whereas the availability of the Arabidopsis genome sequence allows us to compile lists of proteins that are involved in the regulation of transcription, and of putative promoter and cis-acting sequences, a global understanding of this process is still in its infancy. However, somewhere along the way of generating these functional maps, and once a sufficient amount of data has been collected, it should be possible to start decoding the “language” of transcriptional control, and to eventually be able, for instance, to build synthetic promoters directing gene expression in novel, designed spatial and temporal patterns (for an example of an initial attempt to design an artificial expression cassette in plants simply by statistical analysis of nucleotide sequences, see Sawant et al., 2001).

6.1 The transcription factor phenome map.

The number of Arabidopsis transcription factors that have been functionally characterized is still small, approximately 10% of the total (an incipient phenome; Table 1). Most of these genes were characterized through the traditional genetic approach, whereby genes are first defined by a mutant phenotype and then isolated. For the majority of these transcriptional regulators, functional characterization is limited to the description of phenotypic differences between mutant and wild-type plants, and determination of their expression pattern, but there is very little knowledge on their modes of action, that is, on the genes that they regulate (the transcriptome map) and on the mechanisms that they use to achieve that regulation (the interactome and the promoterome). As a result, the dynamic relationship between the genome, the transcriptional regulators, and the transcriptome, remains largely uncharacterized.

Different reverse genetics strategies are or can be used in plants to generate and isolate mutants in known genes: T-DNA or transposon insertional mutagenesis (Krysan et al., 1999; Maes et al., 1999; Parinov and Sundaresan, 2000; YouNg et al., 2001), fast neutron deletion mutagenesis (Li et al., 2001b), targeted screening for induced local lesions (TILLING) (Colbert et al., 2001), and DNA/RNA oligonucleotide-mediated site-directed mutagenesis (Oh and May, 2001). In addition, gene function can be inhibited by RNA interference (RNAi) or by virus–induced gene silencing (Baulcombe, 1999; Chuang and Meyerowitz, 2000; Levin et al., 2000; Hammond et al., 2001; Wesley et al., 2001). All these methods have been extensively reviewed and will not be discussed here. They are being used in several large-scale reverse genetics efforts to characterize the function of Arabidopsis transcription factors and chromatin-related proteins (for example, Meissner et al., 1999) (see also http://Ag.Arizona.Edu/chromatin/chromatin.html).

Probably the two main difficulties for generating a comprehensive phenome of Arabidopsis transcriptional regulators are the finite number of assays in which the mutants can usually be screened, and the existence of functional redundancy or overlap among different genes (Riechmann and Ratcliffe, 2000). Many of the Arabidopsis knockout mutants thus far isolated through reverse genetics approaches, in transcription factor genes as well as in genes of other classes, do not exhibit obvious morphological phenotypic alterations (Meissner et al., 1999; Bouche and Bouchez, 2001). This finding parallels what has been observed in other eukaryotic organisms, such as C. elegans, Drosophila, and yeast, in both forward and reverse genetics screens (for an overview, see Thatcher et al., 1998). For instance, the systematic analysis by RNAi in C. elegans of 4,590 genes (contained in chromosomes 1 and 3) only revealed mutant phenotypes in ∼14% of the cases (Fraser et al., 2000; Gönczy et al., 2000). However, it is likely that Arabidopsis mutants in “silent” or “nonessential” transcription factor genes (that is, they show no overt phenotype) might in fact reveal informative phenotypes when tested in comprehensive assays to characterize their physiology, metabolism, etc. (for an example of the use of metabolome data to reveal the phenotype of silent mutations in yeast, see Raamsdonk et al., 2001). For those genes that are involved in the plant's response to the environment, either biotic or abiotic, mutant phenotypes might not be revealed unless specific environmental conditions are used in the experiments. However, the assumption that if a gene is expressed or induced under a particular set of conditions then that gene is important for the organism's growth or survival in those conditions, should be taken with some caution: in yeast, there appears to be little correlation between the two when large sets of genes are considered (Winzeler et al., 1999). Lastly, detection of slightly deleterious effects caused by mutations in “silent” genes might require multigenerational competition studies in which fitness can be assessed, as shown in Arabidopsis (actin genes) and in yeast (Gilliland et al., 1998; Thatcher et al., 1998; Winzeler et al., 1999). The analysis by deletion of 2,026 genes in yeast indicated that ∼80% of them were nonessential for viability, but 40% of those silent deletants showed impaired growth in a simple competitive assay (Winzeler et al., 1999).

The extent of functional redundancy among related Arabidopsis transcription factors has been illustrated by several recent studies on factors from different groups, such as the MADS, GARP, YABBY, and GRAS gene families. MADS-box genes that act redundantly include: AP1, CAULIFLOWER (CAL), and FRUITFULL (FUL), in the control of floral meristem identity (Bowman et al., 1993; Kempin et al., 1995; Ferrándiz et al., 2000); the SHATTERPROOF genes (SHP1 and SHP2), which are required for proper development of the fruit-valve margin (Liljegren et al., 2000); and the SEPALLATA genes (SEP1, SEP2, and SEP3), which are cofactors or interactors for the floral organ identity genes AP1, AP3, PI, and AG (see above, and: Pelaz et al., 2000; Honma and Goto, 2001; Pelaz et al., 2001a; Pelaz et al., 2001b). The redundancy among AP1, CAL, and FUL in specifying floral meristem identity is partial. ap1 plants show a mutant phenotype (a partial conversion of flowers into inflorescences and a disruption of sepal and petal development), whereas a mutation in CAL results in a mutant phenotype only when combined with an ap1 allele (Bowman et al., 1993; Kempin et al., 1995). ap1 cal mutant plants show a complete conversion of the floral meristems into inflorescence meristems (Bowman et al., 1993). In other words, AP1 can completely compensate for the loss of CAL function, but CAL can only compensate for part of AP1 activity. A mutation in FUL does not alter floral meristem identity in the presence of a functional copy of AP1 or CAL (Ferrándiz et al., 2000). The SEP genes appear to have largely overlapping, although not identical, functions: the triple sep1 sep2 sep3 mutant shows a clear conversion of petals, stamens, and carpels, to sepals, whereas single or double sep mutants exhibit more subtle phenotypic alterations (Pelaz et al., 2000; Pelaz et al., 2001a) (for a review on the SEP genes and the ABC model of flower development: Jack, 2001). Similarly, only the shp1 shp2 double mutant, and not the single mutants, shows drastic phenotypic effects, in this case fruit that fails to dehisce (Liljegren et al., 2000).

Another example of related genes that act redundantly is provided by KANADI1 (KAN1) and KANADI2 (KAN2), which participate in the establishment of polarity in Arabidopsis lateral organs by determining abaxial cell fate (Eshed et al., 2001). KAN1 and KAN2 are members of the GARP family of plant-specific transcription factors, and they form part of a monophyletic group within the family (Eshed et al., 2001; Kerstetter et al., 2001). In fact, the genetic mechanism or network that controls lateral organ polarity in Arabidopsis appears to consist of multiple transcription factors from different gene families, with the corresponding genes within each group acting, at least in part, in a functionally overlapping manner (see below, and Eshed et al., 2001). Finally, GAI and RGA, which are highly related members of the GRAS gene family, have partially redundant functions as negative regulators of the gibberellin (GA) signaling pathway (Dill and Sun, 2001; King et al., 2001). In summary, situations of overlapping or partially redundant gene function among related genes are frequent within the different Arabidopsis transcription factor families (for general discussions on gene function after duplication, and on genetic redundancy and how it might be maintained by selection, see: Thomas, 1993; Cooke et al., 1997; Massingham et al., 2001).

Furthermore, in addition to redundancy resulting from the incomplete functional divergence between highly related (duplicated) members of a gene family, it can also arise from functional convergence of more distantly related genes (reviewed in: Pickett and Meeks-Wagner, 1995; Cooke et al., 1997). For instance, two divergent forkhead transcription factor genes from C. elegans, pes-1 and fkh-2, are partially redundant in embryonic development (Molin et al., 2000). Inactivation of pes-1 or fkh-2 alone caused no apparent phenotypic alteration during embryogenesis, whereas inactivation of both genes severely disrupted it. The functional association between pes-1 and fkh-2 was investigated because of the similarity in their expression patterns, but not because of sequence homology: pes-1 and fkh-2 belong to different clades within the forkhead gene family (Molin et al., 2000). The C. elegans genome contains 15 different forkhead genes (Riechmann et al., 2000), and the expression patterns had been determined for all of them. This example illustrates the limitations of sequence analysis as a tool to explore genetic redundancy. In addition, it suggests another reason why the complete functional characterization of the Arabidopsis complement of transcription factors will require the determination of the expression patterns of all of its members. Although the extent of this type of redundancy (genes that belong to the same family, but to different clades within it, and yet have the same or overlapping functions) in Arabidopsis is unknown, there is evidence that it exists. For example, AINTEGUMENTA (ANT) acts redundantly with APETALA2 (AP2) to repress AG expression in cells of the second whorl of developing Arabidopsis flowers (Krizek et al., 2000). Both ANT and AP2 are AP2/ERF proteins, but they belong to different clades within the AP2 subfamily.

Last, functional redundancy can also exist between genes of different classes or families (i.e., with distinct molecular functions), for instance if they form part of independent pathways controlling the same process. An example of this type is provided by members of the KANADI and YABBY gene families, involved in the determination of abaxial polarity in lateral organs (Eshed et al., 1999; Bowman et al., 2001; Eshed et al., 2001). For example, CRABS CLAW (CRC), the founding member of the YABBY family (Bowman and Smyth, 1999; Bowman, 2000), and KAN1 participate in the determination of abaxial polarity in the carpels (Eshed et al., 1999; Bowman et al., 2001). In contrast to redundancy between duplicated genes, it is not possible to predict functional redundancy between unrelated proteins by simply analyzing the genome sequence. Rather, these cases of functional overlap will be uncovered through classic mutagenesis screens for enhancers of a particular mutant phenotype (for example: Eshed et al., 1999; Bowman et al., 2001; Eshed et al., 2001), and as a result of genome-wide analyses of gene expression designed to identify the targets of different transcription factors (see below).

6.2 The transcriptome and promoterome maps.

Two of the different “-ome” maps are essential to understand transcriptional regulation at a molecular, genome-wide level and, ultimately, to explain the basis for the effects that differential gene expression has on the functions and phenotypes of cells and organisms: the transcriptome and the promoterome maps. The transcriptome can be viewed as the collection of transcripts that are expressed from the genome at any particular temporal and physiological instance, considering both transcript identity and abundance (i. e., a description both qualitative and quantitative). The promoterome, as defined in this chapter, would consist of all the promoters and cis-acting elements in a genome, and of their interactions with the complement of transcriptional regulators.

6.2.1 DNA microarrays.

A comprehensive characterization of the Arabidopsis transcriptome in its multiple forms can be achieved using DNA microarray technologies, which allow the parallel monitoring of the expression of thousands of genes and, eventually, of the complete Arabidopsis genome (for reviews on DNA microarrays and gene expression: Lockhart and Winzeler, 2000; Richmond and Somerville, 2000; Young, 2000; Altman and Raychaudhuri, 2001; Schulze and Downward, 2001) (for information on microarray resources, see: http://www.arabidopsis.org/links/microarrays.html). Currently, the expression of up to ∼8,500 different Arabidopsis genes, or approximately one third of the genome, has been monitored in DNA microarray experiments to generate catalogues of genes that are expressed in response to particular stresses or stimuli, or in certain tissues or developmental processes (Table 7). These early studies have included the response to different nutrient concentrations, to drought and cold stresses, to wounding and insect feeding, the disease response, and light-related processes, such as the circadian clock and phytochrome A signaling (Table 7). The most extensive dynamic reprogramming of the expression of the genome has been observed upon light stimulus or in light-related processes (Harmer et al., 2000; Schaffer et al., 2001; Tepperman et al., 2001) (Table 7). For instance, the analysis of circadian changes in the mRNA levels of more than 8,000 genes reveals how differential gene expression underlies many of the physiological changes that the plant undergoes in its daily life cycle. The expression of genes implicated in photosynthesis, in phenylpropanoid biosynthesis, in lipid modification, and in carbon, nitrogen, and sulfur pathways was found to be regulated by the circadian clock, and a physiological explanation can be reasoned for it: to prepare for light-harvesting, for protection against UV light, to increase chilling-resistance at night, and to coordinate the metabolism of the plant with its environment (Harmer et al., 2000).

Catalogues of expressed genes provide insights into a variety of biological processes. However, elucidating the relationship between the transcriptome, the promoterome, and the complement of transcription factors, to eventually understand the logic of transcription, requires additional types of genome-wide analyses and experiments, such as the following.

6.2.2 Comparative analysis of promoter sequences of genes with similar expression profiles.

The results of multiple DNA microarray experiments can be combined and analyzed together using clustering techniques, by which those genes that show similar expression patterns across the set of experiments are identified and grouped (reviewed in: Sherlock, 2000; Quackenbush, 2001; Raychaudhuri et al., 2001). The set of experiments to be compared might consist of different cell types or tissue samples, different physiological conditions, or might characterize over a time course the transcriptional response to a given stimulus. One assumption underlying the clustering analysis of time course experiments is that genes with highly related expression profiles might be regulated by the same mechanism. Thus, once groups of co-regulated genes are established, their promoter sequences can be compared to identify common cis-acting elements. The success of the approach is determined, at least in part, by the structural organization (i.e., size and complexity) of the regulatory regions, and it has proven particularly fruitful in yeast (Cho et al., 1998; Roth et al., 1998; Spellman et al., 1998; Tavazoie et al., 1999; Wolfsberg et al., 1999; Gasch et al., 2000; Hughes et al., 2000a; Lyons et al., 2000; Jakt et al., 2001). In animals, in which complete regulatory regions are difficult to delimit from sequence information alone, and cis-acting elements might be distributed over very long distances, the analyses might have to be restricted to the proximal promoter sequences (for example, Livesey et al., 2000). In contrast, as discussed above, plant regulatory regions are more similar to those of yeast, in that they are often completely encompassed within a few hundred base pairs upstream from the transcription start site. The comparison of the promoter sequences of a group of Arabidopsis genes that are co-regulated by the circadian clock, and that in the experiment showed the highest level of expression near the end of the subjective day, identified a novel motif that is conserved among those promoters, and that was then experimentally shown to mediate their regulation (Harmer et al., 2000) (Table 7). Similarly, a group of Arabidopsis genes that co-regulate with PR-1 over a series of systemic acquired resistance (SAR) inducing or repressing conditions was identified, and their promoters searched for the presence of known cis-elements for transcription factors. W boxes, the binding site for WRKY proteins, were the only known cis-element that was present in all the promoters, suggesting that WRKY transcription factor(s) participate in the control of the PR-1 regulon (Maleck et al., 2000). This latter example also illustrates that the identification of common elements in the upstream regions of co-regulated genes will usually not be sufficient, in the absence of other information, to pinpoint the identity of the specific regulators, because the majority of the Arabidopsis transcription factors form part of multigene families (Table 1), in which different members have related or similar DNA-binding specificities. The same limitation applies to genome-wide searches for transcription factor binding sites that are carried out without reference to expression data (for an example in Arabidopsis, Du and Chen, 2000). Such searches can lead to the identification of potential downstream genes, especially if the target sites for the DNA-binding protein(s) or complex(es) of interest are well characterized, but additional experimentation is usually required to establish the association between the identified elements and the transcription factor(s) under study (for an example in yeast, Zhong et al., 1999).

6.2.3 Phylogenetic footprinting.

A valuable approach to identify unknown cis-regulatory regions and elements, and that can be applied at a genome-wide scale, is phylogenetic footprinting: sequence comparisons across phylogenetically related species that reveal conserved cis-elements in the non-coding regions of homologous genes. Phylogenetic footprinting is based on the observation that regulatory regions are more conserved throughout evolution than regions that do not have a function that is dependent on their sequence (for reviews: Gumucio et al., 1996; Duret and Bucher, 1997; Hardison et al., 1997; Fickett and Wasserman, 2000). A critical factor for the success of the phylogenetic footprinting method is the choice of species for the comparative analysis: at a genome-wide scale, they should be similar enough so that most sequences can be aligned with the corresponding ortholog(s), but distant enough so that non-functional sequences have diverged by accumulating mutations at neutral positions. Logically, the frequency of detectable conserved elements in the non-coding regions of orthologous genes decreases as species separated by increasing evolutionary distances are compared (Duret and Bucher, 1997). Identifying the optimal species (or group of species) for phylogenetic footprinting might require surveying several species within the corresponding genus or family (for an example of such survey in Saccharomyces, see Cliften et al., 2001). In fact, it appears that the most comprehensive and meaningful results will be obtained in comparisons that include several species of different evolutionary distances (Duret and Bucher, 1997; Cliften et al., 2001). The use of groups of species might be particularly important for genome-wide phylogenetic footprinting, because the optimal evolutionary distance for comparison might vary across genes. The phylogenetic footprinting method has been used to define regulatory elements by comparisons between human and mouse sequences and among mammals, and between C. elegans and C. briggsae, among other organisms (Gumucio et al., 1996; Thacker et al., 1999; Loots et al., 2000; Wasserman et al., 2000). Comparisons of the promoters of the CHALCONE SYNTHASE and AP3 genes across different cruciferous plant species has demonstrated the value of phylogenetic footprinting as a basis to functionally analyze Arabidopsis cis-regulatory regions (Koch et al., 2001), although the ideal species or group of species for that type of comparison with Arabidopsis at a genome-wide scale still needs to be identified.

Phylogenetic footprinting, however, will provide only a partial description of the promoterome, because only those elements that maintain similar functions across the species compared are likely to be conserved. Alterations in gene expression are an important mechanism of evolutionary change, and regulatory elements and functional features that arose after the divergence of the species used in the comparison might not be identified in the analysis (for variations of the phylogenetic footprinting method that, combined with experimental data, try to overcome these limitations, see Gumucio et al., 1996). Furthermore, there are instances in which a cis-region might maintain a given regulatory function despite considerable sequence variation.

6.2.4 Inducible activation of transcription factor activity.

The identification of the downstream genes of the many transcriptional regulators encoded by the genome is a necessary step to define the networks of gene activity that occur in a cell, tissue, or organism. The combination of DNA microarray technology with systems for the inducible activation of transcription factor function offers a way to dissect regulatory programs. The activity of transcription factors can be transcriptionally or posttranslationally regulated, using inducible gene expression systems or generating protein fusions to steroid-binding domains, such as the glucocorticoid receptor (GR) (Aoyama, 1999; Picard, 2000; Zuo and Chua, 2000). The advantage of using posttranslational regulation is that simple direct and indirect effects of transcription factor activity can be separated by using inhibitors of protein synthesis (for examples in Arabidopsis: Sablowski and Meyerowitz, 1998; Wagner et al., 1999; Samach et al., 2000; Sakai et al., 2001). Thus, an experiment to identify at a genome-wide scale the target genes of a particular transcription factor would consist of generating transgenic plants expressing a fusion of the factor to a steroid binding domain (ideally, in a mutant background in which the native transcription factor gene is inactivated), applying the hormone to the tissues under study (ideally, those in which the endogenous gene would be normally active), both in the presence and in the absence of an inhibitor of protein synthesis (cycloheximide), and following the effects on mRNA accumulation over time using DNA microarrays. Direct posttranslational regulation by fusion to GR has already been engineered for several plant transcription factors, including the maize bHLH R protein (Lloyd et al., 1994), the Arabidopsis homeodomain proteins ATHB-1, ATHB-2, and KNAT2 (Aoyama et al., 1995; Ohgishi et al., 2001; Pautot et al., 2001), CONSTANS, a zinc finger transcriptional regulator (Simon et al., 1996; Samach et al., 2000), the MADS domain protein AP3 (Sablowski and Meyerowitz, 1998), LFY (Wagner et al., 1999), and ARR1, a GARP transcription factor of the ARR-B subclass (Sakai et al., 2001). However, the technique might not be universally applicable, since some transcription factor-GR fusion proteins might be inactive, or constitutively active in the absence of the hormone.

6.2.5 Genome-wide maps of in vivo DNA binding by transcription factors.

An alternative approach to identify transcription factor downstream targets has been recently developed in yeast, combining chromatin immunoprecipitation and DNA microarrays. The underlying assumption is that transcription factors bind to the promoters or regulatory regions of the genes whose expression they control. In this method, proteins are crosslinked to genomic DNA in living cells using formaldehyde. The DNA that is specifically crosslinked to the protein of interest is then enriched by immunoprecipitation, amplified by PCR, and labeled for its use as a probe in dual-color microarray experiments (the corresponding control consisting of a sample DNA that was not enriched). The microarray used for the hybridization contains all the intergenic regions of the yeast genome, and might also contain the corresponding ORFs (Ren et al., 2000; Iyer et al., 2001; Lieb et al., 2001). This approach has been used to identify the binding sites for several transcriptional regulators in the yeast genome (Ren et al., 2000; Iyer et al., 2001; Lieb et al., 2001), and to study the targeted recruitment of the yeast histone acetylase Esa1 (Reid et al., 2000). In a more global study, the binding by the nine yeast transcription factors that are known to regulate the cell cycle (Mbp1, Swi4, Swi6, Mcm1, Fkh1, Fkh2, Ndd1, Swi5, and Ace2) was analyzed, showing that these factors form themselves a circular network of serial regulation (Simon et al., 2001). The results of all these experiments are also a testimony to the complexity of transcription, and to how much we still have to learn and to explain when considering the regulation of the expression of eukaryotic genomes as a whole. First, Gal4, SBF, MBF, and Rap1 were all found to bind preferentially to potential promoter regions, despite the fact that consensus binding site sequences for all of them are distributed all over the yeast genome (Ren et al., 2000; Iyer et al., 2001; Lieb et al., 2001). This biased recognition of binding sites suggests the existence of a superimposed level of regulation that might mark or distinguish regulatory regions from coding sequences, a component of which might be chromatin structure. The distribution of binding sites for a given transcription factor in the promoters of the genes that such factor regulates is not random either, suggesting the existence of, and constraints in, long-range interactions with other components of the transcription machinery. For example, Rap1 binding sequences were found to occur more often in tandem and at a certain upstream distance (250–450 bp), and to be located preferentially on the minus strand relative to the corresponding open reading frame (Lieb et al., 2001). In addition, not every promoter bound by, for example, SBF and MBF contains recognizable consensus sites (Iyer et al., 2001), indicating the existence of additional sources for specificity in transcription factor activity in vivo. Once again, the structural similarity between regulatory regions in yeast and plants suggests that the technique might in principle be applicable to Arabidopsis, provided that the corresponding experimental protocols (for crosslinking and immunoprecipiation) can be established, and with the caveat of the multicellular nature of plants (see below).

An alternative technique for the genome-wide identification of in vivo target loci has been developed in Drosophila, which makes use of E. coli DNA adenine methyltransferase (Dam). In this method, named DamID, a protein fusion between a chromatin protein of interest and Dam is expressed at low levels in Drosophila cells (in culture, or in the whole fly) (van Steensel and Henikoff, 2000; van Steensel et al., 2001). This leads to the methylation of the GATC sequences that might occur in the genome surrounding the binding sites of the target protein. Methylated regions are purified (by size fractionation of genomic DNA that has been cleaved with DpnI, which cuts at methylated GATC sites), and labeled for their use as a probe in dual-color microarray experiments (the corresponding control consisting of an equivalent sample from cells in which unfused Dam was expressed) (van Steensel and Henikoff, 2000; van Steensel et al., 2001). Chromatin profiling by DamID has not been developed for plants yet, and the method presents several potential technical difficulties. In particular, the fusion protein must be expressed at very low levels to allow distinguishing specific and non-specific methylation events, at least in Drosophila; and it is not yet known if the approach will work for proteins that bind as single molecules (or as dimers) to specific, short cis-elements (as many transcription factors do), since all the experiments reported so far involved potential cooperative binding that could ‘coat’ a region of DNA (for more information and discussion on DamID, see: http://blocks.fhrc.org/DamID). If these technical hurdles can be overcome, chromatin profiling by DamID could represent a valuable alternative to immunoprecipitation-based methods.

The genome-wide maps of in vivo protein-DNA association will contribute to clarify a longstanding unanswered question in transcriptional regulation: the correlation between the DNA binding properties of the transcription factors (i. e., affinities, which are measured in vitro), and their effects on transcription in vivo (for a discussion on this topic, see Biggin and Tjian, 2001). Ultimately, quantitative studies and information will be needed to understand the transcriptional code, and to be able to model transcriptional regulation, both in vivo and in silico.

The identification of the target genes for the many transcriptional regulators encoded by the Arabidopsis genome, the compilation of lists of genes that are differentially regulated (activated or repressed) in particular biological processes, and, most importantly, the integration and combined analysis of these large genome-wide data sets, will eventually define the networks by which transcription factors act, and the pathways downstream of them. Questions that are difficult to address in gene-by-gene studies can now be considered: to what extent different environmental responses, or distinct developmental pathways, share effector mechanisms (that is, the same target genes)?; how many different patterns of expression are triggered by a particular stimulus, and how are those differences achieved molecularly (that is, the complexity of response pathways at the level of the regulation of effector -or “realizator”- genes)? Combined with the characterization and analysis of the promoterome, such studies would lead to understand the transcriptional code, and eventually to rationally manipulate transcriptional regulation. These genome-wide studies would also assess the degree of connectivity among regulatory networks (and thus start defining the “networkome”). Furthermore, the identification of (direct) target genes that are shared by different transcription factors might also provide clues about what regulators act together, both molecularly (interacting proteins, or proteins binding to the same promoters but not interacting directly), and at the genetic level (either related genes that are (partially) redundant, or genes that form part of different pathways that control the same process). Such analysis would complement, and guide, other genetic studies to characterize overlapping functions among transcriptional regulators (for example, by identifying pairs or groups of genes to be analyzed in double, or multiple, mutant combinations; see above).

Last, it should be noted that the characterization of the Arabidopsis transcriptome, and its explanation in terms of transcription factor activity, faces one challenge not encountered in yeast, in which many of these types of studies have been pioneered: the multicellular nature of plants. The profiles of gene expression of different cell types and cells are logically different, and so can be their responses to the activity of particular transcription factors. Thus, in many instances the transcriptome that is characterized is in fact the average of those of the different cell types included in the study. It is still technically challenging, although not impossible, to achieve cellular resolution in genome-wide studies in multicellular organisms.

6.3 The transcription factor interactome map.

Genome-wide analyses of protein-protein interactions in eukaryotes have been pioneered for the proteomes of yeast and C. elegans using the yeast two-hybrid system. The general merits and problems of the approach in genome-wide screens have been discussed elsewhere (Riechmann and Ratcliffe, 2000; Hazbun and Fields, 2001; Legrain et al., 2001). For transcription factors, the two-hybrid system presents the added complication that their use as “baits” often requires the preparation of specialized constructs in which the sequences coding for activation domains have been removed (which is not a trivial hurdle if hundreds of proteins with presumed, but uncharacterized, activation domains have to be analyzed). This is because the read-out of the system consists on the transcriptional activation of reporter genes as a result of the interaction between the “bait” and “prey” fusion proteins. However, modifications of the two-hybrid system have already been devised (based on repression rather than on activation of transcription) that should be applicable for identifying interactions with transactivator proteins (for example: Hirst et al., 2001).

At present, there is very little data on the Arabidopsis transcription factor interactome, and only a small number of direct interactions between different plant transcription factors have been described (see above, and Singh, 1998). In addition to transcription factors directly interacting among themselves or with other components of the transcription machinery, they can also interact with other types of proteins, thus expanding the networks that would form the transcription factor interactome. Such interactions with proteins of other classes can be mechanistically important for the control of transcription, and they can also provide the link between transcription factor activity and signal transduction pathways, as for example in light- and disease-responses.

Arabidopsis perceives light using different types of light-absorbing photoreceptors, such as phytochromes (phyA through phyE), which absorb red and far-red light, and cryptochromes (cry1 and cry2), which absorb blue and UV-A light (for review, Nagy and Schäfer, 2000). Phytochrome- and cryptochrome-mediated light responses involve differential regulation of gene expression. PIF3 is a transcription factor of the bHLH family that is involved in phytochrome signal transduction, in particular signaling by phyB (Ni et al., 1998; Halliday et al., 1999). PIF3 binds to a cis-element present in several light-regulated promoters, and phyB (which is translocated to the nucleus in a light dependent manner: Kircher et al., 1999; Yamaguchi et al., 1999) reversibly binds to DNA-bound PIF3 upon the light-triggered conversion to its biologically active form (Ni et al., 1998; Martínez-García et al., 2000; Zhu et al., 2000). Thus, phytochromes might act as light-switchable components of transcription complexes, and their interaction with transcription factors might provide a short, direct pathway from light perception to photoresponsive nuclear gene expression (Martínez-García et al., 2000).

Another Arabidopsis transcription factor involved in light-mediated responses is HY5, which controls the photomorphogenic development that is undertaken by seedlings grown in the light. HY5 is a bZIP protein that binds to a cis-element present in several light-responsive promoters (Oyama et al., 1997; Chattopadhyay et al., 1998a). The regulation of HY5 activity by light involves its interaction with COP1, a RING-finger protein with WD-40 repeats whose subcellular localization is light-dependent (nuclear in the dark and cytoplasmic in the light), and that might target HY5 for proteasome-mediated degradation in the nucleus (Osterlund et al., 2000). In this case, it is the COP1 protein that interacts with light-activated photoreceptors (cryptochromes), which repress COP1 activity, thus permitting HY5 accumulation and induction of gene expression (WaNg et al., 2001).

The Arabidopsis ankyrin repeat-containing protein NPR1 has been shown to interact with some bZIP transcription factors of the TGA subfamily, which have been implicated in the activation of salicylic acid (SA)-responsive genes (Zhang et al., 1999). NPR1 is required for the induction of systemic acquired resistance (SAR) responses, such as the expression of pathogenesis-related (PR) genes, and it has been shown to act downstream of SAR-inducing agents (SA and avirulent pathogens) (Cao et al., 1997). NPR1 enhances the DNA binding activity of the interacting TGA bZIP proteins, and the in vivo relevance of the protein-protein interaction is demonstrated by the observation that point mutations in NPR1 that abolish its function also disrupt the interaction with the TGA factors (Després et al., 2000; Zhou et al., 2000). NPR1 is localized in both the cytoplasm and the nucleus of unstimulated cells, but concentrates in the nucleus in response to SA and, in fact, nuclear localization of NPR1 is required for PR gene expression (Kinkema et al., 2000). Furthermore, using an in vivo protein fragment complementation assay, based on association of reconstituted murine dihydrofolate reductase (mDHFR) with a fluorescent probe to detect protein-protein interactions, it has been shown that the interaction between NPR1 and the bZIP factor TGA2 is itself induced by SA and localized predominantly in the nucleus (Subramaniam et al., 2001). Thus, the interaction of transcription factor(s) with NPR1 provides a link between an SAR-inducing agent, SA, and the changes in gene expression that are associated with SAR. The mechanism by which the SA signal is transduced to NPR1 still remains to be determined, but additional two-hybrid screens have identified novel proteins of still uncharacterized function that interact with NPR1 and also accumulate in the nucleus (Weigel et al., 2001).

Another example of how an external trigger for a defense response might modulate gene expression through the interaction between a transcription factor and a protein of a different type is provided by the tomato AP2/ERF protein Pti4. Pti4 was first identified in a two-hybrid screen by virtue of its interaction with Pto, a protein kinase that confers resistance to Pseudomonas syringae carrying the corresponding avirulence gene, AvrPto (Zhou et al., 1997). Pti4 is phosphorylated by Pto, which enhances the binding of Pti4 to the GCC-box present in the promoter of PR genes (Gu et al., 2000).

In summary, it is clear from these different examples and from the published literature that a comprehensive description of the transcription factor “interactome” will encompass, in addition to the more than 1,500 transcriptional regulators encoded by the Arabidopsis genome, many other proteins from a variety of functional classes. It is also apparent that such comprehensive description will not be attained using only two-hybrid experiments, which will reveal only a subset of the interactions that occur in a cell, and that additional, alternative techniques are needed. The mDHFR-based protein fragment complementation assay mentioned above permits direct visualization (through spectroscopy, fluorescence-activated cell sorting, or fluorescence microscopy) of protein-protein interactions in living cells, and can be used to detect interactions in a quantitative manner and to follow the temporal pattern of the interaction (Subramaniam et al., 2001). Although the assay has been used in isolated protoplasts, and still needs to be developed to study interactions in whole plant tissues or organs, it represents a valuable alternative to the two-hybrid system for studying and dissecting signaling cascades, and could be developed into a high-throughput screening system for pathway and network mapping or for the identification of molecules that modify protein-protein interactions (Subramaniam et al., 2001). Additional techniques that will be useful to characterize the transcription factor interactome include fluorescence resonance energy transfer (FRET) by generating protein fusions to spectral variants of the jellyfish green fluorescent protein (GFP) (Gadella et al., 1999; Shah et al., 2001).

Conclusion

Genomics research, and functional genomics in particular, has often been hailed as the provider of a new paradigm in biology research; it has also been sometimes reviled by describing such type of research as consisting of little else than “fishing” experiments. Both opposing views are based on one common premise: the contrast between a frequently hypothesis-free genomics and the hypothesis-driven research that has been so much favored in molecular biology over the past decades. But genomics might not fit either one of these two disparate views. In many aspects, genomics bears many similarities with classic biology disciplines, such as genetics (in which the whole genome is blindly mutagenized at random to “fish” for interesting, novel phenotypes), and taxonomy and phylogeny (in which lists of elements are compiled and the relationships between them have to be established). It is in many instances the methodic collection of unanticipated data what allows the formulation of new hypothesis. However, if not radically different in concepts, genomics certainly changes the scales of biology research, and provides new dimensions to it. The relative simplicity of the Arabidopsis thaliana genome, together with the availability of many modern genetic and genomic research tools in that species, indicates that Arabidopsis is a premier organism to elucidate the complex logic of transcription at a genome-wide level in multicellular eukaryotes.

Acknowledgments

I wish to acknowledge my colleagues at Mendel Biotechnology for their input and work in our transcription factor genomics research program, as well as for their discussions, insight, and comments. I also wish to acknowledge the work of all those who participated in the Arabidopsis Genome Initiative and sequenced the Arabidopsis genome.

References

1.

R. Aasland , T. J. Gibson , and A. F. Stewart . 1995. The PHD finger: implications for chromatin-mediated transcriptional regulation. Trends Biochem. Sci 20:1156–59. Google Scholar

2.

H. Abe , K. Yamaguchi-Shinozaki , T. Urao , T. Iwasaki , D. Hosokawa , and K. Shinozaki . 1997. Role of Arabidopsis MYC and MYB homologs in drought- and abscisic acid- regulated gene expression. Plant Cell 9:111859–1868. Google Scholar

3.

M. D. Adams , S. E. Celniker , R. A. Holt , C. A. Evans , J. D. Gocayne , P. G. Amanatides , S. E. Scherer , P. W. Li , R. A. Hoskins , and R. F. Galle . 2000. The genome sequence of Drosophila melanogaster. Science 287:112185–2195. Google Scholar

4.

T. Agalioti , S. Lomvardas , B. Parekh , J. Yie , T. Maniatis , and D. Thanos . 2000. Ordered recruitment of chromatin modifying and general transcription factors to the IFN-beta promoter. Cell 103:11667–678. Google Scholar

5.

J. Ahringer 2000. NuRD and SIN3 histone deacetylase complexes in development. Trends Genet 16:11351–356. Google Scholar

6.

A. Akhtar , D. Zink , and P. B. Becker . 2000. Chromodomains are protein-RNA interaction modules. Nature 407:11405–409. Google Scholar

7.

D. Alabadí , T. Oyama , M. J. Yanovsky , F. G. Harmon , P. Más , and S. A. Kay . 2001. Reciprocal regulation between TOC1 and LHY/CCA1 within the Arabidopsis circadian clock. Science 293:11880–883. Google Scholar

8.

R. B. Altman and S. Raychaudhuri . 2001. Whole-genome expression analysis: challenges beyond clustering. Curr. Opin. Struct. Biol 11:11340–347. Google Scholar

9.

P. Amedeo , Y. Habu , K. Afsar , O. M. Scheid , and J. Paszkowski . 2000. Disruption of the plant gene MOM releases transcriptional silencing of methylated genes. Nature 405:11203–206. Google Scholar

10.

T. Aoyama 1999. Glucocorticoid-inducible gene expression in plants. In Inducible gene expression in plants, Reynolds, P. H. S. (Ed), (Wallingford, UK: CABI Publishing), pp. 1143–59. Google Scholar

11.

T. Aoyama , C-H. Dong , Y. Wu , M. Carabelli , G. Sessa , I. Ruberti , G. Morelli , and N-H. Chua . 1995. Ectopic expression of the Arabidopsis transcription activator Athb-1 alters leaf cell fate in tobacco. Plant Cell 7:111773–1785. Google Scholar

12.

Arabidopsis Genome Initiative 2000. Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature 408:11796–815. Google Scholar

13.

L. Aravind and E. V. Koonin . 1998. Second family of histone deacetylases. Science 280:111167a. Google Scholar

14.

L. Aravind and D. Landsman . 1998. AT-hook motifs identified in a wide variety of DNA-binding proteins. Nucleic Acids Res 26:114413–4421. Google Scholar

15.

G. Argüello-Astorga and L. Herrera-Estrella . 1998. Evolution of light-regulated plant promoters. Annu. Rev. Plant Physiol. Plant Mol. Biol 49:11525–555. Google Scholar

16.

M. I. Arnone and E. H. Davidson . 1997. The hardwiring of development: organization and function of genomic regulatory systems. Development 124:111851–1864. Google Scholar

17.

D. Balciunas and H. Ronne . 2000. Evidence of domain swapping within the jumonji family of transcription factors. Trends Biochem. Sci 25:11274–276. Google Scholar

18.

A. J. Bannister , P. Zegerman , J. F. Partridge , E. A. Miska , J. O. Thomas , R. C. Allshire , and T. Kouzarides . 2001. Selective recognition of methylated lysine 9 on histone H3 by the HP1 chromo domain. Nature 410:11120–124. Google Scholar

19.

D. R. Bastola , V. V. Pethe , and I. Winicov . 1998. Alfin1, a novel zinc-finger protein in alfalfa roots that binds to promoter elements in the salt-inducible MsPRP2 gene. Plant Mol. Biol 38:111123–1135. Google Scholar

20.

D. C. Baulcombe 1999. Fast forward genetics based on virus-induced gene silencing. Curr. Opin. Plant. Biol 2:11109–113. Google Scholar

21.

P. Benfey and N. H. Chua . 1990. The Cauliflower Mosaic Virus 35S promoter: combinatorial regulation of transcription in plants. Science 250:11959–966. Google Scholar

22.

P. N. Benfey , L. Ren , and N. H. Chua . 1990a. Combinatorial and synergistic properties of CaMV 35S enhancer subdomains. EMBO J 9:111685–1696. Google Scholar

23.

P. N. Benfey , L. Ren , and N. H. Chua . 1990b. Tissue-specific expression from CaMV 35S enhancer subdomains in early stages of plant development. EMBO J 9:111677–1684. Google Scholar

24.

P. N. Benfey and D. Weigel . 2001. Transcriptional networks controlling plant development. Plant Physiol 125:11109–111. Google Scholar

25.

M. D. Biggin and R. Tjian . 2001. Transcriptional regulation in Drosophila: the post genome challenge. Funct. Integr. Genomics 1:11223–234. Google Scholar

26.

A. Birve , A. K. Sengupta , D. Beuchle , J. Larsson , J. A. Kennison , A. Rasmuson-Lestander , and J. Muller . 2001. Su(z)12, a novel Drosophila Polycomb group gene that is conserved in vertebrates and plants. Development 128:113371–3379. Google Scholar

27.

G. Blanc , A. Barakat , R. Guyot , R. Cooke , and M. Delseny . 2000. Extensive duplication and reshuffling in the Arabidopsis genome. Plant Cell 12:111093–1101. Google Scholar

28.

M. A. Blázquez and D. Weigel . 2000. Integration of floral inductive signals in Arabidopsis. Nature 404:11889–892. Google Scholar

29.

T. J. Boggon , W. S. Shan , S. Santagata , S. C. Myers , and L. Shapiro . 1999. Implication of tubby proteins as transcription factors by structure-based functional analysis. Science 286:112119–2125. Google Scholar

30.

K. Bomblies , N. Dagenais , and D. Weigel . 1999. Redundant enhancers mediate transcriptional repression of AGAMOUS by APETALA2. Dev. Biol 216:11260–264. Google Scholar

31.

C. Bonifer 2000. Developmental regulation of eukaryotic gene loci: which cis-regulatory information is required? Trends Genet 16:11310–315. Google Scholar

32.

L. Bordoli , M. Netsch , U. Luthi , W. Lutz , and R. Eckner . 2001. Plant orthologs of p300/CBP: conservation of a core domain in metazoan p300/CBP acetyltransferase-related proteins. Nucleic Acids Res 29:11589–597. Google Scholar

33.

M. J. Bottomley , M. W. Collard , J. I. Huggenvik , Z. Liu , T. J. Gibson , and M. Sattler . 2001. The SAND domain structure defines a novel DNA-binding fold in transcriptional regulation. Nature Struct. Biol 8:11626–633. Google Scholar

34.

N. Bouche and D. Bouchez . 2001. Arabidopsis gene knockout: phenotypes wanted. Curr. Opin. Plant. Biol 4:11111–117. Google Scholar

35.

J. L. Bowman 2000. The YABBY gene family and abaxial cell fate. Curr. Opin. Plant. Biol 3:1117–22. Google Scholar

36.

J. L. Bowman , J. Alvarez , D. Weigel , E. M. Meyerowitz , and D. Smyth . 1993. Control of flower development in Arabidopsis thaliana by APETALA1 and interacting genes. Development 119:11721–743. Google Scholar

37.

J. L. Bowman , Y. Eshed , S. Baum , J. F. Emery , S. K. Floyd , J. Alvarez , N. P. Hawker , J-Y. Lee , K. R. Siegfried , and R. Khodosh . 2001. The story of CRABS CLAW (or how we learned to love the mutagen). Flowering Newsletter 31:113–11. Google Scholar

38.

J. L. Bowman and D. R. Smyth . 1999. CRABS CLAW, a gene that regulates carpel and nectary development in Arabidopsis, encodes a novel protein with zinc finger and helix-loop-helix domains. Development 126:112387–2396. Google Scholar

39.

J. L. Bowman , D. R. Smyth , and E. M. Meyerowitz . 1991. Genetic interactions among floral homeotic genes of Arabidopsis. Development 112:111–20. Google Scholar

40.

S. V. Brasher , B. O. Smith , R. H. Fogh , D. Nietlispach , A. Thiru , P. R. Nielsen , R. W. Broadhurst , L. J. Ball , N. V. Murzina , and E. D. Laue . 2000. The structure of mouse HP1 suggests a unique mode of single peptide recognition by the shadow chromo domain dimer. EMBO J 19:111587–1597. Google Scholar

41.

H. W. Brock and M. van Lohuizen . 2001. The Polycomb group-no longer an exclusive club? Curr. Opin. Genet. Dev 11:11175–181. Google Scholar

42.

C. E. Brown , L. Howe , K. Sousa , S. C. Alley , M. J. Carrozza , S. Tan , and J. L. Workman . 2001. Recruitment of HAT complexes by direct activator interactions with the ATM-related Tra1 subunit. Science 292:112333–2337. Google Scholar

43.

J. Brzeski , W. Podstolski , K. Olczak , and A. Jerzmanowski . 1999. Identification and analysis of the Arabidopsis thaliana BSH gene, a member of the SNF5 gene family. Nucleic Acids Res 27:112393–2399. Google Scholar

44.

T. R. Bürglin 1998. The PBC domain contains a MEINOX domain: coevolution of Hox and TALE homeobox genes? Dev Genes Evol 208:11113–116. Google Scholar

45.

C. G. Burns , R. Ohi , A. R. Krainer , and K. L. Gould . 1999. Evidence that Myb-related CDC5 proteins are required for pre-mRNA splicing. Proc. Natl. Acad. Sci. U. S. A 96:1113789–13794. Google Scholar

46.

M. A. Busch , K. Bomblies , and D. Weigel . 1999. Activation of a floral homeotic gene in Arabidopsis. Science 285:11585–587. Google Scholar

47.

M. V. Byzova , J. Franken , M. G. Aarts , J. de Almeida-Engler , G. Engler , C. Mariani , M. M. Van Lookeren Campagne , and G. C. Angenent . 1999. Arabidopsis STERILE APETALA, a multifunctional gene regulating inflorescence, flower, and ovule development. Genes Dev 13:111002–1014. Google Scholar

48.

H. Cao , J. Glazebrook , J. D. Clarke , S. Volko , and X. Dong . 1997. The Arabidopsis NPR1 gene that controls systemic acquired resistance encodes a novel protein containing ankyrin repeats. Cell 88:1157–63. Google Scholar

49.

A. D. Capili , D. C. Schultz , F. J. Rauscher , and K. L. B. Borden . 2001. Solution structure of the PHD domain from the KAP-1 corepressor: structural determinants for PHD, RING and LIM zinc-binding domains. EMBO J 20:11165–177. Google Scholar

50.

G. Cardon , S. Hohmann , J. Klein , K. Nettesheim , H. Saedler , and P. Huijser . 1999. Molecular characterisation of the Arabidopsis SBP-box genes. Gene 237:1191–104. Google Scholar

51.

S. B. Carroll 2000. Endless forms: the evolution of gene regulation and morphological diversity. Cell 101:11577–580. Google Scholar

52.

R. L. Chan , G. M. Gago , C. M. Palena , and D. H. Gonzalez . 1998. Homeoboxes in plant development. Biochim. Biophys. Acta 1442:111–19. Google Scholar

53.

Q. Chao , M. Rothenberg , R. Solano , G. Roman , W. Terzaghi , and J. R. Ecker . 1997. Activation of the ethylene gas response pathway in Arabidopsis by the nuclear protein ETHYLENE-INSENSITIVE3 and related proteins. Cell 89:111133–1144. Google Scholar

54.

S. Chattopadhyay , L. H. Ang , P. Puente , X. W. Deng , and N. Wei . 1998a. Arabidopsis bZIP protein HY5 directly interacts with light-responsive promoters in mediating light control of gene expression. Plant Cell 10:11673–683. Google Scholar

55.

S. Chattopadhyay , P. Puente , X. W. Deng , and N. Wei . 1998b. Combinatorial interaction of light-responsive elements plays a critical role in determining the response characteristics of light-regulated promoters in Arabidopsis. Plant J 15:1169–77. Google Scholar

56.

C. M. Chen , C. T. Wang , and C. H. Ho . 2001a. A plant gene encoding a myb-like protein that binds telomeric ggtttag repeats in vitro. J. Biol. Chem 276:1116511–16519. Google Scholar

57.

D. Chen , H. Ma , H. Hong , S. S. Koh , S-M. Huang , B. T. Schurter , D. W. Aswad , and M. R. Stallcup . 1999. Regulation of transcription by a protein methyltransferase. Science 284:112174–2177. Google Scholar

58.

H. Chen , M. Tini , and R. M. Evans . 2001b. HATs on and beyond chromatin. Curr. Opin. Cell. Biol 13:11218–224. Google Scholar

59.

W. Chen , G. Chao , and K. B. Singh . 1996. The promoter of a H₂O₂-inducible, Arabidopsis glutathione S-transferase gene contains closely linked OBF- and OBP1-binding sites. Plant J 10:11955–966. Google Scholar

60.

R. J. Cho , M. J. Campbell , E. A. Winzeler , L. Steinmetz , A. Conway , L. Wodicka , T. G. Wolfsberg , A. E. Gabrielian , D. Landsman , and D. J. Lockhart . 1998. A genome-wide transcriptional analysis of the mitotic cell cycle. Mol. Cell 2:1165–73. Google Scholar

61.

H. Christiansen , A. C. Hansen , I. Vijn , N. Pallisgaard , K. Larsen , W. C. Yang , T. Bisseling , K. A. Marcker , and E. O. Jensen . 1996. A novel type of DNA-binding protein interacts with a conserved sequence in an early nodulin ENOD12 promoter. Plant Mol. Biol 32:11809–821. Google Scholar

62.

C. F. Chuang and E. M. Meyerowitz . 2000. Specific and heritable genetic interference by double-stranded RNA in Arabidopsis thaliana. Proc. Natl. Acad. Sci. U. S. A 97:114985–4990. Google Scholar

63.

P. F. Cliften , L. W. Hillier , L. Fulton , T. Graves , T. Miner , W. R. Gish , R. H. Waterston , and M. Johnston . 2001. Surveying Saccharomyces genomes to identify functional elements by comparative DNA sequence analysis. Genome Res 11:111175–1186. Google Scholar

64.

E. S. Coen and E. M. Meyerowitz . 1991. The war of the whorls: genetic interactions controlling flower development. Nature 353:1131–37. Google Scholar

65.

C. Colbert , B. J. Till , R. Tompa , S. Reynolds , M. N. Steine , A. T. Yeung , C. M. McCallum , L. Comai , and S. Henikoff . 2001. High-troughput screening for induced point mutations. Plant Physiol 126:11480–484. Google Scholar

66.

J. Conner and Z. Liu . 2000. LEUNIG, a putative transcriptional corepressor that regulates AGAMOUS expression during flower development. Proc. Natl. Acad. Sci. U. S. A 97:1112902–12907. Google Scholar

67.

J. Cooke , M. A. Nowak , M. Boerlijst , and J. Maynard-Smith . 1997. Evolutionary origins and maintenance of redundant gene expression during metazoan development. Trends Genet 13:11360–364. Google Scholar

68.

M. P. Cosma , S. Panizza , and K. Nasmyth . 2001. Cdk1 triggers association of RNA polymerase to cell cycle promoters only after recruitment of the Mediator by SBF. Mol. Cell 7:111213–1220. Google Scholar

69.

M. P. Cosma , T. Tanaka , and K. Nasmyth . 1999. Ordered recruitment of transcription and chromatin remodeling factors to a cell cycle- and developmentally regulated promoter. Cell 97:11299–311. Google Scholar

70.

P. Cramer , D. A. Bushnell , and R. D. Kornberg . 2001. Structural basis of transcription: RNA polymerase II at 2.8 A resolution. Science 19:1119. Google Scholar

71.

T. Cremer and C. Cremer . 2001. Chromosome territories, nuclear architecture and gene regulation in mammalian cells. Nat. Rev. Genet 2:11292–301. Google Scholar

72.

P. Cubas , N. Lauter , J. Doebley , and E. Coen . 1999a. The TCP domain: a motif found in proteins regulating plant growth and development. Plant J 18:11215–222. Google Scholar

73.

P. Cubas , C. Vincent , and E. Coen . 1999b. An epigenetic mutation responsible for natural variation in floral symmetry. Nature 401:11157–161. Google Scholar

74.

C. Cvitanich , N. Pallisgaard , K. A. Nielsen , A. C. Hansen , K. Larsen , K. Pihakaski-Maunsbach , K. A. Marcker , and E. O. Jensen . 2000. CPP1, a DNA-binding protein involved in the expression of a soybean leghemoglobin c3 gene. Proc. Natl. Acad. Sci. U. S. A 97:118163–8168. Google Scholar

75.

I. B. D'Agostino and J. J. Kieber . 1999. Phosphorelay signal transduction: the emerging family of plant response regulators. Trends Biochem. Sci 24:11452–456. Google Scholar

76.

M. Dangl , G. Brosch , H. Haas , P. Loidl , and A. Lusser . 2001. Comparative analysis of HD2 type histone deacetylases in higher plants. Planta 213:11280–285. Google Scholar

77.

E. H. Davidson 2001. Genomic regulatory systems. Development and evolution. (San Diego: Academic Press). Google Scholar

78.

S. M. de Jager , M. Menges , U-M. Bauer , and J. A. H. Murray . 2001. Arabidopsis E2F1 binds a sequence present in the promoter of S-phase-regulated gene AtCDC6 and is a member of a multigene family with differential activities. Plant Mol. Biol 47:11555–568. Google Scholar

79.

N. de Vetten , F. Quattrocchio , J. Mol , and R. Koes . 1997. The an11 locus controlling flower pigmentation in petunia encodes a novel WD-repeat protein conserved in yeast, plants, and animals. Genes Dev 11:111422–1434. Google Scholar

80.

W. B. Derry , A. Putzke , and J. H. Rothman . 2001. Caenorhabditis elegans p53: role in apoptosis, meiosis, and stress resistance. Science 294:11591–595. Google Scholar

81.

R. Desikan , S. A.-H.-Mackerness , J. T. Hancock , and S. J. Neill . 2001. Regulation of the Arabidopsis transcriptome by oxidative stress. Plant Physiol 127:11159–172. Google Scholar

82.

C. Després , C. DeLong , S. Glaze , E. Liu , and P. R. Fobert . 2000. The Arabidopsis NPR1/NIM1 protein enhances the DNA binding activity of a subgroup of the TGA family of bZIP transcription factors. Plant Cell 12:11279–290. Google Scholar

83.

D. Desveaux , C. Després , A. Joyeux , R. Subramaniam , and N. Brisson . 2000. PBF-2 is a novel single-stranded DNA binding factor implicated in PR-10a gene activation in potato. Plant Cell 12:111477–1489. Google Scholar

84.

M. K. Deyholos and L. E. Sieburth . 2000. Separable whorl-specific expression and negative regulation by enhancer elements within the AGAMOUS second intron. Plant Cell 12:111799–1810. Google Scholar

85.

C. Dhalluin , J. E. Carlson , L. Zeng , C. He , A. K. Aggarwal , and M. M. Zhou . 1999. Structure and ligand of a histone acetyltransferase bromodomain. Nature 399:11491–496. Google Scholar

86.

A. Dill and T-p Sun . 2001. Synergistic derepression of gibberellin signaling by removing RGA and GAI function in Arabidopsis thaliana. Genetics 159:11777–785. Google Scholar

87.

J. Doebley and L. Lukens . 1998. Transcriptional regulators and the evolution of plant form. Plant Cell 10:111075–1082. Google Scholar

88.

J. Doebley , A. Stec , and L. Hubbard . 1997. The evolution of apical dominance in maize. Nature 386:11485–488. Google Scholar

89.

L. Du and Z. Chen . 2000. Identification of genes encoding receptor-like protein kinases as possible targets of pathogen- and salicylic acid-induced WRKY DNA-binding proteins in Arabidopsis. Plant J 24:11837–847. Google Scholar

90.

L. Duret and P. Bucher . 1997. Searching for regulatory elements in human noncoding sequences. Curr. Opin. Struct. Biol 7:11399–406. Google Scholar

91.

M. Egea-Cortines , H. Saedler , and H. Sommer . 1999. Ternary complex formation between the MADS-box proteins SQUAMOSA, DEFICIENS and GLOBOSA is involved in the control of floral architecture in Antirrhinum majus. EMBO J 18:115370–5379. Google Scholar

92.

S. C. R. Elgin and J. L. Workman . (Eds). 2000. Chromatin structure and gene expression. 2nd edn. (Oxford, UK: Oxford University Press). Google Scholar

93.

Y. Eshed , S. F. Baum , and J. L. Bowman . 1999. Distinct mechanisms promote polarity establishment in carpels of Arabidopsis. Cell 99:11199–209. Google Scholar

94.

Y. Eshed , S. F. Baum , J. V. Perea , and J. L. Bowman . 2001. Establishment of polarity in lateral organs of plants. Curr. Biol 11:111251–1260. Google Scholar

95.

T. Eulgem , P. J. Rushton , S. Robatzek , and I. E. Somssich . 2000. The WRKY superfamily of plant transcription factors. Trends Plant Sci 5:11199–206. Google Scholar

96.

C. Ferrándiz , Q. Gu , R. Martienssen , and M. F. Yanofsky . 2000. Redundant regulation of meristem identity and plant architecture by FRUITFULL, APETALA1 and CAULIFLOWER. Development 127:11725–734. Google Scholar

97.

J. W. Fickett and W. W. Wasserman . 2000. Discovery and modeling of transcriptional regulatory regions. Curr. Opin. Biotechnol 11:1119–24. Google Scholar

98.

E. J. Finnegan , W. J. Peacock , and E. S. Dennis . 2000. DNA methylation, a key regulator of plant development and other processes. Curr. Opin. Genet. Dev 10:11217–223. Google Scholar

99.

A. Flaus and T. Owen-Hughes . 2001. Mechanisms for ATP-dependent chromatin remodelling. Curr. Opin. Genet. Dev 11:11148–154. Google Scholar

100.

R. Foster , T. Izawa , and N. H. Chua . 1994. Plant bZIP proteins gather at ACGT elements. FASEB J 8:11192–200. Google Scholar

101.

C. Francastel , D. Schubeler , D. I. Martin , and M. Groudine . 2000. Nuclear compartmentalization and gene activity. Nat. Rev. Mol. Cell Biol 1:11137–143. Google Scholar

102.

N. J. Francis and R. E. Kingston . 2001. Mechanisms of transcriptional memory. Nat. Rev. Mol. Cell Biol 2:11409–421. Google Scholar

103.

A. Frary , T. C. Nesbitt , A. Frary , S. Grandillo , E. van der Knaap , B. Cong , J. Liu , J. Meller , R. Elber , and K. B. Alpert . 2000. fw2.2: a quantitative trait locus key to the evolution of tomato fruit size. Science 289:1185–88. Google Scholar

104.

A. G. Fraser , R. S. Kamath , P. Zipperlen , M. Martinez-Campos , M. Sohrmann , and J. Ahringer . 2000. Functional genomic analysis of C. elegans chromosome I by systematic RNA interference. Nature 408:11325–330. Google Scholar

105.

I. Fridborg , S. Kuusk , M. Robertson , and E. Sundberg . 2001. The Arabidopsis protein SHI represses gibberellin responses in Arabidopsis and barley. Plant Physiol 127:11937–948. Google Scholar

106.

C. J. Fry and C. L. Peterson . 2001. Chromatin remodeling enzymes: who's on first? Curr Biol 11:11R185–R197. Google Scholar

107.

H. Fu , W. Park , X. Yan , Z. Zheng , B. Shen , and H. K. Dooner . 2001. The highly recombinogenic bz locus lies in an unusually gene-rich region of the maize genome. Proc. Natl. Acad. Sci. U. S. A 98:118903–8908. Google Scholar

108.

T. W. J. Gadella , G. N. M. van der Krogt , and T. Bisseling . 1999. GFP-based FRET microscopy in living plant cells. Trends Plant Sci 4:11287–291. Google Scholar

109.

A. P. Gasch , P. T. Spellman , C. M. Kao , O. Carmel-Harel , M. B. Eisen , G. Storz , D. Botstein , and P. O. Brown . 2000. Genomic expression programs in the response of yeast cells to environmental changes. Mol. Biol. Cell 11:114241–4257. Google Scholar

110.

W. J. Gehring , M. Affolter , and T. Bürglin . 1994. Homeodomain proteins. Annu. Rev. Biochem 63:11487–526. Google Scholar

111.

A. R. Gendall , Y. Y. Levy , A. Wilson , and C. Dean . 2001. The VERNALIZATION 2 gene mediates the epigenetic regulation of vernalization in Arabidopsis. Cell 107:11525–535. Google Scholar

112.

P. Gil , E. Dewey , J. Friml , Y. Zhao , K. C. Snowden , J. Putterill , K. Palme , M. Estelle , and J. Chory . 2001. BIG: a calossin-like protein required for polar auxin transport in Arabidopsis. Genes Dev 15:111985–1997. Google Scholar

113.

L. U. Gilliland , E. C. McKinney , M. A. Asmussen , and R. B. Meagher . 1998. Detection of deleterious genotypes in multigenerational studies. I. Disruptions in individual Arabidopsis actin genes. Genetics 149:11717–725. Google Scholar

114.

T. Girke , J. Todd , S. Ruuska , J. White , C. Benning , and J. Ohlrogge . 2000. Microarray analysis of developing Arabidopsis seeds. Plant Physiol 124:111570–1581. Google Scholar

115.

A. Goffeau , R. Aert , M. L. Agostini-Carbone , A. Ahmed , M. Aigle , L. Alberghina , K. Albermann , M. Albers , M. Aldea , and D. Alexandraki . 1997. The yeast genome directory. Nature 387:115. Google Scholar

116.

P. Gönczy , G. Echeverri , K. Oegema , A. Coulson , S. J. Jones , R. R. Copley , J. Duperon , J. Oegema , M. Brehm , and E. Cassin . 2000. Functional genomic analysis of cell division in C. elegans using RNAi of genes on chromosome III. Nature 408:11331–336. Google Scholar

117.

R. H. Goodman and S. Smolik . 2000. CBP/p300 in cell growth, transformation, and development. Genes Dev 14:111553–1577. Google Scholar

118.

J. Goodrich , P. Puangsomlee , M. Martin , D. Long , E. M. Meyerowitz , and G. Coupland . 1997. A Polycomb-group gene regulates homeotic gene expression in Arabidopsis. Nature 386:1144–51. Google Scholar

119.

K. Goto , J. Kyozuka , and J. L. Bowman . 2001. Turning floral organs into leaves, leaves into floral organs. Curr. Opin. Genet. Dev 11:11449–456. Google Scholar

120.

A. Gould 1997. Functions of mammalian Polycomb group and trithorax group related genes. Curr. Opin. Genet. Dev 7:11488–494. Google Scholar

121.

K. D. Grasser 1998. HMG1 and HU proteins: architectural elements in plant chromatin. Trends Plant Sci 3:11260–265. Google Scholar

122.

M. R. Green 2000. TBP-associated factors (TAFIIs): multiple, selective transcriptional mediators in common complexes. Trends Biochem. Sci 25:1159–63. Google Scholar

123.

D. Greenbaum , N. M. Luscombe , R. Jansen , J. Qian , and M. Gerstein . 2001. Interrelating different types of genomic data, from proteome to secretome: 'oming in on function. Genome Res 11:111463–1468. Google Scholar

124.

U. Grossniklaus , J. P. Vielle-Calzada , M. A. Hoeppner , and W. B. Gagliano . 1998. Maternal control of embryogenesis by MEDEA, a polycomb group gene in Arabidopsis. Science 280:11446–450. Google Scholar

125.

E. Grotewold , M. B. Sainz , L. Tagliani , J. M. Hernandez , B. Bowen , and V. L. Chandler . 2000. Identification of the residues in the Myb domain of maize C1 that specify the interaction with the bHLH cofactor R. Proc. Natl. Acad. Sci. U. S. A 97:1113579–13584. Google Scholar

126.

Y-Q. Gu , C. Yang , V. K. Thara , J. Zhou , and G. B. Martin . 2000. Pti4 is induced by ethylene and salicylic acid, and its product is phosphorylated by the Pto kinase. Plant Cell 12:11771–785. Google Scholar

127.

T. J. Guilfoyle , T. Ulmasov , and G. Hagen . 1998. The ARF family of transcription factors and their role in plant hormone-responsive transcription. Cell. Mol. Life Sci 54:11619–627. Google Scholar

128.

D. L. Gumucio , D. A. Shelton , W. J. Bailey , J. L. Slightom , and M. Goodman . 1993. Phylogenetic footprinting reveals unexpected complexity in trans factor binding upstream from the epsilon-globin gene. Proc. Natl. Acad. Sci. U. S. A 90:116018–6022. Google Scholar

129.

D. L. Gumucio , D. A. Shelton , W. Zhu , D. Millinoff , T. Gray , J. H. Bock , J. L. Slightom , and M. Goodman . 1996. Evolutionary strategies for the elucidation of cis and trans factors that regulate the developmental switching programs of the beta-like globin genes. Mol. Phylogenet. Evol 5:1118–32. Google Scholar

130.

G. Gusmaroli , C. Tonelli , and R. Mantovani . 2001. Regulation of the CCAAT-Binding NF-Y subunits in Arabidopsis thaliana. Gene 264:11173–185. Google Scholar

131.

Y. Habu , T. Kakutani , and J. Paszkowski . 2001. Epigenetic developmental mechanisms in plants: molecules and targets of plant epigenetic regulation. Curr. Opin. Genet. Dev 11:11215–220. Google Scholar

132.

T. Halbach , N. Scheer , and W. Werr . 2000. Transcriptional activation by the PHD finger is inhibited through an adjacent leucine zipper that binds 14-3-3 proteins. Nucleic Acids Res 28:113542–3550. Google Scholar

133.

K. J. Halliday , M. Hudson , M. Ni , M. Qin , and P. H. Quail . 1999. poc1: an Arabidopsis mutant perturbed in phytochrome signaling because of a T DNA insertion in the promoter of PIF3, a gene encoding a phytochrome-interacting bHLH protein. Proc. Natl. Acad. Sci. U. S. A 96:115832–5837. Google Scholar

134.

S. M. Hammond , A. A. Caudy , and G. J. Hannon . 2001. Post-transcriptional gene silencing by double-stranded RNA. Nat. Rev. Genet 2:11110–119. Google Scholar

135.

M. Hampsey and D. Reinberg . 1999. RNA polymerase II as a control panel for multiple coactivator complexes. Curr. Opin. Genet. Dev 9:11132–139. Google Scholar

136.

R. C. Hardison , J. Oeltjen , and W. Miller . 1997. Long human-mouse sequence alignments reveal novel regulatory elements: a reason to sequence the mouse genome. Genome Res 7:11959–966. Google Scholar

137.

S. L. Harmer , J. B. Hogenesch , M. Straume , H. S. Chang , B. Han , T. Zhu , X. Wang , J. A. Kreps , and S. A. Kay . 2000. Orchestrated transcription of key pathways in Arabidopsis by the circadian clock. Science 290:112110–2113. Google Scholar

138.

M. Hasebe , C. K. Wen , M. Kato , and J. A. Banks . 1998. Characterization of MADS homeotic genes in the fern Ceratopteris richardii. Proc. Natl. Acad. Sci. U. S. A 95:116222–6227. Google Scholar

139.

T. R. Hazbun and S. Fields . 2001. Networking proteins in yeast. Proc. Natl. Acad. Sci. U. S. A 98:114277–4278. Google Scholar

140.

T. Hirayama and K. Shinozaki . 1996. A cdc5+ homolog of a higher plant, Arabidopsis thaliana. Proc. Natl. Acad. Sci. U. S. A 93:1113371–13376. Google Scholar

141.

M. Hirst , C. Ho , L. Sabourin , M. Rudnicki , L. Penn , and I. Sadowski . 2001. A two-hybrid system for transactivator bait proteins. Proc. Natl. Acad. Sci. U. S. A 98:118726–8731. Google Scholar

142.

T. Hobo , Y. Kowyama , and T. Hattori . 1999. A bZIP factor, TRAB1, interacts with VP1 and mediates abscisic acid- induced transcription. Proc. Natl. Acad. Sci. U. S. A 96:1115348–15353. Google Scholar

143.

J. B. Hogenesch , K. A. Ching , S. Batalov , A. I. Su , J. R. Walker , Y. Zhou , S. A. Kay , P. G. Schultz , and M. P. Cooke . 2001. A comparison of the Celera and Ensembl predicted gene sets reveals little overlap in novel genes. Cell 106:11413–415. Google Scholar

144.

T. Honma and K. Goto . 2001. Complexes of MADS-box proteins are sufficient to convert leaves into floral organs. Nature 409:11525–529. Google Scholar

145.

H. Huang , M. Tudor , T. Su , Y. Zhang , Y. Hu , and H. Ma . 1996. DNA binding properties of two Arabidopsis MADS domain proteins: binding consensus and dimer formation. Plant Cell 8:1181–94. Google Scholar

146.

J. D. Hughes , P. W. Estep , S. Tavazoie , and G. M. Church . 2000a. Computational identification of cis-regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae. J. Mol. Biol 296:111205–1214. Google Scholar

147.

T. R. Hughes , M. J. Marton , A. R. Jones , C. J. Roberts , R. Stoughton , C. D. Armour , H. A. Bennett , E. Coffey , H. Dai , and Y. D. He . 2000b. Functional discovery via a compendium of expression profiles. Cell 102:11109–126. Google Scholar

148.

V. Hugouvieux , J. M. Kwak , and J. I. Schroeder . 2001. An mRNA cap binding protein, ABH1, modulates early abscisic signal transduction in Arabidopsis. Cell 106:11477–487. Google Scholar

149.

I. Hwang and J. Sheen . 2001. Two-component circuitry in Arabidopsis cytokinin signal transduction. Nature 413:11383–389. Google Scholar

150.

M. G. Hwang , I. K. Chung , B. G. Kang , and M. H. Cho . 2001. Sequence-specific binding property of Arabidopsis thaliana telomeric DNA binding protein 1 (AtTBP1). FEBS Lett 503:1135–40. Google Scholar

151.

International Human Genome Sequencing Consortium 2001. Initial sequencing and analysis of the human genome. Nature 409:11860–921. Google Scholar

152.

T. Ito , T. Chiba , R. Ozawa , M. Yoshida , M. Hattori , and Y. Sakaki . 2001. A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc. Natl. Acad. Sci. U. S. A 98:114569–4574. Google Scholar

153.

V. R. Iyer , C. E. Horak , C. S. Scafe , D. Botstein , M. Snyder , and P. O. Brown . 2001. Genomic binding sites of the yeast cell-cycle transcription factors SBF and MBF. Nature 409:11533–538. Google Scholar

154.

T. Jack 2001. Relearning our ABCs: new twists on an old model. Trends Plant Sci 6:11310–316. Google Scholar

155.

R. H. Jacobson , A. G. Ladurner , D. S. King , and R. Tijan . 2000. Structure and function of a human TAF_II250 double bromodomain module. Science 288:111422–1425. Google Scholar

156.

L. M. Jakt , L. Cao , K. S. Cheah , and D. K. Smith . 2001. Assessing clusters and motifs from gene expression data. Genome Res 11:11112–123. Google Scholar

157.

F. Jeanmougin , J-M. Wurtz , B. L. Douarin , P. Chambon , and R. Losson . 1997. The bromodomain revisited. Trends Biochem. Sci 22:11151–153. Google Scholar

158.

J. A. Jeddeloh , T. L. Stokes , and E. J. Richards . 1999. Maintenance of genomic methylation requires a SWI2/SNF2-like protein. Nature Genet 22:1194–97. Google Scholar

159.

T. Jenuwein 2001. Re-SET-ting heterochromatin by histone methyltransferases. Trends Cell Biol 11:11266–273. Google Scholar

160.

C. A. P. Joazeiro and A. M. Weissman . 2000. RING finger proteins: mediators of ubiquitin ligase activity. Cell 102:11549–552. Google Scholar

161.

D. O. Jones , I. G. Cowell , and P. B. Singh . 2000. Mammalian chromodomain proteins: their role in genome organisation and expression. Bioessays 22:11124–137. Google Scholar

162.

J. T. Kadonaga 1998. Eukaryotic transcription: an interlaced network of transcription factors and chromatin-modifying machines. Cell 92:11307–313. Google Scholar

163.

Y. Kagaya , K. Ohmiya , and T. Hattori . 1999. RAV1, a novel DNA-binding protein, binds to bipartite recognition sequence through two distinct DNA-binding domains uniquely found in higher plants. Nucleic Acids Res 27:11470–478. Google Scholar

164.

A. Kawaoka , P. Kaothien , K. Yoshida , S. Endo , K. Yamada , and H. Ebinuma . 2000. Functional analysis of tobacco LIM protein Ntlim1 involved in lignin biosynthesis. Plant J 22:11289–301. Google Scholar

165.

S. A. Kempin , B. Savidge , and M. F. Yanofsky . 1995. Molecular basis of the cauliflower phenotype in Arabidopsis. Science 267:11522–525. Google Scholar

166.

R. A. Kerstetter , K. Bollman , R. A. Taylor , K. Bomblies , and R. S. Poethig . 2001. KANADI regulates organ polarity in Arabidopsis. Nature 411:11706–709. Google Scholar

167.

S. Khochbin , A. Verdel , C. Lemercier , and D. Seigneurin-Berny . 2001. Functional significance of histone deacetylase diversity. Curr. Opin. Genet. Dev 11:11162–166. Google Scholar

168.

J. C. Kim , S. H. Lee , Y. H. Cheong , C-M. Yoo , S. I. Lee , H. J. Chun , D-J. Yun , J. C. Hong , S. Y. Lee , and C. O. Lim . 2001. A novel cold-inducible zinc finger protein from soybean, SCOF-1, enhances cold tolerance in transgenic plants. Plant J 25:11247–259. Google Scholar

169.

S. K. Kim 2001. Http://C. elegans: mining the functional genomic landscape. Nat. Rev. Genet 2:11681–689. Google Scholar

170.

K. E. King , T. Moritz , and N. P. Harberd . 2001. Gibberellins are not required for normal stem growth in Arabidopsis thaliana in the absence of GAI and RGA. Genetics 159:11767–776. Google Scholar

171.

M. Kinkema , W. Fan , and X. Dong . 2000. Nuclear localization of NPR1 is required for activation of PR gene expression. Plant Cell 12:112339–2350. Google Scholar

172.

T. Kinoshita , R. Yadegari , J. J. Harada , R. B. Goldberg , and R. L. Fischer . 1999. Imprinting of the MEDEA polycomb gene in the Arabidopsis endosperm. Plant Cell 11:111945–1952. Google Scholar

173.

S. Kircher , L. Kozma-Bognar , L. Kim , E. Adam , K. Harter , E. Schäfer , and F. Nagy . 1999. Light quality-dependent nuclear import of the plant photoreceptors phytochrome A and B. Plant Cell 11:111445–1456. Google Scholar

174.

V. I. Klimyuk , F. Persello-Cartieaux , M. Havaux , P. Contard-David , D. Schuenemann , K. Meiherhoff , P. Gouet , J. D. Jones , N. E. Hoffman , and L. Nussaume . 1999. A chromodomain protein encoded by the arabidopsis CAO gene is a plant-specific component of the chloroplast signal recognition particle pathway that is involved in LHCP targeting. Plant Cell 11:1187–99. Google Scholar

175.

M. A. Koch , B. Weisshaar , J. Kroymann , B. Haubold , and T. Mitchell-Olds . 2001. Comparative genomics and regulatory evolution: conservation and function of the Chs and Apetala3 promoters. Mol. Biol. Evol 18:111882–1891. Google Scholar

176.

R. D. Kornberg 1999. Eukaryotic transcriptional control. Trends Cell Biol 9:11M46–49. Google Scholar

177.

R. D. Kortschak , P. W. Tucker , and R. Saint . 2000. ARID proteins come in from the desert. Trends Biochem. Sci 25:11294–299. Google Scholar

178.

B. A. Krizek and E. M. Meyerowitz . 1996. The Arabidopsis homeotic genes APETALA3 and PISTILLATA are sufficient to provide the B class organ identity function. Development 122:1111–22. Google Scholar

179.

B. A. Krizek , V. Prost , and A. Macias . 2000. AINTEGUMENTA promotes petal identity and acts as a negative regulator of AGAMOUS. Plant Cell 12:111357–1366. Google Scholar

180.

N. T. Krogan and N. W. Ashton . 2000. Ancestry of plant MADS-box genes revealed by bryophite (Physcomitrella patens) homologues. New Phytol 147:11505–517. Google Scholar

181.

P. J. Krysan , J. C. Young , and M. R. Sussman . 1999. T-DNA as an insertional mutagen in Arabidopsis. Plant Cell 11:112283–2290. Google Scholar

182.

M. Lachner , D. O'Carroll , S. Rea , K. Mechtler , and T. Jenuwein . 2001. Methylation of histone H3 lysine 9 creates a binding site for HP1 proteins. Nature 410:11116–120. Google Scholar

183.

M. Ladomery 1997. Multifunctional proteins suggest connections between transcriptional and post-transcriptional processes. Bioessays 19:11903–909. Google Scholar

184.

J. C. Larkin , D. G. Oppenheimer , S. Pollock , and M. D. Marks . 1993. Arabidopsis GLABROUS1 gene requires downstream sequences for function. Plant Cell 5:111739–1748. Google Scholar

185.

D. S. Latchman 1998. Eukaryotic transcription factors. 3rd edn. (San Diego: Academic Press). Google Scholar

186.

M. M. Lee and J. Schiefelbein . 1999. WEREWOLF, a MYB-related protein in Arabidopsis, is a position-dependent regulator of epidermal cell patterning. Cell 99:11473–483. Google Scholar

187.

M. M. Lee and J. Schiefelbein . 2001. Developmentally distinct MYB genes encode functionally equivalent proteins in Arabidopsis. Development 128:111539–1546. Google Scholar

188.

T. I. Lee and R. A. Young . 2000. Transcription of eukaryotic protein-coding genes. Annu. Rev. Genet 34:1177–137. Google Scholar

189.

P. Legrain , J. Wojcik , and J. Gauthier . 2001. Protein-protein interaction maps: a lead towards cellular functions. Trends Genet 17:11346–352. Google Scholar

190.

X-H. Lei , X. Shen , X-Q. Xu , and H. S. Bernstein . 2000. Human Cdc5, a regulator of mitotic entry, can act as a site-specific DNA binding protein. J. Cell Sci 113:114523–4531. Google Scholar

191.

B. Lemon and R. Tjian . 2000. Orchestrated response: a symphony of transcription factors for gene control. Genes Dev 14:112551–2569. Google Scholar

192.

J. Z. Levin , A. J. de Framond , A. Tuttle , M. W. Bauer , and P. B. Heifetz . 2000. Methods of double-stranded RNA-mediated gene inactivation in Arabidopsis and their use to define an essential gene in methionine biosynthesis. Plant Mol. Biol 44:11759–775. Google Scholar

193.

G. Li , K. J. Bishop , M. B. Chandrasekharan , and T. C. Hall . 1999. β-Phaseolin gene activation is a two-step process: PvALF-facilitated chromatin modification followed by abscisic acid-mediated gene activation. Proc. Natl. Acad. Sci. U. S. A 96:117104–7109. Google Scholar

194.

G. Li , S. P. Chandler , A. P. Wolffe , and T. C. Hall . 1998. Architectural specificity in chromatin structure at the TATA box in vivo: nucleosome displacement upon b-phaseolin gene activation. Proc. Natl. Acad. Sci. U. S. A 95:114772–4777. Google Scholar

195.

G. Li , M. B. Chandrasekharan , A. P. Wolffe , and T. C. Hall . 2001a. Chromatin structure and phaseolin gene regulation. Plant Mol. Biol 46:11121–129. Google Scholar

196.

X. Li , Y. Song , K. Century , S. Straight , P. Ronald , X. Dong , M. Lassner , and Y. Zhang . 2001b. A fast neutron deletion mutagenesis-based reverse genetics system for plants. Plant J 27:11235–242. Google Scholar

197.

Z. Li and T. L. Thomas . 1998. PEI1, an embryo-specific zinc finger protein gene required for heart-stage embryo formation in Arabidopsis. Plant Cell 10:11383–398. Google Scholar

198.

J. D. Lieb , X. Liu , D. Botstein , and P. O. Brown . 2001. Promoter-specific binding of Rap1 revealed by genome-wide maps of protein-DNA association. Nature Genet 28:11327–334. Google Scholar

199.

S. J. Liljegren , G. S. Ditta , Y. Eshed , B. Savidge , J. L. Bowman , and M. F. Yanofsky . 2000. SHATTERPROOF MADS-box genes control seed dispersal in Arabidopsis. Nature 404:11766–770. Google Scholar

200.

X. Lin , S. Kaul , S. Rounsley , T. P. Shea , M. I. Benito , C. D. Town , C. Y. Fujii , T. Mason , C. L. Bowman , and M. Barnstead . 1999. Sequence and analysis of chromosome 2 of the plant Arabidopsis thaliana. Nature 402:11761–768. Google Scholar

201.

F. J. Livesey , T. Furukawa , M. A. Steffen , G. M. Church , and C. L. Cepko . 2000. Microarray analysis of the transcriptional network controlled by the photoreceptor homeobox gene Crx. Curr. Biol 10:11301–310. Google Scholar

202.

A. M. Lloyd , M. Schena , V. Walbot , and R. W. Davis . 1994. Epidermal cell fate determination in Arabidopsis: patterns defined by a steroid-inducible regulator. Science 266:11436–439. Google Scholar

203.

D. J. Lockhart and E. A. Winzeler . 2000. Genomics, gene expression and DNA arrays. Nature 405:11827–836. Google Scholar

204.

J. U. Lohmann , R. L. Hong , M. Hobe , M. A. Busch , F. Parcy , R. Simon , and D. Weigel . 2001. A molecular link between stem cell regulation and floral patterning in Arabidopsis. Cell 105:11793–803. Google Scholar

205.

G. G. Loots , R. M. Locksley , C. M. Blankespoor , Z. E. Wang , W. Miller , E. M. Rubin , and K. A. Frazer . 2000. Identification of a coordinate regulator of interleukins 4, 13, and 5 by cross-species sequence comparisons. Science 288:11136–140. Google Scholar

206.

T. Lotan , M. Ohto , K. M. Yee , M. A. West , R. Lo , R. W. Kwong , K. Yamagishi , R. L. Fischer , R. B. Goldberg , and J. J. Harada . 1998. Arabidopsis LEAFY COTYLEDON1 is sufficient to induce embryo development in vegetative cells. Cell 93:111195–1205. Google Scholar

207.

M. Luo , P. Bilodeau , E. S. Dennis , W. J. Peacock , and A. Chaudhury . 2000. Expression and parent-of-origin effects for FIS2, MEA, and FIE in the endosperm and embryo of developing Arabidopsis seeds. Proc. Natl. Acad. Sci. U. S. A 97:1110637–10642. Google Scholar

208.

M. Luo , P. Bilodeau , A. Koltunow , E. S. Dennis , W. J. Peacock , and A. M. Chaudhury . 1999. Genes controlling fertilization-independent seed development in Arabidopsis thaliana. Proc. Natl. Acad. Sci. U. S. A 96:11296–301. Google Scholar

209.

N. M. Luscombe , S. E. Austin , H. M. Berman , and J. M. Thornton . 2000. An overview of the structures of protein-DNA complexes. Genome Biol. 1.reviews001.001-001.037. Google Scholar

210.

A. Lusser , G. Brosch , A. Loidl , H. Haas , and P. Loidl . 1997. Identification of maize histone deacetylase HD2 as an acidic nucleolar phosphoprotein. Science 277:1188–91. Google Scholar

211.

A. Lusser , D. Kolle , and P. Loidl . 2001. Histone acetylation: lessons from the plant kingdom. Trends Plant Sci 6:1159–65. Google Scholar

212.

T. J. Lyons , A. P. Gasch , L. A. Gaither , D. Botstein , P. O. Brown , and D. J. Eide . 2000. Genome-wide characterization of the Zap1p zinc-responsive regulon in yeast. Proc. Natl. Acad. Sci. U. S. A 97:117957–7962. Google Scholar

213.

I. Maeda , Y. Kohara , M. Yamamoto , and A. Sugimoto . 2001. Large-scale analysis of gene function in Caenorhabditis elegans by high-throughput RNAi. Curr. Biol 11:11171–176. Google Scholar

214.

T. Maes , P. De Keukeleire , and T. Gerats . 1999. Plant tagnology. Trends Plant Sci 4:1190–96. Google Scholar

215.

T. Mahmoudi and C. P. Verrijzer . 2001. Chromatin silencing and activation by Polycomb and trithorax group proteins. Oncogene 20:113055–3066. Google Scholar

216.

K. Maleck , A. Levine , T. Eulgem , A. Morgan , J. Schmid , K. A. Lawton , J. L. Dangl , and R. A. Dietrich . 2000. The transcriptome of Arabidopsis thaliana during systemic acquired resistance. Nature Genet 26:11403–410. Google Scholar

217.

S. Malik and R. G. Roeder . 2000. Transcriptional regulation through Mediator-like coactivators in yeast and metazoan cells. Trends Biochem. Sci 25:11277–283. Google Scholar

218.

R. Marmorstein 2001a. Protein modules that manipulate histone tails for chromatin regulation. Nat. Rev. Mol. Cell Biol 2:11422–432. Google Scholar

219.

R. Marmorstein 2001b. Structure of histone acetyltransferases. J. Mol. Biol 311:11433–444. Google Scholar

220.

R. Marmorstein and S. Y. Roth . 2001. Histone acetyltransferases: function, structure, and catalysis. Curr. Opin. Genet. Dev 11:11155–161. Google Scholar

221.

C. Martin and J. Paz-Ares . 1997. MYB transcription factors in plants. Trends Genet 13:1167–73. Google Scholar

222.

J. F. Martínez-García , E. Huq , and P. H. Quail . 2000. Direct targeting of light signals to a promoter element-bound transcription factor. Science 288:11859–863. Google Scholar

223.

T. Massingham , L. J. Davies , and P. Lio . 2001. Analysing gene function after duplication. Bioessays 23:11873–876. Google Scholar

224.

K. Matsumoto and A. P. Wolffe . 1998. Gene regulation by Y-box proteins: coupling control of transcription and translation. Trends Cell Biol 8:11318–323. Google Scholar

225.

K. Mayer , C. Schuller , R. Wambutt , G. Murphy , G. Volckaert , T. Pohl , A. Dusterhoft , W. Stiekema , K. D. Entian , and N. Terryn . 1999. Sequence and analysis of chromosome 4 of the plant Arabidopsis thaliana. Nature 402:11769–777. Google Scholar

226.

R. C. Meissner , H. Jin , E. Cominelli , M. Denekamp , A. Fuertes , R. Greco , H. D. Kranz , S. Penfield , K. Petroni , and A. Urzainqui . 1999. Function search in a large transcription factor gene family in Arabidopsis: assessing the potential of reverse genetics to identify insertional mutations in R2R3 MYB genes. Plant Cell 11:111827–1840. Google Scholar

227.

M. Merika and D. Thanos . 2001. Enhanceosomes. Curr. Opin. Genet. Dev 11:11205–208. Google Scholar

228.

T. Misteli 2001. Protein dynamics: implications for nuclear architecture and gene expression. Science 291:11843–847. Google Scholar

229.

A. Miura , S. Yonebayashi , K. Watanabe , T. Toyama , H. Shimada , and T. Kakutani . 2001. Mobilization of transposons by a mutation abolishing full DNA methylation in Arabidopsis. Nature 411:11212–214. Google Scholar

230.

D. Moazed 2001. Common themes in mechanisms of gene silencing. Mol. Cell 8:11489–498. Google Scholar

231.

J. Mol , E. Grotewold , and R. Koes . 1998. How genes paint flowers and seeds. Trends Plant Sci 3:11212–217. Google Scholar

232.

L. Molin , A. Mounsey , S. Aslam , P. Bauer , J. Young , M. James , A. Sharma-Oates , and I. A. Hope . 2000. Evolutionary conservation of redundancy between a diverged pair of forkhead transcription factor homologues. Development 127:114825–4835. Google Scholar

233.

T. Münster , J. Pahnke , A. Di Rosa , J. T. Kim , W. Martin , H. Saedler , and G. Theiben . 1997. Floral homeotic genes were recruited from homologous MADS-box genes preexisting in the common ancestor of ferns and seed plants. Proc. Natl. Acad. Sci. U. S. A 94:112415–2420. Google Scholar

234.

J. Murfett , X. J. Wang , G. Hagen , and T. J. Guilfoyle . 2001. Identification of Arabidopsis histone deacetylase HDA6 mutants that affect transgene expression. Plant Cell 13:111047–1061. Google Scholar

235.

Y. Nagano , H. Furuhashi , T. Inaba , and Y. Sasaki . 2001. A novel class of plant-specific zinc-dependent DNA-binding protein that binds to A/T-rich DNA sequences. Nucleic Acids Res 29:114097–4105. Google Scholar

236.

F. Nagy and E. Schäfer . 2000. Nuclear and cytosolic events of light-induced, phytochrome regulated signaling in higher plants. EMBO J 19:11157–163. Google Scholar

237.

S. Nakamura , T. J. Lynch , and R. R. Finkelstein . 2001. Physical interactions between ABA response loci of Arabidopsis. Plant J 26:11627–635. Google Scholar

238.

M. Nei , P. Xu , and G. Glazko . 2001. Estimation of divergence times from multiprotein sequences for a few mammalian species and several distantly related organisms. Proc. Natl. Acad. Sci. U. S. A 98:112497–2502. Google Scholar

239.

N. Nesi , I. Debeaujon , C. Jond , G. Pelletier , M. Caboche , and L. Lepiniec . 2000. The TT8 gene encodes a basic helix-loop-helix domain protein required for expression of DFR and BAN genes in Arabidopsis siliques. Plant Cell 12:111863–1878. Google Scholar

240.

M. Ng and M. F. Yanofsky . 2001. Function and evolution of the plant MADS-box gene family. Nat. Rev. Genet 2:11186–195. Google Scholar

241.

M. Ni , J. M. Tepperman , and P. H. Quail . 1998. PIF3, a phytochrome-interacting factor necessary for normal photoinduced signal transduction, is a novel basic helix-loop-helix protein. Cell 95:11657–667. Google Scholar

242.

D. Niessing , W. Driever , F. Sprenger , H. Taubert , H. Jackle , and R. Rivera-Pomar . 2000. Homeodomain position 54 specifies transcriptional versus translational control by bicoid. Mol. Cell 5:11395–401. Google Scholar

243.

L. Nover , K. Bharti , P. Döring , S. K. Mishra , A. Ganguli , and K. D. Scharf . 2001. Arabidopsis and the heat stress transcription factor world: how many heat stress transcription factors do we need? Cell Stress Chaperones 6:11177–189. Google Scholar

244.

J. Ogas , S. Kaufmann , J. Henderson , and C. Somerville . 1999. PICKLE is a CHD3 chromatin-remodeling factor that regulates the transition from embryonic to vegetative development in Arabidopsis. Proc. Natl. Acad. Sci. U. S. A 96:1113839–13844. Google Scholar

245.

T. J. Oh and G. D. May . 2001. Oligonucleotide-directed plant gene targeting. Curr. Opin. Biotechnol 12:11169–172. Google Scholar

246.

N. Ohad , R. Yadegari , L. Margossian , M. Hannon , D. Michaeli , J. J. Harada , R. B. Goldberg , and R. L. Fischer . 1999. Mutations in FIE, a WD polycomb group gene, allow endosperm development without fertilization. Plant Cell 11:11407–416. Google Scholar

247.

M. Ohgishi , A. Oka , G. Morelli , I. Ruberti , and T. Aoyama . 2001. Negative autoregulation of the Arabidopsis homeobox gene ATHB-2. Plant J 25:11389–398. Google Scholar

248.

R. Ohi , A. Feoktistova , S. McCann , V. Valentine , A. T. Look , J. S. Lipsick , and K. L. Gould . 1998. Myb-related Schizosaccharomyces pombe cdc5p is structurally and functionally conserved in eukaryotes. Mol. Cell. Biol 18:114097–4108. Google Scholar

249.

M. T. Osterlund , C. S. Hardtke , N. Wei , and X. W. Deng . 2000. Targeted destabilization of HY5 during light-regulated development of Arabidopsis. Nature 405:11462–466. Google Scholar

250.

T. Oyama , Y. Shimura , and K. Okada . 1997. The Arabidopsis HY5 gene encodes a bZIP protein that regulates stimulus-induced development of root and hypocotyl. Genes Dev 11:112983–2995. Google Scholar

251.

S. Parinov and V. Sundaresan . 2000. Functional genomics in Arabidopsis: large-scale insertional mutagenesis complements the genome sequencing project. Curr. Opin. Biotechnol 11:11157–161. Google Scholar

252.

V. Pautot , J. Dockx , O. Hamant , J. Kronenberger , O. Grandjean , D. Jublot , and J. Traas . 2001. KNAT2: Evidence for a link between Knotted-like genes and carpel development. Plant Cell 13:111719–1734. Google Scholar

253.

C. T. Payne , F. Zhang , and A. M. Lloyd . 2000. GL3 encodes a bHLH protein that regulates trichome development in Arabidopsis through interaction with GL1 and TTG1. Genetics 156:111349–1362. Google Scholar

254.

S. Pelaz , G. S. Ditta , E. Baumann , E. Wisman , and M. F. Yanofsky . 2000. B and C floral organ identity functions require SEPALLATA MADS-box genes. Nature 405:11200–203. Google Scholar

255.

S. Pelaz , C. Gustafson-Brown , S. E. Kohalmi , W. L. Crosby , and M. F. Yanofsky . 2001a. APETALA1 and SEPALLATA3 interact to promote flower development. Plant J 26:11385–394. Google Scholar

256.

S. Pelaz , R. Tapia-Lopez , E. R. Alvarez-Buylla , and M. F. Yanofsky . 2001b. Conversion of leaves into petals in Arabidopsis. Curr. Biol 11:11182–184. Google Scholar

257.

J. Peng , D. E. Richards , N. M. Hartley , G. P. Murphy , K. M. Devos , J. E. Flintham , J. Beales , L. J. Fish , A. J. Worland , and F. Pelica . 1999. ‘Green revolution’ genes encode mutant gibberellin response modulators. Nature 400:11256–261. Google Scholar

258.

M. Petersen , P. Brodersen , H. Naested , E. Andreasson , U. Lindhart , B. Johansen , H. B. Nielsen , M. Lacy , M. J. Austin , and J. E. Parker . 2000. Arabidopsis MAP kinase 4 negatively regulates systemic acquired resistance. Cell 103:111111–1120. Google Scholar

259.

H. Philippe , A. Germot , and D. Moreira . 2000. The new phylogeny of eukaryotes. Curr. Opin. Genet. Dev 10:11596–601. Google Scholar

260.

D. Picard 2000. Posttranslational regulation of proteins by fusions to steroid-binding domains. Methods Enzymol 327:11385–401. Google Scholar

261.

F. B. Pickett and D. R. Meeks-Wagner . 1995. Seeing double: appreciating genetic redundancy. Plant Cell 7:111347–1356. Google Scholar

262.

C. P. Ponting and L. Aravind . 1999. START: a lipid-binding domain in StAR, HD-ZIP and signalling proteins. Trends Biochem. Sci 24:11130–132. Google Scholar

263.

P. Puente , N. Wei , and X. W. Deng . 1996. Combinatorial interplay of promoter elements constitutes the minimal determinants for light and developmental control of gene expression in Arabidopsis. EMBO J 15:113732–3743. Google Scholar

264.

J. Putterill , F. Robson , K. J. Lee , R. Simon , and G. Coupland . 1995. The CONSTANS gene of Arabidopsis promotes flowering and encodes a protein showing similarities to zinc finger transcription factors. Cell 80:11847–857. Google Scholar

265.

L. D. Pysh , J. W. Wysocka-Diller , C. Camilleri , D. Bouchez , and P. N. Benfey . 1999. The GRAS gene family in Arabidopsis: sequence characterization and basic expression analysis of the SCARECROW-LIKE genes. Plant J 18:11111–119. Google Scholar

266.

J. Quackenbush 2001. Computational analysis of microarray data. Nat. Rev. Genet 2:11418–427. Google Scholar

267.

F. Quattrocchio , J. Wing , K. van der Woude , E. Souer , N. de Vetten , J. Mol , and R. Koes . 1999. Molecular analysis of the anthocyanin2 gene of petunia and its role in the evolution of flower color. Plant Cell 11:111433–1444. Google Scholar

268.

L. M. Raamsdonk , B. Teusink , D. Broadhurst , N. Zhang , A. Hayes , M. C. Walsh , J. A. Berden , K. M. Brindle , D. B. Kell , and J. J. Rowland . 2001. A functional genomics strategy that uses metabolome data to reveal the phenotype of silent mutations. Nature Biotechnol 19:1145–50. Google Scholar

269.

P. D. Rabinowicz , E. L. Braun , A. D. Wolfe , B. Bowen , and E. Grotewold . 1999. Maize R2R3 Myb genes: Sequence analysis reveals amplification in the higher plants. Genetics 153:11427–444. Google Scholar

270.

C. Rachez and L. P. Freedman . 2001. Mediator complexes and transcription. Curr. Opin. Cell. Biol 13:11274–280. Google Scholar

271.

D. Raventos , K. Skriver , M. Schlein , K. Karnahl , S. W. Rogers , J. C. Rogers , and J. Mundy . 1998. HRT, a novel zinc finger, transcriptional repressor from barley. J. Biol. Chem 273:1123313–23320. Google Scholar

272.

S. Raychaudhuri , P. D. Sutphin , J. T. Chang , and R. B. Altman . 2001. Basic microarray analysis: grouping and feature selection. Trends Biotech 19:11189–193. Google Scholar

273.

S. Rea , F. Eisenhaber , D. O'Carroll , B. D. Strahl , Z. W. Sun , M. Schmid , S. Opravil , K. Mechtler , C. P. Ponting , and C. D. Allis . 2000. Regulation of chromatin structure by site-specific histone H3 methyltransferases. Nature 406:11593–599. Google Scholar

274.

J. W. Reed 2001. Roles and activities of Aux/IAA proteins in Arabidopsis. Trends Plant Sci 6:11420–425. Google Scholar

275.

F. Regad , M. Lebas , and B. Lescure . 1994. Interstitial telomeric repeats within the Arabidopsis thaliana genome. J. Mol. Biol 239:11163–169. Google Scholar

276.

J. L. Reid , V. R. Iyer , P. O. Brown , and K. Struhl . 2000. Coordinate regulation of yeast ribosomal protein genes is associated with targeted recruitment of Esa1 histone acetylase. Mol. Cell 6:111297–1307. Google Scholar

277.

B. Ren , F. Robert , J. J. Wyrick , O. Aparicio , E. G. Jennings , I. Simon , J. Zeitlinger , J. Schreiber , N. Hannett , and E. Kanin . 2000. Genome-wide location and function of DNA binding proteins. Science 290:112306–2309. Google Scholar

278.

P. Reymond , H. Weber , M. Damond , and E. E. Farmer . 2000. Differential gene expression in response to mechanical wounding and insect feeding in Arabidopsis. Plant Cell 12:11707–720. Google Scholar

279.

J. C. Rice and C. D. Allis . 2001. Histone methylation versus histone acetylation: new insights into epigenetic regulation. Curr. Opin. Cell. Biol 13:11263–273. Google Scholar

280.

D. E. Richards , J. Peng , and N. P. Harberd . 2000. Plant GRAS and metazoan STATs: one family? Bioessays 22:11573–577. Google Scholar

281.

T. Richmond and S. Somerville . 2000. Chasing the dream: plant EST microarrays. Curr. Opin. Plant. Biol 3:11108–116. Google Scholar

282.

J. L. Riechmann , J. Heard , G. Martin , L. Reuber , C. Jiang , J. Keddie , L. Adam , O. Pineda , O. J. Ratcliffe , and R. R. Samaha . 2000. Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes. Science 290:112105–2110. Google Scholar

283.

J. L. Riechmann , B. A. Krizek , and E. M. Meyerowitz . 1996a. Dimerization specificity of Arabidopsis MADS domain homeotic proteins APETALA1, APETALA3, PISTILLATA, and AGAMOUS. Proc. Natl. Acad. Sci. U. S. A 93:114793–4798. Google Scholar

284.

J. L. Riechmann and E. M. Meyerowitz . 1997a. Determination of floral organ identity by Arabidopsis MADS domain homeotic proteins AP1, AP3, PI, and AG is independent of their DNA-binding specificity. Mol. Biol. Cell 8:111243–1259. Google Scholar

285.

J. L. Riechmann and E. M. Meyerowitz . 1997b. MADS domain proteins in plant development. Biol. Chem 378:111079–1101. Google Scholar

286.

J. L. Riechmann and E. M. Meyerowitz . 1998. The AP2/EREBP family of plant transcription factors. Biol. Chem 379:11633–646. Google Scholar

287.

J. L. Riechmann and O. J. Ratcliffe . 2000. A genomic perspective on plant transcription factors. Curr. Opin. Plant. Biol 3:11423–434. Google Scholar

288.

J. L. Riechmann , M. Wang , and E. M. Meyerowitz . 1996b. DNA-binding properties of Arabidopsis MADS domain homeotic proteins APETALA1, APETALA3, PISTILLATA and AGAMOUS. Nucleic Acids Res 24:113134–3141. Google Scholar

289.

K. Robison , A. M. McGuire , and G. M. Church . 1998. A comprehensive library of DNA-binding site matrices for 55 proteins applied to the complete Escherichia coli K-12 genome. J. Mol. Biol 284:11241–254. Google Scholar

290.

P. Ross-Macdonald , P. S. Coelho , T. Roemer , S. Agarwal , A. Kumar , R. Jansen , K. H. Cheung , A. Sheehan , D. Symoniatis , and L. Umansky . 1999. Large-scale analysis of the yeast genome by transposon tagging and gene disruption. Nature 402:11413–418. Google Scholar

291.

V. Rossi , H. Hartings , and M. Motto . 1998. Identification and characterization of an RPD3 homologue from maize (Zea mays L. ) that is able to complement an rpd3 null mutant of Saccharomyces cerevisiae. Mol. Gen. Genet 258:11288–296. Google Scholar

292.

L. Rossini , L. Cribb , D. J. Martin , and J. A. Langdale . 2001. The maize golden2 gene defines a novel class of transcriptional regulators in plants. Plant Cell 13:111231–1244. Google Scholar

293.

F. P. Roth , J. D. Hughes , P. W. Estep , and G. M. Church . 1998. Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitation. Nature Biotechnol 16:11939–945. Google Scholar

294.

S. Y. Roth , J. M. Denu , and C. D. Allis . 2001. Histone acetyltransferases. Annu. Rev. Biochem 70:1181–120. Google Scholar

295.

Y. Ruan , J. Gilmore , and T. Conner . 1998. Towards Arabidopsis genome analysis: monitoring expression profiles of 1400 genes using cDNA microarrays. Plant J 15:11821–833. Google Scholar

296.

R. W. Sablowski and E. M. Meyerowitz . 1998. A homolog of NO APICAL MERISTEM is an immediate target of the floral homeotic genes APETALA3/PISTILLATA. Cell 92:1193–103. Google Scholar

297.

H. Sakai , T. Honma , T. Aoyama , S. Sato , T. Kato , S. Tabata , and A. Oka . 2001. ARR1, a transcription factor for genes immediately responsive to cytokinins. Science 294:111519–1521. Google Scholar

298.

M. Salanoubat , K. Lemcke , M. Rieger , W. Ansorge , M. Unseld , B. Fartmann , G. Valle , H. Blocker , M. Perez-Alonso , and B. Obermaier . 2000. Sequence and analysis of chromosome 3 of the plant Arabidopsis thaliana. Nature 408:11820–822. Google Scholar

299.

H. Salgado , A. Santos-Zavaleta , S. Gama-Castro , D. Millan-Zarate , E. Diaz-Peredo , F. Sanchez-Solano , E. Perez-Rueda , C. Bonavides-Martinez , and J. Collado-Vides . 2001. RegulonDB (version 3.2): transcriptional regulation and operon organization in Escherichia coli K-12. Nucleic Acids Res 29:1172–74. Google Scholar

300.

A. Samach , H. Onouchi , S. E. Gold , G. S. Ditta , Z. Schwarz-Sommer , M. F. Yanofsky , and G. Coupland . 2000. Distinct roles of CONSTANS target genes in reproductive development of Arabidopsis. Science 288:111613–1616. Google Scholar

301.

I. Sánchez-García and T. H. Rabbitts . 1994. The LIM domain: a new structural motif found in zinc-finger-like proteins. Trends Genet 10:11315–320. Google Scholar

302.

P. SanMiguel , A. Tikhonov , Y. K. Jin , N. Motchoulskaia , D. Zakharov , A. Melake-Berhan , P. S. Springer , K. J. Edwards , M. Lee , and Z. Avramova . 1996. Nested retrotransposons in the intergenic regions of the maize genome. Science 274:11765–768. Google Scholar

303.

L. Savard , P. Li , S. H. Strauss , M. W. Chase , M. Michaud , and J. Bousquet . 1994. Chloroplast and nuclear gene sequences indicate late Pennsylvanian time for the last common ancestor of extant seed plants. Proc. Natl. Acad. Sci. U. S. A 91:115163–5167. Google Scholar

304.

S. Sawant , P. K. Singh , R. Madanala , and R. Tuli . 2001. Designing an artificial expression cassette for the high-level expression of transgenes in plants. Theor. Appl. Genet 102:11635–644. Google Scholar

305.

R. Schaffer , J. Landgraf , M. Accerbi , V. V. Simon , M. Larson , and E. Wisman . 2001. Microarray analysis of diurnal and circadian-regulated genes in Arabidopsis. Plant Cell 13:11113–123. Google Scholar

306.

L. Schauser , A. Roussis , J. Stiller , and J. Stougaard . 1999. A plant regulator controlling development of symbiotic root nodules. Nature 402:11191–195. Google Scholar

307.

M. Schena , D. Shalon , R. W. Davis , and P. O. Brown . 1995. Quantitative monitoring of gene expression patterns with a complementary DNA microarray. Science 270:11467–470. Google Scholar

308.

P. M. Schenk , K. Kazan , I. Wilson , J. P. Anderson , T. Richmond , S. C. Somerville , and J. M. Manners . 2000. Coordinated plant defense responses in Arabidopsis revealed by microarray analysis. Proc. Natl. Acad. Sci. U. S. A 97:1111655–11660. Google Scholar

309.

M. Scherf , A. Klingenhoff , K. Frech , K. Quandt , R. Schneider , K. Grote , M. Frisch , V. Gailus-Durner , A. Seidel , and R. Brack-Werner . 2001. First pass annotation of promoters on human chromosome 22. Genome Res 11:11333–340. Google Scholar

310.

U. Schiefthaler , S. Balasubramanian , P. Sieber , D. Chevalier , E. Wisman , and K. Schneitz . 1999. Molecular analysis of NOZZLE, a gene involved in pattern formation and early sporogenesis during sex organ development in Arabidopsis thaliana. Proc. Natl. Acad. Sci. U. S. A 96:1111664–11669. Google Scholar

311.

A. Schulze and J. Downward . 2001. Navigating gene expression using microarrays - a technology review. Nature Cell Biol 3:11E190–E195. Google Scholar

312.

B. Schumacher , K. Hofmann , S. Boulton , and A. Gartner . 2001. The C. elegans homolog of the p53 tumor supressor is required for DNA damage-induced apoptosis. Curr. Biol 11:111722–1727. Google Scholar

313.

M. P. Scott 2000. Development: the natural history of genes. Cell 100:1127–40. Google Scholar

314.

M. Seki , M. Narusaka , H. Abe , M. Kasuga , K. Yamaguchi-Shinozaki , P. Carninci , Y. Hayashizaki , and K. Shinozaki . 2001. Monitoring the expression pattern of 1300 Arabidopsis genes under drought and cold stresses by using a full-length cDNA microarray. Plant Cell 13:1161–72. Google Scholar

315.

K. Shah , T. W. J. Gadella , H. van Erp , V. Hecht , and S. C. de Vries . 2001. Subcellular localization and oligomerization of the Arabidopsis thaliana Somatic Embryogenesis Receptor Kinase 1 protein. J. Mol. Biol 309:11641–655. Google Scholar

316.

G. Sherlock 2000. Analysis of large-scale gene expression data. Curr. Opin. Immunol 12:11201–205. Google Scholar

317.

L. E. Sieburth and E. M. Meyerowitz . 1997. Molecular dissection of the AGAMOUS control region shows that cis elements for spatial regulation are located intragenically. Plant Cell 9:11355–365. Google Scholar

318.

I. Simon , J. Barnett , N. Hannett , C. T. Harbison , N. J. Rinaldi , T. L. Volkert , J. J. Wyrick , J. Zeitlinger , D. K. Gifford , and T. S. Jaakkola . 2001. Serial regulation of transcriptional regulators in the yeast cell cycle. Cell 106:11697–708. Google Scholar

319.

R. Simon , M. I. Igeño , and G. Coupland . 1996. Activation of floral meristem identity genes in Arabidopsis. Nature 384:1159–62. Google Scholar

320.

T. Singer , C. Yordan , and R. A. Martienssen . 2001. Robertson's Mutator transposons in A. thaliana are regulated by the chromatin-remodeling gene Decrease in DNA Methylation (DDM1). Genes Dev 15:11591–602. Google Scholar

321.

K. B. Singh 1998. Transcriptional regulation in plants: the importance of combinatorial control. Plant Physiol 118:111111–1120. Google Scholar

322.

J. F. Smothers and S. Henikoff . 2000. The HP1 chromo shadow domain binds a consensus sequence pentamer. Curr. Biol 10:1127–30. Google Scholar

323.

J. Sommerville 1999. Activities of cold-shock domain proteins in translation control. Bioessays 21:11319–325. Google Scholar

324.

P. T. Spellman , G. Sherlock , M. Q. Zhang , V. R. Iyer , K. Anders , M. B. Eisen , P. O. Brown , D. Botstein , and B. Futcher . 1998. Comprehensive identification of cell cycle-regulated genes of the yeast Saccharomyces cerevisiae by microarray hybridization. Mol. Biol. Cell 9:113273–3297. Google Scholar

325.

C. Spelt , F. Quattrocchio , J. N. Mol , and R. Koes . 2000. anthocyanin1 of petunia encodes a basic helix-loop-helix protein that directly activates transcription of structural anthocyanin genes. Plant Cell 12:111619–1632. Google Scholar

326.

C. Spillane , C. MacDougall , C. Stock , C. Kohler , J. P. Vielle-Calzada , S. M. Nunes , U. Grossniklaus , and J. Goodrich . 2000. Interaction of the Arabidopsis polycomb group proteins FIE and MEA mediates their common phenotypes. Curr. Biol 10:111535–1538. Google Scholar

327.

D. E. Sterner and S. L. Berger . 2000. Acetylation of histones and transcription-related factors. Microbiol. Mol. Biol. Rev 64:11435–459. Google Scholar

328.

E. J. Stockinger , Y. Mao , M. K. Regier , S. J. Triezenberg , and M. F. Thomashow . 2001. Transcriptional adaptor and histone acetyltransferase proteins in Arabidopsis and their interactions with CBF1, a transcriptional activator involved in cold-regulated gene expression. Nucleic Acids Res 29:111524–1533. Google Scholar

329.

S. L. Stone , L. W. Kwong , K. M. Yee , J. Pelletier , L. Lepiniec , R. L. Fischer , R. B. Goldberg , and J. J. Harada . 2001. LEAFY COTYLEDON2 encodes a B3 domain transcription factor that induces embryo development. Proc. Natl. Acad. Sci. U. S. A 98:1111806–11811. Google Scholar

330.

R. Stracke , M. Werber , and B. Weisshaar . 2001. The R2R3-MYB gene family in Arabidopsis thaliana. Curr. Opin. Plant. Biol 4:11447–456. Google Scholar

331.

B. D. Strahl and C. D. Allis . 2000. The language of covalent histone modifications. Nature 403:1141–45. Google Scholar

332.

K. Struhl 1999. Fundamentally different logic of gene regulation in eukaryotes and prokaryotes. Cell 98:111–4. Google Scholar

333.

R. Subramaniam , D. Desveaux , C. Spickler , S. W. Michnick , and N. Brisson . 2001. Direct visualization of protein interactions in plant cells. Nature Biotechnol 19:11769–772. Google Scholar

334.

M. E. Svensson , H. Johannesson , and P. Engstrom . 2000. The LAMB1 gene from the clubmoss, Lycopodium annotinum, is a divergent MADS-box gene, expressed specifically in sporogenic structures. Gene 253:1131–43. Google Scholar

335.

E. Szathmáry , F. Jordán , and C. Pál . 2001. Can genes explain biological complexity? Science 292:111315–1316. Google Scholar

336.

S. Tabata , T. Kaneko , Y. Nakamura , H. Kotani , T. Kato , E. Asamizu , N. Miyajima , S. Sasamoto , T. Kimura , and T. Hosouchi . 2000. Sequence and analysis of chromosome 5 of the plant Arabidopsis thaliana. Nature 408:11823–826. Google Scholar

337.

H. Takatsuji 1998. Zinc-finger transcription factors in plants. Cell. Mol. Life Sci 54:11582–596. Google Scholar

338.

H. Takatsuji 1999. Zinc-finger proteins: the classical zinc finger emerges in contemporary plant science. Plant Mol. Biol 39:111073–1078. Google Scholar

339.

Y. Takeda , S. Hatano , N. Sentoku , and M. Matsuoka . 1999. Homologs of animal eyes absent (eya) genes are found in higher plants. Mol. Gen. Genet 262:11131–138. Google Scholar

340.

W. P. Tansey 2001. Transcriptional activation: risky business. Genes Dev 15:111045–1050. Google Scholar

341.

D. Tautz 2000. Evolution of transcriptional regulation. Curr. Opin. Genet. Dev 10:11575–579. Google Scholar

342.

R. Tavares , S. Aubourg , A. Lecharny , and M. Kreis . 2000. Organization and structural evolution of four multigene families in Arabidopsis thaliana: AtLCAD, AtLGT, AtMYST, and AtHD-GL2. Plant Mol. Biol 42:11703–717. Google Scholar

343.

S. Tavazoie , J. D. Hughes , M. J. Campbell , R. J. Cho , and G. M. Church . 1999. Systematic determination of genetic network architecture. Nature Genet 22:11281–285. Google Scholar

344.

J. M. Tepperman , T. Zhu , H. S. Chang , X. Wang , and P. H. Quail . 2001. Multiple transcription-factor genes are early targets of phytochrome A signaling. Proc. Natl. Acad. Sci. U. S. A 98:119437–9442. Google Scholar

345.

C. Thacker , M. A. Marra , A. Jones , D. L. Baillie , and A. M. Rose . 1999. Functional genomics in Caenorhabditis elegans: an approach involving comparisons of sequences from related nematodes. Genome Res 9:11348–359. Google Scholar

346.

J. W. Thatcher , J. M. Shaw , and W. J. Dickinson . 1998. Marginal fitness contributions of nonessential genes in yeast. Proc. Natl. Acad. Sci. U. S. A 95:11253–257. Google Scholar

347.

The C. elegans Sequencing Consortium 1998. Genome sequence of the nematode C. elegans: a platform for investigating biology. Science 282:112012–2018. Google Scholar

348.

G. Theissen 2001. Development of floral organ identity: stories from the MADS house. Curr. Opin. Plant. Biol 4:1175–85. Google Scholar

349.

G. Theiben , A. Becker , A. Di Rosa , A. Kanno , J. T. Kim , T. Munster , K. U. Winter , and H. Saedler . 2000. A short history of MADS-box genes in plants. Plant Mol. Biol 42:11115–149. Google Scholar

350.

A. Theologis , J. R. Ecker , C. J. Palm , N. A. Federspiel , S. Kaul , O. White , J. Alonso , H. Altafi , R. Araujo , and C. L. Bowman . 2000. Sequence and analysis of chromosome 1 of the plant Arabidopsis thaliana. Nature 408:11816–820. Google Scholar

351.

D. Thieffry , A. M. Huerta , E. Pérez-Rueda , and J. Collado-Vides . 1998. From specific gene regulation to genomic networks: a global analysis of transcriptional regulation in Escherichia coli. Bioessays 20:11433–440. Google Scholar

352.

J. H. Thomas 1993. Thinking about genetic redundancy. Trends Genet 9:11395–399. Google Scholar

353.

L. Tian and Z. J. Chen . 2001. Blocking histone deacetylation in Arabidopsis induces pleiotropic effects on plant gene regulation and development. Proc. Natl. Acad. Sci. U. S. A 98:11200–205. Google Scholar

354.

R. Tupler , G. Perini , and M. R. Green . 2001. Expressing the human genome. Nature 409:11832–833. Google Scholar

355.

P. Uetz , L. Giot , G. Cagney , T. A. Mansfield , R. S. Judson , J. R. Knight , D. Lockshon , V. Narayan , M. Srinivasan , and P. Pochart . 2000. A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature 403:11623–627. Google Scholar

356.

F. D. Urnov and A. P. Wolffe . 2001. Chromatin remodeling and transcriptional activation: the cast (in order of appearance). Oncogene 20:112991–3006. Google Scholar

357.

E. van der Knaap , J. H. Kim , and H. Kende . 2000. A novel gibberellin-induced gene from rice and its potential regulatory role in stem growth. Plant Physiol 122:11695–705. Google Scholar

358.

B. van Steensel , J. Delrow , and S. Henikoff . 2001. Chromatin profiling using targeted DNA adenine methyltransferase. Nature Genet 27:11304–308. Google Scholar

359.

B. van Steensel and S. Henikoff . 2000. Identification of in vivo DNA targets of chromatin proteins using tethered Dam methyltransferase. Nature Biotechnol 18:11424–428. Google Scholar

360.

G. J. C. Veenstra and A. P. Wolffe . 2001. Gene-selective developmental roles of general trasncription factors. Trends Biochem. Sci 26:11665–671. Google Scholar

361.

J. C. Venter , M. D. Adams , E. W. Myers , P. W. Li , R. J. Mural , G. G. Sutton , H. O. Smith , M. Yandell , C. A. Evans , and R. A. Holt . 2001. The sequence of the human genome. Science 291:111304–1351. Google Scholar

362.

J. Vicente-Carbajosa , S. P. Moose , R. L. Parsons , and R. J. Schmidt . 1997. A maize zinc-finger protein binds the prolamin box in zein gene promoters and interacts with the basic leucine zipper transcriptional activator Opaque2. Proc. Natl. Acad. Sci. U. S. A 94:117685–7690. Google Scholar

363.

M. Vidal 2001. A biological atlas of functional maps. Cell 104:11333–339. Google Scholar

364.

M. Vignali , A. H. Hassan , K. E. Neely , and J. L. Workman . 2000. ATP-dependent chromatin remodeling complexes. Mol. Cell. Biol 20:111899–1910. Google Scholar

365.

T. J. Vision , D. G. Brown , and S. D. Tanksley . 2000. The origins of genomic duplications in Arabidopsis. Science 290:112114–2117. Google Scholar

366.

N. Vo and R. H. Goodman . 2001. CREB-binding protein and p300 in transcriptional regulation. J. Biol. Chem 276:1113505–13508. Google Scholar

367.

D. Wagner , R. W. Sablowski , and E. M. Meyerowitz . 1999. Transcriptional activation of APETALA1 by LEAFY. Science 285:11582–584. Google Scholar

368.

A. J. Walhout and M. Vidal . 2001. Protein interaction maps for model organisms. Nat. Rev. Mol. Cell Biol 2:1155–62. Google Scholar

369.

D. Y. Wang , S. Kumar , and S. B. Hedges . 1999a. Divergence time estimates for the early history of animal phyla and the origin of plants, animals and fungi. Proc. R. Soc. Lond. B Biol. Sci 266:11163–171. Google Scholar

370.

H. Wang , L-G. Ma , J-M. Li , H-Y. Zhao , and X. W. Deng . 2001. Direct interaction of Arabidopsis cryptochromes with COP1 in mediation of photomorphogenic development. Science 294:11154–158. Google Scholar

371.

R. Wang , K. Guegler , S. T. LaBrie , and N. M. Crawford . 2000. Genomic analysis of a nutrient response in Arabidopsis reveals diverse expression patterns and novel metabolic and potential regulatory genes induced by nitrate. Plant Cell 12:111491–1509. Google Scholar

372.

R. L. Wang , A. Stec , J. Hey , L. Lukens , and J. Doebley . 1999b. The limits of selection during maize domestication. Nature 398:11236–239. Google Scholar

373.

W. W. Wasserman , M. Palumbo , W. Thompson , J. W. Fickett , and C. E. Lawrence . 2000. Human-mouse genome comparisons to locate regulatory sites. Nature Genet 26:11225–228. Google Scholar

374.

D. Weigel , J. Alvarez , D. R. Smyth , M. F. Yanofsky , and E. M. Meyerowitz . 1992. LEAFY controls floral meristem identity in Arabidopsis. Cell 69:11843–859. Google Scholar

375.

R. R. Weigel , C. Bäuscher , A. J. P. Pfitzner , and U. M. Pfitzner . 2001. NIMIN-1, NIMIN-2 and NIMIN-3, members of a novel family of proteins from Arabidopsis that interact with NPR1/NIM1, a key regulator of systemic acquired resistance in plants. Plant Mol. Biol 46:11143–160. Google Scholar

376.

S. V. Wesley , C. A. Helliwell , N. A. Smith , M. Wang , D. T. Rouse , Q. Liu , P. S. Gooding , S. P. Singh , D. Abbott , and P. A. Stoutjesdijk . 2001. Construct design for efficient, effective and high-throughput gene silencing in plants. Plant J 27:11581–590. Google Scholar

377.

R. J. White 2001. Gene transcription. Mechanisms and control. (Oxford: Blackwell Science). Google Scholar

378.

M. F. Wilkinson and A-B. Shyu . 2001. Multifunctional regulatory proteins that control gene expression in both the nucleus and the cytoplasm. Bioessays 23:11775–787. Google Scholar

379.

A. Windhövel , I. Hein , R. Dabrowa , and J. Stockhaus . 2001. Characterization of a novel class of plant homeodomain proteins that bind to the C4 phosphoenolpyruvate carboxylase gene of Flaveria trinervia. Plant Mol. Biol 45:11201–214. Google Scholar

380.

E. A. Winzeler , D. D. Shoemaker , A. Astromoff , H. Liang , K. Anderson , B. Andre , R. Bangham , R. Benito , J. D. Boeke , and H. Bussey . 1999. Functional characterization of the S. cerevisiae genome by gene deletion and parallel analysis. Science 285:11901–906. Google Scholar

381.

T. G. Wolfsberg , A. E. Gabrielian , M. J. Campbell , R. J. Cho , J. L. Spouge , and D. Landsman . 1999. Candidate regulatory sequence elements for cell cycle-dependent transcription in Saccharomyces cerevisiae. Genome Res 9:11775–792. Google Scholar

382.

F. A. Wright , W. Lemon , W. D. Zhao , R. Sears , D. Zhuo , J-P. Wang , H-Y. Yang , T. Baer , D. Stredney , and J. Spitzner . 2001. A draft annotation and overview of the human genome. Genome Biol. 2.research0025.0021-0025.0018. Google Scholar

383.

K. Wu , K. Malik , L. Tian , D. Brown , and B. Miki . 2000a. Functional analysis of a RPD3 histone deacetylase homologue in Arabidopsis thaliana. Plant Mol. Biol 44:11167–176. Google Scholar

384.

K. Wu , L. Tian , K. Malik , D. Brown , and B. Miki . 2000b. Functional analysis of HD2 histone deacetylase homologues in Arabidopsis thaliana. Plant J 22:1119–27. Google Scholar

385.

Q. Xie , G. Frugis , D. Colgan , and N. H. Chua . 2000. Arabidopsis NAC1 transduces auxin signal downstream of TIR1 to promote lateral root development. Genes Dev 14:113024–3036. Google Scholar

386.

T. Xu , M. Purcell , P. Zucchi , T. Helentjaris , and L. Bogorad . 2001. TRM1, a YY1-like suppressor of rbcS-m3 expression in maize mesophyll cells. Proc. Natl. Acad. Sci. U. S. A 98:112295–2300. Google Scholar

387.

R. Yadegari , T. Kinoshita , O. Lotan , G. Cohen , A. Katz , Y. Choi , K. Nakashima , J. J. Harada , R. B. Goldberg , and R. L. Fischer . 2000. Mutations in the FIE and MEA genes that encode interacting polycomb proteins cause parent-of-origin effects on seed development by distinct mechanisms. Plant Cell 12:112367–2382. Google Scholar

388.

R. Yamaguchi , M. Nakamura , N. Mochizuki , S. A. Kay , and A. Nagatani . 1999. Light-dependent translocation of a phytochrome B-GFP fusion protein to the nucleus in transgenic Arabidopsis. J. Cell Biol 145:11437–445. Google Scholar

389.

S. Yanagisawa and R. J. Schmidt . 1999. Diversity and similarity among recognition sequences of Dof transcription factors. Plant J 17:11209–214. Google Scholar

390.

N. Yoshida , Y. Yanai , L. Chen , Y. Kato , J. Hiratsuka , T. Miwa , Z. R. Sung , and S. Takahashi . 2001. EMBRYONIC FLOWER2, a novel polycomb group protein homolog, mediates shoot development and flowering in Arabidopsis. Plant Cell 13:112471–2481. Google Scholar

391.

J. C. Young , P. J. Krysan , and M. R. Sussman . 2001. Efficient screening of Arabidopsis T-DNA insertion lines using degenerate primers. Plant Physiol 125:11513–518. Google Scholar

392.

R. A. Young 2000. Biomedical discovery with DNA arrays. Cell 102:119–15. Google Scholar

393.

D. Yu , C. Chen , and Z. Chen . 2001. Evidence for an important role of WRKY DNA binding proteins in the regulation of NPR1 gene expression. Plant Cell 13:111527–1539. Google Scholar

394.

C. H. Yuh , H. Bolouri , and E. H. Davidson . 1998. Genomic cis-regulatory logic: experimental and computational analysis of a sea urchin gene. Science 279:111896–1902. Google Scholar

395.

C. H. Yuh , H. Bolouri , and E. H. Davidson . 2001. Cis-regulatory logic in the endo16 gene: switching from a specification to a differentiation mode of control. Development 128:11617–629. Google Scholar

396.

Y. Zhang , W. Fan , M. Kinkema , X. Li , and X. Dong . 1999. Interaction of NPR1 with basic leucine zipper protein transcription factors that bind sequences required for salicylic acid induction of the PR-1 gene. Proc. Natl. Acad. Sci. U. S. A 96:116523–6528. Google Scholar

397.

H. Zhong , R. McCord , and A. K. Vershon . 1999. Identification of target sites of the a2-Mcm1 repressor complex in the yeast genome. Genome Res 9:111040–1047. Google Scholar

398.

D. X. Zhou 1999. Regulatory mechanism of plant gene transcription by GT-elements and GT-factors. Trends Plant Sci 4:11210–214. Google Scholar

399.

D. X. Zhou , C. Bisanz-Seyer , and R. Mache . 1995. Molecular cloning of a small DNA binding protein with specificity for a tissue-specific negative element within the rps1 promoter. Nucleic Acids Res 23:111165–1169. Google Scholar

400.

J. Zhou , X. Tang , and G. B. Martin . 1997. The Pto kinase conferring resistance to tomato bacterial speck disease interacts with proteins that bind a cis-element of pathogenesis-related genes. EMBO J 16:113207–3218. Google Scholar

401.

J-M. Zhou , Y. Trifa , H. Silva , D. Pontier , E. Lam , J. Shah , and D. F. Klessig . 2000. NPR1 differentially interacts with members of the TGA/OBF family of transcription factors that bind an element of the PR-1 gene required for induction by salicylic acid. Mol. Plant Microbe Interact 13:11191–202. Google Scholar

402.

T. Zhu , P. Budworth , B. Han , D. Brown , H-S. Chang , G. Zou , and X. Wang . 2001. Toward elucidating the global gene expression patterns of developing Arabidopsis: parallel analysis of 8300 genes by a high-density oligonucleotide probe array. Plant Physiol. Biochem 39:11221–242. Google Scholar

403.

Y. Zhu , J. M. Tepperman , C. D. Fairchild , and P. H. Quail . 2000. Phytochrome B binds with greater apparent affinity than phytochrome A to the basic helix-loop-helix factor PIF3 in a reaction requiring the PAS domain of PIF3. Proc. Natl. Acad. Sci. U. S. A 97:1113419–13424. Google Scholar

404.

J. Zuo and N-H. Chua . 2000. Chemical-inducible systems for regulated gene expression of plant genes. Curr. Opin. Biotechnol 11:11146–152. Google Scholar

Figure 1.

The Arabidopsis complement of transcription factors. Gene families are represented by circles, whose size is proportional to the number of members in the family. Domains that have been shuffled, and that therefore “connect” different groups of transcription factors are indicated with rectangles, whose size is proportional to the length of the domain. DNA binding domains are colored; other domains (usually protein-protein interaction domains) are shown with hatched patterns. Dashed lines indicate that a given domain is a characteristic of the family or subfamily to which it is connected. Gene names are written in italics. Whereas many of the indicated domain shuffling events are specific to plants, others likely predate the appearance of the three distinct eukaryotic kingdoms (for details, see Riechmann et al., 2000). This figure is an expanded and updated version of Figure 1 in Riechmann et al. (2000).

Figure 2.

Distribution of transcriptional regulators in eukaryotic organisms (A. thaliana, D. melanogaster, C. elegans, and S. cerevisiae). Transcriptional regulators are kingdom-specific, common to plants, animals, and fungi, or present in only two of the three kingdoms. Members of kingdom-specific families represent only 14% of the total in Drosophila because of its extensive use of the C2H2 zinc finger proteins. The data represented in this figure are from Riechmann et al. (2000).

Figure 3.

Content and distribution of transcriptional regulator genes in eukaryotic genomes. For each of the organisms considered (A. thaliana, D. melanogaster, C. elegans, and S. cerevisiae), the different families of transcription factors are ordered according to the number of members that they contain. The 10 largest families in each organism are identified. The names of those families that are specific to one kingdom are shown in color. The data represented in this figure are from Table 1 and from Riechmann et al. (2000). The number of genes in each of the genomes is given as an approximate number (Goffeau et al., 1997; The C. elegans Sequencing Consortium, 1998; Adams et al., 2000; Arabidopsis Genome Initiative, 2000). This is because the number of genes predicted at the time that a genome is sequenced is always an estimate that is refined over time. The number of genes that code for transcriptional regulators (TRs), and the percentage of the total number of genes that they represent, is indicated. (Zn) indicates a zinc coordinating DNA binding motif.

Figure 4.

The Arabidopsis Homebox (HB) and ZF-HB gene families. The Arabidopsis homeobox gene family can be subdivided into different groups according to the combinations of domains that the corresponding proteins contain, and to the phylogenetic analysis of the homeodomain. The number of members in each Arabidopsis homeobox gene subfamily is indicated (except for three genes, whose classification is unclear and that are not represented in this figure). Most of the combinations of a homeodomain with a domain of a different type (leucine zipper, PHD finger, START domain) are the result of domain shuffling events specific to the plant kingdom: those combinations are not found in Drosophila, C. elegans, or yeast homeodomain proteins. The only Arabidopsis homeodomain proteins that have an additional motif also found in animal homeodomain proteins are those of the KNOX class, which contain a MEINOX domain (Bürglin, 1998). Conversely, homeodomains in animals are associated with a large variety of motifs, such as the paired and POU-specific domains (which are themselves specific to animals), the LIM motif, or C2H2 zinc fingers, in combinations that are not present in Arabidopsis (for a similar depiction of the animal homeodomain proteins, see Gehring et al., 1994). The START domain is a lipid-binding domain that could provide regulation of HD-Zip class III protein function by sterols. It is found in a variety of eukaryotic proteins, but has been found associated with transcription factor domains only in this class of plant homeobox genes (Ponting and Aravind, 1999). Proteins of the ZF-HB family contain a homeodomain-related sequence that is more divergent from all the different groups of homeodomain sequences of the HB family than these are among themselves (Windhövel et al., 2001). In addition, ZF-HB proteins contain a plant-specific zinc coordinating motif (Windhövel et al., 2001)