Identifying pattern-defined regulatory islands in mammalian genomes

Abstract
Identifying cis-regulatory regions in mammalian genomes is a key challenge toward understanding transcriptional regulation. However, identification and functional characterization of those regulatory elements governing differential gene expression has been hampered by the limited understanding of their organization and locations in genomes. We hypothesized that genes that are conserved across species will also display conservation at the level of their transcriptional regulation and that this will be reflected in the organization of cis-elements mediating this regulation. Using a computational approach, clusters of transcription factor binding sites that are absolutely conserved in order and in spacing across human, rat, and mouse genomes were identified. We term these regions pattern-defined regulatory islands (PRIs). We discovered that these sequences are frequently active sites of transcriptional regulation. These PRIs occur in approximately 1.1% of the half-billion base pairs covered in the search and are located mainly in noncoding regions of the genome. We show that the premise of PRIs can be used to identify previously known and novel cis-regulatory regions controlling genes regulated by myogenic differentiation. Thus, PRIs may represent a fundamental property of the architecture of cis-regulatory elements in mammalian genomes, and this feature can be exploited to pinpoint critical transcriptional regulatory elements governing cell type-specific gene expression.

Read More
Cheung TH, Barthel KK, Kwan YL, Liu X.
Identifying pattern-defined regulatory islands in mammalian genomes.
Proceedings of the National Academy of Sciences. 2007; 104(24):10116-21.