Description
i-cisTarget
enables:
- to dissect a set of co-expressed genes into direct target genes of different transcription factor motifs.
- to detect transcription factor motifs in a set of ChIP peaks.
- to detect overrepresented in vivo features (iVF), e.g. histone modifications, for gene signatures and ChIP peaks. These iVFs help to improve motif discovery and candidate target gene prediction.
Some of the key features of i-cisTarget are:
- over-represented motifs are predicted in the set of co-expressed genes, using entire intergenic and intronic sequences
- 12 Drosophila species are used
- For motif scoring, the Cluster-Buster algorithm is used; kindly provided by Martin Frith.
- For cross-species comparisons, the cluster-buster scores of orthologous regions are used independently (no requirement of base pair alignment of D. melanogaster motifs), as described previously here.
- For significant motifs determines the optimal subset of genes that are predicted as direct targets
- For the predicted direct targets, i-cisTarget presents the predicted enhancer and binding site locations in the UCSC Genome Browser.
- 6383 position weight matrices (PWM) are used, from various sources such as Transfac, Jaspar, FlyFactorSurvey, and motif collections from Stark et al., Elemento et al., and Down et al.
- 48 features from the BDTNP project.
- 455 features from the modENCODE project.
- 33 ChIP based features from Eileen Furlong's lab (Zinzen et al., Nature (2009)).
- 2 * 30 chromatin states (derived from the S2 and the BG3 cell lines) taken from modENCODE Consortium et al., Science (2010).
Citations
- Stein Aerts, Xiao-Jiang Quan, Annelies Claeys, Marina Naval Sanchez, Phillip Tate, Jiekun Yan, and Bassem Hassan Robust target gene discovery through transcriptome perturbations and genome-wide enhancer predictions in Drosophila uncovers a regulatory basis for sensory specification PLoS Biology 2010 Jul 27;8(7):e1000435.
- Delphine Potier, Zeynep Kalender Atak, Marina Naval Sanchez, Carl Herrmann, and Stein Aerts Using cisTargetX to Predict Transcriptional Targets and Networks in Drosophila Methods Mol Biol 2012; 786, 291–314.
Supported input formats
Report for a single gene signature
Gene signatures must be supplied as a list of gene identifiers, separated by newline characters.The following IDs are supported:
| FBgn numbers | CG numbers | Symbols |
FBgn0063210 |
CG1004 |
pros |
Batch support for multiple gene signatures
The GMT file format is a tab delimited file format that describes multiple gene sets. In the GMT format each row represents a gene set. Each gene set is described by a name, a description, and the genes in the gene set. These fields are separated by a TAB character; the gene identifiers need to be separated by a semicolon, colon or again a TAB character.
Signature1 Description CG13728,barr,pho,Trl,gkt,abd-A,lab,chm,bab1,Pc,bab2,Pcl,Gint3,Dfd |
ChIP peaks
ChIP peaks must be supplied as BED file entries, specifying the locations of these peaks in the Drosophila genome. FASTA file input is not supported.An example of a BED file:
chr2L 11224674 11228405 FBgn0000011 |
Examples
| Genes expressed in proneural clusters (Analysis based on extended collection of 4238 motifs). The PPV and TPR values are calculated using a set of known proneural CRMs from REDFly. |
Reeves & Posakony, Dev. Cell (2005) | Report |
| MEF2 ChIP peaks | Sandmann et al., Dev Cell (2006) | Report | FlyBase Termlink sets | Report |
Cases
Example sets used in Aerts et al., PLoS Biology 2010
- Dorsal downstream genes (Stathopoulos et al 2002) results
- Ato downstream genes (GOF), upregulated genes results
- Ato downstream genes (LOF), downregulated genes results
- Sens downstream genes (GOF), downregulated genes results
- Sens downstream genes (GOF), upregulated genes results
- Genes expressed in proneural clusters (Reeves & Posakony, Dev. Cell 2005) results
- Serpent GOF (Stramer et al 2008) results
- Ey GOF (Ato independent) (Ostrin et al 2006) results
- Biniou LOF + ChIP (Jakobsen et al 2007) results
- MEF2 downstream genes from LOF microarray data (Sandmann et al 2006) result
- Pointed GOF (Pointed Leading Edge PLE) (Philippakis et al 2006) results
- GO:0007350 (blastoderm segmentation) results
Example sets used in Naval Sanchez et al., in preparation
- Top 75 genes of cross species order statistics ranking results
- Top 100 genes of cross species order statistics ranking results
- Top 245 genes of cross species order statistics ranking results
- Top 545 genes of cross species order statistics ranking results