We then retrieved the sequence from ???2K upstream to the +1K downstream of the first exon of each gene.We regularly run the computational pipeline (once in 3 months) to query the PubMed, GenBank and other databases for retrieving the new nucleotide sequence records that contain information about experimentally validated promoters and TF-binding sites.