Building a dictionary for genomes: Identification of presumptive regulatory sites by statistical analysis