EPD Home page
EPD Sequence Download Page
This page lets you extract and download promoter sequences defined in EPD from corresponding EMBL sequences.
Sequence download is limited to segments smaller than 16kb in the range between -9999 and 10000 bp relative to the transcription start site. You may select several of the taxomic subsets proposed below. The sequence retrieval operation will be applied to the union of the selected subsets. Checking the box
"Representative sets of not closely related sequences"
causes extraction of a subset of promoters not sharing more than 50% sequence identity among each other. This is useful for statistical analyses where one wants to avoid bias by families of similar sequences. If an EMBL sequence entry does not cover the entire sequence range specified below, the returned sequence will be padded with n's at the beginning or at the end. This ensures that all extracted sequences will have the same length, with the transcription initiation sites being located at the same internal positions.
For download of the complete set of promoters, please use the
ftp site
.(files in FASTA format: [epd**.seq] or in EMBL format: [epd**.blk])
Select subsets:
All promoters (4809) ->Please use the
ftp site
Plant promoters (198)
Chromosomal genes (186)
Zea mays (maize) (21)
Prokaryotic plasmid DNA (8)
Viral genes (4)
Nematode promoters (26)
Arthropode promoters (2000)
Chromosomal genes (1991)
Drosophila melanogaster (fruit fly) (1926)
Transposable elements and retroviruses (5)
Viral genes (5)
Mollusc promoters (3)
Echinoderm promoters (44)
Vertebrate promoters (2540)
Chromosomal genes (2383)
Xenopus laevis (African clawed frog) (28)
Gallus gallus (chicken) (72)
Mus musculus (mouse) (196)
Rattus norvegicus (rat) (119)
Bos taurus (cattle) (24)
Homo sapiens (man) (1871)
Transposable elements and retroviruses (28)
Viral genes (129)
EBV (Human Epstein-Barr virus) (23)
HSV-1 (Human herpes simplex virus type 1) (48)
Preliminary EPD entries:
Oryza sativa (rice) (13046)
All promoters or
Representative set of not closely related sequences
Extract sequence segments
FROM
TO
relative to transcription start site.
Format:
Pearson/Fasta
EMBL
EPD Home page