KEGG   ORTHOLOGY: K21809
Entry
K21809                      KO                                     
Symbol
E7
Name
Alphapapillomavirus protein E7
Pathway
map05165  Human papillomavirus infection
map05203  Viral carcinogenesis
Network
nt06166  Human papillomavirus (HPV)
nt06516  TNF signaling
nt06517  TLR signaling
  Element
N00356  HPV E7 to PP2A-AKT signaling patyway
N00359  HPV E7 to p27-cell cycle G1/S
N00360  HPV E7 to p27-cell cycle G1/S
N00361  HPV E7 to cell cycle G1/S
N00365  HPV E7 to cell cycle G1/S
N00371  HPV E7 to pyruvate generation
N00372  HPV E7 to p300-p21-Cell cycle G1/S
N00375  HPV E7 to TNF-IRF1 signaling pathway
N00376  HPV E7 to TBP1-mediated transcription
N00379  HPV E7 to IFN signaling pathway
Brite
KEGG Orthology (KO) [BR:ko00001]
 09160 Human Diseases
  09161 Cancer: overview
   05203 Viral carcinogenesis
    K21809  E7; Alphapapillomavirus protein E7
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    K21809  E7; Alphapapillomavirus protein E7
 09180 Brite Hierarchies
  09185 Viral protein families
   03200 Viral proteins
    K21809  E7; Alphapapillomavirus protein E7
Viral proteins [BR:ko03200]
 dsDNA viruses
  Human papillomavirus
   K21809  E7; Alphapapillomavirus protein E7
Genes
VG: 12218717(E7) 1403313(E7) 1489007(E7) 1489079(E7) 1489089(E7) 1489226(E7) 1489369(E7) 1489376(E7) 1489425(E7) 1489432(E7) 1489465(E7) 1489473(E7) 1496947(E7) 1497433(E7) 37619470(E7) 37627439(CBR63_gp2) 41702610(E7) 41702622(E7) 41702906(E7) 951485(E7)
Reference
PMID:2836062
  Authors
Phelps WC, Yee CL, Munger K, Howley PM
  Title
The human papillomavirus type 16 E7 gene encodes transactivation and transformation functions similar to those of adenovirus E1A.
  Journal
Cell 53:539-47 (1988)
DOI:10.1016/0092-8674(88)90570-3
  Sequence
[vg:1489079]
LinkDB

KEGG   Homo sapiens (human): 3065
Entry
3065              CDS       T01001                                 
Symbol
HDAC1, GON-10, HD1, KDAC1, RPD3, RPD3L1
Name
(RefSeq) histone deacetylase 1
  KO
K06067  histone deacetylase 1/2 [EC:3.5.1.98]
Organism
hsa  Homo sapiens (human)
Pathway
hsa03082  ATP-dependent chromatin remodeling
hsa03083  Polycomb repressive complex
hsa04110  Cell cycle
hsa04213  Longevity regulating pathway - multiple species
hsa04330  Notch signaling pathway
hsa04350  TGF-beta signaling pathway
hsa04613  Neutrophil extracellular trap formation
hsa04919  Thyroid hormone signaling pathway
hsa05016  Huntington disease
hsa05031  Amphetamine addiction
hsa05034  Alcoholism
hsa05165  Human papillomavirus infection
hsa05169  Epstein-Barr virus infection
hsa05200  Pathways in cancer
hsa05202  Transcriptional misregulation in cancer
hsa05203  Viral carcinogenesis
hsa05206  MicroRNAs in cancer
hsa05220  Chronic myeloid leukemia
Network
nt06166  Human papillomavirus (HPV)
nt06240  Transcription (cancer)
nt06461  Huntington disease
nt06516  TNF signaling
nt06523  Epigenetic regulation by Polycomb complexes
  Element
N00118  TEL-AML1 fusion to transcriptional repression
N00375  HPV E7 to TNF-IRF1 signaling pathway
N00980  Mutation-caused aberrant Htt to REST-mediated transcriptional repression
N01614  Activation of PRC2.2 by ubiquitination of H2AK119 in germline genes
Drug target
Abexinostat (DG01402): D10060 D10084
Belinostat: D08870<US>
Bocodepsin (DG03260): D12551 D12552
Entinostat: D09338
Fimepinostat: D11319
Givinostat (DG03284): D12742 D12743<US>
Mocetinostat (DG01404): D09357 D09641
Nanatinostat: D11442
Panobinostat (DG01403): D10019 D10319
Quisinostat (DG01407): D10321 D10322
Remetinostat: D10977
Romidepsin: D06637<JP/US>
Tacedinaline: D05988
Tinostamustine: D11182
Tucidinostat: D10993<JP>
Vorinostat: D06320<JP/US>
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09126 Chromosome
   03082 ATP-dependent chromatin remodeling
    3065 (HDAC1)
   03083 Polycomb repressive complex
    3065 (HDAC1)
 09130 Environmental Information Processing
  09132 Signal transduction
   04330 Notch signaling pathway
    3065 (HDAC1)
   04350 TGF-beta signaling pathway
    3065 (HDAC1)
 09140 Cellular Processes
  09143 Cell growth and death
   04110 Cell cycle
    3065 (HDAC1)
 09150 Organismal Systems
  09151 Immune system
   04613 Neutrophil extracellular trap formation
    3065 (HDAC1)
  09152 Endocrine system
   04919 Thyroid hormone signaling pathway
    3065 (HDAC1)
  09149 Aging
   04213 Longevity regulating pathway - multiple species
    3065 (HDAC1)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    3065 (HDAC1)
   05202 Transcriptional misregulation in cancer
    3065 (HDAC1)
   05206 MicroRNAs in cancer
    3065 (HDAC1)
   05203 Viral carcinogenesis
    3065 (HDAC1)
  09162 Cancer: specific types
   05220 Chronic myeloid leukemia
    3065 (HDAC1)
  09172 Infectious disease: viral
   05169 Epstein-Barr virus infection
    3065 (HDAC1)
   05165 Human papillomavirus infection
    3065 (HDAC1)
  09164 Neurodegenerative disease
   05016 Huntington disease
    3065 (HDAC1)
  09165 Substance dependence
   05031 Amphetamine addiction
    3065 (HDAC1)
   05034 Alcoholism
    3065 (HDAC1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:hsa03036]
    3065 (HDAC1)
Enzymes [BR:hsa01000]
 3. Hydrolases
  3.5  Acting on carbon-nitrogen bonds, other than peptide bonds
   3.5.1  In linear amides
    3.5.1.98  histone deacetylase
     3065 (HDAC1)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   HDACs (histone deacetylases)
    Class I HDACs
     3065 (HDAC1)
   HDAC complexes
    Sin3A-HDAC complex
     3065 (HDAC1)
    BRAF-HDAC complex
     3065 (HDAC1)
    REST complex
     3065 (HDAC1)
    SHIP complex
     3065 (HDAC1)
    MiDAC complex
     3065 (HDAC1)
   Polycomb repressive complex (PRC) and associated proteins
    Noncanonical PRC1 (PRC1.6)
     3065 (HDAC1)
  Heterochromatin formation proteins
   Other heterochromatin formation proteins
    3065 (HDAC1)
  Chromatin remodeling factors
   NuRD complex
    3065 (HDAC1)
SSDB
Motif
Pfam: Hist_deacetyl
Other DBs
NCBI-GeneID: 3065
NCBI-ProteinID: NP_004955
OMIM: 601241
HGNC: 4852
Ensembl: ENSG00000116478
UniProt: Q13547 Q6IT96
Structure
LinkDB
Position
1:32292083..32333626
AA seq 482 aa
MAQTQGTRRKVCYYYDGDVGNYYYGQGHPMKPHRIRMTHNLLLNYGLYRKMEIYRPHKAN
AEEMTKYHSDDYIKFLRSIRPDNMSEYSKQMQRFNVGEDCPVFDGLFEFCQLSTGGSVAS
AVKLNKQQTDIAVNWAGGLHHAKKSEASGFCYVNDIVLAILELLKYHQRVLYIDIDIHHG
DGVEEAFYTTDRVMTVSFHKYGEYFPGTGDLRDIGAGKGKYYAVNYPLRDGIDDESYEAI
FKPVMSKVMEMFQPSAVVLQCGSDSLSGDRLGCFNLTIKGHAKCVEFVKSFNLPMLMLGG
GGYTIRNVARCWTYETAVALDTEIPNELPYNDYFEYFGPDFKLHISPSNMTNQNTNEYLE
KIKQRLFENLRMLPHAPGVQMQAIPEDAIPEESGDEDEDDPDKRISICSSDKRIACEEEF
SDSEEEGEGGRKNSSNFKKAKRVKTEDEKEKDPEEKKEVTEEEKTKEEKPEAKGVKEEVK
LA
NT seq 1449 nt   +upstreamnt  +downstreamnt
atggcgcagacgcagggcacccggaggaaagtctgttactactacgacggggatgttgga
aattactattatggacaaggccacccaatgaagcctcaccgaatccgcatgactcataat
ttgctgctcaactatggtctctaccgaaaaatggaaatctatcgccctcacaaagccaat
gctgaggagatgaccaagtaccacagcgatgactacattaaattcttgcgctccatccgt
ccagataacatgtcggagtacagcaagcagatgcagagattcaacgttggtgaggactgt
ccagtattcgatggcctgtttgagttctgtcagttgtctactggtggttctgtggcaagt
gctgtgaaacttaataagcagcagacggacatcgctgtgaattgggctgggggcctgcac
catgcaaagaagtccgaggcatctggcttctgttacgtcaatgatatcgtcttggccatc
ctggaactgctaaagtatcaccagagggtgctgtacattgacattgatattcaccatggt
gacggcgtggaagaggccttctacaccacggaccgggtcatgactgtgtcctttcataag
tatggagagtacttcccaggaactggggacctacgggatatcggggctggcaaaggcaag
tattatgctgttaactacccgctccgagacgggattgatgacgagtcctatgaggccatt
ttcaagccggtcatgtccaaagtaatggagatgttccagcctagtgcggtggtcttacag
tgtggctcagactccctatctggggatcggttaggttgcttcaatctaactatcaaagga
cacgccaagtgtgtggaatttgtcaagagctttaacctgcctatgctgatgctgggaggc
ggtggttacaccattcgtaacgttgcccggtgctggacatatgagacagctgtggccctg
gatacggagatccctaatgagcttccatacaatgactactttgaatactttggaccagat
ttcaagctccacatcagtccttccaatatgactaaccagaacacgaatgagtacctggag
aagatcaaacagcgactgtttgagaaccttagaatgctgccgcacgcacctggggtccaa
atgcaggcgattcctgaggacgccatccctgaggagagtggcgatgaggacgaagacgac
cctgacaagcgcatctcgatctgctcctctgacaaacgaattgcctgtgaggaagagttc
tccgattctgaagaggagggagaggggggccgcaagaactcttccaacttcaaaaaagcc
aagagagtcaaaacagaggatgaaaaagagaaagacccagaggagaagaaagaagtcacc
gaagaggagaaaaccaaggaggagaagccagaagccaaaggggtcaaggaggaggtcaag
ttggcctga

KEGG   Homo sapiens (human): 3066
Entry
3066              CDS       T01001                                 
Symbol
HDAC2, HD2, KDAC2, RPD3, YAF1
Name
(RefSeq) histone deacetylase 2
  KO
K06067  histone deacetylase 1/2 [EC:3.5.1.98]
Organism
hsa  Homo sapiens (human)
Pathway
hsa03082  ATP-dependent chromatin remodeling
hsa03083  Polycomb repressive complex
hsa04110  Cell cycle
hsa04213  Longevity regulating pathway - multiple species
hsa04330  Notch signaling pathway
hsa04350  TGF-beta signaling pathway
hsa04613  Neutrophil extracellular trap formation
hsa04919  Thyroid hormone signaling pathway
hsa05016  Huntington disease
hsa05031  Amphetamine addiction
hsa05034  Alcoholism
hsa05165  Human papillomavirus infection
hsa05169  Epstein-Barr virus infection
hsa05200  Pathways in cancer
hsa05202  Transcriptional misregulation in cancer
hsa05203  Viral carcinogenesis
hsa05206  MicroRNAs in cancer
hsa05220  Chronic myeloid leukemia
Network
nt06166  Human papillomavirus (HPV)
nt06240  Transcription (cancer)
nt06461  Huntington disease
nt06516  TNF signaling
nt06523  Epigenetic regulation by Polycomb complexes
  Element
N00118  TEL-AML1 fusion to transcriptional repression
N00375  HPV E7 to TNF-IRF1 signaling pathway
N00980  Mutation-caused aberrant Htt to REST-mediated transcriptional repression
N01614  Activation of PRC2.2 by ubiquitination of H2AK119 in germline genes
Drug target
Abexinostat (DG01402): D10060 D10084
Belinostat: D08870<US>
Bocodepsin (DG03260): D12551 D12552
Entinostat: D09338
Fimepinostat: D11319
Givinostat (DG03284): D12742 D12743<US>
Mocetinostat (DG01404): D09357 D09641
Nanatinostat: D11442
Panobinostat (DG01403): D10019 D10319
Quisinostat (DG01407): D10321 D10322
Remetinostat: D10977
Romidepsin: D06637<JP/US>
Tinostamustine: D11182
Tucidinostat: D10993<JP>
Vorinostat: D06320<JP/US>
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09126 Chromosome
   03082 ATP-dependent chromatin remodeling
    3066 (HDAC2)
   03083 Polycomb repressive complex
    3066 (HDAC2)
 09130 Environmental Information Processing
  09132 Signal transduction
   04330 Notch signaling pathway
    3066 (HDAC2)
   04350 TGF-beta signaling pathway
    3066 (HDAC2)
 09140 Cellular Processes
  09143 Cell growth and death
   04110 Cell cycle
    3066 (HDAC2)
 09150 Organismal Systems
  09151 Immune system
   04613 Neutrophil extracellular trap formation
    3066 (HDAC2)
  09152 Endocrine system
   04919 Thyroid hormone signaling pathway
    3066 (HDAC2)
  09149 Aging
   04213 Longevity regulating pathway - multiple species
    3066 (HDAC2)
 09160 Human Diseases
  09161 Cancer: overview
   05200 Pathways in cancer
    3066 (HDAC2)
   05202 Transcriptional misregulation in cancer
    3066 (HDAC2)
   05206 MicroRNAs in cancer
    3066 (HDAC2)
   05203 Viral carcinogenesis
    3066 (HDAC2)
  09162 Cancer: specific types
   05220 Chronic myeloid leukemia
    3066 (HDAC2)
  09172 Infectious disease: viral
   05169 Epstein-Barr virus infection
    3066 (HDAC2)
   05165 Human papillomavirus infection
    3066 (HDAC2)
  09164 Neurodegenerative disease
   05016 Huntington disease
    3066 (HDAC2)
  09165 Substance dependence
   05031 Amphetamine addiction
    3066 (HDAC2)
   05034 Alcoholism
    3066 (HDAC2)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:hsa03036]
    3066 (HDAC2)
Enzymes [BR:hsa01000]
 3. Hydrolases
  3.5  Acting on carbon-nitrogen bonds, other than peptide bonds
   3.5.1  In linear amides
    3.5.1.98  histone deacetylase
     3066 (HDAC2)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   HDACs (histone deacetylases)
    Class I HDACs
     3066 (HDAC2)
   HDAC complexes
    Sin3A-HDAC complex
     3066 (HDAC2)
    BRAF-HDAC complex
     3066 (HDAC2)
    REST complex
     3066 (HDAC2)
    SHIP complex
     3066 (HDAC2)
    MiDAC complex
     3066 (HDAC2)
   Polycomb repressive complex (PRC) and associated proteins
    Noncanonical PRC1 (PRC1.6)
     3066 (HDAC2)
  Heterochromatin formation proteins
   Other heterochromatin formation proteins
    3066 (HDAC2)
  Chromatin remodeling factors
   NuRD complex
    3066 (HDAC2)
SSDB
Motif
Pfam: Hist_deacetyl
Other DBs
NCBI-GeneID: 3066
NCBI-ProteinID: NP_001518
OMIM: 605164
HGNC: 4853
Ensembl: ENSG00000196591
UniProt: Q92769
Structure
LinkDB
Position
6:complement(113933028..113971148)
AA seq 488 aa
MAYSQGGGKKKVCYYYDGDIGNYYYGQGHPMKPHRIRMTHNLLLNYGLYRKMEIYRPHKA
TAEEMTKYHSDEYIKFLRSIRPDNMSEYSKQMQRFNVGEDCPVFDGLFEFCQLSTGGSVA
GAVKLNRQQTDMAVNWAGGLHHAKKSEASGFCYVNDIVLAILELLKYHQRVLYIDIDIHH
GDGVEEAFYTTDRVMTVSFHKYGEYFPGTGDLRDIGAGKGKYYAVNFPMRDGIDDESYGQ
IFKPIISKVMEMYQPSAVVLQCGADSLSGDRLGCFNLTVKGHAKCVEVVKTFNLPLLMLG
GGGYTIRNVARCWTYETAVALDCEIPNELPYNDYFEYFGPDFKLHISPSNMTNQNTPEYM
EKIKQRLFENLRMLPHAPGVQMQAIPEDAVHEDSGDEDGEDPDKRISIRASDKRIACDEE
FSDSEDEGEGGRRNVADHKKGAKKARIEEDKKETEDKKTDVKEEDKSKDNSGEKTDTKGT
KSEQLSNP
NT seq 1467 nt   +upstreamnt  +downstreamnt
atggcgtacagtcaaggaggcggcaaaaaaaaagtctgctactactacgacggtgatatt
ggaaattattattatggacagggtcatcccatgaagcctcatagaatccgcatgacccat
aacttgctgttaaattatggcttatacagaaaaatggaaatatataggccccataaagcc
actgccgaagaaatgacaaaatatcacagtgatgagtatatcaaatttctacggtcaata
agaccagataacatgtctgagtatagtaagcagatgcagagatttaatgttggagaagat
tgtccagtgtttgatggactctttgagttttgtcagctctcaactggcggttcagttgct
ggagctgtgaagttaaaccgacaacagactgatatggctgttaattgggctggaggatta
catcatgctaagaaatcagaagcatcaggattctgttacgttaatgatattgtgcttgcc
atccttgaattactaaagtatcatcagagagtcttatatattgatatagatattcatcat
ggtgatggtgttgaagaagctttttatacaacagatcgtgtaatgacggtatcattccat
aaatatggggaatactttcctggcacaggagacttgagggatattggtgctggaaaaggc
aaatactatgctgtcaattttccaatgagagatggtatagatgatgagtcatatgggcag
atatttaagcctattatctcaaaggtgatggagatgtatcaacctagtgctgtggtatta
cagtgtggtgcagactcattatctggtgatagactgggttgtttcaatctaacagtcaaa
ggtcatgctaaatgtgtagaagttgtaaaaacttttaacttaccattactgatgcttgga
ggaggtggctacacaatccgtaatgttgctcgatgttggacatatgagactgcagttgcc
cttgattgtgagattcccaatgagttgccatataatgattactttgagtattttggacca
gacttcaaactgcatattagtccttcaaacatgacaaaccagaacactccagaatatatg
gaaaagataaaacagcgtttgtttgaaaatttgcgcatgttacctcatgcacctggtgtc
cagatgcaagctattccagaagatgctgttcatgaagacagtggagatgaagatggagaa
gatccagacaagagaatttctattcgagcatcagacaagcggatagcttgtgatgaagaa
ttctcagattctgaggatgaaggagaaggaggtcgaagaaatgtggctgatcataagaaa
ggagcaaagaaagctagaattgaagaagataagaaagaaacagaggacaaaaaaacagac
gttaaggaagaagataaatccaaggacaacagtggtgaaaaaacagataccaaaggaacc
aaatcagaacagctcagcaacccctga

KEGG   Homo sapiens (human): 1108
Entry
1108              CDS       T01001                                 
Symbol
CHD4, CHD-4, Mi-2b, Mi2-BETA, SIHIWES
Name
(RefSeq) chromodomain helicase DNA binding protein 4
  KO
K11643  chromodomain-helicase-DNA-binding protein 4 [EC:5.6.2.-]
Organism
hsa  Homo sapiens (human)
Pathway
hsa03082  ATP-dependent chromatin remodeling
hsa05165  Human papillomavirus infection
hsa05203  Viral carcinogenesis
Network
nt06166  Human papillomavirus (HPV)
nt06516  TNF signaling
  Element
N00375  HPV E7 to TNF-IRF1 signaling pathway
Disease
H02328  Sifrim-Hitz-Weiss syndrome
H02616  Neurodevelopmental disorder with macrocephaly
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09126 Chromosome
   03082 ATP-dependent chromatin remodeling
    1108 (CHD4)
 09160 Human Diseases
  09161 Cancer: overview
   05203 Viral carcinogenesis
    1108 (CHD4)
  09172 Infectious disease: viral
   05165 Human papillomavirus infection
    1108 (CHD4)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:hsa03036]
    1108 (CHD4)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Heterochromatin formation proteins
   Other heterochromatin formation proteins
    1108 (CHD4)
  Chromatin remodeling factors
   NuRD complex
    1108 (CHD4)
SSDB
Motif
Pfam: SNF2-rel_dom CHDCT2 CHDII_SANT-like Helicase_C CHDNT DUF1087 PHD Chromo ResIII HDA2-3 PHD_2 DEAD zf-PHD-like
Other DBs
NCBI-GeneID: 1108
NCBI-ProteinID: NP_001264
OMIM: 603277
HGNC: 1919
Ensembl: ENSG00000111642
UniProt: Q14839
Structure
LinkDB
Position
12:complement(6570082..6607379)
AA seq 1912 aa
MASGLGSPSPCSAGSEEEDMDALLNNSLPPPHPENEEDPEEDLSETETPKLKKKKKPKKP
RDPKIPKSKRQKKERMLLCRQLGDSSGEGPEFVEEEEEVALRSDSEGSDYTPGKKKKKKL
GPKKEKKSKSKRKEEEEEEDDDDDSKEPKSSAQLLEDWGMEDIDHVFSEEDYRTLTNYKA
FSQFVRPLIAAKNPKIAVSKMMMVLGAKWREFSTNNPFKGSSGASVAAAAAAAVAVVESM
VTATEVAPPPPPVEVPIRKAKTKEGKGPNARRKPKGSPRVPDAKKPKPKKVAPLKIKLGG
FGSKRKRSSSEDDDLDVESDFDDASINSYSVSDGSTSRSSRSRKKLRTTKKKKKGEEEVT
AVDGYETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQ
WEAKEDNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPL
PEIPNGEWLCPRCTCPALKGKVQKILIWKWGQPPSPTPVPRPPDADPNTPSPKPLEGRPE
RQFFVKWQGMSYWHCSWVSELQLELHCQVMFRNYQRKNDMDEPPSGDFGGDEEKSRKRKN
KDPKFAEMEERFYRYGIKPEWMMIHRILNHSVDKKGHVHYLIKWRDLPYDQASWESEDVE
IQDYDLFKQSYWNHRELMRGEEGRPGKKLKKVKLRKLERPPETPTVDPTVKYERQPEYLD
ATGGTLHPYQMEGLNWLRFSWAQGTDTILADEMGLGKTVQTAVFLYSLYKEGHSKGPFLV
SAPLSTIINWEREFEMWAPDMYVVTYVGDKDSRAIIRENEFSFEDNAIRGGKKASRMKKE
ASVKFHVLLTSYELITIDMAILGSIDWACLIVDEAHRLKNNQSKFFRVLNGYSLQHKLLL
TGTPLQNNLEELFHLLNFLTPERFHNLEGFLEEFADIAKEDQIKKLHDMLGPHMLRRLKA
DVFKNMPSKTELIVRVELSPMQKKYYKYILTRNFEALNARGGGNQVSLLNVVMDLKKCCN
HPYLFPVAAMEAPKMPNGMYDGSALIRASGKLLLLQKMLKNLKEGGHRVLIFSQMTKMLD
LLEDFLEHEGYKYERIDGGITGNMRQEAIDRFNAPGAQQFCFLLSTRAGGLGINLATADT
VIIYDSDWNPHNDIQAFSRAHRIGQNKKVMIYRFVTRASVEERITQVAKKKMMLTHLVVR
PGLGSKTGSMSKQELDDILKFGTEELFKDEATDGGGDNKEGEDSSVIHYDDKAIERLLDR
NQDETEDTELQGMNEYLSSFKVAQYVVREEEMGEEEEVEREIIKQEESVDPDYWEKLLRH
HYEQQQEDLARNLGKGKRIRKQVNYNDGSQEDRDWQDDQSDNQSDYSVASEEGDEDFDER
SEAPRRPSRKGLRNDKDKPLPPLLARVGGNIEVLGFNARQRKAFLNAIMRYGMPPQDAFT
TQWLVRDLRGKSEKEFKAYVSLFMRHLCEPGADGAETFADGVPREGLSRQHVLTRIGVMS
LIRKKVQEFEHVNGRWSMPELAEVEENKKMSQPGSPSPKTPTPSTPGDTQPNTPAPVPPA
EDGIKIEENSLKEEESIEGEKEVKSTAPETAIECTQAPAPASEDEKVVVEPPEGEEKVEK
AEVKERTEEPMETEPKGAADVEKVEEKSAIDLTPIVVEDKEEKKEEEEKKEVMLQNGETP
KDLNDEKQKKNIKQRFMFNIADGGFTELHSLWQNEERAATVTKKTYEIWHRRHDYWLLAG
IINHGYARWQDIQNDPRYAILNEPFKGEMNRGNFLEIKNKFLARRFKLLEQALVIEEQLR
RAAYLNMSEDPSHPSMALNTRFAEVECLAESHQHLSKESMAGNKPANAVLHKVLKQLEEL
LSDMKADVTRLPATIARIPPVAVRLQMSERNILSRLANRAPEPTPQQVAQQQ
NT seq 5739 nt   +upstreamnt  +downstreamnt
atggcgtcgggcctgggctccccgtccccctgctcggcgggcagtgaggaggaggatatg
gatgcacttttgaacaacagcctgcccccaccccacccagaaaatgaagaggacccagaa
gaggatttgtcagaaacagagactccaaagctcaagaagaagaaaaagcctaagaaacct
cgggaccctaaaatccctaagagcaagcgccaaaaaaaggagcgtatgctcttatgccgg
cagctgggggacagctctggggaggggccagagtttgtggaggaggaggaagaggtggct
ctgcgctcagacagtgagggcagcgactatactcctggcaagaagaagaagaagaagctt
ggacctaagaaagagaagaagagcaaatccaagcggaaggaggaggaggaggaggaggat
gatgatgatgattcaaaggagcctaaatcatctgctcagctcctggaagactggggcatg
gaagacattgaccacgtgttctcagaggaggattatcgaaccctcaccaactacaaggcc
ttcagccagtttgtcagacccctcattgctgccaaaaatcccaagattgctgtctccaag
atgatgatggttttgggtgcaaaatggcgggagttcagtaccaataaccccttcaaaggc
agttctggggcatcagtggcagctgcggcagcagcagcggtagctgtggtggagagcatg
gtgacagccactgaggttgcaccaccacctccccctgtggaggtgcctatccgcaaggcc
aagaccaaggagggcaaaggtcccaatgctcggaggaagcccaagggcagccctcgtgta
cctgatgccaagaagcctaaacccaagaaagtagctcccctgaaaatcaagctgggaggt
tttggttccaagcgtaagagatcctcgagtgaggatgatgacttagatgtggaatctgac
ttcgatgatgccagtatcaatagctattctgtttctgatggttccaccagccgtagtagc
cgcagccgcaagaaactccgaaccactaaaaagaaaaagaaaggcgaggaggaggtgact
gctgtggatggttatgagacagaccaccaggactattgcgaggtgtgccagcaaggcggt
gagatcatcctgtgtgatacctgtccccgtgcttaccacatggtctgcctggatcccgac
atggagaaggctcccgagggcaagtggagctgcccacactgcgagaaggaaggcatccag
tgggaagctaaagaggacaattcggagggtgaggagatcctggaagaggttgggggagac
ctcgaagaggaggatgaccaccatatggaattctgtcgggtctgcaaggatggtggggaa
ctgctctgctgtgatacctgtccttcttcctaccacatccactgcctgaatcccccactt
ccagagatccccaacggtgaatggctctgtccccgttgtacgtgtccagctctgaagggc
aaagtgcagaagatcctaatctggaagtggggtcagccaccatctcccacaccagtgcct
cggcctccagatgctgatcccaacacgccctccccaaagcccttggaggggcggccagag
cggcagttctttgtgaaatggcaaggcatgtcttactggcactgctcctgggtttctgaa
ctgcagctggagctgcactgtcaggtgatgttccgaaactatcagcggaagaatgatatg
gatgagccaccttctggggactttggtggtgatgaagagaaaagccgaaagcgaaagaac
aaggaccctaaatttgcagagatggaggaacgcttctatcgctatgggataaaacccgag
tggatgatgatccaccgaatcctcaaccacagtgtggacaagaagggccacgtccactac
ttgatcaagtggcgggacttaccttacgatcaggcttcttgggagagtgaggatgtggag
atccaggattacgacctgttcaagcagagctattggaatcacagggagttaatgaggggt
gaggaaggccgaccaggcaagaagctcaagaaggtgaagcttcggaagttggagaggcct
ccagaaacgccaacagttgatccaacagtgaagtatgagcgacagccagagtacctggat
gctacaggtggaaccctgcacccctatcaaatggagggcctgaattggttgcgcttctcc
tgggctcagggcactgacaccatcttggctgatgagatgggccttgggaaaactgtacag
acagcagtcttcctgtattccctttacaaggagggtcattccaaaggccccttcctagtg
agcgcccctctttctaccatcatcaactgggagcgggagtttgaaatgtgggctccagac
atgtatgtcgtaacctatgtgggtgacaaggacagccgtgccatcatccgagagaatgag
ttctcctttgaagacaatgccattcgtggtggcaagaaggcctcccgcatgaagaaagag
gcatctgtgaaattccatgtgctgctgacatcctatgaattgatcaccattgacatggct
attttgggctctattgattgggcctgcctcatcgtggatgaagcccatcggctgaagaac
aatcagtctaagttcttccgggtattgaatggttactcactccagcacaagctgttgctg
actgggacaccattacaaaacaatctggaagagttgtttcatctgctcaactttctcacc
cccgagaggttccacaatttggaaggttttttggaggagtttgctgacattgccaaggag
gaccagataaaaaaactgcatgacatgctggggccgcacatgttgcggcggctcaaagcc
gatgtgttcaagaacatgccctccaagacagaactaattgtgcgtgtggagctgagccct
atgcagaagaaatactacaagtacatcctcactcgaaattttgaagcactcaatgcccga
ggtggtggcaaccaggtgtctctgctgaatgtggtgatggatcttaagaagtgctgcaac
catccatacctcttccctgtggctgcaatggaagctcctaagatgcctaatggcatgtat
gatggcagtgccctaatcagagcatctgggaaattattgctgctgcagaaaatgctcaag
aaccttaaggagggtgggcatcgtgtactcatcttttcccagatgaccaagatgctagac
ctgctagaggatttcttggaacatgaaggttataaatacgaacgcatcgatggtggaatc
actgggaacatgcggcaagaggccattgaccgcttcaatgcaccgggtgctcagcagttc
tgcttcttgctttccactcgagctgggggccttggaatcaatctggccactgctgacaca
gttattatctatgactctgactggaacccccataatgacattcaggcctttagcagagct
caccggattgggcaaaataaaaaggtaatgatctaccggtttgtgacccgtgcgtcagtg
gaggagcgcatcacgcaggtggcaaagaagaaaatgatgctgacgcatctagtggtgcgg
cctgggctgggctccaagactggatctatgtccaaacaggagcttgatgatatcctcaaa
tttggcactgaggaactattcaaggatgaagccactgatggaggaggagacaacaaagag
ggagaagatagcagtgttatccactacgatgataaggccattgaacggctgctagaccgt
aaccaggatgagactgaagacacagaattgcagggcatgaatgaatatttgagctcattc
aaagtggcccagtatgtggtacgggaagaagaaatgggggaggaagaggaggtagaacgg
gaaatcattaaacaggaagaaagtgtggatcctgactactgggagaaattgctgcggcac
cattatgagcagcagcaagaagatctagcccgaaatctgggcaaaggaaaaagaatccgt
aaacaggtcaactacaatgatggctcccaggaggaccgagattggcaggacgaccagtcc
gacaaccagtccgattactcagtggcttcagaggaaggtgatgaagactttgatgaacgt
tcagaagctccccgtaggcccagtcgtaagggcctgcggaatgataaagataagccattg
cctcctctgttggcccgtgttggtgggaatattgaagtacttggttttaatgctcgtcag
cgaaaagcctttcttaatgcaattatgcgatatggtatgccacctcaggatgcttttact
acccagtggcttgtaagagacctgcgaggcaaatcagagaaagagttcaaggcatatgtc
tctcttttcatgcggcatttatgtgagccgggggcagatggggctgagacctttgctgat
ggtgtcccccgagaaggcctgtctcgccagcatgtccttactagaattggtgttatgtct
ttgattcgcaagaaggttcaggagtttgaacatgttaatgggcgctggagcatgcctgaa
ctggctgaggtggaggaaaacaagaagatgtcccagccagggtcaccctccccaaaaact
cctacaccctccactccaggggacacgcagcccaacactcctgcacctgtcccacctgct
gaagatgggataaaaatagaggaaaatagcctcaaagaagaagagagcatagaaggagaa
aaggaggttaaatctacagcccctgagactgccattgagtgtacacaggcccctgcccct
gcctcagaggatgaaaaggtcgttgttgaaccccctgagggagaggagaaagtggaaaag
gcagaggtgaaggagagaacagaggaacctatggagacagagcccaaaggtgctgctgat
gtagagaaggtggaggaaaagtcagcaatagatctgacccctattgtggtagaagacaaa
gaagagaagaaagaagaagaagagaaaaaagaggtgatgcttcagaatggagagaccccc
aaggacctgaatgatgagaaacagaagaaaaatattaaacaacgtttcatgtttaacatt
gcagatggtggttttactgagttgcactccctttggcagaatgaagagcgggcagccaca
gttaccaagaagacttatgagatctggcatcgacggcatgactactggctgctagccggc
attataaaccatggctatgcccggtggcaagacatccagaatgacccacgctatgccatc
ctcaatgagcctttcaagggtgaaatgaaccgtggcaatttcttagagatcaagaataaa
tttctagctcgaaggtttaagctcttagaacaagctctggtgattgaggaacagctgcgc
cgggctgcttacttgaacatgtcagaagacccttctcacccttccatggccctcaacacc
cgctttgctgaggtggagtgtttggcggaaagtcatcagcacctgtccaaggagtcaatg
gcaggaaacaagccagccaatgcagtcctgcacaaagttctgaaacagctggaagaactg
ctgagtgacatgaaagctgatgtgactcgactcccagctaccattgcccgaattccccca
gttgctgtgaggttacagatgtcagagcgtaacattctcagccgcctggcaaaccgggca
cccgaacctaccccacagcaggtagcccagcagcagtga

DBGET integrated database retrieval system