ORTHOLOGY: K21809
Help
Entry
K21809 KO
Symbol
E7
Name
Alphapapillomavirus protein E7
Pathway
map05165
Human papillomavirus infection
map05203
Viral carcinogenesis
Network
nt06166
Human papillomavirus (HPV)
nt06516
TNF signaling
nt06517
TLR signaling
Element
N00356
HPV E7 to PP2A-AKT signaling patyway
N00359
HPV E7 to p27-cell cycle G1/S
N00360
HPV E7 to p27-cell cycle G1/S
N00361
HPV E7 to cell cycle G1/S
N00365
HPV E7 to cell cycle G1/S
N00371
HPV E7 to pyruvate generation
N00372
HPV E7 to p300-p21-Cell cycle G1/S
N00375
HPV E7 to TNF-IRF1 signaling pathway
N00376
HPV E7 to TBP1-mediated transcription
N00379
HPV E7 to IFN signaling pathway
Brite
KEGG Orthology (KO) [BR:
ko00001
]
09160 Human Diseases
09161 Cancer: overview
05203 Viral carcinogenesis
K21809 E7; Alphapapillomavirus protein E7
09172 Infectious disease: viral
05165 Human papillomavirus infection
K21809 E7; Alphapapillomavirus protein E7
09180 Brite Hierarchies
09185 Viral protein families
03200 Viral proteins
K21809 E7; Alphapapillomavirus protein E7
Viral proteins [BR:
ko03200
]
dsDNA viruses
Human papillomavirus
K21809 E7; Alphapapillomavirus protein E7
BRITE hierarchy
Genes
VG:
12218717
(E7)
1403313
(E7)
1489007
(E7)
1489079
(E7)
1489089
(E7)
1489226
(E7)
1489369
(E7)
1489376
(E7)
1489425
(E7)
1489432
(E7)
1489465
(E7)
1489473
(E7)
1496947
(E7)
1497433
(E7)
37619470
(E7)
37627439
(CBR63_gp2)
41702610
(E7)
41702622
(E7)
41702906
(E7)
951485
(E7)
Virus taxonomy
UniProt
Reference
PMID:
2836062
Authors
Phelps WC, Yee CL, Munger K, Howley PM
Title
The human papillomavirus type 16 E7 gene encodes transactivation and transformation functions similar to those of adenovirus E1A.
Journal
Cell 53:539-47 (1988)
DOI:
10.1016/0092-8674(88)90570-3
Sequence
[vg:
1489079
]
LinkDB
All DBs
Homo sapiens (human): 3065
Help
Entry
3065 CDS
T01001
Symbol
HDAC1, GON-10, HD1, KDAC1, RPD3, RPD3L1
Name
(RefSeq) histone deacetylase 1
KO
K06067
histone deacetylase 1/2 [EC:
3.5.1.98
]
Organism
hsa
Homo sapiens (human)
Pathway
hsa03082
ATP-dependent chromatin remodeling
hsa03083
Polycomb repressive complex
hsa04110
Cell cycle
hsa04213
Longevity regulating pathway - multiple species
hsa04330
Notch signaling pathway
hsa04350
TGF-beta signaling pathway
hsa04613
Neutrophil extracellular trap formation
hsa04919
Thyroid hormone signaling pathway
hsa05016
Huntington disease
hsa05031
Amphetamine addiction
hsa05034
Alcoholism
hsa05165
Human papillomavirus infection
hsa05169
Epstein-Barr virus infection
hsa05200
Pathways in cancer
hsa05202
Transcriptional misregulation in cancer
hsa05203
Viral carcinogenesis
hsa05206
MicroRNAs in cancer
hsa05220
Chronic myeloid leukemia
Network
nt06166
Human papillomavirus (HPV)
nt06240
Transcription (cancer)
nt06461
Huntington disease
nt06516
TNF signaling
nt06523
Epigenetic regulation by Polycomb complexes
Element
N00118
TEL-AML1 fusion to transcriptional repression
N00375
HPV E7 to TNF-IRF1 signaling pathway
N00980
Mutation-caused aberrant Htt to REST-mediated transcriptional repression
N01614
Activation of PRC2.2 by ubiquitination of H2AK119 in germline genes
Drug target
Abexinostat (
DG01402
):
D10060
D10084
Belinostat:
D08870
<US>
Bocodepsin (
DG03260
):
D12551
D12552
Entinostat:
D09338
Fimepinostat:
D11319
Givinostat (
DG03284
):
D12742
D12743
<US>
Mocetinostat (
DG01404
):
D09357
D09641
Nanatinostat:
D11442
Panobinostat (
DG01403
):
D10019
D10319
Quisinostat (
DG01407
):
D10321
D10322
Remetinostat:
D10977
Romidepsin:
D06637
<JP/US>
Tacedinaline:
D05988
Tinostamustine:
D11182
Tucidinostat:
D10993
<JP>
Vorinostat:
D06320
<JP/US>
Brite
KEGG Orthology (KO) [BR:
hsa00001
]
09120 Genetic Information Processing
09126 Chromosome
03082 ATP-dependent chromatin remodeling
3065 (HDAC1)
03083 Polycomb repressive complex
3065 (HDAC1)
09130 Environmental Information Processing
09132 Signal transduction
04330 Notch signaling pathway
3065 (HDAC1)
04350 TGF-beta signaling pathway
3065 (HDAC1)
09140 Cellular Processes
09143 Cell growth and death
04110 Cell cycle
3065 (HDAC1)
09150 Organismal Systems
09151 Immune system
04613 Neutrophil extracellular trap formation
3065 (HDAC1)
09152 Endocrine system
04919 Thyroid hormone signaling pathway
3065 (HDAC1)
09149 Aging
04213 Longevity regulating pathway - multiple species
3065 (HDAC1)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
3065 (HDAC1)
05202 Transcriptional misregulation in cancer
3065 (HDAC1)
05206 MicroRNAs in cancer
3065 (HDAC1)
05203 Viral carcinogenesis
3065 (HDAC1)
09162 Cancer: specific types
05220 Chronic myeloid leukemia
3065 (HDAC1)
09172 Infectious disease: viral
05169 Epstein-Barr virus infection
3065 (HDAC1)
05165 Human papillomavirus infection
3065 (HDAC1)
09164 Neurodegenerative disease
05016 Huntington disease
3065 (HDAC1)
09165 Substance dependence
05031 Amphetamine addiction
3065 (HDAC1)
05034 Alcoholism
3065 (HDAC1)
09180 Brite Hierarchies
09182 Protein families: genetic information processing
03036 Chromosome and associated proteins [BR:
hsa03036
]
3065 (HDAC1)
Enzymes [BR:
hsa01000
]
3. Hydrolases
3.5 Acting on carbon-nitrogen bonds, other than peptide bonds
3.5.1 In linear amides
3.5.1.98 histone deacetylase
3065 (HDAC1)
Chromosome and associated proteins [BR:
hsa03036
]
Eukaryotic type
Histone modification proteins
HDACs (histone deacetylases)
Class I HDACs
3065 (HDAC1)
HDAC complexes
Sin3A-HDAC complex
3065 (HDAC1)
BRAF-HDAC complex
3065 (HDAC1)
REST complex
3065 (HDAC1)
SHIP complex
3065 (HDAC1)
MiDAC complex
3065 (HDAC1)
Polycomb repressive complex (PRC) and associated proteins
Noncanonical PRC1 (PRC1.6)
3065 (HDAC1)
Heterochromatin formation proteins
Other heterochromatin formation proteins
3065 (HDAC1)
Chromatin remodeling factors
NuRD complex
3065 (HDAC1)
BRITE hierarchy
SSDB
Ortholog
Paralog
Gene cluster
GFIT
Motif
Pfam:
Hist_deacetyl
Motif
Other DBs
NCBI-GeneID:
3065
NCBI-ProteinID:
NP_004955
OMIM:
601241
HGNC:
4852
Ensembl:
ENSG00000116478
UniProt:
Q13547
Q6IT96
Structure
PDB
LinkDB
All DBs
Position
1:32292083..32333626
Genome browser
AA seq
482 aa
AA seq
DB search
MAQTQGTRRKVCYYYDGDVGNYYYGQGHPMKPHRIRMTHNLLLNYGLYRKMEIYRPHKAN
AEEMTKYHSDDYIKFLRSIRPDNMSEYSKQMQRFNVGEDCPVFDGLFEFCQLSTGGSVAS
AVKLNKQQTDIAVNWAGGLHHAKKSEASGFCYVNDIVLAILELLKYHQRVLYIDIDIHHG
DGVEEAFYTTDRVMTVSFHKYGEYFPGTGDLRDIGAGKGKYYAVNYPLRDGIDDESYEAI
FKPVMSKVMEMFQPSAVVLQCGSDSLSGDRLGCFNLTIKGHAKCVEFVKSFNLPMLMLGG
GGYTIRNVARCWTYETAVALDTEIPNELPYNDYFEYFGPDFKLHISPSNMTNQNTNEYLE
KIKQRLFENLRMLPHAPGVQMQAIPEDAIPEESGDEDEDDPDKRISICSSDKRIACEEEF
SDSEEEGEGGRKNSSNFKKAKRVKTEDEKEKDPEEKKEVTEEEKTKEEKPEAKGVKEEVK
LA
NT seq
1449 nt
NT seq
+upstream
nt +downstream
nt
atggcgcagacgcagggcacccggaggaaagtctgttactactacgacggggatgttgga
aattactattatggacaaggccacccaatgaagcctcaccgaatccgcatgactcataat
ttgctgctcaactatggtctctaccgaaaaatggaaatctatcgccctcacaaagccaat
gctgaggagatgaccaagtaccacagcgatgactacattaaattcttgcgctccatccgt
ccagataacatgtcggagtacagcaagcagatgcagagattcaacgttggtgaggactgt
ccagtattcgatggcctgtttgagttctgtcagttgtctactggtggttctgtggcaagt
gctgtgaaacttaataagcagcagacggacatcgctgtgaattgggctgggggcctgcac
catgcaaagaagtccgaggcatctggcttctgttacgtcaatgatatcgtcttggccatc
ctggaactgctaaagtatcaccagagggtgctgtacattgacattgatattcaccatggt
gacggcgtggaagaggccttctacaccacggaccgggtcatgactgtgtcctttcataag
tatggagagtacttcccaggaactggggacctacgggatatcggggctggcaaaggcaag
tattatgctgttaactacccgctccgagacgggattgatgacgagtcctatgaggccatt
ttcaagccggtcatgtccaaagtaatggagatgttccagcctagtgcggtggtcttacag
tgtggctcagactccctatctggggatcggttaggttgcttcaatctaactatcaaagga
cacgccaagtgtgtggaatttgtcaagagctttaacctgcctatgctgatgctgggaggc
ggtggttacaccattcgtaacgttgcccggtgctggacatatgagacagctgtggccctg
gatacggagatccctaatgagcttccatacaatgactactttgaatactttggaccagat
ttcaagctccacatcagtccttccaatatgactaaccagaacacgaatgagtacctggag
aagatcaaacagcgactgtttgagaaccttagaatgctgccgcacgcacctggggtccaa
atgcaggcgattcctgaggacgccatccctgaggagagtggcgatgaggacgaagacgac
cctgacaagcgcatctcgatctgctcctctgacaaacgaattgcctgtgaggaagagttc
tccgattctgaagaggagggagaggggggccgcaagaactcttccaacttcaaaaaagcc
aagagagtcaaaacagaggatgaaaaagagaaagacccagaggagaagaaagaagtcacc
gaagaggagaaaaccaaggaggagaagccagaagccaaaggggtcaaggaggaggtcaag
ttggcctga
Homo sapiens (human): 3066
Help
Entry
3066 CDS
T01001
Symbol
HDAC2, HD2, KDAC2, RPD3, YAF1
Name
(RefSeq) histone deacetylase 2
KO
K06067
histone deacetylase 1/2 [EC:
3.5.1.98
]
Organism
hsa
Homo sapiens (human)
Pathway
hsa03082
ATP-dependent chromatin remodeling
hsa03083
Polycomb repressive complex
hsa04110
Cell cycle
hsa04213
Longevity regulating pathway - multiple species
hsa04330
Notch signaling pathway
hsa04350
TGF-beta signaling pathway
hsa04613
Neutrophil extracellular trap formation
hsa04919
Thyroid hormone signaling pathway
hsa05016
Huntington disease
hsa05031
Amphetamine addiction
hsa05034
Alcoholism
hsa05165
Human papillomavirus infection
hsa05169
Epstein-Barr virus infection
hsa05200
Pathways in cancer
hsa05202
Transcriptional misregulation in cancer
hsa05203
Viral carcinogenesis
hsa05206
MicroRNAs in cancer
hsa05220
Chronic myeloid leukemia
Network
nt06166
Human papillomavirus (HPV)
nt06240
Transcription (cancer)
nt06461
Huntington disease
nt06516
TNF signaling
nt06523
Epigenetic regulation by Polycomb complexes
Element
N00118
TEL-AML1 fusion to transcriptional repression
N00375
HPV E7 to TNF-IRF1 signaling pathway
N00980
Mutation-caused aberrant Htt to REST-mediated transcriptional repression
N01614
Activation of PRC2.2 by ubiquitination of H2AK119 in germline genes
Drug target
Abexinostat (
DG01402
):
D10060
D10084
Belinostat:
D08870
<US>
Bocodepsin (
DG03260
):
D12551
D12552
Entinostat:
D09338
Fimepinostat:
D11319
Givinostat (
DG03284
):
D12742
D12743
<US>
Mocetinostat (
DG01404
):
D09357
D09641
Nanatinostat:
D11442
Panobinostat (
DG01403
):
D10019
D10319
Quisinostat (
DG01407
):
D10321
D10322
Remetinostat:
D10977
Romidepsin:
D06637
<JP/US>
Tinostamustine:
D11182
Tucidinostat:
D10993
<JP>
Vorinostat:
D06320
<JP/US>
Brite
KEGG Orthology (KO) [BR:
hsa00001
]
09120 Genetic Information Processing
09126 Chromosome
03082 ATP-dependent chromatin remodeling
3066 (HDAC2)
03083 Polycomb repressive complex
3066 (HDAC2)
09130 Environmental Information Processing
09132 Signal transduction
04330 Notch signaling pathway
3066 (HDAC2)
04350 TGF-beta signaling pathway
3066 (HDAC2)
09140 Cellular Processes
09143 Cell growth and death
04110 Cell cycle
3066 (HDAC2)
09150 Organismal Systems
09151 Immune system
04613 Neutrophil extracellular trap formation
3066 (HDAC2)
09152 Endocrine system
04919 Thyroid hormone signaling pathway
3066 (HDAC2)
09149 Aging
04213 Longevity regulating pathway - multiple species
3066 (HDAC2)
09160 Human Diseases
09161 Cancer: overview
05200 Pathways in cancer
3066 (HDAC2)
05202 Transcriptional misregulation in cancer
3066 (HDAC2)
05206 MicroRNAs in cancer
3066 (HDAC2)
05203 Viral carcinogenesis
3066 (HDAC2)
09162 Cancer: specific types
05220 Chronic myeloid leukemia
3066 (HDAC2)
09172 Infectious disease: viral
05169 Epstein-Barr virus infection
3066 (HDAC2)
05165 Human papillomavirus infection
3066 (HDAC2)
09164 Neurodegenerative disease
05016 Huntington disease
3066 (HDAC2)
09165 Substance dependence
05031 Amphetamine addiction
3066 (HDAC2)
05034 Alcoholism
3066 (HDAC2)
09180 Brite Hierarchies
09182 Protein families: genetic information processing
03036 Chromosome and associated proteins [BR:
hsa03036
]
3066 (HDAC2)
Enzymes [BR:
hsa01000
]
3. Hydrolases
3.5 Acting on carbon-nitrogen bonds, other than peptide bonds
3.5.1 In linear amides
3.5.1.98 histone deacetylase
3066 (HDAC2)
Chromosome and associated proteins [BR:
hsa03036
]
Eukaryotic type
Histone modification proteins
HDACs (histone deacetylases)
Class I HDACs
3066 (HDAC2)
HDAC complexes
Sin3A-HDAC complex
3066 (HDAC2)
BRAF-HDAC complex
3066 (HDAC2)
REST complex
3066 (HDAC2)
SHIP complex
3066 (HDAC2)
MiDAC complex
3066 (HDAC2)
Polycomb repressive complex (PRC) and associated proteins
Noncanonical PRC1 (PRC1.6)
3066 (HDAC2)
Heterochromatin formation proteins
Other heterochromatin formation proteins
3066 (HDAC2)
Chromatin remodeling factors
NuRD complex
3066 (HDAC2)
BRITE hierarchy
SSDB
Ortholog
Paralog
Gene cluster
GFIT
Motif
Pfam:
Hist_deacetyl
Motif
Other DBs
NCBI-GeneID:
3066
NCBI-ProteinID:
NP_001518
OMIM:
605164
HGNC:
4853
Ensembl:
ENSG00000196591
UniProt:
Q92769
Structure
PDB
LinkDB
All DBs
Position
6:complement(113933028..113971148)
Genome browser
AA seq
488 aa
AA seq
DB search
MAYSQGGGKKKVCYYYDGDIGNYYYGQGHPMKPHRIRMTHNLLLNYGLYRKMEIYRPHKA
TAEEMTKYHSDEYIKFLRSIRPDNMSEYSKQMQRFNVGEDCPVFDGLFEFCQLSTGGSVA
GAVKLNRQQTDMAVNWAGGLHHAKKSEASGFCYVNDIVLAILELLKYHQRVLYIDIDIHH
GDGVEEAFYTTDRVMTVSFHKYGEYFPGTGDLRDIGAGKGKYYAVNFPMRDGIDDESYGQ
IFKPIISKVMEMYQPSAVVLQCGADSLSGDRLGCFNLTVKGHAKCVEVVKTFNLPLLMLG
GGGYTIRNVARCWTYETAVALDCEIPNELPYNDYFEYFGPDFKLHISPSNMTNQNTPEYM
EKIKQRLFENLRMLPHAPGVQMQAIPEDAVHEDSGDEDGEDPDKRISIRASDKRIACDEE
FSDSEDEGEGGRRNVADHKKGAKKARIEEDKKETEDKKTDVKEEDKSKDNSGEKTDTKGT
KSEQLSNP
NT seq
1467 nt
NT seq
+upstream
nt +downstream
nt
atggcgtacagtcaaggaggcggcaaaaaaaaagtctgctactactacgacggtgatatt
ggaaattattattatggacagggtcatcccatgaagcctcatagaatccgcatgacccat
aacttgctgttaaattatggcttatacagaaaaatggaaatatataggccccataaagcc
actgccgaagaaatgacaaaatatcacagtgatgagtatatcaaatttctacggtcaata
agaccagataacatgtctgagtatagtaagcagatgcagagatttaatgttggagaagat
tgtccagtgtttgatggactctttgagttttgtcagctctcaactggcggttcagttgct
ggagctgtgaagttaaaccgacaacagactgatatggctgttaattgggctggaggatta
catcatgctaagaaatcagaagcatcaggattctgttacgttaatgatattgtgcttgcc
atccttgaattactaaagtatcatcagagagtcttatatattgatatagatattcatcat
ggtgatggtgttgaagaagctttttatacaacagatcgtgtaatgacggtatcattccat
aaatatggggaatactttcctggcacaggagacttgagggatattggtgctggaaaaggc
aaatactatgctgtcaattttccaatgagagatggtatagatgatgagtcatatgggcag
atatttaagcctattatctcaaaggtgatggagatgtatcaacctagtgctgtggtatta
cagtgtggtgcagactcattatctggtgatagactgggttgtttcaatctaacagtcaaa
ggtcatgctaaatgtgtagaagttgtaaaaacttttaacttaccattactgatgcttgga
ggaggtggctacacaatccgtaatgttgctcgatgttggacatatgagactgcagttgcc
cttgattgtgagattcccaatgagttgccatataatgattactttgagtattttggacca
gacttcaaactgcatattagtccttcaaacatgacaaaccagaacactccagaatatatg
gaaaagataaaacagcgtttgtttgaaaatttgcgcatgttacctcatgcacctggtgtc
cagatgcaagctattccagaagatgctgttcatgaagacagtggagatgaagatggagaa
gatccagacaagagaatttctattcgagcatcagacaagcggatagcttgtgatgaagaa
ttctcagattctgaggatgaaggagaaggaggtcgaagaaatgtggctgatcataagaaa
ggagcaaagaaagctagaattgaagaagataagaaagaaacagaggacaaaaaaacagac
gttaaggaagaagataaatccaaggacaacagtggtgaaaaaacagataccaaaggaacc
aaatcagaacagctcagcaacccctga
Homo sapiens (human): 1108
Help
Entry
1108 CDS
T01001
Symbol
CHD4, CHD-4, Mi-2b, Mi2-BETA, SIHIWES
Name
(RefSeq) chromodomain helicase DNA binding protein 4
KO
K11643
chromodomain-helicase-DNA-binding protein 4 [EC:5.6.2.-]
Organism
hsa
Homo sapiens (human)
Pathway
hsa03082
ATP-dependent chromatin remodeling
hsa05165
Human papillomavirus infection
hsa05203
Viral carcinogenesis
Network
nt06166
Human papillomavirus (HPV)
nt06516
TNF signaling
Element
N00375
HPV E7 to TNF-IRF1 signaling pathway
Disease
H02328
Sifrim-Hitz-Weiss syndrome
H02616
Neurodevelopmental disorder with macrocephaly
Brite
KEGG Orthology (KO) [BR:
hsa00001
]
09120 Genetic Information Processing
09126 Chromosome
03082 ATP-dependent chromatin remodeling
1108 (CHD4)
09160 Human Diseases
09161 Cancer: overview
05203 Viral carcinogenesis
1108 (CHD4)
09172 Infectious disease: viral
05165 Human papillomavirus infection
1108 (CHD4)
09180 Brite Hierarchies
09182 Protein families: genetic information processing
03036 Chromosome and associated proteins [BR:
hsa03036
]
1108 (CHD4)
Chromosome and associated proteins [BR:
hsa03036
]
Eukaryotic type
Heterochromatin formation proteins
Other heterochromatin formation proteins
1108 (CHD4)
Chromatin remodeling factors
NuRD complex
1108 (CHD4)
BRITE hierarchy
SSDB
Ortholog
Paralog
Gene cluster
GFIT
Motif
Pfam:
SNF2-rel_dom
CHDCT2
CHDII_SANT-like
Helicase_C
CHDNT
DUF1087
PHD
Chromo
ResIII
HDA2-3
PHD_2
DEAD
zf-PHD-like
Motif
Other DBs
NCBI-GeneID:
1108
NCBI-ProteinID:
NP_001264
OMIM:
603277
HGNC:
1919
Ensembl:
ENSG00000111642
UniProt:
Q14839
Structure
PDB
LinkDB
All DBs
Position
12:complement(6570082..6607379)
Genome browser
AA seq
1912 aa
AA seq
DB search
MASGLGSPSPCSAGSEEEDMDALLNNSLPPPHPENEEDPEEDLSETETPKLKKKKKPKKP
RDPKIPKSKRQKKERMLLCRQLGDSSGEGPEFVEEEEEVALRSDSEGSDYTPGKKKKKKL
GPKKEKKSKSKRKEEEEEEDDDDDSKEPKSSAQLLEDWGMEDIDHVFSEEDYRTLTNYKA
FSQFVRPLIAAKNPKIAVSKMMMVLGAKWREFSTNNPFKGSSGASVAAAAAAAVAVVESM
VTATEVAPPPPPVEVPIRKAKTKEGKGPNARRKPKGSPRVPDAKKPKPKKVAPLKIKLGG
FGSKRKRSSSEDDDLDVESDFDDASINSYSVSDGSTSRSSRSRKKLRTTKKKKKGEEEVT
AVDGYETDHQDYCEVCQQGGEIILCDTCPRAYHMVCLDPDMEKAPEGKWSCPHCEKEGIQ
WEAKEDNSEGEEILEEVGGDLEEEDDHHMEFCRVCKDGGELLCCDTCPSSYHIHCLNPPL
PEIPNGEWLCPRCTCPALKGKVQKILIWKWGQPPSPTPVPRPPDADPNTPSPKPLEGRPE
RQFFVKWQGMSYWHCSWVSELQLELHCQVMFRNYQRKNDMDEPPSGDFGGDEEKSRKRKN
KDPKFAEMEERFYRYGIKPEWMMIHRILNHSVDKKGHVHYLIKWRDLPYDQASWESEDVE
IQDYDLFKQSYWNHRELMRGEEGRPGKKLKKVKLRKLERPPETPTVDPTVKYERQPEYLD
ATGGTLHPYQMEGLNWLRFSWAQGTDTILADEMGLGKTVQTAVFLYSLYKEGHSKGPFLV
SAPLSTIINWEREFEMWAPDMYVVTYVGDKDSRAIIRENEFSFEDNAIRGGKKASRMKKE
ASVKFHVLLTSYELITIDMAILGSIDWACLIVDEAHRLKNNQSKFFRVLNGYSLQHKLLL
TGTPLQNNLEELFHLLNFLTPERFHNLEGFLEEFADIAKEDQIKKLHDMLGPHMLRRLKA
DVFKNMPSKTELIVRVELSPMQKKYYKYILTRNFEALNARGGGNQVSLLNVVMDLKKCCN
HPYLFPVAAMEAPKMPNGMYDGSALIRASGKLLLLQKMLKNLKEGGHRVLIFSQMTKMLD
LLEDFLEHEGYKYERIDGGITGNMRQEAIDRFNAPGAQQFCFLLSTRAGGLGINLATADT
VIIYDSDWNPHNDIQAFSRAHRIGQNKKVMIYRFVTRASVEERITQVAKKKMMLTHLVVR
PGLGSKTGSMSKQELDDILKFGTEELFKDEATDGGGDNKEGEDSSVIHYDDKAIERLLDR
NQDETEDTELQGMNEYLSSFKVAQYVVREEEMGEEEEVEREIIKQEESVDPDYWEKLLRH
HYEQQQEDLARNLGKGKRIRKQVNYNDGSQEDRDWQDDQSDNQSDYSVASEEGDEDFDER
SEAPRRPSRKGLRNDKDKPLPPLLARVGGNIEVLGFNARQRKAFLNAIMRYGMPPQDAFT
TQWLVRDLRGKSEKEFKAYVSLFMRHLCEPGADGAETFADGVPREGLSRQHVLTRIGVMS
LIRKKVQEFEHVNGRWSMPELAEVEENKKMSQPGSPSPKTPTPSTPGDTQPNTPAPVPPA
EDGIKIEENSLKEEESIEGEKEVKSTAPETAIECTQAPAPASEDEKVVVEPPEGEEKVEK
AEVKERTEEPMETEPKGAADVEKVEEKSAIDLTPIVVEDKEEKKEEEEKKEVMLQNGETP
KDLNDEKQKKNIKQRFMFNIADGGFTELHSLWQNEERAATVTKKTYEIWHRRHDYWLLAG
IINHGYARWQDIQNDPRYAILNEPFKGEMNRGNFLEIKNKFLARRFKLLEQALVIEEQLR
RAAYLNMSEDPSHPSMALNTRFAEVECLAESHQHLSKESMAGNKPANAVLHKVLKQLEEL
LSDMKADVTRLPATIARIPPVAVRLQMSERNILSRLANRAPEPTPQQVAQQQ
NT seq
5739 nt
NT seq
+upstream
nt +downstream
nt
atggcgtcgggcctgggctccccgtccccctgctcggcgggcagtgaggaggaggatatg
gatgcacttttgaacaacagcctgcccccaccccacccagaaaatgaagaggacccagaa
gaggatttgtcagaaacagagactccaaagctcaagaagaagaaaaagcctaagaaacct
cgggaccctaaaatccctaagagcaagcgccaaaaaaaggagcgtatgctcttatgccgg
cagctgggggacagctctggggaggggccagagtttgtggaggaggaggaagaggtggct
ctgcgctcagacagtgagggcagcgactatactcctggcaagaagaagaagaagaagctt
ggacctaagaaagagaagaagagcaaatccaagcggaaggaggaggaggaggaggaggat
gatgatgatgattcaaaggagcctaaatcatctgctcagctcctggaagactggggcatg
gaagacattgaccacgtgttctcagaggaggattatcgaaccctcaccaactacaaggcc
ttcagccagtttgtcagacccctcattgctgccaaaaatcccaagattgctgtctccaag
atgatgatggttttgggtgcaaaatggcgggagttcagtaccaataaccccttcaaaggc
agttctggggcatcagtggcagctgcggcagcagcagcggtagctgtggtggagagcatg
gtgacagccactgaggttgcaccaccacctccccctgtggaggtgcctatccgcaaggcc
aagaccaaggagggcaaaggtcccaatgctcggaggaagcccaagggcagccctcgtgta
cctgatgccaagaagcctaaacccaagaaagtagctcccctgaaaatcaagctgggaggt
tttggttccaagcgtaagagatcctcgagtgaggatgatgacttagatgtggaatctgac
ttcgatgatgccagtatcaatagctattctgtttctgatggttccaccagccgtagtagc
cgcagccgcaagaaactccgaaccactaaaaagaaaaagaaaggcgaggaggaggtgact
gctgtggatggttatgagacagaccaccaggactattgcgaggtgtgccagcaaggcggt
gagatcatcctgtgtgatacctgtccccgtgcttaccacatggtctgcctggatcccgac
atggagaaggctcccgagggcaagtggagctgcccacactgcgagaaggaaggcatccag
tgggaagctaaagaggacaattcggagggtgaggagatcctggaagaggttgggggagac
ctcgaagaggaggatgaccaccatatggaattctgtcgggtctgcaaggatggtggggaa
ctgctctgctgtgatacctgtccttcttcctaccacatccactgcctgaatcccccactt
ccagagatccccaacggtgaatggctctgtccccgttgtacgtgtccagctctgaagggc
aaagtgcagaagatcctaatctggaagtggggtcagccaccatctcccacaccagtgcct
cggcctccagatgctgatcccaacacgccctccccaaagcccttggaggggcggccagag
cggcagttctttgtgaaatggcaaggcatgtcttactggcactgctcctgggtttctgaa
ctgcagctggagctgcactgtcaggtgatgttccgaaactatcagcggaagaatgatatg
gatgagccaccttctggggactttggtggtgatgaagagaaaagccgaaagcgaaagaac
aaggaccctaaatttgcagagatggaggaacgcttctatcgctatgggataaaacccgag
tggatgatgatccaccgaatcctcaaccacagtgtggacaagaagggccacgtccactac
ttgatcaagtggcgggacttaccttacgatcaggcttcttgggagagtgaggatgtggag
atccaggattacgacctgttcaagcagagctattggaatcacagggagttaatgaggggt
gaggaaggccgaccaggcaagaagctcaagaaggtgaagcttcggaagttggagaggcct
ccagaaacgccaacagttgatccaacagtgaagtatgagcgacagccagagtacctggat
gctacaggtggaaccctgcacccctatcaaatggagggcctgaattggttgcgcttctcc
tgggctcagggcactgacaccatcttggctgatgagatgggccttgggaaaactgtacag
acagcagtcttcctgtattccctttacaaggagggtcattccaaaggccccttcctagtg
agcgcccctctttctaccatcatcaactgggagcgggagtttgaaatgtgggctccagac
atgtatgtcgtaacctatgtgggtgacaaggacagccgtgccatcatccgagagaatgag
ttctcctttgaagacaatgccattcgtggtggcaagaaggcctcccgcatgaagaaagag
gcatctgtgaaattccatgtgctgctgacatcctatgaattgatcaccattgacatggct
attttgggctctattgattgggcctgcctcatcgtggatgaagcccatcggctgaagaac
aatcagtctaagttcttccgggtattgaatggttactcactccagcacaagctgttgctg
actgggacaccattacaaaacaatctggaagagttgtttcatctgctcaactttctcacc
cccgagaggttccacaatttggaaggttttttggaggagtttgctgacattgccaaggag
gaccagataaaaaaactgcatgacatgctggggccgcacatgttgcggcggctcaaagcc
gatgtgttcaagaacatgccctccaagacagaactaattgtgcgtgtggagctgagccct
atgcagaagaaatactacaagtacatcctcactcgaaattttgaagcactcaatgcccga
ggtggtggcaaccaggtgtctctgctgaatgtggtgatggatcttaagaagtgctgcaac
catccatacctcttccctgtggctgcaatggaagctcctaagatgcctaatggcatgtat
gatggcagtgccctaatcagagcatctgggaaattattgctgctgcagaaaatgctcaag
aaccttaaggagggtgggcatcgtgtactcatcttttcccagatgaccaagatgctagac
ctgctagaggatttcttggaacatgaaggttataaatacgaacgcatcgatggtggaatc
actgggaacatgcggcaagaggccattgaccgcttcaatgcaccgggtgctcagcagttc
tgcttcttgctttccactcgagctgggggccttggaatcaatctggccactgctgacaca
gttattatctatgactctgactggaacccccataatgacattcaggcctttagcagagct
caccggattgggcaaaataaaaaggtaatgatctaccggtttgtgacccgtgcgtcagtg
gaggagcgcatcacgcaggtggcaaagaagaaaatgatgctgacgcatctagtggtgcgg
cctgggctgggctccaagactggatctatgtccaaacaggagcttgatgatatcctcaaa
tttggcactgaggaactattcaaggatgaagccactgatggaggaggagacaacaaagag
ggagaagatagcagtgttatccactacgatgataaggccattgaacggctgctagaccgt
aaccaggatgagactgaagacacagaattgcagggcatgaatgaatatttgagctcattc
aaagtggcccagtatgtggtacgggaagaagaaatgggggaggaagaggaggtagaacgg
gaaatcattaaacaggaagaaagtgtggatcctgactactgggagaaattgctgcggcac
cattatgagcagcagcaagaagatctagcccgaaatctgggcaaaggaaaaagaatccgt
aaacaggtcaactacaatgatggctcccaggaggaccgagattggcaggacgaccagtcc
gacaaccagtccgattactcagtggcttcagaggaaggtgatgaagactttgatgaacgt
tcagaagctccccgtaggcccagtcgtaagggcctgcggaatgataaagataagccattg
cctcctctgttggcccgtgttggtgggaatattgaagtacttggttttaatgctcgtcag
cgaaaagcctttcttaatgcaattatgcgatatggtatgccacctcaggatgcttttact
acccagtggcttgtaagagacctgcgaggcaaatcagagaaagagttcaaggcatatgtc
tctcttttcatgcggcatttatgtgagccgggggcagatggggctgagacctttgctgat
ggtgtcccccgagaaggcctgtctcgccagcatgtccttactagaattggtgttatgtct
ttgattcgcaagaaggttcaggagtttgaacatgttaatgggcgctggagcatgcctgaa
ctggctgaggtggaggaaaacaagaagatgtcccagccagggtcaccctccccaaaaact
cctacaccctccactccaggggacacgcagcccaacactcctgcacctgtcccacctgct
gaagatgggataaaaatagaggaaaatagcctcaaagaagaagagagcatagaaggagaa
aaggaggttaaatctacagcccctgagactgccattgagtgtacacaggcccctgcccct
gcctcagaggatgaaaaggtcgttgttgaaccccctgagggagaggagaaagtggaaaag
gcagaggtgaaggagagaacagaggaacctatggagacagagcccaaaggtgctgctgat
gtagagaaggtggaggaaaagtcagcaatagatctgacccctattgtggtagaagacaaa
gaagagaagaaagaagaagaagagaaaaaagaggtgatgcttcagaatggagagaccccc
aaggacctgaatgatgagaaacagaagaaaaatattaaacaacgtttcatgtttaacatt
gcagatggtggttttactgagttgcactccctttggcagaatgaagagcgggcagccaca
gttaccaagaagacttatgagatctggcatcgacggcatgactactggctgctagccggc
attataaaccatggctatgcccggtggcaagacatccagaatgacccacgctatgccatc
ctcaatgagcctttcaagggtgaaatgaaccgtggcaatttcttagagatcaagaataaa
tttctagctcgaaggtttaagctcttagaacaagctctggtgattgaggaacagctgcgc
cgggctgcttacttgaacatgtcagaagacccttctcacccttccatggccctcaacacc
cgctttgctgaggtggagtgtttggcggaaagtcatcagcacctgtccaaggagtcaatg
gcaggaaacaagccagccaatgcagtcctgcacaaagttctgaaacagctggaagaactg
ctgagtgacatgaaagctgatgtgactcgactcccagctaccattgcccgaattccccca
gttgctgtgaggttacagatgtcagagcgtaacattctcagccgcctggcaaaccgggca
cccgaacctaccccacagcaggtagcccagcagcagtga
DBGET
integrated database retrieval system