KEGG   ORTHOLOGY: K19444
Entry
K19444                      KO                                     
Symbol
UL48
Name
Simplexvirus tegument protein VP16
Pathway
map05168  Herpes simplex virus 1 infection
Network
nt06168  Herpes simplex virus 1 (HSV-1)
  Element
N00588  HSV VP16 to Oct-1-mediated transcription
Brite
KEGG Orthology (KO) [BR:ko00001]
 09160 Human Diseases
  09172 Infectious disease: viral
   05168 Herpes simplex virus 1 infection
    K19444  UL48; Simplexvirus tegument protein VP16
 09180 Brite Hierarchies
  09185 Viral protein families
   03200 Viral proteins
    K19444  UL48; Simplexvirus tegument protein VP16
Viral proteins [BR:ko03200]
 dsDNA viruses
  Human herpes simplex virus 1/2
   K19444  UL48; Simplexvirus tegument protein VP16
Genes
VG: 1487335(UL48) 1487443(UL48) 18533940(UL48) 19621693(UL48) 24271473(UL48) 3190305(UL48) 3850231(UL48) 9829309(UL48)
Reference
PMID:8139019
  Authors
Smibert CA, Popova B, Xiao P, Capone JP, Smiley JR
  Title
Herpes simplex virus VP16 forms a complex with the virion host shutoff protein vhs.
  Journal
J Virol 68:2339-46 (1994)
DOI:10.1128/JVI.68.4.2339-2346.1994
  Sequence
LinkDB

KEGG   Homo sapiens (human): 3054
Entry
3054              CDS       T01001                                 
Symbol
HCFC1, CFF, HCF, HCF-1, HCF1, HFC1, MAHCX, MRX3, PPP1R89, VCAF, XLID3
Name
(RefSeq) host cell factor C1
  KO
K14966  host cell factor 1
Organism
hsa  Homo sapiens (human)
Pathway
hsa03083  Polycomb repressive complex
hsa04980  Cobalamin transport and metabolism
hsa05168  Herpes simplex virus 1 infection
Network
nt06168  Herpes simplex virus 1 (HSV-1)
nt06523  Epigenetic regulation by Polycomb complexes
nt06538  Cobalamin transport and metabolism
  Element
N00588  HSV VP16 to Oct-1-mediated transcription
N01585  Deubiquitination of H2AK119
N01810  Regulation of MMACHC expression
Disease
H00174  Methylmalonic aciduria
H00480  X-linked intellectual developmental disorder
H02222  Methylmalonic acidemia and hyperhomocysteinemia, cblX type
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09120 Genetic Information Processing
  09126 Chromosome
   03083 Polycomb repressive complex
    3054 (HCFC1)
 09150 Organismal Systems
  09154 Digestive system
   04980 Cobalamin transport and metabolism
    3054 (HCFC1)
 09160 Human Diseases
  09172 Infectious disease: viral
   05168 Herpes simplex virus 1 infection
    3054 (HCFC1)
 09180 Brite Hierarchies
  09181 Protein families: metabolism
   01009 Protein phosphatases and associated proteins [BR:hsa01009]
    3054 (HCFC1)
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:hsa03036]
    3054 (HCFC1)
   03029 Mitochondrial biogenesis [BR:hsa03029]
    3054 (HCFC1)
Protein phosphatases and associated proteins [BR:hsa01009]
 Protein serine/threonine phosphatases
  Phosphoprotein phosphatases (PPPs)
   Protein phosphatase-1
    PP1-interacting proteins (PIPs)
     3054 (HCFC1)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   HAT complexes
    NSL complex
     3054 (HCFC1)
   HMT complexes
    COMPASS/SET1 complex
     3054 (HCFC1)
    MLL-HCF complex
     3054 (HCFC1)
   Polycomb repressive complex (PRC) and associated proteins
    PR-DUB complex
     3054 (HCFC1)
Mitochondrial biogenesis [BR:hsa03029]
 Mitochondrial quality control factors
  Regulator of mitochondrial biogenesis
   Other regulator of mitochondrial biogenesis
    3054 (HCFC1)
SSDB
Motif
Pfam: Kelch_5 Kelch_3 Kelch_1 Kelch_4 Kelch_6 Kelch_2 fn3
Other DBs
NCBI-GeneID: 3054
NCBI-ProteinID: NP_005325
OMIM: 300019
HGNC: 4839
Ensembl: ENSG00000172534
UniProt: P51610
Structure
LinkDB
Position
X:complement(153947557..153971818)
AA seq 2035 aa
MASAVSPANLPAVLLQPRWKRVVGWSGPVPRPRHGHRAVAIKELIVVFGGGNEGIVDELH
VYNTATNQWFIPAVRGDIPPGCAAYGFVCDGTRLLVFGGMVEYGKYSNDLYELQASRWEW
KRLKAKTPKNGPPPCPRLGHSFSLVGNKCYLFGGLANDSEDPKNNIPRYLNDLYILELRP
GSGVVAWDIPITYGVLPPPRESHTAVVYTEKDNKKSKLVIYGGMSGCRLGDLWTLDIDTL
TWNKPSLSGVAPLPRSLHSATTIGNKMYVFGGWVPLVMDDVKVATHEKEWKCTNTLACLN
LDTMAWETILMDTLEDNIPRARAGHCAVAINTRLYIWSGRDGYRKAWNNQVCCKDLWYLE
TEKPPPPARVQLVRANTNSLEVSWGAVATADSYLLQLQKYDIPATAATATSPTPNPVPSV
PANPPKSPAPAAAAPAVQPLTQVGITLLPQAAPAPPTTTTIQVLPTVPGSSISVPTAART
QGVPAVLKVTGPQATTGTPLVTMRPASQAGKAPVTVTSLPAGVRMVVPTQSAQGTVIGSS
PQMSGMAALAAAAAATQKIPPSSAPTVLSVPAGTTIVKTMAVTPGTTTLPATVKVASSPV
MVSNPATRMLKTAAAQVGTSVSSATNTSTRPIITVHKSGTVTVAQQAQVVTTVVGGVTKT
ITLVKSPISVPGGSALISNLGKVMSVVQTKPVQTSAVTGQASTGPVTQIIQTKGPLPAGT
ILKLVTSADGKPTTIITTTQASGAGTKPTILGISSVSPSTTKPGTTTIIKTIPMSAIITQ
AGATGVTSSPGIKSPITIITTKVMTSGTGAPAKIITAVPKIATGHGQQGVTQVVLKGAPG
QPGTILRTVPMGGVRLVTPVTVSAVKPAVTTLVVKGTTGVTTLGTVTGTVSTSLAGAGGH
STSASLATPITTLGTIATLSSQVINPTAITVSAAQTTLTAAGGLTTPTITMQPVSQPTQV
TLITAPSGVEAQPVHDLPVSILASPTTEQPTATVTIADSGQGDVQPGTVTLVCSNPPCET
HETGTTNTATTTVVANLGGHPQPTQVQFVCDRQEAAASLVTSTVGQQNGSVVRVCSNPPC
ETHETGTTNTATTATSNMAGQHGCSNPPCETHETGTTNTATTAMSSVGANHQRDARRACA
AGTPAVIRISVATGALEAAQGSKSQCQTRQTSATSTTMTVMATGAPCSAGPLLGPSMARE
PGGRSPAFVQLAPLSSKVRLSSPSIKDLPAGRHSHAVSTAAMTRSSVGAGEPRMAPVCES
LQGGSPSTTVTVTALEALLCPSATVTQVCSNPPCETHETGTTNTATTSNAGSAQRVCSNP
PCETHETGTTHTATTATSNGGTGQPEGGQQPPAGRPCETHQTTSTGTTMSVSVGALLPDA
TSSHRTVESGLEVAAAPSVTPQAGTALLAPFPTQRVCSNPPCETHETGTTHTATTVTSNM
SSNQDPPPAASDQGEVESTQGDSVNITSSSAITTTVSSTLTRAVTTVTQSTPVPGPSVPP
PEELQVSPGPRQQLPPRQLLQSASTALMGESAEVLSASQTPELPAAVDLSSTGEPSSGQE
SAGSAVVATVVVQPPPPTQSEVDQLSLPQELMAEAQAGTTTLMVTGLTPEELAVTAAAEA
AAQAAATEEAQALAIQAVLQAAQQAVMGTGEPMDTSEAAATVTQAELGHLSAEGQEGQAT
TIPIVLTQQELAALVQQQQLQEAQAQQQHHHLPTEALAPADSLNDPAIESNCLNELAGTV
PSTVALLPSTATESLAPSNTFVAPQPVVVASPAKLQAAATLTEVANGIESLGVKPDLPPP
PSKAPMKKENQWFDVGVIKGTNVMVTHYFLPPDDAVPSDDDLGTVPDYNQLKKQELQPGT
AYKFRVAGINACGRGPFSEISAFKTCLPGFPGAPCAIKISKSPDGAHLTWEPPSVTSGKI
IEYSVYLAIQSSQAGGELKSSTPAQLAFMRVYCGPSPSCLVQSSSLSNAHIDYTTKPAII
FRIAARNEKGYGPATQVRWLQETSKDSSGTKPANKRPMSSPEMKSAPKKSKADGQ
NT seq 6108 nt   +upstreamnt  +downstreamnt
atggcttcggccgtgtcgcccgccaacttgccagcggtgcttctgcagccccgctggaag
cgagtggtgggctggtcgggtccggtgccacggccccgccacggccaccgcgccgtggcc
atcaaggagctcatcgtggtgtttggcggcggcaacgagggaatagtggacgaactgcac
gtgtacaacacggcaaccaaccagtggttcatcccagccgtgaggggggacattccccct
gggtgtgcagcctatggcttcgtgtgtgacgggactcgcctcctggtgtttggtgggatg
gtggagtatgggaaatacagcaatgacctctacgaactccaggcgagccggtgggagtgg
aagagactcaaagcaaagacgcccaaaaacgggccccctccgtgtcctcgactcgggcac
agcttctcccttgtgggcaacaaatgctacctgtttgggggtctggccaatgatagcgag
gacccaaagaacaacattccaaggtacctgaatgacttatatatcctggaattacggcca
ggctctggagtggtagcctgggacattcccatcacttacggggtcctaccaccaccccgg
gagtcacatactgccgtggtctacaccgaaaaagacaataagaagtccaagctggtgatc
tacggcgggatgagtggctgcaggctgggggacctgtggaccctagatattgacaccctg
acgtggaataagcccagtctcagcggggtggcgcctcttcctcgcagtctccactcggca
accaccatcggaaataaaatgtacgtgtttggtggctgggtgcctctcgtcatggatgac
gtcaaagtggccacacacgagaaggagtggaagtgtaccaacacgctggcttgtctcaac
ctggataccatggcctgggagaccatcctgatggatacactggaggacaacatcccccgt
gctcgggctggccactgcgcagtcgccatcaacacccgcctgtacatttggagtgggcgt
gacggctaccgcaaggcctggaacaaccaggtctgctgcaaggacctctggtacctagag
acagaaaagccaccacccccagcccgagtacaactggtacgcgccaacaccaactccctg
gaggtgagctggggggcagtggcaacagccgacagctaccttctccagctccagaaatat
gacattcctgccacggctgctactgccacctcccctacacccaatccggtcccatctgtg
cctgccaaccctcccaagagccctgccccagcagcagccgcacctgctgtgcagccgctg
acccaagtaggcatcacgctcctgccccaggctgcccccgcacccccgaccaccaccacc
atccaggtcttgccaacggtgcctggcagctccatttctgtgcccaccgcagccaggact
caaggtgtccctgctgttctcaaagtgaccggtcctcaggctacaacaggaactccattg
gtcaccatgcgacctgccagccaggctgggaaagcccctgtcaccgtgacctcccttccc
gccggagtgcggatggttgtgccaacacagagtgcccagggaacggtgattggcagtagc
ccacagatgagtgggatggccgcactggccgctgcggccgctgccacccagaagatcccc
ccttcctcggcacccacggtgctgagtgtcccagcgggtaccaccatcgtgaagaccatg
gctgtgacacctggcactaccaccctcccagccactgtgaaggtggcctcctcgccagtc
atggtgagcaaccctgccactcgcatgctgaagactgcagccgcccaggtggggacatcg
gtttcctccgccaccaacacgtctacccgccctatcatcacagtgcacaagtcaggcact
gtgacagtggcccagcaagcccaggtggtgaccacagttgtgggcggggtcaccaagacc
atcaccctggtgaagagccccatctctgtcccaggaggcagtgctctgatttccaatctg
ggcaaagtgatgtcggtggtccagaccaaaccagttcagacttcagcagtcacaggccag
gcgtccacgggtcctgtgactcagatcatccagaccaaagggcccctgccagcgggaaca
atcctgaagctggtgacctcagcagatggcaagcccaccaccatcatcactaccacgcag
gccagtggggcggggaccaagcccaccatcctgggcatcagcagcgtctcccccagtacc
accaagcccggcacgaccaccatcatcaaaaccatccccatgtcggccatcatcacccag
gcgggcgccacgggtgtgaccagcagtcctggcatcaagtcccccatcaccatcatcacc
accaaggtgatgacttcaggaactggagcacctgcgaaaatcatcactgctgtccccaaa
attgccactggccacgggcagcagggagtgacccaggtggtgcttaagggggccccggga
cagccaggcaccatcctccgcactgtgcccatggggggtgttcgcctggtcacacccgtc
accgtctccgccgtcaagccagccgtcaccacgttggttgtgaaaggcaccacaggtgtc
acgaccctaggcacagtgacaggcaccgtctccaccagccttgccggggcggggggccac
agcactagtgcttccctggccacgcccatcaccaccttgggcaccattgccaccctctca
agccaggtgatcaaccccactgccatcactgtgtcggccgcacagaccacgctgacagcg
gcaggcgggctcacaaccccaaccatcaccatgcagcccgtgtcccagcccacccaggta
actctgatcacggcacctagtggggtggaggcccagcctgtgcatgacctccctgtgtcc
attctggcctccccgactacagaacagcccaccgccacagttaccatcgccgactcaggc
cagggtgatgtgcagcctggcactgtcaccttggtgtgctccaacccaccctgtgagacc
cacgagactggcaccaccaacacggccaccactactgttgtggctaaccttgggggacac
ccccagcccacccaagtgcagttcgtctgtgacagacaggaggcagctgcttctcttgtg
acctcgactgtgggccagcagaatggtagcgtggtccgagtctgttcgaacccgccctgc
gagacccacgagacgggcaccaccaacaccgccaccaccgccacctccaacatggccggg
cagcatggctgctcaaacccaccctgcgagacccacgagacgggcaccaccaacactgcc
actacagccatgtcgagcgtcggcgccaaccaccagcgagatgcccgtcgggcctgtgca
gctggcacccctgccgtgatccggatcagtgtggccactggggcgctggaggcagcccag
ggctctaagtcccagtgccaaacccgccagaccagcgcgaccagcaccaccatgactgtg
atggccaccggggccccgtgctcggccggcccactccttgggccgagcatggcacgggag
cccgggggccgcagccctgcttttgtgcagttggcccctctgagcagcaaagtcaggctg
agcagcccaagcattaaggaccttcctgcggggcgccacagccatgcggtcagcaccgct
gccatgacccgttccagcgtgggtgctggggagccccgcatggcacctgtgtgcgagagc
ctccagggtggctcgcccagcaccacagtgactgtgacagccctggaggcactgctgtgc
ccctcggccaccgtgacccaagtctgctccaacccaccatgtgagacccacgagacaggc
accaccaacaccgccactacctcgaatgcaggcagcgcccagagggtgtgctccaacccg
ccatgcgagacccacgagacgggcaccacccacacggccaccaccgctacttcaaacggg
ggcacgggccagcccgagggtgggcagcagccccctgctggtcgcccctgtgagacacac
cagaccacttccactggcaccaccatgtcggtcagcgtgggtgccctgcttcccgacgcc
acttcttcccacaggaccgtggagtctggcctagaggtggcggcggcacccagcgtcacc
ccccaggctggcaccgcgctgctggctcctttcccaacacagagggtgtgctccaacccc
ccctgtgagacccacgagacgggcaccactcacacggccaccactgtcacttccaacatg
agttcaaaccaagaccccccacctgctgccagcgatcagggagaggtggagagcacccag
ggcgacagcgtgaacatcaccagctccagtgccatcacgacaaccgtgtcctccacactg
acgcgggctgtgaccaccgtgacgcagtccacaccggtcccgggcccctctgtgccgccc
ccagaggaactccaggtgtcgccaggtcctcgccagcagctgccgccacggcagcttctg
cagtcggcttccacagccctgatgggggagtccgccgaggtcctgtcagcctcccagacc
cctgagctcccggccgccgtggatctgagcagcacaggggagccatcttcgggccaggag
tctgccggctctgcggtggtggccactgtggtggtccagccacccccacccacacagtcc
gaagtagaccagttatcacttccccaagagctaatggccgaggcccaagctggcaccacc
accctcatggtaacggggctcacccccgaggagctggcagtgacggctgctgcagaagca
gctgcccaggccgcagccacggaggaagcccaggccctggccatccaggcggtgctccag
gccgcgcagcaggccgtcatgggcaccggcgagcccatggacacctccgaggcagcagca
accgtgactcaggcggagctggggcacctgtcggccgagggtcaggagggccaggccacc
accatacccattgtgctgacacagcaggagctggctgccctggtgcagcagcagcagctg
caggaggcccaggcccagcagcagcatcaccacctccccactgaggccctggcccctgcc
gacagtctcaacgacccagccattgagagcaattgcctcaatgagctggccggcacggtc
cccagcactgtggcgctgctgccctcaacggccactgagagcctggctccatccaacaca
tttgtggccccccagccggttgtggtggccagcccagccaagctgcaggctgcagctacc
ctgaccgaagtggccaatggcatcgagtccctgggtgtgaagccagacctgccgccccca
cccagcaaagcccccatgaagaaggaaaaccagtggtttgatgtgggagtcattaagggc
accaatgtaatggtgacacactatttcctgccaccagatgatgctgtcccatcagacgat
gatttgggcaccgtccctgactataaccagctgaagaagcaggagctgcagccaggcaca
gcctataagtttcgtgttgccggaatcaatgcctgtggccgggggcccttcagcgaaatc
tcagcctttaagacgtgcctgcctggtttcccaggggccccttgtgccattaaaatcagc
aaaagtccggatggtgctcacctcacctgggagccaccctctgtgacctccggcaagatt
atcgagtactccgtgtacctggccatccagagctcacaggctgggggcgagctcaagagc
tccaccccggcccagctggccttcatgcgggtgtactgcgggcccagcccctcctgcctg
gtgcagtcctccagcctttccaacgcccacatcgactacaccaccaagcccgccatcatc
ttccgcatcgccgcccgcaatgagaagggctatggcccggccacacaagtgaggtggctg
caggaaaccagtaaagacagctctggcaccaagccagccaacaagcggcccatgtcctct
ccagaaatgaaatctgctccaaagaaatctaaggccgatggtcagtga

KEGG   Homo sapiens (human): 29915
Entry
29915             CDS       T01001                                 
Symbol
HCFC2, HCF-2, HCF2
Name
(RefSeq) host cell factor C2
  KO
K27390  host cell factor 2
Organism
hsa  Homo sapiens (human)
Pathway
hsa05168  Herpes simplex virus 1 infection
Network
nt06168  Herpes simplex virus 1 (HSV-1)
  Element
N00588  HSV VP16 to Oct-1-mediated transcription
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09160 Human Diseases
  09172 Infectious disease: viral
   05168 Herpes simplex virus 1 infection
    29915 (HCFC2)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03036 Chromosome and associated proteins [BR:hsa03036]
    29915 (HCFC2)
   03029 Mitochondrial biogenesis [BR:hsa03029]
    29915 (HCFC2)
Chromosome and associated proteins [BR:hsa03036]
 Eukaryotic type
  Histone modification proteins
   HMT complexes
    MLL-HCF complex
     29915 (HCFC2)
Mitochondrial biogenesis [BR:hsa03029]
 Mitochondrial quality control factors
  Regulator of mitochondrial biogenesis
   Other regulator of mitochondrial biogenesis
    29915 (HCFC2)
SSDB
Motif
Pfam: Kelch_5 Kelch_3 Kelch_1 Kelch_4 Kelch_6 Kelch_2 fn3 DUF5944
Other DBs
NCBI-GeneID: 29915
NCBI-ProteinID: NP_037452
OMIM: 607926
HGNC: 24972
Ensembl: ENSG00000111727
UniProt: Q9Y5Z7
LinkDB
Position
12:104064531..104106524
AA seq 792 aa
MAAPSLLNWRRVSSFTGPVPRARHGHRAVAIRELMIIFGGGNEGIADELHVYNTATNQWF
LPAVRGDIPPGCAAHGFVCDGTRILVFGGMVEYGRYSNELYELQASRWLWKKVKPHPPPS
GLPPCPRLGHSFSLYGNKCYLFGGLANESEDSNNNVPRYLNDFYELELQHGSGVVGWSIP
VTKGVVPSPRESHTAVIYCKKDSGSPKMYVFGGMCGARLDDLWQLDLETMSWSKPETKGT
VPLPRSLHTASVIGNKMYIFGGWVPHKGENTETSPHDCEWRCTSSFSYLNLDTTEWTTLV
SDSQEDKKNSRPRPRAGHCAVAIGTRLYFWSGRDGYKKALNSQVCCKDLWYLDTEKPPAP
SQVQLIKATTNSFHVKWDEVSTVEGYLLQLSTDLPYQAASSDSSAAPNMQGVRMDPHRQG
SNNIVPNSINDTINSTKTEQPATKETSMKNKPDFKALTDSNAILYPSLASNASNHNSHVV
DMLRKNEGPHTSANVGVLSSCLDVRTVIPETSVSSTVSSTQTMVTQQTIKTESSSTNGAV
VKDETSLTTFSTKSEVDETYALPATKISRVETHATATPFSKETPSNPVATVKAGERQWCD
VGIFKNNTALVSQFYLLPKGKQSISKVGNADVPDYSLLKKQDLVPGTGYRFRVAAINGCG
IGPFSKISEFKTCIPGFPGAPSAVRISKNVEGIHLSWEPPTSPSGNILEYSAYLAIRTAQ
IQDNPSQLVFMRIYCGLKTSCIVTAGQLANAHIDYTSRPAIVFRISAKNEKGYGPATQVR
WLQGNNKKAPLN
NT seq 2379 nt   +upstreamnt  +downstreamnt
atggcggctcccagcctcctcaactggaggcgagtttcttccttcacggggccggtcccc
cgcgcccggcacggacaccgagcggtggccatccgggagctgatgatcatctttggaggg
ggaaatgagggcatcgcggatgagctgcacgtctacaacacggctacgaatcagtggttt
ctgccagctgttagaggagatatccctccaggctgtgctgcccatggatttgtctgtgat
ggtaccagaatattagtatttgggggaatggttgaatatggaagatacagcaatgagtta
tatgagttacaagcaagtcgttggttatggaaaaaagtgaaaccccatccccctccttct
ggtttacctccttgtcctcggcttggacatagcttctctttatatggtaacaaatgctat
ttgtttggtggcctggcaaacgaaagcgaagattcaaacaataatgttcccagatattta
aatgatttttatgagttggagctacagcatggctctggtgttgtgggttggagcattcca
gtgactaaaggggttgtgccttctccaagagaatcccacacagctgttatatattgcaaa
aaagattctggaagtcctaaaatgtatgtttttggtggaatgtgtggtgctcgcctggat
gacctatggcagcttgacttagaaactatgtcatggtcaaaaccagaaactaaagggaca
gtgccacttccacgaagccttcatacagccagtgttataggaaacaagatgtacattttt
ggtggatgggtcccacataagggggaaaatactgagacttcacctcatgattgtgaatgg
agatgtaccagttcattttcttacctaaatctggatacaacagagtggaccaccctagta
tcagattctcaggaagataaaaaaaattcaagaccaagaccaagagctggccactgtgct
gttgcaatcggcactcgattgtatttttggagtggaagagatggctacaaaaaagcactg
aatagtcaagtttgctgcaaggatctttggtatcttgatactgagaaaccaccggcacca
tctcaagtacagctgatcaaagccactaccaactcctttcatgtcaagtgggatgaagtg
tctacagttgagggctatcttttgcagttgagtacagacttgccataccaagctgcatca
tcagattcttcagcagcaccaaatatgcaaggagtcaggatggaccctcacagacaaggc
agtaataacatcgttcctaacagtatcaatgatacaataaacagcacaaaaactgaacag
ccagccacaaaagaaacttcaatgaaaaacaaaccagactttaaagcactgacggattct
aatgccattttatatccatctttggcatcaaatgcttctaatcataatagtcatgtggtg
gatatgctaaggaaaaatgaaggtcctcacacttcagcaaatgtaggtgttctaagtagt
tgcctggatgtaagaacagtaattcctgaaacatctgtatccagtactgtttccagcaca
caaactatggtaacccagcagaccattaaaactgaatcatccagtacaaatggggcagtt
gttaaagatgaaacttcactaacaacattcagtaccaaatctgaagttgatgaaacatat
gcactgcctgcaacgaagatcagccgtgtagagacacatgctacagcaacgccgttttct
aaagagactccttcaaatccagtggccacagtgaaagcgggagaacgacaatggtgtgat
gtgggaatttttaaaaataatacagctttggtgagccagttttatttgctgccaaaaggg
aagcaaagcatctcaaaggtaggaaatgcagatgtacctgactacagcttgcttaagaaa
caagatcttgttccaggcacaggatacagattcagggttgctgcaatcaatggttgtggg
ataggtcctttcagcaaaatcagtgaatttaaaacttgtattcctggttttcctggagct
ccttctgcagtcagaatttcaaagaatgttgaaggtatccacctttcctgggaacctcca
acctcaccttctggaaatattttggaatattcagcctacttggctatccgcacagcacag
atacaagataatccaagtcaacttgtgttcatgaggatttattgtggtcttaagacatca
tgtatagtaactgctgggcaacttgcaaatgcacatattgattatacatccaggcctgcc
attgtgttcaggatatcagcaaagaatgaaaagggatatggaccagctacacaagttcgg
tggcttcaaggtaacaataagaaagcacctttaaattga

KEGG   Homo sapiens (human): 5451
Entry
5451              CDS       T01001                                 
Symbol
POU2F1, OCT1, OTF1, Oct1Z, oct-1B
Name
(RefSeq) POU class 2 homeobox 1
  KO
K09364  POU domain transcription factor, class 2
Organism
hsa  Homo sapiens (human)
Pathway
hsa05168  Herpes simplex virus 1 infection
hsa05417  Lipid and atherosclerosis
Network
nt06168  Herpes simplex virus 1 (HSV-1)
  Element
N00588  HSV VP16 to Oct-1-mediated transcription
Brite
KEGG Orthology (KO) [BR:hsa00001]
 09160 Human Diseases
  09172 Infectious disease: viral
   05168 Herpes simplex virus 1 infection
    5451 (POU2F1)
  09166 Cardiovascular disease
   05417 Lipid and atherosclerosis
    5451 (POU2F1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03000 Transcription factors [BR:hsa03000]
    5451 (POU2F1)
   03021 Transcription machinery [BR:hsa03021]
    5451 (POU2F1)
Transcription factors [BR:hsa03000]
 Eukaryotic type
  Helix-turn-helix
   Homeo domain POU
    5451 (POU2F1)
Transcription machinery [BR:hsa03021]
 Eukaryotic type
  RNA polymerase III system
   Other transcription-related factors
    Octamer transcription factors
     5451 (POU2F1)
SSDB
Motif
Pfam: POU2F1_C Pou Homeodomain Homeobox_KN HTH_31
Other DBs
NCBI-GeneID: 5451
NCBI-ProteinID: NP_001185712
OMIM: 164175
HGNC: 9212
Ensembl: ENSG00000143190
UniProt: P14859
Structure
LinkDB
Position
1:167220885..167427345
AA seq 755 aa
MLDCSDYVLDSRMNNPSETSKPSMESGDGNTGTQTNGLDFQKQPVPVGGAISTAQAQAFL
GHLHQVQLAGTSLQAAAQSLNVQSKSNEESGDSQQPSQPSQQPSVQAAIPQTQLMLAGGQ
ITGLTLTPAQQQLLLQQAQAQAQLLAAAVQQHSASQQHSAAGATISASAATPMTQIPLSQ
PIQIAQDLQQLQQLQQQNLNLQQFVLVHPTTNLQPAQFIISQTPQGQQGLLQAQNLLTQL
PQQSQANLLQSQPSITLTSQPATPTRTIAATPIQTLPQSQSTPKRIDTPSLEEPSDLEEL
EQFAKTFKQRRIKLGFTQGDVGLAMGKLYGNDFSQTTISRFEALNLSFKNMCKLKPLLEK
WLNDAENLSSDSSLSSPSALNSPGIEGLSRRRKKRTSIETNIRVALEKSFLENQKPTSEE
ITMIADQLNMEKEVIRVWFCNRRQKEKRINPPSSGGTSSSPIKAIFPSPTSLVATTPSLV
TSSAATTLTVSPVLPLTSAAVTNLSVTGTSDTTSNNTATVISTAPPASSAVTSPSLSPSP
SASASTSEASSASETSTTQTTSTPLSSPLGTSQVMVTASGLQTAAAAALQGAAQLPANAS
LAAMAAAAGLNPSLMAPSQFAAGGALLSLNPGTLSGALSPALMSNSTLATIQALASGGSL
PITSLDATGNLVFANAGGAPNIVTAPLFLNPQNLSLLTSNPVSLVSAAAASAGNSAPVAS
LHATSTSAESIQNSLFTVASASGAASTTTTASKAQ
NT seq 2268 nt   +upstreamnt  +downstreamnt
atgctggactgcagtgactatgttctagactcaagaatgaacaatccgtcagaaaccagt
aaaccatctatggagagtggagatggcaacacaggcacacaaaccaatggtctggacttt
cagaagcagcctgtgcctgtaggaggagcaatctcaacagcccaggcgcaggctttcctt
ggacatctccatcaggtccaactcgctggaacaagtttacaggctgctgctcagtcttta
aatgtacagtctaaatctaatgaagaatcgggggattcgcagcagccaagccagccttcc
cagcagccttcagtgcaggcagccattccccagacccagcttatgctagctggaggacag
ataactgggcttactttgacgcctgcccagcaacagttactactccagcaggcacaggca
caggcacagctgctggctgctgcagtgcagcagcactccgccagccagcagcacagtgct
gctggagccaccatctccgcctctgctgccacgcccatgacgcagatccccctgtctcag
cccatacagatcgcacaggatcttcaacaactgcaacagcttcaacagcagaatctcaac
ctgcaacagtttgtgttggtgcatccaaccaccaatttgcagccagcgcagtttatcatc
tcacagacgccccagggccagcagggtctcctgcaagcgcaaaatcttctaacgcaacta
cctcagcaaagccaagccaacctcctacagtcgcagccaagcatcaccctcacctcccag
ccagcaaccccaacacgcacaatagcagcaaccccaattcagacacttccacagagccag
tcaacaccaaagcgaattgatactcccagcttggaggagcccagtgaccttgaggagctt
gagcagtttgccaagaccttcaaacaaagacgaatcaaacttggattcactcagggtgat
gttgggctcgctatggggaaactatatggaaatgacttcagccaaactaccatctctcga
tttgaagccttgaacctcagctttaagaacatgtgcaagttgaagccacttttagagaag
tggctaaatgatgcagagaacctctcatctgattcgtccctctccagcccaagtgccctg
aattctccaggaattgagggcttgagccgtaggaggaagaaacgcaccagcatagagacc
aacatccgtgtggccttagagaagagtttcttggagaatcaaaagcctacctcggaagag
atcactatgattgctgatcagctcaatatggaaaaagaggtgattcgtgtttggttctgt
aaccgccgccagaaagaaaaaagaatcaacccaccaagcagtggtgggaccagcagctca
cctattaaagcaattttccccagcccaacttcactggtggcgaccacaccaagccttgtg
actagcagtgcagcaactaccctcacagtcagccctgtcctccctctgaccagtgctgct
gtgacgaatctttcagttacaggcacttcagacaccacctccaacaacacagcaaccgtg
atttccacagcgcctccagcttcctcagcagtcacgtccccctctctgagtccctcccct
tctgcctcagcctccacctccgaggcatccagtgccagtgagaccagcacaacacagacc
acctccactcctttgtcctcccctcttgggaccagccaggtgatggtgacagcatcaggt
ttgcaaacagcagcagctgctgcccttcaaggagctgcacagttgccagcaaatgccagt
cttgctgccatggcagctgctgcaggactaaacccaagcctgatggcaccctcacagttt
gcggctggaggtgccttactcagtctgaatccagggaccctgagcggtgctctcagccca
gctctaatgagcaacagtacactggcaactattcaagctcttgcttctggtggctctctt
ccaataacatcacttgatgcaactgggaacctggtatttgccaatgcgggaggagccccc
aacatcgtgactgcccctctgttcctgaaccctcagaacctctctctgctcaccagcaac
cctgttagcttggtctctgccgccgcagcatctgcagggaactctgcacctgtagccagc
cttcacgccacctccacctctgctgagtccatccagaactctctcttcacagtggcctct
gccagcggggctgcgtccaccaccaccaccgcctccaaggcacagtga

DBGET integrated database retrieval system