KEGG   Thalassiosira pseudonana: THAPSDRAFT_168
Entry
THAPSDRAFT_168    CDS       T01078                                 
Symbol
RPB1
Name
(RefSeq) DNA directed RNA polymerase II, largest subunit
  KO
K03006  DNA-directed RNA polymerase II subunit RPB1 [EC:2.7.7.6]
Organism
tps  Thalassiosira pseudonana
Pathway
tps03020  RNA polymerase
tps03420  Nucleotide excision repair
Brite
KEGG Orthology (KO) [BR:tps00001]
 09120 Genetic Information Processing
  09121 Transcription
   03020 RNA polymerase
    THAPSDRAFT_168 (RPB1)
  09124 Replication and repair
   03420 Nucleotide excision repair
    THAPSDRAFT_168 (RPB1)
 09180 Brite Hierarchies
  09182 Protein families: genetic information processing
   03021 Transcription machinery [BR:tps03021]
    THAPSDRAFT_168 (RPB1)
   03400 DNA repair and recombination proteins [BR:tps03400]
    THAPSDRAFT_168 (RPB1)
Enzymes [BR:tps01000]
 2. Transferases
  2.7  Transferring phosphorus-containing groups
   2.7.7  Nucleotidyltransferases
    2.7.7.6  DNA-directed RNA polymerase
     THAPSDRAFT_168 (RPB1)
Transcription machinery [BR:tps03021]
 Eukaryotic type
  RNA polymerase II system
   RNA polymerase II
    Core subunits
     THAPSDRAFT_168 (RPB1)
DNA repair and recombination proteins [BR:tps03400]
 Eukaryotic type
  SSBR (single strand breaks repair)
   NER (nucleotide excision repair)
    TCR (transcription coupled repair) factors
     DNA-directed RNA polymerase II complex
      THAPSDRAFT_168 (RPB1)
SSDB
Motif
Pfam: RNA_pol_Rpb1_5 RNA_pol_Rpb1_1 RNA_pol_Rpb1_2 RNA_pol_Rpb1_6 RNA_pol_Rpb1_3 RNA_pol_Rpb1_7 RNA_pol_Rpb1_4 RNA_pol_Rpb1_R
Other DBs
NCBI-GeneID: 7453440
NCBI-ProteinID: XP_002286060
JGI: 168
UniProt: B8BQ02
LinkDB
Position
1:692424..697888
AA seq 1627 aa
MAPRKDGTAREAGAISLSQASTNFGHSSARLRRIKKLQFGIINPEELRQYSVTQAITVNG
RKIPAGVTRYETYMSGQPVYGGVNDPRLGDLHDKSDPGYFGHIELARPVYHQGFIDVTLK
ALRCVCFHCSRITMEDTEYKFQRARQIRNRKRRLDAMHALIRPKKKCDHCNGYQPKYTKV
GLHVEIEYADEMERVPGSSGDKKQFMSAQKAVDIFKKMRDDEVKALGLDVTWARPEWMCV
SVMPVPPLHVRPSVVMGGGAQSSEDDLTHQLVNIVKSNIALKTAIQNGEPNIIVEQFEQA
LQHNVAAFVNNEMRGMPQITQRSGRPLKTLAQRLKAKEGRIRGNLMGKRVDFSARTVITA
DPNLGIHQVGVPRSVAMNLTVPTRVTPFNIQELSALVANGPTEHPGAKHIIRSDGLRIDL
RYVKNKSDLLLANGWIVERHLRDGDIVLFNRQPSLHKMSIMGHMAKVLDWSTFRLNLSCT
SPYNADFDGDEMNLHVPQSLPARAEAELMMHSPRVIVSGQSNRPVMGIVQDSLLAVQKMT
KRDVFVKKDLMMNILMWVEDWDGRIPPPAIYRPEELWTGKQIMSMILPKINLTGKANNGG
PGPNTFNAYDNLVRVMEGELIEGTIDKKTIGSGMGGLIHTAWLDVGHEDTARFMNQTQVV
TNYWVLQSSFSIGVCDTIADFATMEQIASTINKAKLQVLDLVRQGQRGELETQPGRTMIE
SFEQFVNKVLNTARDHAGKSAQASLDETNSVKAMVTAGSKGSFINISQIIACVGQQNVEG
NRIPYGFKRRTLPHFSKDDLGPESRGFVENSYLSGLTPQEFFFHAMGGREGLIDTACKTA
ETGYIQRRLVKAMETVMARYDGTLRTSGGQIVQFLYGEDGMDAVWIERQSFDSLTLNKRE
FDERYLLHSDDPDFGYDDQNIPFLEAEVIEDCRHNPEVQQMLDREMEVLKEDQAMLRIIM
ANREAGRESDVNSYAPGNVKRVIQNALRQFQIDKGLPSNLHPKDVIEKIEAMLRRLVVVV
GDDLLSVEAQNNATTLYRILIRSYLASKRVLREYRLSEAALIWVLGEIEARFHHAKVSPG
EMAGVLAAQSIGEPATQMTLNTFHYAGVSAKNVTLGVPRLKEIINVAKTVKTPGLTIYLQ
NHVSGDADVAKLVHSMIEYTVLTDLTKLTEIYYDPDPVNTIVAKDREFVKEYYEMGEETE
EDLRRLSPWVLRIELNQVVMVDKKIKMNEIAAEIANEYGSDLNVLVSDDNADDLVARIRI
VNDLPMVQGMDEDGNPIMTDDDVELGQEDDVFLKRLEKNMLHTLKLRGVDDVKKVFMRGG
AKKTVWDDEKGFGITDEWVLETDGTNLMSVLGIDYVDATRTISNDIVEVFMVLGIEGVRA
AILSELRNVISFDGSYVNYRHLACLVDVMTMHGHLMAVDRHGINRVESGPLLRCSFEETV
DMLNDAACFAEEEVLRGVTENIMMGQLARVGTGDMDLLLDEQKVLLWMDLRQTRNWGSWA
ALVCQHRMLPLHSRLLKCLHLLMLVDSVLRLGDSPLVHTATSGKLNDSWYIPRHALSSFF
VSNLLLRCPSLQTGIFSNSNKSCIFTNKPCLQPYVTCVQSYDPTSPAYSPTSPAYSPTSP
VSLLFVC
NT seq 4884 nt   +upstreamnt  +downstreamnt
atggcacccagaaaagacggcaccgcccgcgaagccggagccatctccctctcccaagca
tccaccaacttcggacactcctccgctcgtctccgtcgtatcaagaaattgcaatttggt
attattaatccggaggagttgaggcagtattcggtgacgcaggcgattacggtgaatggg
aggaagattccggcgggggttacgcggtatgagacctacatgtcaggacaacccgtttac
ggaggggtcaacgaccctcgtctcggtgatctccacgacaagtccgaccctggatacttt
ggtcacattgagcttgctcgtcctgtctatcatcaggggtttattgatgttactctcaag
gctttgaggtgtgtttgctttcactgtagtaggattacgatggaggatacggagtataag
tttcagagggcgaggcagattaggaataggaagaggaggctggatgcaatgcatgctcta
attcgccccaaaaagaagtgcgatcactgcaatgggtatcagccaaagtataccaaggtg
ggactgcacgtggaaattgagtatgctgatgagatggaacgagttccagggagtagtggg
gataagaagcaattcatgagtgctcaaaaggccgttgatatcttcaaaaagatgcgggat
gacgaggttaaagctcttggactcgacgttacttgggctcgtcctgagtggatgtgcgtc
tctgtgatgcccgtgcctcctcttcacgtccgtccctctgtcgtcatgggagggggtgct
caatcgtctgaggatgatctcacccatcagcttgtcaacattgtcaagtccaacattgct
ctcaaaactgctatccagaacggggaacccaacattatcgttgagcagtttgagcaggca
cttcagcacaacgtggctgcattcgtcaacaatgaaatgaggggaatgcctcaaatcact
cagaggagtggacgtcctctcaaaacgttggcacaaaggctcaaggccaaggagggacgt
attcgtggtaacttgatggggaagcgtgtggatttctcggctcgtactgtcattacggcg
gatcccaacctgggaattcatcaagtgggtgtgccgaggagtgttgccatgaacttgact
gtgccgactcgtgtgacaccgttcaacattcaagagttgagtgccttggtggccaacgga
cccacggagcacccaggagcaaagcacatcattcgcagtgacggattgcgtatcgacttg
cgatatgtcaagaataagagtgacttgctgttggcgaatggttggattgtggagcgtcac
ttgagggatggggatattgtgcttttcaatcgtcagccgagtcttcacaaaatgtccatc
atgggacacatggccaaagtgttggattggagtacctttcgtttgaacctcagttgtact
tcgccgtacaatgctgacttcgatggagatgagatgaacttgcacgttccacagagtttg
cctgctcgtgctgaggctgagttgatgatgcacagtcctcgagtgattgttagtgggcaa
tccaatcgtccagtgatgggtatcgtccaggattcgttgttggctgtgcagaagatgaca
aagcgcgatgtgttcgtgaagaaggatttgatgatgaacattctaatgtgggtggaagat
tgggatgggcgaatccctccaccagctatctacaggcccgaggaactttggacgggaaag
caaatcatgtcaatgattctacccaagataaacctcactggcaaggcgaataatggggga
cctggacccaacacattcaatgcatatgacaaccttgtcagagtaatggagggggagcta
attgaaggaacaatcgacaagaagaccattggatcaggaatggggggtctcatccacaca
gcatggcttgacgtggggcacgaggatacagctcgcttcatgaatcaaactcaggtggtc
accaactactgggtgttgcagagcagttttagtattggagtttgtgatacgatagctgac
tttgccaccatggagcagattgcgagtaccatcaacaaagccaaacttcaagtacttgat
ctcgttcgccagggtcaacgtggagagttggagactcagccaggacgaacaatgattgag
tcttttgagcagtttgtcaacaaagtcctcaatacagcacgtgatcacgcgggtaaatca
gcccaggccagtttggacgagactaacagtgttaaggccatggtgactgcaggttccaag
ggctccttcatcaacatctctcagattatcgcctgtgtcggacagcagaacgttgagggg
aaccgtatcccctacggatttaaacgtaggacgttgccccacttctccaaggatgatttg
ggacctgagtcgcgaggattcgtcgagaactcgtacctcagtggattgactcctcaagag
ttcttcttccatgcaatgggtggtcgtgagggtctgattgataccgcctgtaagacagcc
gagacgggttacatccaacgacgtcttgtgaaagcaatggagactgttatggcgaggtat
gacggtactttgcgtacatctggagggcagattgttcagtttctttatggagaggatggt
atggatgctgtctggattgaacgtcaatcctttgactcgctgactcttaacaagcgtgag
tttgatgaacgctacctactccattcggatgacccagactttggctatgacgatcagaac
attcctttccttgaggcagaggtgattgaagattgtcgtcacaaccccgaggtgcaacag
atgctggaccgtgagatggaagtgctgaaggaagatcaggcaatgcttcgtattatcatg
gccaatcgtgaggctggacgagagtcggatgtgaactcctacgcccccggtaacgtcaaa
cgtgtcattcaaaatgccctccgccagttccaaattgataaggggcttccttcaaacttg
catcccaaggatgtcattgaaaagattgaggcaatgctccgtcgtcttgtggtggtggtt
ggtgatgacttgttgagtgttgaagctcaaaacaatgcaacgacgctgtaccgtatcttg
attcgcagttatcttgcgagcaaacgtgttctcagggaataccgtttgagcgaggcggct
ttgatttgggtgcttggagagattgaggcacgattccatcacgccaaggtcagccctgga
gaaatggctggcgtccttgcagctcagagtattggggagccagccactcagatgacactg
aacacattccactacgccggagtgtccgcgaagaatgtaactttgggagtgcctcgtctg
aaggagattatcaacgttgccaaaacggtcaagactccaggtctaacgatctacttgcaa
aatcatgtcagtggcgatgcggatgtcgccaagttggtgcactccatgattgaatacacc
gtgcttactgatttgaccaaactgacagagatttactacgatccagaccctgtcaacacc
attgtcgccaaggatcgtgagtttgtcaaggaatactacgagatgggagaggagaccgaa
gaggatctccgacgtttgtcaccgtgggtccttcgcattgagttgaaccaagttgtgatg
gtggataagaagatcaaaatgaatgagattgcagcagagattgccaacgagtacggaagt
gacttgaatgtgcttgtttcggatgataacgctgatgatttggtggctcgcattcgcatt
gtcaacgatctacccatggtgcaaggaatggatgaagatggcaaccccatcatgacagac
gacgacgtggaacttggccaagaagatgatgtgttcttgaagcgattggagaagaacatg
cttcatactctaaagcttcgtggagtggatgacgtcaagaaggtcttcatgcgtggtgga
gcgaagaagacagtgtgggacgacgagaaaggcttcggaatcacagacgagtgggttctt
gagactgatggaacgaatctcatgtctgtgcttggtattgactacgttgacgcgactagg
acaatctctaatgacattgtggaggtgtttatggtgcttggtattgagggtgtacgtgct
gctattctcagtgagctgcgaaacgtcatttcttttgatggttcctacgtcaactatcgc
catctcgcttgtcttgtggacgtgatgacaatgcatggccacttgatggcagtggatcgt
cacggtattaaccgtgtcgagtctggtcccttgctgcgttgttcgttcgaagagacggtg
gacatgttgaacgatgcagcttgcttcgctgaagaggaggtccttagaggagttactgag
aacattatgatggggcagcttgctcgagttggtactggtgacatggacctgttgcttgat
gagcaaaaggttttgttgtggatggatttgagacagacaaggaattggggctcgtgggca
gcattggtatgccaacaccgtatgcttccactccattcgcgtctattgaaatgtctccat
ttgctgatgctggtggattcagtcctgcggttgggggattctcccctggttcatacggca
acatccggtaagttgaatgattcttggtatattcctcgtcatgcattatcttcattcttc
gtttctaatcttctgcttcgctgtccttcattgcaaacaggcatattctccaactccaac
aagtcctgcatattcaccaacaagccctgcctacagccctacgtcacctgcgtacagtcc
tacgatcctacgagtccagcatattctccaacgagtcctgcttactctcccaccagtcca
gtgagtctgttgtttgtgtgctag

DBGET integrated database retrieval system