Ref: Lucas,S. et al. unpublished

REBASE ref # 11232

Complete sequence: 4,669,813 bp

GenBank #: CP001701 (NC_013161)

REBASE acronym: Csp8802

Org_num: 5525

 

All begin Cyan8802_

Type I
ORF Gene Most similar Specificity Name
 
724 TPR repeat-containing protein
725 R DspB1220ORF15210P (42% identity) Csp8802ORF725P
726 transcriptional regulator,
727 S S.Asp27244ORF1181P (38% identity) S.Csp8802ORF725P
728 chromate transporter, chromate ion transporter
 
2272 transcriptional regulator, MerR family
2273 M M.Csp8801ORF2210P (98% identity) M.Csp8802ORF2273P
2274 protein of unknown function DUF559
2275 72 aa conserved hypothetical protein
2276 YcfA family protein
2277 S S.CspNS01ORF1471P (39% identity) S.Csp8802ORF2273P
2278 86 aa hypothetical protein
2279 83 aa hypothetical protein
2280 Nucleotide binding protein PINc
2281 R Csp8801ORF2210P (97% identity) Csp8802ORF2273P
2282 protein of unknown function DUF45
 
3131 PilT protein domain protein
3132 S S1.Fpe1ORF11930P (38% identity) S.Csp8802ORF3132P
3133 PilT protein domain protein
 
3141 protein of unknown function DUF86
3142 R Mae88ORF34170P (83% identity) Csp8802ORF3142P
3143 protein of unknown function DUF45
 
3763 70 aa hypothetical protein
3764 S S.Csp8801ORF3712P (53% identity) S.Csp8802ORF3764P
3765 147 aa hypothetical protein
 
3983 circadian clock protein, KaiC
3984 M M.Csp8801ORF3935P (99% identity) M.Csp8802ORF3984P
3985 95 aa conserved hypothetical protein
3986 75 aa conserved hypothetical protein
3987 87 aa hypothetical protein
3988 PilT protein domain protein
3989 S S.Ssp1002ORF1870P (39% identity) S.Csp8802ORF3984P
3990 75 aa hypothetical protein
3991 PilT protein domain protein
3992 R Fsp4106ORF35660P (69% identity) Csp8802ORF3984P
3993 PEP-utilising protein mobile region
Type II
ORF Gene Most similar Specificity Name
 
81 466 aa hypothetical protein
82 M M.Csp8801ORF84P (97% identity) GGNCC M.Csp8802ORF82P
83 carbohydrate kinase, YjeF related protein
 
653 70 aa hypothetical protein
655 M M.PspETS05ORFBP (59% identity) GGTNACC M.Csp8802ORF655P
656 R Ebo30301ORF4310P (68% identity) GGTNACC Csp8802ORF655P
657 30 aa hypothetical protein
 
1017 exodeoxyribonuclease III
1018 M M.Csp8801ORF989P (100% identity) CGATCG M.Csp8802ORF1018P
1019 75 aa hypothetical protein
 
1679 protein of unknown function UPF0150
1680 RM Ned1411ORF27825P (65% identity) Csp8802ORF1680P
1681 protein of unknown function DUF820
 
2015 protein of unknown function DUF29
2016 M M.Csp8801ORF1989P (99% identity) GGCC M.Csp8802ORF2016P
2017 86 aa hypothetical protein
 
2206 XisH protein
??? RM DspCY41ORFAP (33% identity) Csp8802ORF2207P
2209 206 aa hypothetical protein
 
2359 257 aa hypothetical protein
2360 M M.Csp68KORF3196P (69% identity) RGCGCY M.Csp8802ORF2360P
2361 glutamyl-tRNA synthetase
 
2665 sodium/hydrogen exchanger
2666 M M.Csp8801ORF3450P (100% identity) GGWCC M.Csp8802ORF2666P
2667 TrkA-N domain protein
 
3306 TrkA-N domain protein
3307 M M.Csp8801ORF2793P (44% identity) GDGCHC M.Csp8802ORF3307P
3308 protein of unknown function DUF1130
 
3361 149 aa hypothetical protein
3362 R Csp8801ORF2739P (99% identity) GATC Csp8802ORF3363P
3363 M M.Csp8801ORF2739P (100% identity) GATC M.Csp8802ORF3363P
3364 glucokinase
 
3597 binding-protein-dependent transport systems
??? M M.Csp8801ORF2506P (98% identity) YACGTR M.Csp8802ORF3598P
3601 pentapeptide repeat protein
 
3821 35 aa hypothetical protein
3822 M M.Csp8801ORF3772P (100% identity) TTCGAA M.Csp8802ORF3822P
3823 R Csp8801ORF3772P (99% identity) TTCGAA Csp8802ORF3822P
3824 37 aa hypothetical protein
 
3919 protein of unknown function UPF0102
3920 C C.Csp8801ORF3873P (100% identity) C.Csp8802ORF3922P
3921 R Csp8801ORF3873P (100% identity) ACRYGT Csp8802ORF3922P
3922 M M.Csp8801ORF3873P (100% identity) ACRYGT M.Csp8802ORF3922P
3923 pentapeptide repeat protein
 
4075 HAD-superfamily hydrolase, subfamily IA, variant
4077 M M.LspO77ORF2643P (65% identity) M.Csp8802ORF4077P
4078 R LspO77ORF2643P (68% identity) Csp8802ORF4077P
4079 phosphoenolpyruvate carboxykinase (ATP)
 
4252 adenosine/AMP deaminase
4254 M M.Csp8801ORF4215P (100% identity) CMGCKG M.Csp8802ORF4254P
4255 R Csp8801ORF4215P (99% identity) CMGCKG Csp8802ORF4254P
4256 nicotinate-nucleotide pyrophosphorylase
 
4355 2-C-methyl-D-erythritol 4-phosphate
4356 M M.Csp8801ORF4296P (98% identity) GGWCC M.Csp8802ORF4356P
4357 R Cep9333ORF3880P (70% identity) GGWCC Csp8802ORF4356P
4358 metallophosphoesterase
Type IV
ORF Gene Most similar Specificity Name
 
1089 TPR repeat-containing protein
1090 R Csp8801MrrP (99% identity) Csp8802MrrP
1091 143 aa hypothetical protein
 
3743 pseudogene
3744 R Csp8801Mrr2P (99% identity) Csp8802Mrr2P
3745 protein of unknown function DUF29
Tech Support Feedback NEB Overview Site Map Trademarks Legal and Disclaimers Privacy Cookie Policy Terms of Use
© Copyright 2023 New England Biolabs. All Rights Reserved.