|
Putative Cyanothece species PCC 8801 RM systems
Ref: Lucas,S. et al. unpublished
REBASE ref # 10856
Complete sequence: 4,679,413 bp
GenBank #: CP001287 (NC_011726)
REBASE acronym: Csp8801
Org_num: 5257
All begin PCC8801_
Type I | ||||
---|---|---|---|---|
ORF | Gene | Most similar | Specificity | Name |
2209 | transcriptional regulator, MerR family | |||
2210 | M | M.Csp8802ORF2273P (98% identity) | M.Csp8801ORF2210P | |
2211 | protein of unknown function DUF559 | |||
2212 | 72 aa conserved hypothetical protein | |||
2213 | YcfA family protein | |||
2214 | S | S.Mko49807ORF6120P (37% identity) | S.Csp8801ORF2210P | |
2215 | 86 aa hypothetical protein | |||
2216 | 83 aa conserved hypothetical protein | |||
2217 | Nucleotide binding protein PINc | |||
2218 | R | Csp8802ORF2273P (97% identity) | Csp8801ORF2210P | |
2219 | protein of unknown function DUF45 | |||
??? | R | HsdR family type I site-specific deoxyribonuclease | Csp8801ORF2980AP | |
2961 | pseudogene | |||
2962 | RNA-directed DNA polymerase (Reverse | |||
2963 | 18 aa hypothetical protein | |||
2964 | RNA-directed DNA polymerase (Reverse | |||
2966 | 133 aa hypothetical protein | |||
??? | R | HsdR family type I site-specific deoxyribonuclease | Csp8801ORF2980BP | |
??? | R | Csp8802ORF3142P (45% identity) | Csp8801ORF2980CP | |
2968 | protein of unknown function DUF86 | |||
2969 | DNA polymerase beta domain protein region | |||
2970 | protein of unknown function UPF0175 | |||
2971 | death-on-curing family protein | |||
2972 | transcriptional regulator/antitoxin, MazE | |||
2973 | protein of unknown function DUF86 | |||
2974 | DNA polymerase beta domain protein region | |||
2975 | 79 aa conserved hypothetical protein | |||
2976 | S | S.Sce26ORF54400P (24% identity) | S.Csp8801ORF2980AP | |
2977 | transposase IS204/IS1001/IS1096/IS1165 family | |||
2978 | S | restriction modification system DNA specificity | S.Csp8801ORF2980BP | |
2979 | 107 aa hypothetical protein | |||
2980 | M | M.Mae9806ORF70004P (27% identity) | M.Csp8801ORF2980P | |
2981 | Transposase-like Mu | |||
2987 | N-6 DNA methylase | |||
2988 | S | M.Csp7822ORF4712P (18% identity) | S.Csp8801ORF2989P | |
2989 | M | M.Csp7424ORF2087P (88% identity) | M.Csp8801ORF2989P | |
2990 | R | Nsp543ORF22070P (70% identity) | Csp8801ORF2989P | |
2991 | 58 aa conserved hypothetical protein | |||
3710 | 70 aa hypothetical protein | |||
3711 | S | 92 aa hypothetical protein | S.Csp8801ORF3712P | |
3712 | M | type I restriction-modification system, M subunit protein | M.Csp8801ORF3712P | |
3713 | 147 aa conserved hypothetical protein | |||
3934 | circadian clock protein, KaiC | |||
3935 | M | M.Csp8802ORF3984P (99% identity) | M.Csp8801ORF3935P | |
3936 | 95 aa conserved hypothetical protein | |||
3937 | 75 aa conserved hypothetical protein | |||
3938 | 87 aa hypothetical protein | |||
3939 | PilT protein domain protein | |||
3940 | S | S.Mae2481ORF1685P (48% identity) | S.Csp8801ORF3935P | |
3941 | 75 aa conserved hypothetical protein | |||
3942 | PilT protein domain protein | |||
3943 | R | Csp8802ORF3984P (24% identity) | Csp8801ORF3935P | |
3944 | Pyruvate, water dikinase | |||
Type II | ||||
ORF | Gene | Most similar | Specificity | Name |
83 | 466 aa conserved hypothetical protein | |||
84 | M | M.Csp8802ORF82P (97% identity) | GGNCC | M.Csp8801ORF84P |
85 | R | Gsp3708ORF110P (54% identity) | GGWCC | Csp8801ORF84P |
86 | carbohydrate kinase, YjeF related protein | |||
988 | exodeoxyribonuclease III | |||
989 | M | M.Csp8802ORF1018P (100% identity) | CGATCG | M.Csp8801ORF989P |
990 | 75 aa conserved hypothetical protein | |||
1661 | protein of unknown function UPF0150 | |||
1662 | RM | Mae88ORF32740P (78% identity) | Csp8801ORF1662P | |
1663 | protein of unknown function DUF820 | |||
2307 | 257 aa conserved hypothetical protein | |||
??? | R | type II restriction enzyme HaeII | RGCGCY | Csp8801ORF1P |
2309 | glutamyl-tRNA synthetase | |||
2307 | 257 aa conserved hypothetical protein | |||
2308 | R | type II restriction enzyme HaeII | RGCGCY | Csp8801ORF2308P |
2309 | glutamyl-tRNA synthetase | |||
2505 | pentapeptide repeat protein | |||
??? | M | M.Csp8802ORF3598P (98% identity) | YACGTR | M.Csp8801ORF2506P |
2509 | binding-protein-dependent transport systems | |||
2738 | glucokinase | |||
2739 | M | M.Csp8802ORF3363P (100% identity) | GATC | M.Csp8801ORF2739P |
2740 | R | Csp8802ORF3363P (99% identity) | GATC | Csp8801ORF2739P |
2741 | 149 aa conserved hypothetical protein | |||
2792 | transposase IS4 family protein | |||
2793 | M | M.Csp8802ORF3307P (44% identity) | GDGCHC | M.Csp8801ORF2793P |
2794 | TrkA-N domain protein | |||
3449 | TrkA-N domain protein | |||
3450 | M | M.Csp8802ORF2666P (100% identity) | GGWCC | M.Csp8801ORF3450P |
3451 | sodium/hydrogen exchanger | |||
3771 | MutS2 family protein | |||
3772 | M | M.Csp8802ORF3822P (100% identity) | TTCGAA | M.Csp8801ORF3772P |
3773 | R | Csp8802ORF3822P (99% identity) | TTCGAA | Csp8801ORF3772P |
3774 | 225 aa conserved hypothetical protein | |||
3870 | protein of unknown function UPF0102 | |||
3871 | C | C.Csp8802ORF3922P (100% identity) | C.Csp8801ORF3873P | |
3872 | R | Csp8802ORF3922P (100% identity) | ACRYGT | Csp8801ORF3873P |
3873 | M | M.Csp8802ORF3922P (100% identity) | ACRYGT | M.Csp8801ORF3873P |
3874 | pentapeptide repeat protein | |||
4213 | adenosine/AMP deaminase | |||
4215 | M | M.Csp8802ORF4254P (100% identity) | CMGCKG | M.Csp8801ORF4215P |
4216 | R | Csp8802ORF4254P (99% identity) | CMGCKG | Csp8801ORF4215P |
4217 | nicotinate-nucleotide pyrophosphorylase | |||
4295 | 2-C-methyl-D-erythritol 4-phosphate | |||
4296 | M | M.Csp8802ORF4356P (98% identity) | GGWCC | M.Csp8801ORF4296P |
4297 | R | Csp8802ORF4356P (67% identity) | GGWCC | Csp8801ORF4296P |
4298 | type II restriction endonuclease | |||
Type IV | ||||
ORF | Gene | Most similar | Specificity | Name |
1060 | TPR repeat-containing protein | |||
1061 | R | Csp8802MrrP (99% identity) | Csp8801MrrP | |
1062 | 143 aa conserved hypothetical protein | |||
3689 | pseudogene | |||
3690 | R | Csp8802Mrr2P (99% identity) | Csp8801Mrr2P | |
3691 | protein of unknown function DUF29 | |||
Orphan M | ||||
ORF | Gene | Most similar | Specificity | Name |
1988 | protein of unknown function DUF29 | |||
1989 | M | M.Csp8802ORF2016P (99% identity) | GGCC | M.Csp8801ORF1989P |
1990 | 86 aa conserved hypothetical protein |