Ref: Lucas,S. et al. unpublished

REBASE ref # 10671

Complete sequence: 2,736,403 bp

GenBank #: CP001101 (NC_010831)

REBASE acronym: CphB

Org_num: 4402

 

All begin Cphamn1_

Type I
ORF Gene Most similar Specificity Name
 
332 short-chain dehydrogenase/reductase SDR
333 M M.PaeDORF306P (91% identity) M.CphBORF333P
334 S S.DthMLF1ORF775P (48% identity) S.CphBORF333P
335 591 aa conserved hypothetical protein
336 Piwi domain protein
337 R PaeDORF306P (89% identity) CphBORF333P
338 protein of unknown function DUF45
 
??? adenine-specific DNA-methyltransferase
749 R MbaWORF3703P (39% identity) CphBORF751P
750 S S.Tbr16975I (46% identity) S.CphBORF751P
751 M M.NspENR4ORF968P (55% identity) M.CphBORF751P
752 203 aa conserved hypothetical protein
 
2542 protein of unknown function DUF45
2543 R PspZM2ORF2640P (85% identity) CphBORF2550P
2544 127 aa conserved hypothetical protein
2545 filamentation induced by cAMP protein Fic
2546 277 aa conserved hypothetical protein
2547 77 aa conserved hypothetical protein
2548 S S.Mps15ORF2244P (47% identity) S1.CphBORF2550P
2549 279 aa conserved hypothetical protein
2550 M M.GsuPLORF8925P (88% identity) M.CphBORF2550P
2551 133 aa conserved hypothetical protein
2552 S S2.GsuPLORF8925P (67% identity) S2.CphBORF2550P
2553 77 aa hypothetical protein
Type II
ORF Gene Most similar Specificity Name
 
40 sodium:neurotransmitter symporter
41 M M.PspGSB1ORF360P (79% identity) CTAG M.CphBORF40P
42 R PspGSB1ORF360P (57% identity) CTAG CphBORF40P
43 transposase IS4 family protein
 
116 pseudogene
??? M M.GbaPX52ORF3096P (63% identity) CTCGAG M.CphBORF117P
??? R Lil3055ORFIP (48% identity) CTCGAG CphBORF117P
119 pseudogene
 
744 932 aa conserved hypothetical protein, putative DNA
745 RM AcrJFORF2421P (80% identity) CphBORF745P
746 350 aa conserved hypothetical protein
 
746 350 aa conserved hypothetical protein
??? RM CteTORF1159P (53% identity) CphBORF747P
749 type III restriction protein res subunit
 
872 protein of unknown function DUF86
873 R PmeK22ORF707P (73% identity) CphBORF874P
874 M M.PmeK22ORF707P (63% identity) M.CphBORF874P
876 pseudogene
 
883 pseudogene
884 RM Cph266ORF1300P (85% identity) CphBORF884P
885 pseudogene
 
1060 nitrogenase-associated protein
1062 M M.Cfe13031ORF340P (73% identity) CCCGGG M.CphBI
1063 R Cph266ORF2524P (73% identity) CCCGGG CphBI
1064 pentapeptide repeat protein
 
1156 465 aa conserved hypothetical protein
1157 M M.Mda10076ORF14160P (85% identity) GAATTC M.CphBORF1157P
1158 HNH endonuclease
 
1245 109 aa hypothetical protein
1246 M M.Cli245ORF789P (84% identity) CAGCTG M.CphBORF1246P
1247 R Cph266ORF1518P (72% identity) CAGCTG CphBORF1246P
1248 59 aa hypothetical protein
 
1411 HpcH/HpaI aldolase
1412 V V.DdeEORF12855P (61% identity) V.CphBORF1413P
1413 M M.DdeEORF12855P (75% identity) M.CphBORF1413P
1414 635 aa conserved hypothetical protein
 
1676 PglZ domain protein
1677 RM AspSHORF9255P (54% identity) CphBORF1677P
1678 174 aa conserved hypothetical protein
 
2495 ATPase (AAA+ superfamily)-like protein
2496 M M.CliFORF3910P (87% identity) M.CphBORF2496P
2497 protein of unknown function DUF1016
Type III
ORF Gene Most similar Specificity Name
 
704 465 aa conserved hypothetical protein
705 M M.DspX2ORF1802P (48% identity) M.CphBORF705P
706 protein of unknown function DUF1016
707 R CliFORF9615P (94% identity) CphBORF705P
708 443 aa conserved hypothetical cytosolic protein
 
1147 212 aa conserved hypothetical protein
1148 M M.Dli5ac10ORF21590P (63% identity) M.CphBORF1148P
1149 R Dli5ac10ORF21590P (87% identity) CphBORF1148P
1150 protein of unknown function DUF262
 
2420 192 aa conserved hypothetical protein
2421 R PspHL130ORF10575P (94% identity) CphBORF2417P
2422 UDP-N-acetylenolpyruvoylglucosamine reductase