|
Putative Solitalea canadensis DSM 3403 RM systems
Ref: Lucas,S. et al. unpublished
REBASE ref # 14225
Complete sequence: 5,202,069 bp
GenBank #: CP003349
REBASE acronym: Sca3403
Org_num: 8634
All begin Solca_
Type I | ||||
---|---|---|---|---|
ORF | Gene | Most similar | Specificity | Name |
1965 | pseudogene | |||
1966 | R | CcoA37T2ORF102274P (91% identity) | CAGNNNNNNRTGG | Sca3403IP |
1967 | virulence protein | |||
1968 | S | S1.Pga1313ORF4740P (52% identity) | CAGNNNNNNRTGG | S.Sca3403I |
1969 | TIGR02436 family protein | |||
1970 | pseudogene | |||
1971 | M | M.CcoA37T2ORF102274P (90% identity) | CAGNNNNNNRTGG | M.Sca3403I |
1972 | C | C.SspNJ44ORFDP (57% identity) | C.Sca3403IP | |
1973 | pseudogene | |||
Type II | ||||
ORF | Gene | Most similar | Specificity | Name |
189 | Protein of unknown function DUF262 | |||
190 | M | M.Ave16549ORFDP (59% identity) | CTGCAG | M.Sca3403II |
191 | transposase | |||
1818 | 304 aa hypothetical protein | |||
1819 | R | Csp1310ORF33320P (53% identity) | Sca3403ORF1820P | |
1820 | M | M.Csp1310ORF33320P (71% identity) | M.Sca3403ORF1820P | |
1821 | A/G-specific DNA glycosylase | |||
3696 | 205 aa hypothetical protein | |||
3697 | RM | Afl52984ORF6275P (70% identity) | Sca3403ORF3697P | |
3700 | protein containing C-terminal region/beta chain | |||
3831 | 284 aa hypothetical protein | |||
3832 | RM | Sba196ORFDP (47% identity) | Sca3403ORF3832P | |
3833 | 306 aa hypothetical protein | |||
Type IV | ||||
ORF | Gene | Most similar | Specificity | Name |
3832 | type II restriction endonuclease | |||
3833 | R | LpnLMrrP (52% identity) | Sca3403MrrP | |
3834 | 280 aa hypothetical protein |