|
Putative Mixed sample RM systems
Ref: Beaulaurier,J.A. et al. Nat. Biotechnol. (2017) In press
REBASE ref # 25194
GenBank #: PDZQ00000000
REBASE acronym: Msa17
Org_num: 27261
All begin ORF_
Type I | ||||
---|---|---|---|---|
ORF | Gene | Most similar | Specificity | Name |
6 | 125 aa hypothetical protein | |||
7 | R | Cae25986ORF5215P (86% identity) | Msa17ORFT1P | |
11 | M | M.Caewa7ORFBP (100% identity) | M.Msa17ORFT1P | |
14 | S | S2.Cae25986ORF5215P (100% identity) | S.Msa17ORFT1P | |
18 | integrase | |||
54 | 122 aa hypothetical protein | |||
56 | M | M.PmeBin87ORFFP (100% identity) | M.Msa17ORFZP | |
58 | 201 aa hypothetical protein | |||
59 | S | S.PmeBin87ORFFP (100% identity) | S.Msa17ORFZP | |
60 | 259 aa hypothetical protein | |||
61 | 67 aa hypothetical protein | |||
62 | 65 aa hypothetical protein bolteae] | |||
63 | 91 aa hypothetical protein | |||
64 | 118 aa hypothetical protein | |||
65 | R | PmeBin87ORFFP (100% identity) | Msa17ORFZP | |
70 | C | C.PmeBin87ORFFP (100% identity) | C.Msa17ORFZP | |
71 | 152 aa hypothetical protein | |||
98 | XRE family transcriptional regulator | |||
99 | R | Ebo1231ORF14870P (100% identity) | Msa17ORFS1P | |
102 | M | M.Ebo1231ORF14870P (100% identity) | M.Msa17ORFS1P | |
104 | S | S1.Ebo1231ORF14870P (100% identity) | S1.Msa17ORFS1P | |
106 | S | S2.Ebo1231ORF14870P (100% identity) | S2.Msa17ORFS1P | |
108 | 67 aa hypothetical protein | |||
261 | 504 aa hypothetical protein | |||
262 | R | BccORFDP (100% identity) | Msa17ORFF1P | |
268 | S | S2.Bca43185aORF2045P (100% identity) | S1.Msa17ORFF1P | |
271 | integrase | |||
272 | divergent AAA domain protein | |||
273 | 109 aa hypothetical protein | |||
274 | 136 aa hypothetical protein | |||
275 | S | S1.Bca43195FORF18200P (100% identity) | S2.Msa17ORFF1P | |
276 | M | M2.Bov179ORF20415P (100% identity) | M2.Msa17ORFF1P | |
278 | M | M1.BccORFDP (100% identity) | M1.Msa17ORFF1P | |
281 | XRE family transcriptional regulator | |||
278 | SAM-dependent DNA methyltransferase | |||
281 | C | C.Bth576ORF2630P (100% identity) | C.Msa17ORFG1P | |
282 | 60 aa hypothetical protein | |||
980 | 134 aa hypothetical protein | |||
983 | S | S4.Bca43195FORF1915P (100% identity) | S1.Msa17ORFW1P | |
984 | Integrase/recombinase | |||
986 | S | S6.BccORFAP (44% identity) | S2.Msa17ORFW1P | |
987 | S | S2.Bca43195FORF1915P (100% identity) | S3.Msa17ORFW1P | |
988 | S | S1.BccORFAP (100% identity) | S4.Msa17ORFW1P | |
989 | M | M.BccORFAP (100% identity) | M.Msa17ORFW1P | |
991 | HNH endonuclease | |||
992 | R | BccORFAP (100% identity) | Msa17ORFW1P | |
997 | C | C.BccORFAP (100% identity) | C.Msa17ORFW1P | |
998 | 319 aa hypothetical protein C825_00593 | |||
1996 | restriction endonuclease subunit S | |||
1998 | R | Cae25986ORF10000P (100% identity) | Msa17ORFG2P | |
2004 | S | S3.Cae25986ORF10000P (100% identity) | S1.Msa17ORFG2P | |
2007 | 139 aa hypothetical protein | |||
2008 | tyrosine-type recombinase/integrase | |||
2009 | S | S1.Cae25986ORF10000P (81% identity) | S2.Msa17ORFG2P | |
2014 | M | M.Cae25986ORF10000P (100% identity) | M.Msa17ORFG2P | |
2022 | GIY-YIG nuclease family protein | |||
1164 | uncharacterized protein | |||
1170 | S | S.PmeBin13ORFGP (100% identity) | S.Msa17ORFSP | |
1172 | M | M.PmeBin13ORFGP (100% identity) | M.Msa17ORFSP | |
1173 | 77 aa hypothetical protein | |||
1252 | 155 aa hypothetical protein | |||
1253 | R | Bov975ORF2541P (100% identity) | Msa17ORFJ1P | |
1255 | M | M.BspD2aORF12550P (100% identity) | M.Msa17ORFJ1P | |
1256 | S | S1.Bov8483FORF11440P (100% identity) | S1.Msa17ORFJ1P | |
1257 | S | S2.Bov1235ORF21635P (91% identity) | S2.Msa17ORFJ1P | |
1258 | S | S4.Bov1516ORF20510P (53% identity) | S3.Msa17ORFJ1P | |
1259 | S | S1.Bov8483aORF12415P (93% identity) | S4.Msa17ORFJ1P | |
1260 | site-specific integrase | |||
1518 | ATP/GTP-binding protein | |||
??? | M | M1.Msa17ORFWP (99% identity) | M.Msa17ORFUP | |
1867 | DUF262 domain-containing protein | |||
1868 | R | BthVORF4518P (100% identity) | Msa17ORFY1P | |
1871 | M | M.BthVORF4518P (100% identity) | M.Msa17ORFY1P | |
1874 | S | S2.BthVORF4518P (100% identity) | S1.Msa17ORFY1P | |
1875 | S | S3.BthVORF4518P (83% identity) | S2.Msa17ORFY1P | |
1877 | S | S4.BthVORF4518P (100% identity) | S3.Msa17ORFY1P | |
1879 | type I restriction endonuclease EcoR124II | |||
1901 | XRE family transcriptional regulator | |||
1902 | R | BthVORF4538P (100% identity) | Msa17ORFB2P | |
1906 | 307 aa hypothetical protein | |||
1907 | 567 aa hypothetical protein CR159_21190 | |||
1908 | M | M.BthVORF4538P (100% identity) | M.Msa17ORFB2P | |
1911 | toxin-antitoxin system toxin component Fic family | |||
1913 | S | S.BthVORF4538AP (99% identity) | S1.Msa17ORFB2P | |
1915 | S | S.BthVORF4538BP (100% identity) | S2.Msa17ORFB2P | |
1916 | S | S.BthVORF4538CP (100% identity) | S3.Msa17ORFB2P | |
1918 | S | S.BthVORF4538DP (100% identity) | S4.Msa17ORFB2P | |
1919 | site-specific recombinase phage integrase family | |||
3361 | methylated adenine and cytosine restriction protein | |||
3362 | R | EcotolCORF4050P (100% identity) | AACNNNNNNGTGC | Msa17ORFC2P |
3370 | M | M.SenHNK130ORF17125P (100% identity) | AACNNNNNNGTGC | M.Msa17ORFC2P |
3374 | S | S.SenHNK130ORF17125P (100% identity) | AACNNNNNNGTGC | S.Msa17ORFC2P |
3375 | 114 aa hypothetical protein | |||
2922 | DUF262 domain-containing protein | |||
2923 | R | BccORFFP (100% identity) | Msa17ORFWP | |
2924 | S | S2.Bov8483aORF21285P (94% identity) | S1.Msa17ORFWP | |
2927 | integrase | |||
2928 | S | S1.Bun901ORF13185P (100% identity) | S2.Msa17ORFWP | |
??? | M | M2.Bun901ORF13185P (80% identity) | M2.Msa17ORFWP | |
2934 | M | M1.Bun901ORF13185P (100% identity) | M1.Msa17ORFWP | |
2935 | ATP/GTP-binding protein | |||
3970 | ATP-dependent RecD-like DNA helicase | |||
3974 | R | Pvu1098ORF11445P (100% identity) | Msa17ORFX1P | |
3977 | DUF262 domain-containing protein | |||
3978 | 114 aa hypothetical protein | |||
3979 | S | S.Pvu1098ORF11445P (100% identity) | S.Msa17ORFX1P | |
3980 | M | M.Pvu1098ORF11445P (100% identity) | M.Msa17ORFX1P | |
3982 | 128 aa hypothetical protein | |||
3983 | C | C.Bvu8492ORF9340P (100% identity) | C.Msa17ORFX1P | |
3986 | 86 aa hypothetical protein | |||
6864 | 382 aa hypothetical protein | |||
6865 | S | S2.Cae25986ORF5215P (100% identity) | S.Msa17ORFT1P | |
6868 | M | M.Caewa7ORFBP (100% identity) | M.Msa17ORFT1P | |
6873 | R | Cae25986ORF5215P (86% identity) | Msa17ORFT1P | |
6877 | 121 aa hypothetical protein | |||
4659 | 108 aa hypothetical protein | |||
4660 | S | S5.Pvu1098ORF10145P (94% identity) | S1.Msa17ORFA2P | |
4661 | IS66 family insertion sequence hypothetical protein | |||
4662 | 63 aa hypothetical protein | |||
4663 | IS66 Orf2 like protein | |||
4664 | IS66 family transposase | |||
4665 | 150 aa hypothetical protein | |||
4666 | 104 aa hypothetical protein | |||
4667 | S | S4.Bvu8492ORF2385P (100% identity) | S2.Msa17ORFA2P | |
4668 | S | S2.BcaH1617ORFCP (90% identity) | S3.Msa17ORFA2P | |
4670 | S | S4.Bvu8482ORF3687P (96% identity) | S4.Msa17ORFA2P | |
4672 | S | S1.Bvu8492ORF2385P (100% identity) | S5.Msa17ORFA2P | |
4673 | toxin-antitoxin system toxin component Fic family | |||
4674 | 102 aa hypothetical protein | |||
4676 | M | M1.Pvu1098ORF10145P (100% identity) | M.Msa17ORFA2P | |
4677 | R | Pvu1098ORF10145P (100% identity) | Msa17ORFA2P | |
4680 | C | C.Pvu1098ORF10145P (100% identity) | C.Msa17ORFA2P | |
4681 | 454 aa hypothetical protein | |||
5919 | 67 aa hypothetical protein HMPREF1065_01803 | |||
5920 | S | S5.Pvu1098ORF7795P (100% identity) | S1.Msa17ORFH2P | |
5922 | Integrase/recombinase | |||
5923 | type I restriction endonuclease subunit S | |||
5924 | S | S3.PvuC06ORF3690P (100% identity) | S2.Msa17ORFH2P | |
5925 | S | S2.Pvu1098ORF7795P (100% identity) | S3.Msa17ORFH2P | |
5927 | 376 aa hypothetical protein BACDOR_00640 | |||
5929 | M | M.Pvu201ORF22395P (100% identity) | M.Msa17ORFH2P | |
5931 | R | PvuC06ORF3690P (100% identity) | Msa17ORFH2P | |
5934 | C | C.Pvu201ORF22395P (100% identity) | C.Msa17ORFH2P | |
5935 | uncharacterized protein BN496_01499 | |||
Type II | ||||
ORF | Gene | Most similar | Specificity | Name |
7 | 147 aa hypothetical protein QAU_1028 | |||
9 | M | M.PmeBin85ORFGP (100% identity) | M.Msa17ORFDP | |
13 | 112 aa hypothetical protein BN3662_02704 | |||
21 | 77 aa hypothetical protein | |||
22 | M | M.Ebo1231ORF17050P (100% identity) | M.Msa17ORFMP | |
28 | 60 aa hypothetical protein | |||
48 | 149 aa hypothetical protein bolteae] | |||
51 | M | M.Ebo1231ORF6970P (100% identity) | M.Msa17ORFRP | |
54 | 66 aa hypothetical protein | |||
83 | 60 aa hypothetical protein | |||
84 | M | M.Cbo613ORF8710P (100% identity) | M.Msa17ORFPP | |
87 | 389 aa hypothetical protein | |||
63 | 80 aa hypothetical protein | |||
64 | R | Ebo2ORF19175P (94% identity) | Msa17ORFF2P | |
66 | M | M1.Ebo1231ORF8265P (100% identity) | M2.Msa17ORFF2P | |
67 | M | M2.Ebo1231ORF8265P (96% identity) | M1.Msa17ORFF2P | |
68 | DUF305 domain-containing protein | |||
69 | 477 aa hypothetical protein | |||
??? | RM | Ebo1231ORF19465P (100% identity) | Msa17ORFL1P | |
73 | S | S.Ebo1231ORF19465P (100% identity) | S.Msa17ORFL1P | |
74 | 67 aa hypothetical protein | |||
79 | 74 aa hypothetical protein CLOBOL_03905 bolteae ATCC BAA-613] | |||
??? | M | M.Ebo1231ORF30215P (100% identity) | M.Msa17ORFI1P | |
87 | 79 aa hypothetical protein | |||
77 | PrgI family protein dolichum] | |||
80 | M | M.PmeBin390ORFAP (76% identity) | M.Msa17ORFR1P | |
83 | group II intron reverse transcriptase/maturase gnavus] | |||
147 | 118 aa hypothetical protein AS222_06105 | |||
148 | M | M.Cbo613FORF10550P (100% identity) | M.Msa17ORFOP | |
150 | PrgI family protein | |||
190 | 143 aa hypothetical protein CLOSYM_00892 symbiosum ATCC 14940] | |||
194 | M | M.Ebo1231ORF20380P (100% identity) | M.Msa17ORFQP | |
196 | 95 aa hypothetical protein | |||
201 | 102 aa hypothetical protein | |||
202 | M | M.Ebo2ORF12460P (100% identity) | M.Msa17ORFC1P | |
205 | 303 aa hypothetical protein | |||
211 | 61 aa hypothetical protein | |||
213 | M | M.Ebo2ORF12480P (99% identity) | M.Msa17ORFD1P | |
215 | 67 aa hypothetical protein | |||
232 | 79 aa hypothetical protein | |||
236 | M | M.PmeBin61ORFCP (98% identity) | M.Msa17ORFLP | |
240 | 319 aa hypothetical protein | |||
271 | 235 aa hypothetical protein HMPREF0127_05255 | |||
274 | RM | PmeBin96ORFBP (99% identity) | Msa17ORFE1P | |
275 | 79 aa hypothetical protein | |||
430 | DNA-binding protein | |||
431 | M | M.Rgns29149ORF255P (100% identity) | M.Msa17ORFP1P | |
432 | GTPase subunit of restriction endonuclease | |||
345 | 161 aa hypothetical protein | |||
347 | M | M.Bov8483aORF1960P (100% identity) | M.Msa17ORFXP | |
348 | histidine kinase | |||
492 | 30S ribosomal protein S27e | |||
494 | M | M.Rgns29149ORF375P (100% identity) | M1.Msa17ORFQ1P | |
497 | M | M.Rgns29149ORF380P (100% identity) | M2.Msa17ORFQ1P | |
501 | 275 aa hypothetical protein C806_00750 | |||
458 | 462 aa hypothetical protein HMPREF2141_01675 | |||
459 | M | M.Pvu626ORF9540P (100% identity) | M.Msa17ORFU1P | |
460 | peptide ABC transporter ATP-binding protein | |||
464 | 101 aa hypothetical protein | |||
466 | M | M.BthI5482ORF9380P (100% identity) | M.Msa17ORFV1P | |
467 | site-specific integrase | |||
458 | 292 aa hypothetical protein | |||
459 | M | M.PmeBin96ORFEP (100% identity) | M.Msa17ORFJP | |
460 | DGQHR domain-containing protein | |||
891 | 71 aa hypothetical protein | |||
895 | RM | CaeORF1256P (100% identity) | Msa17ORFFP | |
900 | DUF262 domain-containing protein | |||
606 | 111 aa hypothetical protein | |||
607 | M | M.BccORFGP (100% identity) | M.Msa17ORFI2P | |
608 | 182 aa hypothetical protein | |||
620 | 184 aa hypothetical protein | |||
621 | M | M.BccORFHP (100% identity) | M.Msa17ORFJ2P | |
622 | phosphoadenosine phosphosulfate reductase | |||
884 | 93 aa hypothetical protein | |||
885 | M | M.PvuC06ORF3565P (99% identity) | M.Msa17ORFIP | |
887 | 73 aa hypothetical protein | |||
931 | 725 aa hypothetical protein A3D31_06155 | |||
932 | M | M.BthVORF2355P (100% identity) | M.Msa17ORFTP | |
933 | transcriptional regulator, AraC family, partial | |||
1173 | 137 aa hypothetical protein | |||
??? | M | M.Rgns29149ORF7275P (100% identity) | M.Msa17ORFK2P | |
1178 | 94 aa hypothetical protein | |||
805 | adenine-specific methyltransferase EcoRI family protein | |||
807 | M | M.BcaC61ORF867P (100% identity) | M.Msa17ORFHP | |
809 | MORN repeat protein | |||
911 | hybrid sensor histidine kinase/response regulator | |||
912 | M | M1.PmeBin7ORFAP (100% identity) | M1.Msa17ORFYP | |
913 | R | PmeBin7ORFAP (100% identity) | Msa17ORFYP | |
915 | M | M2.PmeBin7ORFAP (100% identity) | M2.Msa17ORFYP | |
916 | rubrerythrin | |||
1066 | 127 aa hypothetical protein | |||
1067 | RM | Bov8483FORF11035P (100% identity) | Msa17ORFH1P | |
1069 | 310 aa hypothetical protein BACOV975_02621 | |||
1615 | 80 aa hypothetical protein | |||
1616 | RM | BvuVIC01ORF2016P (100% identity) | Msa17ORFEP | |
1619 | DNA-processing protein DprA | |||
2125 | 142 aa hypothetical protein | |||
2126 | M | PxyORF2592P (94% identity) | M2.Msa17ORFE2P | |
2127 | RM | BthI5482ORF12450P (100% identity) | Msa17ORFE2P | |
2131 | SNF2/RAD54 family helicase | |||
2174 | integrase | |||
2175 | R | BccI (100% identity) | CCATC | Msa17ORFZ1P |
2176 | M | M2.BccI (100% identity) | CCATC | M2.Msa17ORFZ1P |
2177 | M | M1.PmeBin35ORFAP (100% identity) | CCATC | M1.Msa17ORFZ1P |
2178 | 72 aa hypothetical protein | |||
2432 | XRE family transcriptional regulator | |||
2433 | M | M.BthVORF4754P (100% identity) | M.Msa17ORFGP | |
2434 | histidine kinase | |||
2457 | site-specific integrase | |||
2459 | M | M.Bov8483aORF10050P (100% identity) | M.Msa17ORFM1P | |
2462 | 421 aa hypothetical protein BGL_2c26870 | |||
2462 | 421 aa hypothetical protein BGL_2c26870 | |||
2463 | RM | Bov8483FORF13845P (100% identity) | Msa17ORFN1P | |
2465 | site-specific recombinase, phage integrase family | |||
2530 | 201 aa hypothetical protein | |||
2533 | M | M.BunC05ORF4092P (100% identity) | M.Msa17ORFO1P | |
2535 | 283 aa hypothetical protein HMPREF0127_00400 | |||
4818 | 1222 aa hypothetical protein | |||
4820 | M | M.Pvu1098ORF9885P (100% identity) | M.Msa17ORFD2P | |
4822 | 134 aa hypothetical protein | |||
6054 | cell division protein DamX | |||
6057 | M | M.UbaC1152DamP (100% identity) | GATC | M.Msa17DamP |
6059 | Tryptophan--tRNA ligase | |||
6365 | 59 aa hypothetical protein SFxv_3614 | |||
6366 | M | M.SflSTLE4ORF2240P (100% identity) | ATGCAT | M.Msa17ORFBP |
6368 | tRNA-dihydrouridine synthase B | |||
9844 | HD domain protein | |||
9845 | M | M.SflLIN6DcmP (100% identity) | CCWGG | M.Msa17DcmP |
9847 | V | V.EcoTX1999Dcm2P (100% identity) | V.Msa17DcmP | |
9850 | YedA | |||
Type III | ||||
ORF | Gene | Most similar | Specificity | Name |
27 | TIR domain | |||
30 | M | M.Ebo1231ORF11650P (100% identity) | M.Msa17ORFB1P | |
32 | 413 aa hypothetical protein bolteae] | |||
33 | 109 aa hypothetical protein | |||
34 | R | Ebo1231ORF11650P (100% identity) | Msa17ORFB1P | |
39 | 117 aa hypothetical protein CLOBOL_05630 bolteae ATCC BAA-613] | |||
42 | 115 aa hypothetical protein | |||
43 | M | M.Cbo613ORF23625P (100% identity) | M.Msa17ORFNP | |
45 | DUF2634 domain-containing protein | |||
46 | 61 aa hypothetical protein | |||
47 | R | Ebo1231ORF21160P (100% identity) | Msa17ORFNP | |
52 | ISL3 family transposase | |||
Type IV | ||||
ORF | Gene | Most similar | Specificity | Name |
36 | anti-adapter protein IraM | |||
37 | R | SsoSE61ORF22640P (100% identity) | YCGR | Msa17ORFAP |
38 | pinE invertase/site-specific DNA recombinase | |||
76 | 85 aa hypothetical protein | |||
77 | R | Ebo1231MrrP (98% identity) | Msa17MrrP | |
79 | ABC transporter ATP-binding protein clostridioforme] | |||
200 | glutamate binding periplasmic protein | |||
201 | R | Rgns29149ORF3980P (100% identity) | Msa17ORFVP | |
202 | 73 aa hypothetical protein | |||
1502 | 80 aa hypothetical protein | |||
1503 | R | BccMcrBP (100% identity) | Msa17McrBP | |
1505 | R | PmeBin35McrCP (100% identity) | Msa17McrCP | |
1506 | uncharacterized protein BN535_01935 | |||
3377 | 100 aa hypothetical protein Ec53638_1442 | |||
3378 | R | EcoW12McrBP (100% identity) | Msa17McrB2P | |
3379 | R | EcoZK126McrCP (100% identity) | Msa17McrC2P | |
3380 | 931 aa hypothetical protein | |||
11881 | anti-adapter protein IraM | |||
11882 | R | SsoSE61ORF22640P (100% identity) | YCGR | Msa17ORFCP |
11883 | pinE invertase/site-specific DNA recombinase |