Distribution of restriction sites in the human genome

Enzyme:  MwoI               Longest uncut segments
Specificity:  GCNNNNNNNGC               Repeats in uncut segments
Number of sites:  6098164               Genes in uncut segments
Mean distance between sites:  469 base pairs
Standard deviation:  642 base pairs
Site density2131.2 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   489092  chr15  NT_037852.6  1397712-1886804    0.06 % in   4 repeats    0.00 % in 0 genes
2   402127  chr6  NT_167244.1  2359591-2761718    0.08 % in   1 repeats    0.00 % in 0 genes
3   208788  chr6  NT_167244.1  4389911-4598699    0.42 % in   5 repeats    0.00 % in 0 genes
4   180843  chr6  NT_167244.1  3790330-3971173    0.30 % in   3 repeats    0.00 % in 0 genes
5   175064  chr6  NT_167244.1  3180232-3355296    0.02 % in   2 repeats    0.00 % in 0 genes
6   172120  chr6  NT_167247.1  4422134-4594254    0.00 % in   1 repeats    100.00 % in 1 genes
7   166299  chr6  NT_167247.1  1560812-1727111    0.62 % in   3 repeats    1.29 % in 1 genes
8   165802  chr6  NT_167249.1  2137913-2303715    0.35 % in   6 repeats    0.00 % in 0 genes
9   159431  chr6  NT_167248.1  521850-681281    0.08 % in   2 repeats    0.00 % in 0 genes
10   151547  chr9  NT_008470.19  21692760-21844307    0.37 % in   3 repeats    0.00 % in 0 genes
11   143306  chr6  NT_167244.1  2894615-3037921    0.33 % in   5 repeats    0.00 % in 0 genes
12   118159  chr6  NT_167245.1  2605940-2724099    0.53 % in   3 repeats    0.00 % in 0 genes
13   116031  chr6  NT_167247.1  1176190-1292221    1.27 % in   2 repeats    0.00 % in 0 genes
14   113849  chr6  NT_167246.1  3261038-3374887    0.15 % in   2 repeats    0.00 % in 0 genes
15   108796  chr6  NT_167245.1  138027-246823    0.78 % in   4 repeats    0.00 % in 0 genes
16   106698  chr6  NT_167244.1  588591-695289    2.17 % in   7 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
489092  chr15  NT_037852.6  1397712-1886804    4       MIRc (1)  MIRb (1)  L1M3 (1) 
402127  chr6  NT_167244.1  2359591-2761718    1       AluSp (1) 
208788  chr6  NT_167244.1  4389911-4598699    4       AluSx (2)  L1MC (1)  AluSg/x (1) 
180843  chr6  NT_167244.1  3790330-3971173    3       MLT1H-int (1)  MER52D (1)  AluSc (1) 
175064  chr6  NT_167244.1  3180232-3355296    2       GC_rich (1)  AluSp (1) 
172120  chr6  NT_167247.1  4422134-4594254    1       AluSc (1) 
166299  chr6  NT_167247.1  1560812-1727111    3       MIRc (1)  L1MC3 (1)  A-rich (1) 
165802  chr6  NT_167249.1  2137913-2303715    4       L1MB8 (2)  AluSx (2)  L1MC4a (1) 
159431  chr6  NT_167248.1  521850-681281    2       L1PREC2 (1)  HERVH-int (1) 
10  151547  chr9  NT_008470.19  21692760-21844307    3       MIR3 (1)  LTR67B (1)  L1M5 (1) 
11  143306  chr6  NT_167244.1  2894615-3037921    5       L1MC5 (1)  AluY (1)  AluSp (1) 
12  118159  chr6  NT_167245.1  2605940-2724099    3       MLT1E2 (1)  L2a (1)  L2 (1) 
13  116031  chr6  NT_167247.1  1176190-1292221    1       ERV3-16A3_I-int (2) 
14  113849  chr6  NT_167246.1  3261038-3374887    2       MIRb (1)  AluSx (1) 
15  108796  chr6  NT_167245.1  138027-246823    4       MLT1F (1)  MLT1E2 (1)  LTR12C (1) 
16  106698  chr6  NT_167244.1  588591-695289    5       L1MA9 (3)  L1PB1 (1)  L1P5 (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
6   172120       chr6  NT_167247.1  4422134-4594254    LOC100507722  hypothetical_protein_LOC100507722
7   166299       chr6  NT_167247.1  1560812-1727111    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011