Distribution of restriction sites in the human genome

Enzyme:  SuaI               Longest uncut segments
Specificity:  GGCC               Repeats in uncut segments
Number of sites:  8350301               Genes in uncut segments
Mean distance between sites:  342 base pairs
Standard deviation:  478 base pairs
Site density2918.3 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   487930  chr15  NT_037852.6  1397711-1885641    0.06 % in   4 repeats    0.00 % in 0 genes
2   401264  chr6  NT_167244.1  2359970-2761234    0.00 % in   1 repeats    0.00 % in 0 genes
3   209576  chr6  NT_167244.1  4388350-4597926    0.81 % in   6 repeats    0.00 % in 0 genes
4   159314  chr6  NT_167248.1  521896-681210    0.00 % in   1 repeats    0.00 % in 0 genes
5   150684  chr9  NT_008470.19  21693060-21843744    0.18 % in   2 repeats    0.00 % in 0 genes
6   142914  chr6  NT_167244.1  2894569-3037483    0.06 % in   2 repeats    0.00 % in 0 genes
7   107909  chr6  NT_167245.1  138051-245960    0.00 % in   1 repeats    0.00 % in 0 genes
8   104920  chr6  NT_167244.1  1451577-1556497    0.31 % in   2 repeats    0.00 % in 0 genes
9   103039  chr6  NT_167244.1  3491062-3594101    0.00 % in   1 repeats    0.00 % in 0 genes
10   101780  chr9  NT_008470.19  21506738-21608518    1.10 % in   6 repeats    0.00 % in 0 genes
11   101317  chr7  NT_007933.15  68186605-68287922    0.55 % in   1 repeats    0.00 % in 0 genes
12   100616  chr6  NT_025741.15  72111287-72211903    0.36 % in   2 repeats    0.00 % in 0 genes
13   100318  chr5  NW_003315917.1  1145801-1246119    0.28 % in   2 repeats    0.00 % in 0 genes
14   100245  chr1  NT_167185.1  38191-138436    0.10 % in   2 repeats    0.00 % in 0 genes
15   99456  chr6  NT_167247.1  2712447-2811903    0.01 % in   1 repeats    0.00 % in 0 genes
16   93672  chrY  NT_011875.12  8558349-8652021    46.56 % in   5 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
487930  chr15  NT_037852.6  1397711-1885641    4       MIRc (1)  MIRb (1)  L1M3 (1) 
401264  chr6  NT_167244.1  2359970-2761234    1       AluSp (1) 
209576  chr6  NT_167244.1  4388350-4597926    5       MER57-int (2)  (TTCC)n (1)  AluY (1) 
159314  chr6  NT_167248.1  521896-681210    1       HERVH-int (1) 
150684  chr9  NT_008470.19  21693060-21843744    2       MIR3 (1)  L1M5 (1) 
142914  chr6  NT_167244.1  2894569-3037483    2       AluY (1)  AluSg1 (1) 
107909  chr6  NT_167245.1  138051-245960    1       LTR12C (1) 
104920  chr6  NT_167244.1  1451577-1556497    2       ERV3-16A3_I-int (1)  AluSg1 (1) 
103039  chr6  NT_167244.1  3491062-3594101    1       AluS (1) 
10  101780  chr9  NT_008470.19  21506738-21608518    6       L2a (1)  L1ME1 (1)  Charlie4a (1) 
11  101317  chr7  NT_007933.15  68186605-68287922    1       L1PB1 (1) 
12  100616  chr6  NT_025741.15  72111287-72211903    2       L1ME1 (1)  (CCA)n (1) 
13  100318  chr5  NW_003315917.1  1145801-1246119    1       AluSg (2) 
14  100245  chr1  NT_167185.1  38191-138436    2       AluY (1)  AluSg (1) 
15  99456  chr6  NT_167247.1  2712447-2811903    1       AluJb (1) 
16  93672  chrY  NT_011875.12  8558349-8652021    1       LTR12B (5) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 



Posfai@neb.com
May 11, 2011