Distribution of restriction sites in the human genome

Enzyme:  BspCNI               Longest uncut segments
Specificity:  CTCAG               Repeats in uncut segments
Number of sites:  9393164               Genes in uncut segments
Mean distance between sites:  304 base pairs
Standard deviation:  357 base pairs
Site density3282.8 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   401340  chr6  NT_167244.1  2359954-2761294    0.01 % in   1 repeats    0.00 % in 0 genes
2   207927  chr6  NT_167244.1  4390013-4597940    0.03 % in   2 repeats    0.00 % in 0 genes
3   180951  chr6  NT_167244.1  3789706-3970657    0.18 % in   3 repeats    0.00 % in 0 genes
4   175433  chr6  NT_167244.1  3179953-3355386    0.11 % in   4 repeats    0.16 % in 1 genes
5   172389  chr6  NT_167247.1  4421988-4594377    0.07 % in   2 repeats    100.00 % in 1 genes
6   165307  chr6  NT_167249.1  2138069-2303376    0.09 % in   3 repeats    0.00 % in 0 genes
7   159622  chr6  NT_167248.1  521633-681255    0.20 % in   2 repeats    0.00 % in 0 genes
8   150753  chr9  NT_008470.19  21692871-21843624    0.26 % in   2 repeats    0.00 % in 0 genes
9   143652  chr6  NT_167244.1  2894138-3037790    0.35 % in   5 repeats    0.00 % in 0 genes
10   117945  chr6  NT_167245.1  2605856-2723801    0.38 % in   2 repeats    0.00 % in 0 genes
11   108380  chr6  NT_167245.1  138028-246408    0.44 % in   3 repeats    0.00 % in 0 genes
12   105714  chr6  NT_167244.1  588594-694308    1.26 % in   7 repeats    0.00 % in 0 genes
13   104642  chr6  NT_167244.1  1451534-1556176    0.05 % in   2 repeats    0.00 % in 0 genes
14   104503  chr6  NT_167244.1  1833243-1937746    0.27 % in   3 repeats    0.00 % in 0 genes
15   104359  chr6  NT_167244.1  3490097-3594456    1.15 % in   6 repeats    0.00 % in 0 genes
16   103152  chr7  NT_007933.15  68185868-68289020    1.84 % in   5 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
401340  chr6  NT_167244.1  2359954-2761294    1       AluSp (1) 
207927  chr6  NT_167244.1  4390013-4597940    2       AluSg/x (1)  AluJo (1) 
180951  chr6  NT_167244.1  3789706-3970657    3       MLT1H-int (1)  MER52D (1)  AluJb (1) 
175433  chr6  NT_167244.1  3179953-3355386    3       GC_rich (2)  (CCG)n (1)  AluSp (1) 
172389  chr6  NT_167247.1  4421988-4594377    2       MER11A (1)  AluSc (1) 
165307  chr6  NT_167249.1  2138069-2303376    3       L1MB8 (1)  AT_rich (1)  AluSx (1) 
159622  chr6  NT_167248.1  521633-681255    2       L1PREC2 (1)  HERVH-int (1) 
150753  chr9  NT_008470.19  21692871-21843624    2       MIR3 (1)  L1M5 (1) 
143652  chr6  NT_167244.1  2894138-3037790    5       L1MC5 (1)  AluY (1)  AluSg1 (1) 
10  117945  chr6  NT_167245.1  2605856-2723801    2       L2a (1)  L2 (1) 
11  108380  chr6  NT_167245.1  138028-246408    3       MLT1F (1)  MLT1E2 (1)  LTR12C (1) 
12  105714  chr6  NT_167244.1  588594-694308    5       L1MA9 (3)  L1PB1 (1)  L1P5 (1) 
13  104642  chr6  NT_167244.1  1451534-1556176    2       ERV3-16A3_I-int (1)  AluSg1 (1) 
14  104503  chr6  NT_167244.1  1833243-1937746    3       (TATG)n (1)  MIR (1)  AluSx (1) 
15  104359  chr6  NT_167244.1  3490097-3594456    5       L1M2 (2)  LTR78B (1)  AluSx (1) 
16  103152  chr7  NT_007933.15  68185868-68289020    4       L1PB1 (2)  (TC)n (1)  (TA)n (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   175433       chr6  NT_167244.1  3179953-3355386    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
5   172389       chr6  NT_167247.1  4421988-4594377    LOC100507722  hypothetical_protein_LOC100507722



Posfai@neb.com
May 11, 2011