Distribution of restriction sites in the human genome

Enzyme:  NcuI               Longest uncut segments
Specificity:  GAAGA               Repeats in uncut segments
Number of sites:  8787331               Genes in uncut segments
Mean distance between sites:  325 base pairs
Standard deviation:  371 base pairs
Site density3071.0 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   487025  chr15  NT_037852.6  1398804-1885829    0.01 % in   1 repeats    0.00 % in 0 genes
2   401951  chr6  NT_167244.1  2359394-2761345    0.08 % in   2 repeats    0.00 % in 0 genes
3   209355  chr6  NT_167244.1  4389012-4598367    0.69 % in   7 repeats    0.00 % in 0 genes
4   180876  chr6  NT_167244.1  3789766-3970642    0.17 % in   3 repeats    0.00 % in 0 genes
5   176360  chr6  NT_167244.1  3179754-3356114    0.26 % in   6 repeats    0.27 % in 1 genes
6   172821  chr6  NT_167247.1  4422148-4594969    0.41 % in   2 repeats    100.00 % in 1 genes
7   164874  chr6  NT_167249.1  2138175-2303049    0.03 % in   1 repeats    0.00 % in 0 genes
8   164615  chr6  NT_167247.1  1562129-1726744    0.05 % in   2 repeats    0.50 % in 1 genes
9   159837  chr6  NT_167248.1  521675-681512    0.33 % in   2 repeats    0.00 % in 0 genes
10   156851  chr6  NT_167244.1  2007747-2164598    0.79 % in   5 repeats    0.00 % in 0 genes
11   143883  chr6  NT_167244.1  2894522-3038405    0.65 % in   7 repeats    0.00 % in 0 genes
12   118165  chr6  NT_167245.1  2606157-2724322    0.54 % in   3 repeats    0.00 % in 0 genes
13   114843  chr6  NT_167247.1  1177599-1292442    0.06 % in   1 repeats    0.00 % in 0 genes
14   114111  chr6  NT_167246.1  3261136-3375247    0.41 % in   3 repeats    0.00 % in 0 genes
15   110130  chr6  NT_167245.1  137191-247321    1.86 % in   5 repeats    0.00 % in 0 genes
16   106022  chr6  NT_167244.1  1451111-1557133    1.26 % in   5 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
487025  chr15  NT_037852.6  1398804-1885829    1       AT_rich (1) 
401951  chr6  NT_167244.1  2359394-2761345    2       L4 (1)  AluSp (1) 
209355  chr6  NT_167244.1  4389012-4598367    6       MER57-int (2)  (TTCC)n (1)  AluY (1) 
180876  chr6  NT_167244.1  3789766-3970642    3       MLT1H-int (1)  MER52D (1)  AluJb (1) 
176360  chr6  NT_167244.1  3179754-3356114    4       GC_rich (3)  Charlie4a (1)  (CCG)n (1) 
172821  chr6  NT_167247.1  4422148-4594969    2       MER11A (1)  AluSc (1) 
164874  chr6  NT_167249.1  2138175-2303049    1       AT_rich (1) 
164615  chr6  NT_167247.1  1562129-1726744    2       L1MC3 (1)  A-rich (1) 
159837  chr6  NT_167248.1  521675-681512    2       L1PREC2 (1)  HERVH-int (1) 
10  156851  chr6  NT_167244.1  2007747-2164598    4       AluSx (2)  MIRb (1)  MIR (1) 
11  143883  chr6  NT_167244.1  2894522-3038405    5       L1MC5 (2)  AluJo (2)  AluY (1) 
12  118165  chr6  NT_167245.1  2606157-2724322    3       MLT1E2 (1)  L2a (1)  L2 (1) 
13  114843  chr6  NT_167247.1  1177599-1292442    1       ERV3-16A3_I-int (1) 
14  114111  chr6  NT_167246.1  3261136-3375247    2       MIRb (2)  AluSx (1) 
15  110130  chr6  NT_167245.1  137191-247321    5       MLT1F (1)  MLT1E2 (1)  LTR12C (1) 
16  106022  chr6  NT_167244.1  1451111-1557133    5       L1MA1 (1)  ERV3-16A3_I-int (1)  AT_rich (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
5   176360       chr6  NT_167244.1  3179754-3356114    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
6   172821       chr6  NT_167247.1  4422148-4594969    LOC100507722  hypothetical_protein_LOC100507722
8   164615       chr6  NT_167247.1  1562129-1726744    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011