Distribution of restriction sites in the human genome

Enzyme:  HpyHI               Longest uncut segments
Specificity:  CTNAG               Repeats in uncut segments
Number of sites:  13860143               Genes in uncut segments
Mean distance between sites:  206 base pairs
Standard deviation:  222 base pairs
Site density4843.9 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   401340  chr6  NT_167244.1  2359954-2761294    0.01 % in   1 repeats    0.00 % in 0 genes
2   207927  chr6  NT_167244.1  4390013-4597940    0.03 % in   2 repeats    0.00 % in 0 genes
3   180408  chr6  NT_167244.1  3790249-3970657    0.07 % in   2 repeats    0.00 % in 0 genes
4   175433  chr6  NT_167244.1  3179953-3355386    0.11 % in   4 repeats    0.16 % in 1 genes
5   172373  chr6  NT_167247.1  4421988-4594361    0.06 % in   2 repeats    100.00 % in 1 genes
6   165307  chr6  NT_167249.1  2138069-2303376    0.09 % in   3 repeats    0.00 % in 0 genes
7   159459  chr6  NT_167248.1  521796-681255    0.09 % in   2 repeats    0.00 % in 0 genes
8   143652  chr6  NT_167244.1  2894138-3037790    0.35 % in   5 repeats    0.00 % in 0 genes
9   117945  chr6  NT_167245.1  2605856-2723801    0.38 % in   2 repeats    0.00 % in 0 genes
10   108380  chr6  NT_167245.1  138028-246408    0.44 % in   3 repeats    0.00 % in 0 genes
11   104655  chr6  NT_167244.1  588631-693286    0.30 % in   2 repeats    0.00 % in 0 genes
12   104642  chr6  NT_167244.1  1451534-1556176    0.05 % in   2 repeats    0.00 % in 0 genes
13   104503  chr6  NT_167244.1  1833243-1937746    0.27 % in   3 repeats    0.00 % in 0 genes
14   104029  chr6  NT_167244.1  3490427-3594456    0.83 % in   4 repeats    0.00 % in 0 genes
15   100705  chr9  NT_008470.19  21507722-21608427    0.63 % in   4 repeats    0.00 % in 0 genes
16   100541  chr6  NT_025741.15  72111419-72211960    0.23 % in   1 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
401340  chr6  NT_167244.1  2359954-2761294    1       AluSp (1) 
207927  chr6  NT_167244.1  4390013-4597940    2       AluSg/x (1)  AluJo (1) 
180408  chr6  NT_167244.1  3790249-3970657    2       MLT1H-int (1)  MER52D (1) 
175433  chr6  NT_167244.1  3179953-3355386    3       GC_rich (2)  (CCG)n (1)  AluSp (1) 
172373  chr6  NT_167247.1  4421988-4594361    2       MER11A (1)  AluSc (1) 
165307  chr6  NT_167249.1  2138069-2303376    3       L1MB8 (1)  AT_rich (1)  AluSx (1) 
159459  chr6  NT_167248.1  521796-681255    2       L1PREC2 (1)  HERVH-int (1) 
143652  chr6  NT_167244.1  2894138-3037790    5       L1MC5 (1)  AluY (1)  AluSg1 (1) 
117945  chr6  NT_167245.1  2605856-2723801    2       L2a (1)  L2 (1) 
10  108380  chr6  NT_167245.1  138028-246408    3       MLT1F (1)  MLT1E2 (1)  LTR12C (1) 
11  104655  chr6  NT_167244.1  588631-693286    2       L1PB1 (1)  L1MA9 (1) 
12  104642  chr6  NT_167244.1  1451534-1556176    2       ERV3-16A3_I-int (1)  AluSg1 (1) 
13  104503  chr6  NT_167244.1  1833243-1937746    3       (TATG)n (1)  MIR (1)  AluSx (1) 
14  104029  chr6  NT_167244.1  3490427-3594456    4       LTR78B (1)  L1M2 (1)  AluSg (1) 
15  100705  chr9  NT_008470.19  21507722-21608427    4       L1ME1 (1)  Charlie4a (1)  AluSx (1) 
16  100541  chr6  NT_025741.15  72111419-72211960    1       (CCA)n (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   175433       chr6  NT_167244.1  3179953-3355386    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
5   172373       chr6  NT_167247.1  4421988-4594361    LOC100507722  hypothetical_protein_LOC100507722



Posfai@neb.com
May 11, 2011