Distribution of restriction sites in the human genome

Enzyme:  EcoT38I               Longest uncut segments
Specificity:  GRGCYC               Repeats in uncut segments
Number of sites:  2330338               Genes in uncut segments
Mean distance between sites:  1227 base pairs
Standard deviation:  1701 base pairs
Site density 814.4 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   490935  chr15  NT_037852.6  1397012-1887947    0.27 % in   9 repeats    0.00 % in 0 genes
2   401904  chr6  NT_167244.1  2359617-2761521    0.08 % in   1 repeats    0.00 % in 0 genes
3   213597  chr6  NT_167244.1  4384459-4598056    1.83 % in   11 repeats    0.00 % in 0 genes
4   184245  chr6  NT_167244.1  3787380-3971625    0.99 % in   10 repeats    0.06 % in 1 genes
5   175598  chr6  NT_167244.1  3180070-3355668    0.23 % in   5 repeats    0.09 % in 1 genes
6   174670  chr6  NT_167247.1  4421944-4596614    1.07 % in   6 repeats    100.00 % in 1 genes
7   168522  chr6  NT_167249.1  2137109-2305631    1.69 % in   11 repeats    0.00 % in 0 genes
8   166459  chr6  NT_167247.1  1562927-1729386    0.60 % in   6 repeats    0.02 % in 1 genes
9   159612  chr6  NT_167248.1  521723-681335    0.19 % in   2 repeats    0.00 % in 0 genes
10   156069  chr6  NT_167244.1  2008868-2164937    0.18 % in   2 repeats    0.00 % in 0 genes
11   151927  chr9  NT_008470.19  21691699-21843626    0.78 % in   5 repeats    0.00 % in 0 genes
12   144045  chr6  NT_167244.1  2893724-3037769    0.48 % in   6 repeats    0.00 % in 0 genes
13   119119  chr6  NT_167245.1  2604987-2724106    1.27 % in   4 repeats    0.00 % in 0 genes
14   116804  chr6  NT_167247.1  1175457-1292261    1.88 % in   3 repeats    0.00 % in 0 genes
15   115061  chr6  NT_167246.1  3260330-3375391    0.56 % in   4 repeats    0.00 % in 0 genes
16   110846  chr6  NT_167245.1  137961-248807    2.01 % in   10 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
490935  chr15  NT_037852.6  1397012-1887947    9       MLT1L (1)  MIRc (1)  MIRb (1) 
401904  chr6  NT_167244.1  2359617-2761521    1       AluSp (1) 
213597  chr6  NT_167244.1  4384459-4598056    11  8       MER57-int (3)  AluY (2)  (TTTTA)n (1) 
184245  chr6  NT_167244.1  3787380-3971625    10  9       AT_rich (2)  MLT1H-int (1)  MIR (1) 
175598  chr6  NT_167244.1  3180070-3355668    4       GC_rich (2)  Charlie4a (1)  (CCG)n (1) 
174670  chr6  NT_167247.1  4421944-4596614    6       (TTAAA)n (1)  MLT1J (1)  MER11A (1) 
168522  chr6  NT_167249.1  2137109-2305631    11  7       L1MB8 (3)  AluSx (3)  MIR (1) 
166459  chr6  NT_167247.1  1562927-1729386    4       MIR (2)  L1MEe (2)  (GGAA)n (1) 
159612  chr6  NT_167248.1  521723-681335    2       L1PREC2 (1)  HERVH-int (1) 
10  156069  chr6  NT_167244.1  2008868-2164937    2       MIRb (1)  MIR (1) 
11  151927  chr9  NT_008470.19  21691699-21843626    4       LTR67B (2)  MSTA (1)  MIR3 (1) 
12  144045  chr6  NT_167244.1  2893724-3037769    6       (TCC)n (1)  L1MC5 (1)  AluY (1) 
13  119119  chr6  NT_167245.1  2604987-2724106    4       MLT1E2 (1)  MER5A1 (1)  L2a (1) 
14  116804  chr6  NT_167247.1  1175457-1292261    2       ERV3-16A3_I-int (2)  LTR16B2 (1) 
15  115061  chr6  NT_167246.1  3260330-3375391    3       MIRb (2)  MIR3 (1)  AluSx (1) 
16  110846  chr6  NT_167245.1  137961-248807    10  9       L2c (2)  (TTTC)n (1)  MLT1F (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   184245       chr6  NT_167244.1  3787380-3971625    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor
5   175598       chr6  NT_167244.1  3180070-3355668    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
6   174670       chr6  NT_167247.1  4421944-4596614    LOC100507722  hypothetical_protein_LOC100507722
8   166459       chr6  NT_167247.1  1562927-1729386    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011