Distribution of restriction sites in the human genome

Enzyme:  TspGWI               Longest uncut segments
Specificity:  ACGGA               Repeats in uncut segments
Number of sites:  994136               Genes in uncut segments
Mean distance between sites:  2878 base pairs
Standard deviation:  3333 base pairs
Site density 347.4 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   490468  chr15  NT_037852.6  1398150-1888618    0.23 % in   6 repeats    0.00 % in 0 genes
2   404337  chr6  NT_167244.1  2359452-2763789    0.12 % in   2 repeats    0.00 % in 0 genes
3   210506  chr6  NT_167244.1  4389533-4600039    0.87 % in   11 repeats    0.00 % in 0 genes
4   182926  chr6  NT_167244.1  3787887-3970813    0.58 % in   7 repeats    0.00 % in 0 genes
5   180555  chr6  NT_167247.1  1561002-1741557    6.03 % in   36 repeats    1.08 % in 1 genes
6   177253  chr6  NT_167244.1  3179984-3357237    0.53 % in   8 repeats    0.73 % in 2 genes
7   174632  chr6  NT_167247.1  4421976-4596608    1.07 % in   6 repeats    100.00 % in 1 genes
8   169186  chr6  NT_167248.1  521561-690747    3.11 % in   8 repeats    0.67 % in 1 genes
9   165601  chr6  NT_167249.1  2137730-2303331    0.22 % in   4 repeats    0.00 % in 0 genes
10   157608  chr6  NT_167244.1  2007908-2165516    0.77 % in   5 repeats    0.00 % in 0 genes
11   156024  chr9  NT_008470.19  21692565-21848589    1.40 % in   10 repeats    0.00 % in 0 genes
12   150232  chr6  NT_167244.1  2889278-3039510    4.00 % in   30 repeats    0.00 % in 0 genes
13   120835  chr6  NT_167245.1  2605231-2726066    2.32 % in   5 repeats    0.00 % in 0 genes
14   120428  chr6  NT_167246.1  3259303-3379731    2.21 % in   16 repeats    0.00 % in 0 genes
15   115357  chr6  NT_167244.1  3485060-3600417    5.93 % in   22 repeats    0.00 % in 0 genes
16   115120  chr6  NT_167247.1  1177330-1292450    0.29 % in   1 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
490468  chr15  NT_037852.6  1398150-1888618    5       L2a (2)  (TA)n (1)  MLT1L (1) 
404337  chr6  NT_167244.1  2359452-2763789    2       L1MEg (1)  AluSp (1) 
210506  chr6  NT_167244.1  4389533-4600039    11  10       AluSx (2)  (TTCC)n (1)  MER57-int (1) 
182926  chr6  NT_167244.1  3787887-3970813    6       AT_rich (2)  MLT1H-int (1)  MIR (1) 
180555  chr6  NT_167247.1  1561002-1741557    36  23       L1PB2 (4)  L1MEf (3)  MSTB (2) 
177253  chr6  NT_167244.1  3179984-3357237    6       GC_rich (2)  AluSp (2)  L2c (1) 
174632  chr6  NT_167247.1  4421976-4596608    6       (TTAAA)n (1)  MLT1J (1)  MER11A (1) 
169186  chr6  NT_167248.1  521561-690747    5       AT_rich (4)  MLT1G3 (1)  L1PREC2 (1) 
165601  chr6  NT_167249.1  2137730-2303331    4       L1MC4a (1)  L1MB8 (1)  AT_rich (1) 
10  157608  chr6  NT_167244.1  2007908-2165516    4       AluSx (2)  MIRb (1)  MIR (1) 
11  156024  chr9  NT_008470.19  21692565-21848589    10  8       MIRb (2)  L2 (2)  MLT1A (1) 
12  150232  chr6  NT_167244.1  2889278-3039510    30  16       L1MC5 (4)  AluY (4)  AluSc (3) 
13  120835  chr6  NT_167245.1  2605231-2726066    4       L2 (2)  MLT1E2 (1)  L2a (1) 
14  120428  chr6  NT_167246.1  3259303-3379731    16  11       L1MC5 (3)  AluSx (3)  MIRb (2) 
15  115357  chr6  NT_167244.1  3485060-3600417    22  14       L1M2 (4)  AluSx (3)  AluSg (3) 
16  115120  chr6  NT_167247.1  1177330-1292450    1       ERV3-16A3_I-int (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
5   180555       chr6  NT_167247.1  1561002-1741557    LOC100421582  tripartite_motif-containing_protein_26
6   177253       chr6  NT_167244.1  3179984-3357237    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
7   174632       chr6  NT_167247.1  4421976-4596608    LOC100507722  hypothetical_protein_LOC100507722
8   169186       chr6  NT_167248.1  521561-690747    OR12D1P 



Posfai@neb.com
May 11, 2011