Distribution of restriction sites in the human genome

Enzyme:  TspRI               Longest uncut segments
Specificity:  CASTG               Repeats in uncut segments
Number of sites:  8668095               Genes in uncut segments
Mean distance between sites:  330 base pairs
Standard deviation:  361 base pairs
Site density3029.4 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   401287  chr6  NT_167244.1  2359960-2761247    0.00 % in   1 repeats    0.00 % in 0 genes
2   208192  chr6  NT_167244.1  4389976-4598168    0.14 % in   2 repeats    0.00 % in 0 genes
3   180429  chr6  NT_167244.1  3790331-3970760    0.09 % in   2 repeats    0.00 % in 0 genes
4   176170  chr6  NT_167244.1  3179257-3355427    0.15 % in   5 repeats    0.55 % in 1 genes
5   173398  chr6  NT_167247.1  4421676-4595074    0.47 % in   2 repeats    100.00 % in 1 genes
6   165344  chr6  NT_167249.1  2137959-2303303    0.07 % in   3 repeats    0.00 % in 0 genes
7   164723  chr6  NT_167247.1  1562058-1726781    0.09 % in   2 repeats    0.54 % in 1 genes
8   159420  chr6  NT_167248.1  521833-681253    0.07 % in   2 repeats    0.00 % in 0 genes
9   150710  chr9  NT_008470.19  21692905-21843615    0.24 % in   1 repeats    0.00 % in 0 genes
10   143266  chr6  NT_167244.1  2894483-3037749    0.24 % in   4 repeats    0.00 % in 0 genes
11   118255  chr6  NT_167245.1  2606211-2724466    0.61 % in   4 repeats    0.00 % in 0 genes
12   114886  chr6  NT_167247.1  1177604-1292490    0.05 % in   1 repeats    0.00 % in 0 genes
13   108504  chr6  NT_167245.1  137608-246112    0.55 % in   2 repeats    0.00 % in 0 genes
14   105238  chr6  NT_167244.1  588032-693270    0.46 % in   4 repeats    0.00 % in 0 genes
15   104668  chr6  NT_167244.1  1451522-1556190    0.07 % in   2 repeats    0.00 % in 0 genes
16   104264  chr6  NT_167244.1  1833640-1937904    0.21 % in   1 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
401287  chr6  NT_167244.1  2359960-2761247    1       AluSp (1) 
208192  chr6  NT_167244.1  4389976-4598168    2       AluSg/x (1)  AluJo (1) 
180429  chr6  NT_167244.1  3790331-3970760    2       MLT1H-int (1)  MER52D (1) 
176170  chr6  NT_167244.1  3179257-3355427    3       GC_rich (3)  (CCG)n (1)  AluSp (1) 
173398  chr6  NT_167247.1  4421676-4595074    2       MER11A (1)  AluSc (1) 
165344  chr6  NT_167249.1  2137959-2303303    3       L1MC4a (1)  L1MB8 (1)  AT_rich (1) 
164723  chr6  NT_167247.1  1562058-1726781    2       L1MC3 (1)  A-rich (1) 
159420  chr6  NT_167248.1  521833-681253    2       L1PREC2 (1)  HERVH-int (1) 
150710  chr9  NT_008470.19  21692905-21843615    1       L1M5 (1) 
10  143266  chr6  NT_167244.1  2894483-3037749    4       L1MC5 (1)  AluY (1)  AluSg1 (1) 
11  118255  chr6  NT_167245.1  2606211-2724466    3       L2 (2)  MLT1E2 (1)  L2a (1) 
12  114886  chr6  NT_167247.1  1177604-1292490    1       ERV3-16A3_I-int (1) 
13  108504  chr6  NT_167245.1  137608-246112    2       MLT1E2 (1)  LTR12C (1) 
14  105238  chr6  NT_167244.1  588032-693270    4       L1PB1 (1)  L1ME3D (1)  L1MA9 (1) 
15  104668  chr6  NT_167244.1  1451522-1556190    2       ERV3-16A3_I-int (1)  AluSg1 (1) 
16  104264  chr6  NT_167244.1  1833640-1937904    1       AluSx (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   176170       chr6  NT_167244.1  3179257-3355427    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
5   173398       chr6  NT_167247.1  4421676-4595074    LOC100507722  hypothetical_protein_LOC100507722
7   164723       chr6  NT_167247.1  1562058-1726781    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011