Distribution of restriction sites in the human genome

Enzyme:  Tsp509I               Longest uncut segments
Specificity:  AATT               Repeats in uncut segments
Number of sites:  21456481               Genes in uncut segments
Mean distance between sites:  133 base pairs
Standard deviation:  181 base pairs
Site density7498.7 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   401765  chr6  NT_167244.1  2359644-2761409    0.08 % in   1 repeats    0.00 % in 0 genes
2   208006  chr6  NT_167244.1  4389990-4597996    0.07 % in   2 repeats    0.00 % in 0 genes
3   180446  chr6  NT_167244.1  3790207-3970653    0.07 % in   2 repeats    0.00 % in 0 genes
4   175904  chr6  NT_167244.1  3179432-3355336    0.10 % in   5 repeats    0.45 % in 1 genes
5   172743  chr6  NT_167247.1  4421552-4594295    0.02 % in   1 repeats    100.00 % in 1 genes
6   164648  chr6  NT_167249.1  2138383-2303031    0.01 % in   1 repeats    0.00 % in 0 genes
7   164419  chr6  NT_167247.1  1562415-1726834    0.02 % in   1 repeats    0.33 % in 1 genes
8   160412  chr6  NT_167248.1  521569-681981    0.69 % in   2 repeats    0.00 % in 0 genes
9   143501  chr6  NT_167244.1  2894088-3037589    0.25 % in   4 repeats    0.00 % in 0 genes
10   118175  chr6  NT_167245.1  2605686-2723861    0.55 % in   3 repeats    0.00 % in 0 genes
11   115853  chr6  NT_167247.1  1177429-1293282    0.20 % in   1 repeats    0.00 % in 0 genes
12   108729  chr6  NT_167245.1  137292-246021    0.76 % in   2 repeats    0.00 % in 0 genes
13   105074  chr6  NT_167244.1  1451169-1556243    0.44 % in   3 repeats    0.00 % in 0 genes
14   104524  chr6  NT_167244.1  1833290-1937814    0.34 % in   3 repeats    0.00 % in 0 genes
15   104452  chr6  NT_167244.1  588667-693119    0.14 % in   1 repeats    0.00 % in 0 genes
16   103276  chr6  NT_167244.1  3490830-3594106    0.23 % in   2 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
401765  chr6  NT_167244.1  2359644-2761409    1       AluSp (1) 
208006  chr6  NT_167244.1  4389990-4597996    2       AluSg/x (1)  AluJo (1) 
180446  chr6  NT_167244.1  3790207-3970653    2       MLT1H-int (1)  MER52D (1) 
175904  chr6  NT_167244.1  3179432-3355336    3       GC_rich (3)  (CCG)n (1)  AluSp (1) 
172743  chr6  NT_167247.1  4421552-4594295    1       AluSc (1) 
164648  chr6  NT_167249.1  2138383-2303031    1       AT_rich (1) 
164419  chr6  NT_167247.1  1562415-1726834    1       A-rich (1) 
160412  chr6  NT_167248.1  521569-681981    2       L1PREC2 (1)  HERVH-int (1) 
143501  chr6  NT_167244.1  2894088-3037589    4       L1MC5 (1)  AluY (1)  AluSg1 (1) 
10  118175  chr6  NT_167245.1  2605686-2723861    3       MLT1E2 (1)  L2a (1)  L2 (1) 
11  115853  chr6  NT_167247.1  1177429-1293282    1       ERV3-16A3_I-int (1) 
12  108729  chr6  NT_167245.1  137292-246021    2       MLT1E2 (1)  LTR12C (1) 
13  105074  chr6  NT_167244.1  1451169-1556243    3       ERV3-16A3_I-int (1)  AluY (1)  AluSg1 (1) 
14  104524  chr6  NT_167244.1  1833290-1937814    3       (TATG)n (1)  MIR (1)  AluSx (1) 
15  104452  chr6  NT_167244.1  588667-693119    1       L1MA9 (1) 
16  103276  chr6  NT_167244.1  3490830-3594106    2       L1M2 (1)  AluS (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   175904       chr6  NT_167244.1  3179432-3355336    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
5   172743       chr6  NT_167247.1  4421552-4594295    LOC100507722  hypothetical_protein_LOC100507722
7   164419       chr6  NT_167247.1  1562415-1726834    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011