Distribution of restriction sites in the human genome

Enzyme:  ApoI               Longest uncut segments
Specificity:  RAATTY               Repeats in uncut segments
Number of sites:  6220571               Genes in uncut segments
Mean distance between sites:  459 base pairs
Standard deviation:  592 base pairs
Site density2174.0 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   401765  chr6  NT_167244.1  2359643-2761408    0.08 % in   1 repeats    0.00 % in 0 genes
2   208433  chr6  NT_167244.1  4389562-4597995    0.27 % in   4 repeats    0.00 % in 0 genes
3   180740  chr6  NT_167244.1  3789912-3970652    0.18 % in   3 repeats    0.00 % in 0 genes
4   177924  chr6  NT_167244.1  3179104-3357028    0.42 % in   7 repeats    1.10 % in 2 genes
5   172858  chr6  NT_167247.1  4421551-4594409    0.09 % in   2 repeats    100.00 % in 1 genes
6   165388  chr6  NT_167249.1  2138278-2303666    0.27 % in   5 repeats    0.00 % in 0 genes
7   164646  chr6  NT_167247.1  1562414-1727060    0.02 % in   1 repeats    0.33 % in 1 genes
8   160914  chr6  NT_167248.1  521066-681980    1.00 % in   2 repeats    0.00 % in 0 genes
9   157267  chr6  NT_167244.1  2008493-2165760    0.41 % in   4 repeats    0.00 % in 0 genes
10   150178  chr9  NT_008470.19  21693150-21843328    0.08 % in   1 repeats    0.00 % in 0 genes
11   145700  chr6  NT_167244.1  2891979-3037679    1.29 % in   9 repeats    0.00 % in 0 genes
12   118699  chr6  NT_167245.1  2605329-2724028    0.99 % in   3 repeats    0.00 % in 0 genes
13   117251  chr6  NT_167247.1  1176380-1293631    1.10 % in   1 repeats    0.00 % in 0 genes
14   108729  chr6  NT_167245.1  137291-246020    0.76 % in   2 repeats    0.00 % in 0 genes
15   108394  chr6  NT_167244.1  1832967-1941361    1.63 % in   8 repeats    0.00 % in 0 genes
16   105509  chr6  NT_167244.1  1451035-1556544    0.73 % in   3 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
401765  chr6  NT_167244.1  2359643-2761408    1       AluSp (1) 
208433  chr6  NT_167244.1  4389562-4597995    4       (TTCC)n (1)  MER57-int (1)  AluSg/x (1) 
180740  chr6  NT_167244.1  3789912-3970652    3       MLT1H-int (1)  MER52D (1)  AluJb (1) 
177924  chr6  NT_167244.1  3179104-3357028    5       GC_rich (3)  Charlie4a (1)  (CCG)n (1) 
172858  chr6  NT_167247.1  4421551-4594409    2       MER11A (1)  AluSc (1) 
165388  chr6  NT_167249.1  2138278-2303666    3       L1MB8 (2)  AluSx (2)  AT_rich (1) 
164646  chr6  NT_167247.1  1562414-1727060    1       A-rich (1) 
160914  chr6  NT_167248.1  521066-681980    2       L1PREC2 (1)  HERVH-int (1) 
157267  chr6  NT_167244.1  2008493-2165760    4       MIRb (1)  MIR (1)  AluY (1) 
10  150178  chr9  NT_008470.19  21693150-21843328    1       L1M5 (1) 
11  145700  chr6  NT_167244.1  2891979-3037679    8       AluY (2)  (TG)n (1)  (TCC)n (1) 
12  118699  chr6  NT_167245.1  2605329-2724028    3       MLT1E2 (1)  L2a (1)  L2 (1) 
13  117251  chr6  NT_167247.1  1176380-1293631    1       ERV3-16A3_I-int (1) 
14  108729  chr6  NT_167245.1  137291-246020    2       MLT1E2 (1)  LTR12C (1) 
15  108394  chr6  NT_167244.1  1832967-1941361    7       AluSx (2)  (TATG)n (1)  MIR (1) 
16  105509  chr6  NT_167244.1  1451035-1556544    3       ERV3-16A3_I-int (1)  AluY (1)  AluSg1 (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   177924       chr6  NT_167244.1  3179104-3357028    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
5   172858       chr6  NT_167247.1  4421551-4594409    LOC100507722  hypothetical_protein_LOC100507722
7   164646       chr6  NT_167247.1  1562414-1727060    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011