Distribution of restriction sites in the human genome

Enzyme:  TdeIII               Longest uncut segments
Specificity:  GGNCC               Repeats in uncut segments
Number of sites:  6290916               Genes in uncut segments
Mean distance between sites:  454 base pairs
Standard deviation:  700 base pairs
Site density2198.6 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   488935  chr15  NT_037852.6  1396668-1885603    0.18 % in   6 repeats    0.00 % in 0 genes
2   401786  chr6  NT_167244.1  2359504-2761290    0.08 % in   1 repeats    0.00 % in 0 genes
3   209499  chr6  NT_167244.1  4389632-4599131    0.71 % in   8 repeats    0.00 % in 0 genes
4   181578  chr6  NT_167244.1  3790033-3971611    0.65 % in   6 repeats    0.00 % in 0 genes
5   175828  chr6  NT_167244.1  3180230-3356058    0.20 % in   3 repeats    0.00 % in 0 genes
6   172529  chr6  NT_167247.1  4421893-4594422    0.10 % in   2 repeats    100.00 % in 1 genes
7   167369  chr6  NT_167249.1  2138463-2305832    1.30 % in   9 repeats    0.00 % in 0 genes
8   159436  chr6  NT_167248.1  521878-681314    0.08 % in   2 repeats    0.00 % in 0 genes
9   151037  chr9  NT_008470.19  21692640-21843677    0.45 % in   3 repeats    0.00 % in 0 genes
10   143521  chr6  NT_167244.1  2894289-3037810    0.29 % in   4 repeats    0.00 % in 0 genes
11   118344  chr6  NT_167245.1  2605743-2724087    0.69 % in   3 repeats    0.00 % in 0 genes
12   114796  chr6  NT_167247.1  1177472-1292268    0.17 % in   1 repeats    0.00 % in 0 genes
13   113970  chr6  NT_167246.1  3261230-3375200    0.41 % in   3 repeats    0.00 % in 0 genes
14   108104  chr6  NT_167245.1  137962-246066    0.19 % in   2 repeats    0.00 % in 0 genes
15   106210  chr6  NT_167244.1  587381-693591    1.16 % in   6 repeats    0.00 % in 0 genes
16   105683  chr5  NW_003315917.1  1145801-1251484    1.63 % in   10 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
488935  chr15  NT_037852.6  1396668-1885603    6       MIRc (1)  MIRb (1)  L1M3 (1) 
401786  chr6  NT_167244.1  2359504-2761290    1       AluSp (1) 
209499  chr6  NT_167244.1  4389632-4599131    7       AluSx (2)  (TTCC)n (1)  MER57-int (1) 
181578  chr6  NT_167244.1  3790033-3971611    6       MLT1H-int (1)  MER52D (1)  LTR19B (1) 
175828  chr6  NT_167244.1  3180230-3356058    3       GC_rich (1)  Charlie4a (1)  AluSp (1) 
172529  chr6  NT_167247.1  4421893-4594422    2       MER11A (1)  AluSc (1) 
167369  chr6  NT_167249.1  2138463-2305832    4       L1MB8 (3)  AluSx (3)  Charlie2b (2) 
159436  chr6  NT_167248.1  521878-681314    2       L1PREC2 (1)  HERVH-int (1) 
151037  chr9  NT_008470.19  21692640-21843677    3       MIR3 (1)  LTR67B (1)  L1M5 (1) 
10  143521  chr6  NT_167244.1  2894289-3037810    4       L1MC5 (1)  AluY (1)  AluSg1 (1) 
11  118344  chr6  NT_167245.1  2605743-2724087    3       MLT1E2 (1)  L2a (1)  L2 (1) 
12  114796  chr6  NT_167247.1  1177472-1292268    1       ERV3-16A3_I-int (1) 
13  113970  chr6  NT_167246.1  3261230-3375200    2       MIRb (2)  AluSx (1) 
14  108104  chr6  NT_167245.1  137962-246066    2       MLT1E2 (1)  LTR12C (1) 
15  106210  chr6  NT_167244.1  587381-693591    5       L1MA9 (2)  MER77 (1)  L1PB1 (1) 
16  105683  chr5  NW_003315917.1  1145801-1251484    10  8       AT_rich (2)  AluSg (2)  MLT1F1 (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
6   172529       chr6  NT_167247.1  4421893-4594422    LOC100507722  hypothetical_protein_LOC100507722



Posfai@neb.com
May 11, 2011