Distribution of restriction sites in the human genome

Enzyme:  HpyF17I               Longest uncut segments
Specificity:  TCNGA               Repeats in uncut segments
Number of sites:  8531532               Genes in uncut segments
Mean distance between sites:  335 base pairs
Standard deviation:  362 base pairs
Site density2981.6 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   401445  chr6  NT_167244.1  2359906-2761351    0.02 % in   1 repeats    0.00 % in 0 genes
2   207978  chr6  NT_167244.1  4389940-4597918    0.06 % in   2 repeats    0.00 % in 0 genes
3   180949  chr6  NT_167244.1  3789707-3970656    0.18 % in   3 repeats    0.00 % in 0 genes
4   176501  chr6  NT_167244.1  3179383-3355884    0.26 % in   6 repeats    0.48 % in 1 genes
5   174151  chr6  NT_167247.1  4420225-4594376    0.38 % in   6 repeats    100.00 % in 1 genes
6   166748  chr6  NT_167249.1  2138088-2304836    0.74 % in   8 repeats    0.00 % in 0 genes
7   159656  chr6  NT_167248.1  521758-681414    0.22 % in   2 repeats    0.00 % in 0 genes
8   150432  chr9  NT_008470.19  21693191-21843623    0.05 % in   1 repeats    0.00 % in 0 genes
9   143805  chr6  NT_167244.1  2893870-3037675    0.40 % in   4 repeats    0.00 % in 0 genes
10   117602  chr6  NT_167245.1  2606198-2723800    0.09 % in   2 repeats    0.00 % in 0 genes
11   115242  chr6  NT_167247.1  1177527-1292769    0.12 % in   1 repeats    0.00 % in 0 genes
12   108121  chr6  NT_167245.1  137914-246035    0.20 % in   2 repeats    0.00 % in 0 genes
13   106787  chr6  NT_167244.1  1451108-1557895    1.96 % in   7 repeats    0.00 % in 0 genes
14   105444  chr6  NT_167244.1  588595-694039    1.01 % in   7 repeats    0.00 % in 0 genes
15   105005  chr6  NT_167244.1  1833261-1938266    0.66 % in   4 repeats    0.00 % in 0 genes
16   103244  chr6  NT_167244.1  3491010-3594254    0.08 % in   2 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
401445  chr6  NT_167244.1  2359906-2761351    1       AluSp (1) 
207978  chr6  NT_167244.1  4389940-4597918    2       AluSg/x (1)  AluJo (1) 
180949  chr6  NT_167244.1  3789707-3970656    3       MLT1H-int (1)  MER52D (1)  AluJb (1) 
176501  chr6  NT_167244.1  3179383-3355884    4       GC_rich (3)  Charlie4a (1)  (CCG)n (1) 
174151  chr6  NT_167247.1  4420225-4594376    6       MIR (1)  MER11A (1)  L2b (1) 
166748  chr6  NT_167249.1  2138088-2304836    4       L1MB8 (3)  AluSx (3)  Charlie2b (1) 
159656  chr6  NT_167248.1  521758-681414    2       L1PREC2 (1)  HERVH-int (1) 
150432  chr9  NT_008470.19  21693191-21843623    1       L1M5 (1) 
143805  chr6  NT_167244.1  2893870-3037675    4       L1MC5 (1)  AluY (1)  AluSg1 (1) 
10  117602  chr6  NT_167245.1  2606198-2723800    2       L2a (1)  L2 (1) 
11  115242  chr6  NT_167247.1  1177527-1292769    1       ERV3-16A3_I-int (1) 
12  108121  chr6  NT_167245.1  137914-246035    2       MLT1E2 (1)  LTR12C (1) 
13  106787  chr6  NT_167244.1  1451108-1557895    6       L1MA1 (2)  ERV3-16A3_I-int (1)  AT_rich (1) 
14  105444  chr6  NT_167244.1  588595-694039    5       L1MA9 (3)  L1PB1 (1)  L1P5 (1) 
15  105005  chr6  NT_167244.1  1833261-1938266    4       (TATG)n (1)  MIR (1)  AluSx (1) 
16  103244  chr6  NT_167244.1  3491010-3594254    2       LTR78B (1)  AluS (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   176501       chr6  NT_167244.1  3179383-3355884    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
5   174151       chr6  NT_167247.1  4420225-4594376    LOC100507722  hypothetical_protein_LOC100507722



Posfai@neb.com
May 11, 2011