Distribution of restriction sites in the human genome

Enzyme:  HpyAV               Longest uncut segments
Specificity:  CCTTC               Repeats in uncut segments
Number of sites:  6876025               Genes in uncut segments
Mean distance between sites:  416 base pairs
Standard deviation:  499 base pairs
Site density2403.1 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   487036  chr15  NT_037852.6  1398276-1885312    0.01 % in   1 repeats    0.00 % in 0 genes
2   401477  chr6  NT_167244.1  2359784-2761261    0.05 % in   1 repeats    0.00 % in 0 genes
3   208003  chr6  NT_167244.1  4389912-4597915    0.07 % in   2 repeats    0.00 % in 0 genes
4   180974  chr6  NT_167244.1  3790179-3971153    0.33 % in   4 repeats    0.00 % in 0 genes
5   175418  chr6  NT_167244.1  3180163-3355581    0.20 % in   4 repeats    0.04 % in 1 genes
6   173640  chr6  NT_167247.1  4422116-4595756    0.77 % in   4 repeats    100.00 % in 1 genes
7   164657  chr6  NT_167247.1  1562479-1727136    0.02 % in   1 repeats    0.29 % in 1 genes
8   159637  chr6  NT_167248.1  521685-681322    0.20 % in   2 repeats    0.00 % in 0 genes
9   150619  chr9  NT_008470.19  21693062-21843681    0.17 % in   2 repeats    0.00 % in 0 genes
10   143985  chr6  NT_167244.1  2894374-3038359    0.64 % in   6 repeats    0.00 % in 0 genes
11   117652  chr6  NT_167245.1  2606119-2723771    0.13 % in   2 repeats    0.00 % in 0 genes
12   114975  chr6  NT_167247.1  1177645-1292620    0.02 % in   1 repeats    0.00 % in 0 genes
13   108947  chr6  NT_167245.1  137385-246332    0.96 % in   3 repeats    0.00 % in 0 genes
14   106024  chr6  NT_167244.1  1451145-1557169    1.29 % in   5 repeats    0.00 % in 0 genes
15   105845  chr6  NT_167244.1  1832379-1938224    0.65 % in   4 repeats    0.00 % in 0 genes
16   105085  chr6  NT_167244.1  588279-693364    0.46 % in   3 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
487036  chr15  NT_037852.6  1398276-1885312    1       AT_rich (1) 
401477  chr6  NT_167244.1  2359784-2761261    1       AluSp (1) 
208003  chr6  NT_167244.1  4389912-4597915    2       AluSg/x (1)  AluJo (1) 
180974  chr6  NT_167244.1  3790179-3971153    4       MLT1H-int (1)  MER52D (1)  AluSc (1) 
175418  chr6  NT_167244.1  3180163-3355581    4       GC_rich (1)  Charlie4a (1)  (CCG)n (1) 
173640  chr6  NT_167247.1  4422116-4595756    4       (TTAAA)n (1)  MER11A (1)  AluSg/x (1) 
164657  chr6  NT_167247.1  1562479-1727136    1       A-rich (1) 
159637  chr6  NT_167248.1  521685-681322    2       L1PREC2 (1)  HERVH-int (1) 
150619  chr9  NT_008470.19  21693062-21843681    2       MIR3 (1)  L1M5 (1) 
10  143985  chr6  NT_167244.1  2894374-3038359    5       AluJo (2)  L1MC5 (1)  AluY (1) 
11  117652  chr6  NT_167245.1  2606119-2723771    2       L2a (1)  L2 (1) 
12  114975  chr6  NT_167247.1  1177645-1292620    1       ERV3-16A3_I-int (1) 
13  108947  chr6  NT_167245.1  137385-246332    3       MLT1F (1)  MLT1E2 (1)  LTR12C (1) 
14  106024  chr6  NT_167244.1  1451145-1557169    5       L1MA1 (1)  ERV3-16A3_I-int (1)  AT_rich (1) 
15  105845  chr6  NT_167244.1  1832379-1938224    4       (TATG)n (1)  MIR (1)  AluSx (1) 
16  105085  chr6  NT_167244.1  588279-693364    3       L1PB1 (1)  L1ME3D (1)  L1MA9 (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
5   175418       chr6  NT_167244.1  3180163-3355581    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
6   173640       chr6  NT_167247.1  4422116-4595756    LOC100507722  hypothetical_protein_LOC100507722
7   164657       chr6  NT_167247.1  1562479-1727136    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011