Distribution of restriction sites in the human genome

Enzyme:  AluI               Longest uncut segments
Specificity:  AGCT               Repeats in uncut segments
Number of sites:  12810528               Genes in uncut segments
Mean distance between sites:  223 base pairs
Standard deviation:  236 base pairs
Site density4477.1 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   401811  chr6  NT_167244.1  2359795-2761606    0.04 % in   1 repeats    0.00 % in 0 genes
2   208354  chr6  NT_167244.1  4389954-4598308    0.21 % in   3 repeats    0.00 % in 0 genes
3   180820  chr6  NT_167244.1  3790292-3971112    0.29 % in   3 repeats    0.00 % in 0 genes
4   175146  chr6  NT_167244.1  3180194-3355340    0.06 % in   3 repeats    0.02 % in 1 genes
5   172251  chr6  NT_167247.1  4422048-4594299    0.03 % in   1 repeats    100.00 % in 1 genes
6   164988  chr6  NT_167249.1  2138237-2303225    0.03 % in   1 repeats    0.00 % in 0 genes
7   159588  chr6  NT_167248.1  521748-681336    0.17 % in   2 repeats    0.00 % in 0 genes
8   150494  chr9  NT_008470.19  21693198-21843692    0.09 % in   2 repeats    0.00 % in 0 genes
9   143621  chr6  NT_167244.1  2894124-3037745    0.33 % in   5 repeats    0.00 % in 0 genes
10   117773  chr6  NT_167245.1  2605939-2723712    0.25 % in   1 repeats    0.00 % in 0 genes
11   115436  chr6  NT_167247.1  1177641-1293077    0.02 % in   1 repeats    0.00 % in 0 genes
12   107941  chr6  NT_167245.1  138026-245967    0.03 % in   2 repeats    0.00 % in 0 genes
13   105171  chr6  NT_167244.1  1451349-1556520    0.55 % in   3 repeats    0.00 % in 0 genes
14   104685  chr6  NT_167244.1  588599-693284    0.32 % in   3 repeats    0.00 % in 0 genes
15   104048  chr6  NT_167244.1  1833802-1937850    0.16 % in   1 repeats    0.00 % in 0 genes
16   103461  chr6  NT_167244.1  3490649-3594110    0.40 % in   2 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
401811  chr6  NT_167244.1  2359795-2761606    1       AluSp (1) 
208354  chr6  NT_167244.1  4389954-4598308    3       AluSx (1)  AluSg/x (1)  AluJo (1) 
180820  chr6  NT_167244.1  3790292-3971112    3       MLT1H-int (1)  MER52D (1)  AluSc (1) 
175146  chr6  NT_167244.1  3180194-3355340    3       GC_rich (1)  (CCG)n (1)  AluSp (1) 
172251  chr6  NT_167247.1  4422048-4594299    1       AluSc (1) 
164988  chr6  NT_167249.1  2138237-2303225    1       AT_rich (1) 
159588  chr6  NT_167248.1  521748-681336    2       L1PREC2 (1)  HERVH-int (1) 
150494  chr9  NT_008470.19  21693198-21843692    2       MIR3 (1)  L1M5 (1) 
143621  chr6  NT_167244.1  2894124-3037745    5       L1MC5 (1)  AluY (1)  AluSg1 (1) 
10  117773  chr6  NT_167245.1  2605939-2723712    1       L2a (1) 
11  115436  chr6  NT_167247.1  1177641-1293077    1       ERV3-16A3_I-int (1) 
12  107941  chr6  NT_167245.1  138026-245967    2       MLT1E2 (1)  LTR12C (1) 
13  105171  chr6  NT_167244.1  1451349-1556520    3       ERV3-16A3_I-int (1)  AluY (1)  AluSg1 (1) 
14  104685  chr6  NT_167244.1  588599-693284    3       L1PB1 (1)  L1ME3D (1)  L1MA9 (1) 
15  104048  chr6  NT_167244.1  1833802-1937850    1       AluSx (1) 
16  103461  chr6  NT_167244.1  3490649-3594110    2       L1M2 (1)  AluS (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   175146       chr6  NT_167244.1  3180194-3355340    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
5   172251       chr6  NT_167247.1  4422048-4594299    LOC100507722  hypothetical_protein_LOC100507722



Posfai@neb.com
May 11, 2011