Distribution of restriction sites in the human genome

Enzyme:  RsaI               Longest uncut segments
Specificity:  GTAC               Repeats in uncut segments
Number of sites:  5046774               Genes in uncut segments
Mean distance between sites:  566 base pairs
Standard deviation:  606 base pairs
Site density1763.8 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   488573  chr15  NT_037852.6  1396500-1885073    0.20 % in   6 repeats    0.00 % in 0 genes
2   401528  chr6  NT_167244.1  2359716-2761244    0.06 % in   1 repeats    0.00 % in 0 genes
3   209581  chr6  NT_167244.1  4388993-4598574    0.79 % in   8 repeats    0.00 % in 0 genes
4   181283  chr6  NT_167244.1  3789345-3970628    0.18 % in   5 repeats    0.00 % in 0 genes
5   178351  chr6  NT_167244.1  3177746-3356097    0.64 % in   12 repeats    1.39 % in 1 genes
6   172621  chr6  NT_167247.1  4421776-4594397    0.08 % in   2 repeats    100.00 % in 1 genes
7   160458  chr6  NT_167248.1  520945-681403    0.72 % in   2 repeats    0.00 % in 0 genes
8   145109  chr6  NT_167244.1  2893992-3039101    1.31 % in   11 repeats    0.00 % in 0 genes
9   120178  chr6  NT_167245.1  2604491-2724669    1.88 % in   6 repeats    0.00 % in 0 genes
10   115781  chr6  NT_167247.1  1177215-1292996    0.39 % in   1 repeats    0.00 % in 0 genes
11   113583  chr6  NT_167246.1  3261196-3374779    0.05 % in   1 repeats    0.00 % in 0 genes
12   111133  chr6  NT_167245.1  136159-247292    2.74 % in   7 repeats    0.00 % in 0 genes
13   105991  chr6  NT_167244.1  1450439-1556430    0.78 % in   4 repeats    0.00 % in 0 genes
14   105432  chr6  NT_167244.1  587671-693103    0.43 % in   4 repeats    0.00 % in 0 genes
15   104533  chr6  NT_167244.1  1833466-1937999    0.37 % in   4 repeats    0.00 % in 0 genes
16   104322  chr6  NT_167244.1  3490233-3594555    1.11 % in   5 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
488573  chr15  NT_037852.6  1396500-1885073    6       MIRc (1)  MIRb (1)  L1M3 (1) 
401528  chr6  NT_167244.1  2359716-2761244    1       AluSp (1) 
209581  chr6  NT_167244.1  4388993-4598574    7       MER57-int (2)  (TTCC)n (1)  L1MC (1) 
181283  chr6  NT_167244.1  3789345-3970628    5       MLT1H-int (1)  MIR (1)  MER52D (1) 
178351  chr6  NT_167244.1  3177746-3356097    12  9       GC_rich (3)  LTR23 (2)  MER66C (1) 
172621  chr6  NT_167247.1  4421776-4594397    2       MER11A (1)  AluSc (1) 
160458  chr6  NT_167248.1  520945-681403    2       L1PREC2 (1)  HERVH-int (1) 
145109  chr6  NT_167244.1  2893992-3039101    11  6       L1MC5 (3)  AluSc (3)  AluJo (2) 
120178  chr6  NT_167245.1  2604491-2724669    5       L2 (2)  MLT1E2 (1)  MER5B (1) 
10  115781  chr6  NT_167247.1  1177215-1292996    1       ERV3-16A3_I-int (1) 
11  113583  chr6  NT_167246.1  3261196-3374779    1       MIRb (1) 
12  111133  chr6  NT_167245.1  136159-247292    7       MLT1F (1)  MLT1E2 (1)  MER6 (1) 
13  105991  chr6  NT_167244.1  1450439-1556430    4       ERV3-16A3_I-int (1)  AluY (1)  AluSg/x (1) 
14  105432  chr6  NT_167244.1  587671-693103    4       MER77 (1)  L1ME3D (1)  L1MA9 (1) 
15  104533  chr6  NT_167244.1  1833466-1937999    4       (TATG)n (1)  MIR (1)  AluSx (1) 
16  104322  chr6  NT_167244.1  3490233-3594555    4       L1M2 (2)  LTR78B (1)  AluSg (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
5   178351       chr6  NT_167244.1  3177746-3356097    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
6   172621       chr6  NT_167247.1  4421776-4594397    LOC100507722  hypothetical_protein_LOC100507722



Posfai@neb.com
May 11, 2011