Distribution of restriction sites in the human genome

Enzyme:  TthHB27I               Longest uncut segments
Specificity:  CAARCA               Repeats in uncut segments
Number of sites:  4551460               Genes in uncut segments
Mean distance between sites:  628 base pairs
Standard deviation:  674 base pairs
Site density1590.7 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   487618  chr15  NT_037852.6  1397542-1885160    0.08 % in   3 repeats    0.00 % in 0 genes
2   404619  chr6  NT_167244.1  2358969-2763588    0.22 % in   4 repeats    0.00 % in 0 genes
3   208208  chr6  NT_167244.1  4389923-4598131    0.17 % in   2 repeats    0.00 % in 0 genes
4   180898  chr6  NT_167244.1  3790327-3971225    0.33 % in   3 repeats    0.00 % in 0 genes
5   176335  chr6  NT_167244.1  3179066-3355401    0.13 % in   5 repeats    0.66 % in 1 genes
6   173784  chr6  NT_167247.1  4420741-4594525    0.35 % in   6 repeats    100.00 % in 1 genes
7   165582  chr6  NT_167249.1  2137491-2303073    0.34 % in   2 repeats    0.00 % in 0 genes
8   160837  chr6  NT_167248.1  520460-681297    0.95 % in   2 repeats    0.00 % in 0 genes
9   155987  chr6  NT_167244.1  2009145-2165132    0.07 % in   1 repeats    0.00 % in 0 genes
10   150980  chr9  NT_008470.19  21692408-21843388    0.44 % in   2 repeats    0.00 % in 0 genes
11   143312  chr6  NT_167244.1  2894543-3037855    0.32 % in   5 repeats    0.00 % in 0 genes
12   118923  chr6  NT_167245.1  2605235-2724158    1.17 % in   3 repeats    0.00 % in 0 genes
13   116068  chr6  NT_167247.1  1177021-1293089    0.55 % in   1 repeats    0.00 % in 0 genes
14   114094  chr6  NT_167246.1  3260712-3374806    0.08 % in   2 repeats    0.00 % in 0 genes
15   109107  chr6  NT_167245.1  136904-246011    1.10 % in   2 repeats    0.00 % in 0 genes
16   106350  chr6  NT_167244.1  1450027-1556377    0.73 % in   4 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
487618  chr15  NT_037852.6  1397542-1885160    3       MIRc (1)  MIRb (1)  L1M3 (1) 
404619  chr6  NT_167244.1  2358969-2763588    4       L4 (1)  L1MEg (1)  AluSp (1) 
208208  chr6  NT_167244.1  4389923-4598131    2       AluSg/x (1)  AluJo (1) 
180898  chr6  NT_167244.1  3790327-3971225    3       MLT1H-int (1)  MER52D (1)  AluSc (1) 
176335  chr6  NT_167244.1  3179066-3355401    3       GC_rich (3)  (CCG)n (1)  AluSp (1) 
173784  chr6  NT_167247.1  4420741-4594525    6       MIR (1)  MER11A (1)  L2b (1) 
165582  chr6  NT_167249.1  2137491-2303073    2       L1MC4a (1)  AT_rich (1) 
160837  chr6  NT_167248.1  520460-681297    2       L1PREC2 (1)  HERVH-int (1) 
155987  chr6  NT_167244.1  2009145-2165132    1       MIRb (1) 
10  150980  chr9  NT_008470.19  21692408-21843388    2       LTR67B (1)  L1M5 (1) 
11  143312  chr6  NT_167244.1  2894543-3037855    5       L1MC5 (1)  AluY (1)  AluSp (1) 
12  118923  chr6  NT_167245.1  2605235-2724158    3       MLT1E2 (1)  L2a (1)  L2 (1) 
13  116068  chr6  NT_167247.1  1177021-1293089    1       ERV3-16A3_I-int (1) 
14  114094  chr6  NT_167246.1  3260712-3374806    2       MIRb (1)  AluSx (1) 
15  109107  chr6  NT_167245.1  136904-246011    2       MLT1E2 (1)  LTR12C (1) 
16  106350  chr6  NT_167244.1  1450027-1556377    4       ERV3-16A3_I-int (1)  AluY (1)  AluSg/x (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
5   176335       chr6  NT_167244.1  3179066-3355401    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
6   173784       chr6  NT_167247.1  4420741-4594525    LOC100507722  hypothetical_protein_LOC100507722



Posfai@neb.com
May 11, 2011