Distribution of restriction sites in the human genome

Enzyme:  BsmAI               Longest uncut segments
Specificity:  GTCTC               Repeats in uncut segments
Number of sites:  6569189               Genes in uncut segments
Mean distance between sites:  435 base pairs
Standard deviation:  534 base pairs
Site density2295.8 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   487991  chr15  NT_037852.6  1398765-1886756    0.01 % in   1 repeats    0.00 % in 0 genes
2   401802  chr6  NT_167244.1  2359449-2761251    0.08 % in   1 repeats    0.00 % in 0 genes
3   207882  chr6  NT_167244.1  4390009-4597891    0.01 % in   1 repeats    0.00 % in 0 genes
4   181144  chr6  NT_167244.1  3789878-3971022    0.36 % in   4 repeats    0.00 % in 0 genes
5   176934  chr6  NT_167244.1  3179536-3356470    0.26 % in   6 repeats    0.55 % in 2 genes
6   172254  chr6  NT_167247.1  4422019-4594273    0.01 % in   1 repeats    100.00 % in 1 genes
7   164911  chr6  NT_167249.1  2138273-2303184    0.03 % in   1 repeats    0.00 % in 0 genes
8   164531  chr6  NT_167247.1  1562896-1727427    0.13 % in   2 repeats    0.03 % in 1 genes
9   160256  chr6  NT_167248.1  521286-681542    0.59 % in   2 repeats    0.00 % in 0 genes
10   150433  chr9  NT_008470.19  21693044-21843477    0.15 % in   1 repeats    0.00 % in 0 genes
11   143461  chr6  NT_167244.1  2894233-3037694    0.22 % in   5 repeats    0.00 % in 0 genes
12   118202  chr6  NT_167245.1  2605422-2723624    0.69 % in   1 repeats    0.00 % in 0 genes
13   114162  chr6  NT_167246.1  3260666-3374828    0.10 % in   2 repeats    0.00 % in 0 genes
14   108171  chr6  NT_167245.1  137819-245990    0.25 % in   2 repeats    0.00 % in 0 genes
15   105908  chr6  NT_167244.1  588207-694115    1.14 % in   7 repeats    0.00 % in 0 genes
16   104764  chr6  NT_167244.1  1451566-1556330    0.17 % in   2 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
487991  chr15  NT_037852.6  1398765-1886756    1       AT_rich (1) 
401802  chr6  NT_167244.1  2359449-2761251    1       AluSp (1) 
207882  chr6  NT_167244.1  4390009-4597891    1       AluSg/x (1) 
181144  chr6  NT_167244.1  3789878-3971022    4       MLT1H-int (1)  MER52D (1)  AluSc (1) 
176934  chr6  NT_167244.1  3179536-3356470    4       GC_rich (3)  Charlie4a (1)  (CCG)n (1) 
172254  chr6  NT_167247.1  4422019-4594273    1       AluSc (1) 
164911  chr6  NT_167249.1  2138273-2303184    1       AT_rich (1) 
164531  chr6  NT_167247.1  1562896-1727427    2       MIR (1)  AluSq (1) 
160256  chr6  NT_167248.1  521286-681542    2       L1PREC2 (1)  HERVH-int (1) 
10  150433  chr9  NT_008470.19  21693044-21843477    1       L1M5 (1) 
11  143461  chr6  NT_167244.1  2894233-3037694    5       L1MC5 (1)  AluY (1)  AluSg1 (1) 
12  118202  chr6  NT_167245.1  2605422-2723624    1       L2a (1) 
13  114162  chr6  NT_167246.1  3260666-3374828    2       MIRb (1)  AluSx (1) 
14  108171  chr6  NT_167245.1  137819-245990    2       MLT1E2 (1)  LTR12C (1) 
15  105908  chr6  NT_167244.1  588207-694115    5       L1MA9 (3)  L1PB1 (1)  L1P5 (1) 
16  104764  chr6  NT_167244.1  1451566-1556330    2       ERV3-16A3_I-int (1)  AluSg1 (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
5   176934       chr6  NT_167244.1  3179536-3356470    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
6   172254       chr6  NT_167247.1  4422019-4594273    LOC100507722  hypothetical_protein_LOC100507722
8   164531       chr6  NT_167247.1  1562896-1727427    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011