Distribution of restriction sites in the human genome

Enzyme:  BstXI               Longest uncut segments
Specificity:  CCANNNNNNTGG               Repeats in uncut segments
Number of sites:  1804908               Genes in uncut segments
Mean distance between sites:  1585 base pairs
Standard deviation:  1952 base pairs
Site density 630.8 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   494303  chr15  NT_037852.6  1397170-1891473    0.79 % in   18 repeats    0.00 % in 0 genes
2   401348  chr6  NT_167244.1  2359932-2761280    0.01 % in   1 repeats    0.00 % in 0 genes
3   209069  chr6  NT_167244.1  4389017-4598086    0.57 % in   6 repeats    0.00 % in 0 genes
4   182278  chr6  NT_167244.1  3788696-3970974    0.50 % in   8 repeats    0.00 % in 0 genes
5   179060  chr6  NT_167247.1  4418867-4597927    1.86 % in   14 repeats    100.00 % in 1 genes
6   176506  chr6  NT_167244.1  3179887-3356393    0.24 % in   5 repeats    0.31 % in 2 genes
7   167810  chr6  NT_167247.1  1559567-1727377    1.18 % in   9 repeats    2.02 % in 1 genes
8   166942  chr6  NT_167249.1  2136636-2303578    0.79 % in   6 repeats    0.00 % in 0 genes
9   164886  chr7  NT_023603.5  29557-194443    100.00 % in   5 repeats    0.00 % in 0 genes
10   163769  chr6  NT_167248.1  521239-685008    2.72 % in   2 repeats    0.00 % in 0 genes
11   156187  chr6  NT_167244.1  2008755-2164942    0.25 % in   3 repeats    0.00 % in 0 genes
12   153654  chr9  NT_008470.19  21690987-21844641    1.09 % in   9 repeats    0.00 % in 0 genes
13   142935  chr6  NT_167244.1  2894599-3037534    0.08 % in   2 repeats    0.00 % in 0 genes
14   119813  chr6  NT_167245.1  2603875-2723688    1.62 % in   5 repeats    0.00 % in 0 genes
15   117087  chr6  NT_167247.1  1177313-1294400    0.30 % in   1 repeats    0.00 % in 0 genes
16   116547  chr6  NT_167246.1  3258522-3375069    0.46 % in   3 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
494303  chr15  NT_037852.6  1397170-1891473    18  15       L2a (3)  L1M5 (2)  U2 (1) 
401348  chr6  NT_167244.1  2359932-2761280    1       AluSp (1) 
209069  chr6  NT_167244.1  4389017-4598086    5       MER57-int (2)  (TTCC)n (1)  AluY (1) 
182278  chr6  NT_167244.1  3788696-3970974    7       AT_rich (2)  MLT1H-int (1)  MIR (1) 
179060  chr6  NT_167247.1  4418867-4597927    14  12       MLT1J (2)  AluSx (2)  (TTAAA)n (1) 
176506  chr6  NT_167244.1  3179887-3356393    4       GC_rich (2)  Charlie4a (1)  (CCG)n (1) 
167810  chr6  NT_167247.1  1559567-1727377    8       Tigger7 (2)  MIRc (1)  MIR (1) 
166942  chr6  NT_167249.1  2136636-2303578    6       MamGypLTR1b (1)  L1MC4a (1)  L1MB8 (1) 
164886  chr7  NT_023603.5  29557-194443    2       L1PA2 (3)  ALR/Alpha (2) 
10  163769  chr6  NT_167248.1  521239-685008    2       L1PREC2 (1)  HERVH-int (1) 
11  156187  chr6  NT_167244.1  2008755-2164942    3       MIRb (1)  MIR (1)  AluSx (1) 
12  153654  chr9  NT_008470.19  21690987-21844641    7       LTR67B (2)  L2 (2)  MSTA (1) 
13  142935  chr6  NT_167244.1  2894599-3037534    2       AluY (1)  AluSg1 (1) 
14  119813  chr6  NT_167245.1  2603875-2723688    5       MLT1N2 (1)  MER5B (1)  MER5A1 (1) 
15  117087  chr6  NT_167247.1  1177313-1294400    1       ERV3-16A3_I-int (1) 
16  116547  chr6  NT_167246.1  3258522-3375069    3       MIRb (1)  MIR3 (1)  AluSx (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
5   179060       chr6  NT_167247.1  4418867-4597927    LOC100507722  hypothetical_protein_LOC100507722
6   176506       chr6  NT_167244.1  3179887-3356393    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
7   167810       chr6  NT_167247.1  1559567-1727377    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011