Distribution of restriction sites in the human genome

Enzyme:  BseYIB               Longest uncut segments
Specificity:  CCCAGC               Repeats in uncut segments
Number of sites:  3670849               Genes in uncut segments
Mean distance between sites:  779 base pairs
Standard deviation:  1212 base pairs
Site density1282.9 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   489734  chr15  NT_037852.6  1397304-1887038    0.14 % in   6 repeats    0.00 % in 0 genes
2   401806  chr6  NT_167244.1  2359939-2761745    0.01 % in   1 repeats    0.00 % in 0 genes
3   299467  chrY  NT_011875.12  8417624-8717091    83.16 % in   28 repeats    0.00 % in 0 genes
4   211023  chr6  NT_167244.1  4386944-4597967    1.49 % in   9 repeats    0.00 % in 0 genes
5   181117  chr6  NT_167244.1  3789992-3971109    0.40 % in   4 repeats    0.00 % in 0 genes
6   175116  chr6  NT_167244.1  3180225-3355341    0.04 % in   2 repeats    0.00 % in 1 genes
7   172374  chr6  NT_167247.1  4421926-4594300    0.03 % in   1 repeats    100.00 % in 1 genes
8   167706  chr6  NT_167247.1  1559642-1727348    1.12 % in   9 repeats    1.97 % in 1 genes
9   165734  chr6  NT_167249.1  2137706-2303440    0.30 % in   4 repeats    0.00 % in 0 genes
10   161216  chr7  NT_023603.5  32834-194050    100.00 % in   4 repeats    0.00 % in 0 genes
11   159887  chr6  NT_167248.1  521390-681277    0.36 % in   2 repeats    0.00 % in 0 genes
12   155947  chr6  NT_167244.1  2008762-2164709    0.24 % in   3 repeats    0.00 % in 0 genes
13   152810  chr9  NT_008470.19  21692117-21844927    0.83 % in   6 repeats    0.00 % in 0 genes
14   142943  chr6  NT_167244.1  2894598-3037541    0.08 % in   2 repeats    0.00 % in 0 genes
15   128670  chr1  NT_077389.3  261552-390222    97.78 % in   60 repeats    0.00 % in 0 genes
16   117759  chr6  NT_167245.1  2606034-2723793    0.22 % in   2 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
489734  chr15  NT_037852.6  1397304-1887038    6       MLT1L (1)  MIRc (1)  MIRb (1) 
401806  chr6  NT_167244.1  2359939-2761745    1       AluSp (1) 
299467  chrY  NT_011875.12  8417624-8717091    28  10       LTR12B (17)  LTR12D (2)  L1PA16 (2) 
211023  chr6  NT_167244.1  4386944-4597967    7       MER57-int (3)  (TTTTA)n (1)  (TTCC)n (1) 
181117  chr6  NT_167244.1  3789992-3971109    4       MLT1H-int (1)  MER52D (1)  AluSc (1) 
175116  chr6  NT_167244.1  3180225-3355341    2       GC_rich (1)  AluSp (1) 
172374  chr6  NT_167247.1  4421926-4594300    1       AluSc (1) 
167706  chr6  NT_167247.1  1559642-1727348    8       Tigger7 (2)  MIRc (1)  MIR (1) 
165734  chr6  NT_167249.1  2137706-2303440    4       L1MC4a (1)  L1MB8 (1)  AT_rich (1) 
10  161216  chr7  NT_023603.5  32834-194050    2       L1PA2 (3)  ALR/Alpha (1) 
11  159887  chr6  NT_167248.1  521390-681277    2       L1PREC2 (1)  HERVH-int (1) 
12  155947  chr6  NT_167244.1  2008762-2164709    3       MIRb (1)  MIR (1)  AluSx (1) 
13  152810  chr9  NT_008470.19  21692117-21844927    4       LTR67B (2)  L2 (2)  MIR3 (1) 
14  142943  chr6  NT_167244.1  2894598-3037541    2       AluY (1)  AluSg1 (1) 
15  128670  chr1  NT_077389.3  261552-390222    60  8       ALR/Alpha (52)  MLT1J (2)  L2c (1) 
16  117759  chr6  NT_167245.1  2606034-2723793    2       L2a (1)  L2 (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
6   175116       chr6  NT_167244.1  3180225-3355341    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
7   172374       chr6  NT_167247.1  4421926-4594300    LOC100507722  hypothetical_protein_LOC100507722
8   167706       chr6  NT_167247.1  1559642-1727348    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011