Distribution of restriction sites in the human genome

Enzyme:  BsrI               Longest uncut segments
Specificity:  ACTGG               Repeats in uncut segments
Number of sites:  5679785               Genes in uncut segments
Mean distance between sites:  503 base pairs
Standard deviation:  543 base pairs
Site density1985.0 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   487617  chr15  NT_037852.6  1398808-1886425    0.01 % in   1 repeats    0.00 % in 0 genes
2   401630  chr6  NT_167244.1  2359650-2761280    0.08 % in   1 repeats    0.00 % in 0 genes
3   208218  chr6  NT_167244.1  4389949-4598167    0.15 % in   2 repeats    0.00 % in 0 genes
4   182141  chr6  NT_167244.1  3788712-3970853    0.45 % in   7 repeats    0.00 % in 0 genes
5   175806  chr6  NT_167244.1  3179564-3355370    0.12 % in   5 repeats    0.38 % in 1 genes
6   172919  chr6  NT_167247.1  4421453-4594372    0.08 % in   3 repeats    100.00 % in 1 genes
7   165227  chr6  NT_167247.1  1562369-1727596    0.25 % in   3 repeats    0.35 % in 1 genes
8   165096  chr6  NT_167249.1  2138208-2303304    0.05 % in   2 repeats    0.00 % in 0 genes
9   159459  chr6  NT_167248.1  521832-681291    0.09 % in   2 repeats    0.00 % in 0 genes
10   151721  chr9  NT_008470.19  21692847-21844568    0.38 % in   3 repeats    0.00 % in 0 genes
11   143912  chr6  NT_167244.1  2894406-3038318    0.64 % in   6 repeats    0.00 % in 0 genes
12   118761  chr6  NT_167245.1  2606164-2724925    1.03 % in   4 repeats    0.00 % in 0 genes
13   115509  chr6  NT_167247.1  1177468-1292977    0.17 % in   1 repeats    0.00 % in 0 genes
14   114275  chr6  NT_167246.1  3261120-3375395    0.41 % in   3 repeats    0.00 % in 0 genes
15   108627  chr6  NT_167245.1  137647-246274    0.67 % in   3 repeats    0.00 % in 0 genes
16   107243  chr6  NT_167244.1  587710-694953    2.08 % in   9 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
487617  chr15  NT_037852.6  1398808-1886425    1       AT_rich (1) 
401630  chr6  NT_167244.1  2359650-2761280    1       AluSp (1) 
208218  chr6  NT_167244.1  4389949-4598167    2       AluSg/x (1)  AluJo (1) 
182141  chr6  NT_167244.1  3788712-3970853    6       AT_rich (2)  MLT1H-int (1)  MIR (1) 
175806  chr6  NT_167244.1  3179564-3355370    3       GC_rich (3)  (CCG)n (1)  AluSp (1) 
172919  chr6  NT_167247.1  4421453-4594372    3       MIR (1)  MER11A (1)  AluSc (1) 
165227  chr6  NT_167247.1  1562369-1727596    3       MIR (1)  A-rich (1)  AluSq (1) 
165096  chr6  NT_167249.1  2138208-2303304    2       L1MB8 (1)  AT_rich (1) 
159459  chr6  NT_167248.1  521832-681291    2       L1PREC2 (1)  HERVH-int (1) 
10  151721  chr9  NT_008470.19  21692847-21844568    3       MIR3 (1)  L2 (1)  L1M5 (1) 
11  143912  chr6  NT_167244.1  2894406-3038318    5       AluJo (2)  L1MC5 (1)  AluY (1) 
12  118761  chr6  NT_167245.1  2606164-2724925    3       L2 (2)  MLT1E2 (1)  L2a (1) 
13  115509  chr6  NT_167247.1  1177468-1292977    1       ERV3-16A3_I-int (1) 
14  114275  chr6  NT_167246.1  3261120-3375395    2       MIRb (2)  AluSx (1) 
15  108627  chr6  NT_167245.1  137647-246274    3       MLT1F (1)  MLT1E2 (1)  LTR12C (1) 
16  107243  chr6  NT_167244.1  587710-694953    7       L1MA9 (3)  MER77 (1)  L1PB1 (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
5   175806       chr6  NT_167244.1  3179564-3355370    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
6   172919       chr6  NT_167247.1  4421453-4594372    LOC100507722  hypothetical_protein_LOC100507722
7   165227       chr6  NT_167247.1  1562369-1727596    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011