Distribution of restriction sites in the human genome

Enzyme:  SfeI               Longest uncut segments
Specificity:  CTRYAG               Repeats in uncut segments
Number of sites:  3460333               Genes in uncut segments
Mean distance between sites:  826 base pairs
Standard deviation:  864 base pairs
Site density1209.3 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   486847  chr15  NT_037852.6  1398630-1885477    0.01 % in   1 repeats    0.00 % in 0 genes
2   404611  chr6  NT_167244.1  2359234-2763845    0.17 % in   3 repeats    0.00 % in 0 genes
3   208593  chr6  NT_167244.1  4389962-4598555    0.33 % in   4 repeats    0.00 % in 0 genes
4   181223  chr6  NT_167244.1  3790311-3971534    0.51 % in   5 repeats    0.00 % in 0 genes
5   176438  chr6  NT_167244.1  3180190-3356628    0.26 % in   5 repeats    0.27 % in 2 genes
6   172619  chr6  NT_167247.1  4422156-4594775    0.30 % in   2 repeats    100.00 % in 1 genes
7   166731  chr6  NT_167247.1  1561147-1727878    1.02 % in   7 repeats    1.08 % in 1 genes
8   165981  chr6  NT_167249.1  2138239-2304220    0.58 % in   7 repeats    0.00 % in 0 genes
9   160538  chr6  NT_167248.1  521326-681864    0.76 % in   2 repeats    0.00 % in 0 genes
10   155726  chr6  NT_167244.1  2009041-2164767    0.07 % in   2 repeats    0.00 % in 0 genes
11   151214  chr9  NT_008470.19  21692612-21843826    0.47 % in   3 repeats    0.00 % in 0 genes
12   143637  chr6  NT_167244.1  2894114-3037751    0.34 % in   5 repeats    0.00 % in 0 genes
13   120725  chr6  NT_167245.1  2603486-2724211    2.30 % in   8 repeats    0.00 % in 0 genes
14   116004  chr6  NT_167247.1  1177191-1293195    0.41 % in   1 repeats    0.00 % in 0 genes
15   109236  chr6  NT_167245.1  137791-247027    1.18 % in   4 repeats    0.00 % in 0 genes
16   108159  chr6  NT_167244.1  587107-695266    2.81 % in   10 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
486847  chr15  NT_037852.6  1398630-1885477    1       AT_rich (1) 
404611  chr6  NT_167244.1  2359234-2763845    3       L4 (1)  L1MEg (1)  AluSp (1) 
208593  chr6  NT_167244.1  4389962-4598555    4       L1MC (1)  AluSx (1)  AluSg/x (1) 
181223  chr6  NT_167244.1  3790311-3971534    5       MLT1H-int (1)  MER52D (1)  LTR19B (1) 
176438  chr6  NT_167244.1  3180190-3356628    5       GC_rich (1)  Charlie4a (1)  (CCG)n (1) 
172619  chr6  NT_167247.1  4422156-4594775    2       MER11A (1)  AluSc (1) 
166731  chr6  NT_167247.1  1561147-1727878    6       MIR (2)  MIRc (1)  L1MC3 (1) 
165981  chr6  NT_167249.1  2138239-2304220    3       L1MB8 (3)  AluSx (3)  AT_rich (1) 
160538  chr6  NT_167248.1  521326-681864    2       L1PREC2 (1)  HERVH-int (1) 
10  155726  chr6  NT_167244.1  2009041-2164767    2       MIRb (1)  MIR (1) 
11  151214  chr9  NT_008470.19  21692612-21843826    3       MIR3 (1)  LTR67B (1)  L1M5 (1) 
12  143637  chr6  NT_167244.1  2894114-3037751    5       L1MC5 (1)  AluY (1)  AluSg1 (1) 
13  120725  chr6  NT_167245.1  2603486-2724211    7       MLT1N2 (2)  MLT1E2 (1)  MER5B (1) 
14  116004  chr6  NT_167247.1  1177191-1293195    1       ERV3-16A3_I-int (1) 
15  109236  chr6  NT_167245.1  137791-247027    4       MLT1F (1)  MLT1E2 (1)  LTR12C (1) 
16  108159  chr6  NT_167244.1  587107-695266    10  8       L1MA9 (3)  MIR (1)  MER77 (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
5   176438       chr6  NT_167244.1  3180190-3356628    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
6   172619       chr6  NT_167247.1  4422156-4594775    LOC100507722  hypothetical_protein_LOC100507722
7   166731       chr6  NT_167247.1  1561147-1727878    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011