Distribution of restriction sites in the human genome

Enzyme:  BspNCI               Longest uncut segments
Specificity:  CCAGA               Repeats in uncut segments
Number of sites:  7151671               Genes in uncut segments
Mean distance between sites:  400 base pairs
Standard deviation:  446 base pairs
Site density2499.4 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   487123  chr15  NT_037852.6  1398349-1885472    0.01 % in   1 repeats    0.00 % in 0 genes
2   401720  chr6  NT_167244.1  2359614-2761334    0.08 % in   1 repeats    0.00 % in 0 genes
3   209120  chr6  NT_167244.1  4388915-4598035    0.60 % in   6 repeats    0.00 % in 0 genes
4   180852  chr6  NT_167244.1  3790298-3971150    0.31 % in   3 repeats    0.00 % in 0 genes
5   175864  chr6  NT_167244.1  3179741-3355605    0.25 % in   6 repeats    0.28 % in 1 genes
6   172593  chr6  NT_167247.1  4421719-4594312    0.03 % in   2 repeats    100.00 % in 1 genes
7   165645  chr6  NT_167249.1  2138455-2304100    0.47 % in   6 repeats    0.00 % in 0 genes
8   159373  chr6  NT_167248.1  521839-681212    0.04 % in   2 repeats    0.00 % in 0 genes
9   143072  chr6  NT_167244.1  2894462-3037534    0.10 % in   2 repeats    0.00 % in 0 genes
10   117559  chr6  NT_167245.1  2606250-2723809    0.06 % in   1 repeats    0.00 % in 0 genes
11   114980  chr6  NT_167247.1  1177285-1292265    0.33 % in   1 repeats    0.00 % in 0 genes
12   113940  chr6  NT_167246.1  3261165-3375105    0.34 % in   2 repeats    0.00 % in 0 genes
13   108546  chr6  NT_167245.1  137579-246125    0.59 % in   2 repeats    0.00 % in 0 genes
14   105587  chr6  NT_167244.1  1833111-1938698    0.93 % in   5 repeats    0.00 % in 0 genes
15   105189  chr6  NT_167244.1  588021-693210    0.40 % in   3 repeats    0.00 % in 0 genes
16   104759  chr6  NT_167244.1  1451450-1556209    0.16 % in   3 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
487123  chr15  NT_037852.6  1398349-1885472    1       AT_rich (1) 
401720  chr6  NT_167244.1  2359614-2761334    1       AluSp (1) 
209120  chr6  NT_167244.1  4388915-4598035    5       MER57-int (2)  (TTCC)n (1)  AluY (1) 
180852  chr6  NT_167244.1  3790298-3971150    3       MLT1H-int (1)  MER52D (1)  AluSc (1) 
175864  chr6  NT_167244.1  3179741-3355605    4       GC_rich (3)  Charlie4a (1)  (CCG)n (1) 
172593  chr6  NT_167247.1  4421719-4594312    2       MER11A (1)  AluSc (1) 
165645  chr6  NT_167249.1  2138455-2304100    2       L1MB8 (3)  AluSx (3) 
159373  chr6  NT_167248.1  521839-681212    2       L1PREC2 (1)  HERVH-int (1) 
143072  chr6  NT_167244.1  2894462-3037534    2       AluY (1)  AluSg1 (1) 
10  117559  chr6  NT_167245.1  2606250-2723809    1       L2 (1) 
11  114980  chr6  NT_167247.1  1177285-1292265    1       ERV3-16A3_I-int (1) 
12  113940  chr6  NT_167246.1  3261165-3375105    2       MIRb (1)  AluSx (1) 
13  108546  chr6  NT_167245.1  137579-246125    2       MLT1E2 (1)  LTR12C (1) 
14  105587  chr6  NT_167244.1  1833111-1938698    4       AluSx (2)  (TATG)n (1)  MIR (1) 
15  105189  chr6  NT_167244.1  588021-693210    3       L1ME3D (1)  L1MA9 (1)  GA-rich (1) 
16  104759  chr6  NT_167244.1  1451450-1556209    3       ERV3-16A3_I-int (1)  AluY (1)  AluSg1 (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
5   175864       chr6  NT_167244.1  3179741-3355605    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
6   172593       chr6  NT_167247.1  4421719-4594312    LOC100507722  hypothetical_protein_LOC100507722



Posfai@neb.com
May 11, 2011