Distribution of restriction sites in the human genome

Enzyme:  CjeI               Longest uncut segments
Specificity:  CCANNNNNNGT               Repeats in uncut segments
Number of sites:  5147848               Genes in uncut segments
Mean distance between sites:  555 base pairs
Standard deviation:  600 base pairs
Site density1799.1 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   488525  chr15  NT_037852.6  1397521-1886046    0.09 % in   4 repeats    0.00 % in 0 genes
2   401967  chr6  NT_167244.1  2359961-2761928    0.00 % in   1 repeats    0.00 % in 0 genes
3   208675  chr6  NT_167244.1  4389396-4598071    0.38 % in   6 repeats    0.00 % in 0 genes
4   181010  chr6  NT_167244.1  3790072-3971082    0.35 % in   4 repeats    0.00 % in 0 genes
5   177946  chr6  NT_167244.1  3179355-3357301    0.58 % in   9 repeats    1.12 % in 2 genes
6   172946  chr6  NT_167247.1  4422053-4594999    0.43 % in   2 repeats    100.00 % in 1 genes
7   165355  chr6  NT_167249.1  2138208-2303563    0.21 % in   3 repeats    0.00 % in 0 genes
8   165275  chr6  NT_167247.1  1562856-1728131    0.41 % in   4 repeats    0.06 % in 1 genes
9   159986  chr6  NT_167248.1  521589-681575    0.42 % in   2 repeats    0.00 % in 0 genes
10   150598  chr9  NT_008470.19  21692870-21843468    0.26 % in   1 repeats    0.00 % in 0 genes
11   143244  chr6  NT_167244.1  2894571-3037815    0.29 % in   4 repeats    0.00 % in 0 genes
12   117856  chr6  NT_167245.1  2605827-2723683    0.35 % in   1 repeats    0.00 % in 0 genes
13   114822  chr6  NT_167247.1  1177582-1292404    0.07 % in   1 repeats    0.00 % in 0 genes
14   114457  chr6  NT_167246.1  3260392-3374849    0.21 % in   3 repeats    0.00 % in 0 genes
15   108348  chr6  NT_167245.1  137952-246300    0.41 % in   3 repeats    0.00 % in 0 genes
16   106376  chr6  NT_167244.1  1451184-1557560    1.65 % in   6 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
488525  chr15  NT_037852.6  1397521-1886046    4       MIRc (1)  MIRb (1)  L1M3 (1) 
401967  chr6  NT_167244.1  2359961-2761928    1       AluSp (1) 
208675  chr6  NT_167244.1  4389396-4598071    5       MER57-int (2)  (TTCC)n (1)  AluY (1) 
181010  chr6  NT_167244.1  3790072-3971082    4       MLT1H-int (1)  MER52D (1)  AluSc (1) 
177946  chr6  NT_167244.1  3179355-3357301    6       GC_rich (3)  AluSp (2)  L2c (1) 
172946  chr6  NT_167247.1  4422053-4594999    2       MER11A (1)  AluSc (1) 
165355  chr6  NT_167249.1  2138208-2303563    3       L1MB8 (1)  AT_rich (1)  AluSx (1) 
165275  chr6  NT_167247.1  1562856-1728131    3       MIR (2)  (GGAA)n (1)  AluSq (1) 
159986  chr6  NT_167248.1  521589-681575    2       L1PREC2 (1)  HERVH-int (1) 
10  150598  chr9  NT_008470.19  21692870-21843468    1       L1M5 (1) 
11  143244  chr6  NT_167244.1  2894571-3037815    4       L1MC5 (1)  AluY (1)  AluSg1 (1) 
12  117856  chr6  NT_167245.1  2605827-2723683    1       L2a (1) 
13  114822  chr6  NT_167247.1  1177582-1292404    1       ERV3-16A3_I-int (1) 
14  114457  chr6  NT_167246.1  3260392-3374849    3       MIRb (1)  MIR3 (1)  AluSx (1) 
15  108348  chr6  NT_167245.1  137952-246300    3       MLT1F (1)  MLT1E2 (1)  LTR12C (1) 
16  106376  chr6  NT_167244.1  1451184-1557560    6       L1MA1 (1)  ERV3-16A3_I-int (1)  AT_rich (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
5   177946       chr6  NT_167244.1  3179355-3357301    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
6   172946       chr6  NT_167247.1  4422053-4594999    LOC100507722  hypothetical_protein_LOC100507722
8   165275       chr6  NT_167247.1  1562856-1728131    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011