Distribution of restriction sites in the human genome

Enzyme:  OgrI               Longest uncut segments
Specificity:  CAACNAC               Repeats in uncut segments
Number of sites:  1037948               Genes in uncut segments
Mean distance between sites:  2756 base pairs
Standard deviation:  3162 base pairs
Site density 362.7 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   488952  chr15  NT_037852.6  1397753-1886705    0.05 % in   4 repeats    0.00 % in 0 genes
2   407353  chr6  NT_167244.1  2358395-2765748    0.66 % in   12 repeats    0.00 % in 0 genes
3   218018  chr6  NT_167244.1  4385531-4603549    3.81 % in   21 repeats    0.00 % in 0 genes
4   189496  chr6  NT_167244.1  3786018-3975514    2.50 % in   23 repeats    0.78 % in 1 genes
5   181667  chr6  NT_167244.1  3175583-3357250    1.99 % in   28 repeats    3.14 % in 2 genes
6   179376  chr6  NT_167247.1  4421505-4600881    2.04 % in   17 repeats    100.00 % in 1 genes
7   169200  chr6  NT_167247.1  1557996-1727196    1.71 % in   13 repeats    2.93 % in 1 genes
8   167682  chr6  NT_167249.1  2137711-2305393    1.24 % in   9 repeats    0.00 % in 0 genes
9   164311  chr6  NT_167248.1  520797-685108    3.04 % in   2 repeats    0.00 % in 0 genes
10   162141  chr6  NT_167244.1  2005853-2167994    2.08 % in   15 repeats    0.00 % in 0 genes
11   155869  chr9  NT_008470.19  21687989-21843858    2.71 % in   12 repeats    0.00 % in 0 genes
12   152191  chr6  NT_167244.1  2892311-3044502    2.88 % in   23 repeats    0.00 % in 0 genes
13   128581  chr1  NT_077389.3  261274-389855    97.69 % in   60 repeats    0.00 % in 0 genes
14   122971  chr6  NT_167247.1  1173094-1296065    3.56 % in   7 repeats    0.00 % in 0 genes
15   120184  chr6  NT_167245.1  2603466-2723650    1.95 % in   6 repeats    0.00 % in 0 genes
16   119088  chr10  NT_008705.16  38710957-38830045    27.57 % in   217 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
488952  chr15  NT_037852.6  1397753-1886705    4       MIRc (1)  MIRb (1)  L1M3 (1) 
407353  chr6  NT_167244.1  2358395-2765748    12  9       LTR84b (2)  AluY (2)  AluJb (2) 
218018  chr6  NT_167244.1  4385531-4603549    21  13       MER57-int (3)  HERVH-int (3)  AluSx (3) 
189496  chr6  NT_167244.1  3786018-3975514    23  19       MLT1H-int (2)  L2a (2)  AT_rich (2) 
181667  chr6  NT_167244.1  3175583-3357250    28  16       AluSx (5)  L1MB3 (4)  GC_rich (3) 
179376  chr6  NT_167247.1  4421505-4600881    17  13       AluSx (3)  MLT1J (2)  L1MC5 (2) 
169200  chr6  NT_167247.1  1557996-1727196    13  11       Tigger7 (2)  MSTD (2)  (TG)n (1) 
167682  chr6  NT_167249.1  2137711-2305393    5       L1MB8 (3)  AluSx (3)  L1MC4a (1) 
164311  chr6  NT_167248.1  520797-685108    2       L1PREC2 (1)  HERVH-int (1) 
10  162141  chr6  NT_167244.1  2005853-2167994    15  10       AluSx (4)  MIR (2)  AluJb (2) 
11  155869  chr9  NT_008470.19  21687989-21843858    12  9       MER5B (2)  LTR67B (2)  L1M4b (2) 
12  152191  chr6  NT_167244.1  2892311-3044502    23  11       L1MC5 (6)  L2c (3)  AluY (3) 
13  128581  chr1  NT_077389.3  261274-389855    60  8       ALR/Alpha (52)  MLT1J (2)  L2c (1) 
14  122971  chr6  NT_167247.1  1173094-1296065    4       L2 (3)  ERV3-16A3_I-int (2)  MLT1E2 (1) 
15  120184  chr6  NT_167245.1  2603466-2723650    5       MLT1N2 (2)  MER5B (1)  MER5A1 (1) 
16  119088  chr10  NT_008705.16  38710957-38830045    217  29       GA-rich (24)  (GAATG)n (22)  (AAATG)n (22) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   189496       chr6  NT_167244.1  3786018-3975514    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor
5   181667       chr6  NT_167244.1  3175583-3357250    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
6   179376       chr6  NT_167247.1  4421505-4600881    LOC100507722  hypothetical_protein_LOC100507722
7   169200       chr6  NT_167247.1  1557996-1727196    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011