Distribution of restriction sites in the human genome

Enzyme:  PflMI               Longest uncut segments
Specificity:  CCANNNNNTGG               Repeats in uncut segments
Number of sites:  974536               Genes in uncut segments
Mean distance between sites:  2936 base pairs
Standard deviation:  3306 base pairs
Site density 340.6 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   501540  chr15  NT_037852.6  1395444-1896984    1.05 % in   24 repeats    0.78 % in 1 genes
2   402276  chr6  NT_167244.1  2359404-2761680    0.08 % in   2 repeats    0.00 % in 0 genes
3   209458  chr6  NT_167244.1  4389949-4599407    0.63 % in   7 repeats    0.00 % in 0 genes
4   185791  chr6  NT_167244.1  3785656-3971447    1.08 % in   13 repeats    0.99 % in 1 genes
5   177736  chr6  NT_167247.1  4418772-4596508    1.47 % in   11 repeats    100.00 % in 1 genes
6   176116  chr6  NT_167244.1  3180007-3356123    0.24 % in   5 repeats    0.12 % in 1 genes
7   173957  chr6  NT_167249.1  2130540-2304497    3.73 % in   29 repeats    0.00 % in 0 genes
8   166578  chr6  NT_167247.1  1560669-1727247    0.67 % in   5 repeats    1.37 % in 1 genes
9   160416  chr6  NT_167248.1  521028-681444    0.69 % in   2 repeats    0.00 % in 0 genes
10   156770  chr6  NT_167244.1  2008324-2165094    0.52 % in   4 repeats    0.00 % in 0 genes
11   155457  chr9  NT_008470.19  21690613-21846070    1.75 % in   12 repeats    0.00 % in 0 genes
12   143813  chr6  NT_167244.1  2894162-3037975    0.47 % in   6 repeats    0.00 % in 0 genes
13   125291  chr1  NT_077389.3  264585-389876    99.44 % in   55 repeats    0.00 % in 0 genes
14   119583  chr6  NT_167245.1  2606069-2725652    1.59 % in   5 repeats    0.00 % in 0 genes
15   118756  chr6  NT_167245.1  135685-254441    5.77 % in   19 repeats    0.00 % in 0 genes
16   117986  chr6  NT_167244.1  586619-704605    5.76 % in   30 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
501540  chr15  NT_037852.6  1395444-1896984    24  19       L2a (3)  MER44C (2)  L1M5 (2) 
402276  chr6  NT_167244.1  2359404-2761680    2       L4 (1)  AluSp (1) 
209458  chr6  NT_167244.1  4389949-4599407    6       AluSx (2)  L1PA15 (1)  L1ME3D (1) 
185791  chr6  NT_167244.1  3785656-3971447    13  11       AT_rich (2)  AluJb (2)  MLT1H-int (1) 
177736  chr6  NT_167247.1  4418772-4596508    11  11       (TTAAA)n (1)  MLT1J (1)  MIRb (1) 
176116  chr6  NT_167244.1  3180007-3356123    4       GC_rich (2)  Charlie4a (1)  (CCG)n (1) 
173957  chr6  NT_167249.1  2130540-2304497    29  15       AluSx (5)  L1MB8 (3)  AluJo (3) 
166578  chr6  NT_167247.1  1560669-1727247    5       MIRc (1)  MIR (1)  L2c (1) 
160416  chr6  NT_167248.1  521028-681444    2       L1PREC2 (1)  HERVH-int (1) 
10  156770  chr6  NT_167244.1  2008324-2165094    4       MIRb (1)  MIR (1)  AluY (1) 
11  155457  chr9  NT_008470.19  21690613-21846070    12  9       LTR67B (2)  L2 (2)  L1M4b (2) 
12  143813  chr6  NT_167244.1  2894162-3037975    6       L1MC5 (1)  AluY (1)  AluSp (1) 
13  125291  chr1  NT_077389.3  264585-389876    55  4       ALR/Alpha (52)  MLT1J (1)  L1HS (1) 
14  119583  chr6  NT_167245.1  2606069-2725652    4       L2 (2)  MLT1E2 (1)  L2a (1) 
15  118756  chr6  NT_167245.1  135685-254441    19  18       L2c (2)  (TTTC)n (1)  tRNA-Phe-TTY (1) 
16  117986  chr6  NT_167244.1  586619-704605    30  23       L2b (3)  L1MA9 (3)  L2c (2) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
1   501540       chr15  NT_037852.6  1395444-1896984    LOC100418897 
4   185791       chr6  NT_167244.1  3785656-3971447    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor
5   177736       chr6  NT_167247.1  4418772-4596508    LOC100507722  hypothetical_protein_LOC100507722
6   176116       chr6  NT_167244.1  3180007-3356123    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
8   166578       chr6  NT_167247.1  1560669-1727247    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011