Distribution of restriction sites in the human genome

Enzyme:  HpaI               Longest uncut segments
Specificity:  GTTAAC               Repeats in uncut segments
Number of sites:  387237               Genes in uncut segments
Mean distance between sites:  7389 base pairs
Standard deviation:  8047 base pairs
Site density 135.3 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   497839  chr15  NT_037852.6  1389903-1887742    1.35 % in   19 repeats    0.00 % in 0 genes
2   407660  chr6  NT_167244.1  2358436-2766096    0.69 % in   12 repeats    0.00 % in 0 genes
3   214899  chr6  NT_167247.1  4406256-4621155    5.05 % in   47 repeats    94.92 % in 3 genes
4   207798  chr6  NT_167244.1  3787288-3995086    7.06 % in   56 repeats    0.10 % in 1 genes
5   202880  chr6  NT_167249.1  2134745-2337625    11.46 % in   79 repeats    4.34 % in 1 genes
6   191141  chr6  NT_167244.1  3166620-3357761    4.16 % in   49 repeats    7.94 % in 2 genes
7   185619  chr6  NT_167248.1  502005-687624    11.14 % in   21 repeats    0.00 % in 0 genes
8   179894  chr9  NT_008470.19  21685548-21865442    6.66 % in   42 repeats    0.00 % in 0 genes
9   172096  chr6  NT_167244.1  1994734-2166830    3.39 % in   31 repeats    0.00 % in 0 genes
10   171312  chr1  NT_004350.19  2047618-2218930    12.15 % in   72 repeats    0.00 % in 0 genes
11   170549  chr6  NT_167247.1  1562036-1732585    1.58 % in   15 repeats    0.00 % in 0 genes
12   163783  chr6  NT_167244.1  1436632-1600415    18.20 % in   82 repeats    0.00 % in 0 genes
13   160772  chr6  NT_167244.1  2891570-3052342    4.09 % in   33 repeats    0.00 % in 0 genes
14   152138  chrX  NT_011669.17  98957-251095    99.45 % in   32 repeats    0.00 % in 0 genes
15   151125  chr6  NT_167246.1  3002600-3153725    15.40 % in   122 repeats    0.00 % in 0 genes
16   143844  chr12  NT_009714.17  27197687-27341531    85.09 % in   129 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
497839  chr15  NT_037852.6  1389903-1887742    19  14       L1MDa (6)  MLT1L (1)  MIRc (1) 
407660  chr6  NT_167244.1  2358436-2766096    12  9       LTR84b (2)  AluY (2)  AluJb (2) 
214899  chr6  NT_167247.1  4406256-4621155    47  31       L2b (3)  L1PB1 (3)  AluSx (3) 
207798  chr6  NT_167244.1  3787288-3995086    56  40       L2a (6)  L1P3 (3)  L1MEc (3) 
202880  chr6  NT_167249.1  2134745-2337625    79  38       AluSx (10)  Charlie2b (6)  L2a (4) 
191141  chr6  NT_167244.1  3166620-3357761    49  26       L1MC5 (6)  AluSx (6)  L1MB3 (4) 
185619  chr6  NT_167248.1  502005-687624    21  18       MER4D (2)  L1PA14 (2)  L1M5 (2) 
179894  chr9  NT_008470.19  21685548-21865442    42  30       L2 (3)  L1M5 (3)  Tigger1 (2) 
172096  chr6  NT_167244.1  1994734-2166830    31  18       AluSx (6)  L2c (3)  MIR (2) 
10  171312  chr1  NT_004350.19  2047618-2218930    72  44       MIR (5)  AluJb (5)  L1MEf (4) 
11  170549  chr6  NT_167247.1  1562036-1732585    15  12       MSTB (2)  MIR (2)  L1MEe (2) 
12  163783  chr6  NT_167244.1  1436632-1600415    82  45       L1MA1 (7)  AluSx (6)  AluY (5) 
13  160772  chr6  NT_167244.1  2891570-3052342    33  18       L1MC5 (6)  L2c (3)  AluY (3) 
14  152138  chrX  NT_011669.17  98957-251095    32  10       ALR/Alpha (17)  L1PA2 (4)  L1PA3 (3) 
15  151125  chr6  NT_167246.1  3002600-3153725    122  39       AluSx (20)  AluY (11)  L2c (10) 
16  143844  chr12  NT_009714.17  27197687-27341531    129  8       GSATII (116)  GSATX (4)  ALR/Alpha (3) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
3   214899       chr6  NT_167247.1  4406256-4621155    COL11A2P 
LOC100507722  hypothetical_protein_LOC100507722
COL11A2  collagen_alpha-2(XI)_chain_isoform_4_precursor
4   207798       chr6  NT_167244.1  3787288-3995086    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor
5   202880       chr6  NT_167249.1  2134745-2337625    LOC100507679  hypothetical_protein_LOC100507679
6   191141       chr6  NT_167244.1  3166620-3357761    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor



Posfai@neb.com
May 11, 2011