Distribution of restriction sites in the human genome

Enzyme:  HindII               Longest uncut segments
Specificity:  GTYRAC               Repeats in uncut segments
Number of sites:  1097138               Genes in uncut segments
Mean distance between sites:  2608 base pairs
Standard deviation:  2748 base pairs
Site density 383.4 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   490984  chr15  NT_037852.6  1396758-1887742    0.27 % in   8 repeats    0.00 % in 0 genes
2   405712  chr6  NT_167244.1  2358436-2764148    0.35 % in   6 repeats    0.00 % in 0 genes
3   219145  chr6  NT_167244.1  4385863-4605008    4.46 % in   21 repeats    0.00 % in 0 genes
4   188958  chr6  NT_167247.1  4410773-4599731    3.55 % in   28 repeats    97.10 % in 2 genes
5   181245  chr6  NT_167244.1  3789952-3971197    0.45 % in   4 repeats    0.00 % in 0 genes
6   178605  chr6  NT_167244.1  3178326-3356931    0.53 % in   9 repeats    1.48 % in 2 genes
7   169940  chr6  NT_167248.1  517684-687624    5.07 % in   3 repeats    0.00 % in 0 genes
8   169112  chr6  NT_167249.1  2134745-2303857    1.66 % in   14 repeats    0.00 % in 0 genes
9   167797  chr6  NT_167247.1  1562036-1729833    0.86 % in   9 repeats    0.00 % in 0 genes
10   157664  chr6  NT_167244.1  2009166-2166830    0.17 % in   3 repeats    0.00 % in 0 genes
11   156466  chr6  NT_167244.1  2892338-3048804    3.09 % in   25 repeats    0.00 % in 0 genes
12   153276  chr9  NT_008470.19  21690746-21844022    1.15 % in   8 repeats    0.00 % in 0 genes
13   140459  chr7  NT_023603.5  41865-182324    100.00 % in   1 repeats    0.00 % in 0 genes
14   124073  chr6  NT_167245.1  129247-253320    7.25 % in   26 repeats    0.00 % in 0 genes
15   122504  chr6  NT_167247.1  1175931-1298435    3.14 % in   7 repeats    0.00 % in 0 genes
16   122111  chr6  NT_167245.1  2603332-2725443    3.26 % in   11 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
490984  chr15  NT_037852.6  1396758-1887742    8       MLT1L (1)  MIRc (1)  MIRb (1) 
405712  chr6  NT_167244.1  2358436-2764148    5       AluJb (2)  L4 (1)  L1MEg (1) 
219145  chr6  NT_167244.1  4385863-4605008    21  13       MER57-int (3)  HERVH-int (3)  AluSx (3) 
188958  chr6  NT_167247.1  4410773-4599731    28  21       L2b (3)  AluSx (3)  MLT1J (2) 
181245  chr6  NT_167244.1  3789952-3971197    4       MLT1H-int (1)  MER52D (1)  AluSc (1) 
178605  chr6  NT_167244.1  3178326-3356931    7       GC_rich (3)  LTR23 (1)  L2a (1) 
169940  chr6  NT_167248.1  517684-687624    3       L1PREC2 (1)  HERVH-int (1)  AT_rich (1) 
169112  chr6  NT_167249.1  2134745-2303857    14  9       AluSx (3)  MLT1A (2)  L1MB8 (2) 
167797  chr6  NT_167247.1  1562036-1729833    7       MIR (2)  L1MEe (2)  L1MC3 (1) 
10  157664  chr6  NT_167244.1  2009166-2166830    3       MIRb (1)  MER5A1 (1)  L1MC4a (1) 
11  156466  chr6  NT_167244.1  2892338-3048804    25  13       L1MC5 (6)  L2c (3)  AluY (3) 
12  153276  chr9  NT_008470.19  21690746-21844022    6       LTR67B (2)  L1M4b (2)  MSTA (1) 
13  140459  chr7  NT_023603.5  41865-182324    1       ALR/Alpha (1) 
14  124073  chr6  NT_167245.1  129247-253320    26  23       L2c (2)  AT_rich (2)  AluSx (2) 
15  122504  chr6  NT_167247.1  1175931-1298435    4       ERV3-16A3_I-int (4)  L3 (1)  L2a (1) 
16  122111  chr6  NT_167245.1  2603332-2725443    11  9       MLT1N2 (2)  L2 (2)  MLT1E2 (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   188958       chr6  NT_167247.1  4410773-4599731    COL11A2P 
LOC100507722  hypothetical_protein_LOC100507722
6   178605       chr6  NT_167244.1  3178326-3356931    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor



Posfai@neb.com
May 11, 2011