Distribution of restriction sites in the human genome

Enzyme:  KpnI               Longest uncut segments
Specificity:  GGTACC               Repeats in uncut segments
Number of sites:  286337               Genes in uncut segments
Mean distance between sites:  9992 base pairs
Standard deviation:  10869 base pairs
Site density 100.1 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   543594  chr15  NT_037852.6  1394625-1938219    3.47 % in   92 repeats    2.89 % in 2 genes
2   402911  chr6  NT_167244.1  2359609-2762520    0.08 % in   1 repeats    0.00 % in 0 genes
3   308050  chrY  NT_011875.12  8412710-8720760    82.79 % in   43 repeats    0.00 % in 0 genes
4   220547  chr6  NT_167244.1  4387334-4607881    5.20 % in   21 repeats    0.00 % in 0 genes
5   219797  chr6  NT_167247.1  4411383-4631180    4.59 % in   50 repeats    96.01 % in 3 genes
6   201212  chr6  NT_167244.1  3771832-3973044    3.80 % in   40 repeats    6.49 % in 1 genes
7   198835  chr6  NT_167244.1  3157261-3356096    4.71 % in   58 repeats    11.19 % in 2 genes
8   183005  chr6  NT_167248.1  520596-703601    7.39 % in   31 repeats    1.84 % in 2 genes
9   179826  chr6  NT_167247.1  1555564-1735390    5.09 % in   39 repeats    0.00 % in 0 genes
10   172726  chr9  NT_008470.19  21682672-21855398    6.42 % in   38 repeats    0.00 % in 0 genes
11   171764  chr7  NT_033968.6  3073842-3245606    52.13 % in   266 repeats    0.00 % in 0 genes
12   171694  chr6  NT_167249.1  2135780-2307474    3.13 % in   25 repeats    0.00 % in 0 genes
13   168216  chr6  NT_167244.1  2881928-3050144    7.73 % in   65 repeats    0.00 % in 0 genes
14   161917  chr6  NT_167244.1  2005224-2167141    1.84 % in   14 repeats    0.00 % in 0 genes
15   161216  chr7  NT_023603.5  32972-194188    100.00 % in   4 repeats    0.00 % in 0 genes
16   152601  chr8  NT_008046.16  35951832-36104433    48.64 % in   290 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
543594  chr15  NT_037852.6  1394625-1938219    92  50       AT_rich (11)  L2a (6)  L1MC4 (4) 
402911  chr6  NT_167244.1  2359609-2762520    1       AluSp (1) 
308050  chrY  NT_011875.12  8412710-8720760    43  16       LTR12B (17)  L1PA16 (6)  L1PA7 (3) 
220547  chr6  NT_167244.1  4387334-4607881    21  14       HERVH-int (4)  AluSx (3)  MER57-int (2) 
219797  chr6  NT_167247.1  4411383-4631180    50  32       AluSx (5)  MIRc (3)  L2b (3) 
201212  chr6  NT_167244.1  3771832-3973044    40  27       L2a (5)  AT_rich (4)  MIR (3) 
198835  chr6  NT_167244.1  3157261-3356096    58  32       L1MC5 (6)  AluSx (5)  L1MB3 (4) 
183005  chr6  NT_167248.1  520596-703601    31  20       AT_rich (6)  L2b (3)  L1MA9 (3) 
179826  chr6  NT_167247.1  1555564-1735390    39  30       L2c (3)  Tigger7 (2)  MSTD (2) 
10  172726  chr9  NT_008470.19  21682672-21855398    38  25       MIRb (3)  L1M5 (3)  AluSq (3) 
11  171764  chr7  NT_033968.6  3073842-3245606    266  120       AT_rich (33)  L2a (9)  L1MCb (8) 
12  171694  chr6  NT_167249.1  2135780-2307474    25  12       Charlie2b (6)  AluSx (5)  L1MB8 (3) 
13  168216  chr6  NT_167244.1  2881928-3050144    65  27       AluY (7)  L1MC5 (6)  AluSx (5) 
14  161917  chr6  NT_167244.1  2005224-2167141    14  9       AluSx (4)  FRAM (2)  AluJb (2) 
15  161216  chr7  NT_023603.5  32972-194188    2       L1PA2 (3)  ALR/Alpha (1) 
16  152601  chr8  NT_008046.16  35951832-36104433    290  97       MIRb (19)  AluSx (17)  MIR (16) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
1   543594       chr15  NT_037852.6  1394625-1938219    LOC100418897 
LOC646214  p21_protein_(Cdc42/Rac)-activated_kinase_2_pseudogene
5   219797       chr6  NT_167247.1  4411383-4631180    COL11A2P 
LOC100507722  hypothetical_protein_LOC100507722
COL11A2  collagen_alpha-2(XI)_chain_isoform_4_precursor
6   201212       chr6  NT_167244.1  3771832-3973044    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor
7   198835       chr6  NT_167244.1  3157261-3356096    SLC44A4  choline_transporter-like_protein_4_isoform_3
EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
8   183005       chr6  NT_167248.1  520596-703601    OR12D1P 
OR11A1  olfactory_receptor_11A1



Posfai@neb.com
May 11, 2011