Distribution of restriction sites in the human genome

Enzyme:  AvrII               Longest uncut segments
Specificity:  CCTAGG               Repeats in uncut segments
Number of sites:  590888               Genes in uncut segments
Mean distance between sites:  4842 base pairs
Standard deviation:  5176 base pairs
Site density 206.5 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   494796  chr15  NT_037852.6  1392626-1887422    0.80 % in   14 repeats    0.00 % in 0 genes
2   405586  chr6  NT_167244.1  2355931-2761517    0.75 % in   15 repeats    0.00 % in 0 genes
3   260077  chr6  NT_167244.1  2009752-2269829    3.33 % in   40 repeats    3.92 % in 3 genes
4   213509  chr6  NT_167244.1  4385236-4598745    2.13 % in   14 repeats    0.00 % in 0 genes
5   201817  chr6  NT_167244.1  3771610-3973427    4.06 % in   42 repeats    6.47 % in 1 genes
6   188088  chr6  NT_167244.1  3168343-3356431    3.26 % in   41 repeats    6.45 % in 2 genes
7   174903  chr6  NT_167247.1  4421407-4596310    1.05 % in   7 repeats    100.00 % in 1 genes
8   172847  chr6  NT_167248.1  512797-685644    7.50 % in   7 repeats    0.00 % in 0 genes
9   168555  chr6  NT_167249.1  2135189-2303744    1.52 % in   13 repeats    0.00 % in 0 genes
10   167715  chr6  NT_167247.1  1561566-1729281    0.98 % in   8 repeats    0.00 % in 0 genes
11   167512  chrY  NT_011875.12  8551242-8718754    70.02 % in   20 repeats    0.00 % in 0 genes
12   161221  chr7  NT_023603.5  33331-194552    100.00 % in   3 repeats    0.00 % in 0 genes
13   159595  chr9  NT_008470.19  21686894-21846489    3.52 % in   17 repeats    0.00 % in 0 genes
14   147724  chr6  NT_167244.1  2894517-3042241    2.02 % in   18 repeats    0.00 % in 0 genes
15   134197  chr9  NT_078070.3  952067-1086264    59.23 % in   32 repeats    0.00 % in 0 genes
16   133711  chr6  NT_167245.1  130431-264142    10.27 % in   55 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
494796  chr15  NT_037852.6  1392626-1887422    14  12       L1MDa (3)  MLT1L (1)  MIRc (1) 
405586  chr6  NT_167244.1  2355931-2761517    15  10       AluJb (3)  MLT2D (2)  L4 (2) 
260077  chr6  NT_167244.1  2009752-2269829    40  28       MIR (4)  L1MEe (3)  AluSx (3) 
213509  chr6  NT_167244.1  4385236-4598745    14  9       MER57-int (3)  AluSx (3)  AluY (2) 
201817  chr6  NT_167244.1  3771610-3973427    42  29       L2a (5)  AT_rich (4)  MIR (3) 
188088  chr6  NT_167244.1  3168343-3356431    41  21       L1MC5 (6)  AluSx (5)  L1MB3 (4) 
174903  chr6  NT_167247.1  4421407-4596310    7       (TTAAA)n (1)  MLT1J (1)  MIR (1) 
172847  chr6  NT_167248.1  512797-685644    7       LTR7 (1)  L1PREC2 (1)  L1PA7 (1) 
168555  chr6  NT_167249.1  2135189-2303744    13  8       AluSx (3)  MLT1A (2)  L1MB8 (2) 
10  167715  chr6  NT_167247.1  1561566-1729281    6       MIR (2)  L1MEe (2)  L1MC3 (1) 
11  167512  chrY  NT_011875.12  8551242-8718754    20  9       LTR12B (9)  L1PA16 (4)  (TATAA)n (1) 
12  161221  chr7  NT_023603.5  33331-194552    2       L1PA2 (2)  ALR/Alpha (1) 
13  159595  chr9  NT_008470.19  21686894-21846489    17  11       MER5B (2)  LTR67B (2)  L2 (2) 
14  147724  chr6  NT_167244.1  2894517-3042241    18  9       L1MC5 (6)  L2c (2)  AluY (2) 
15  134197  chr9  NT_078070.3  952067-1086264    32  14       ERVL-E-int (5)  (TAA)n (3)  L1PA3 (3) 
16  133711  chr6  NT_167245.1  130431-264142    55  39       AluSx (6)  L1MC5 (4)  AluY (4) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
3   260077       chr6  NT_167244.1  2009752-2269829    FLOT1  flotillin-1
DDR1  epithelial_discoidin_domain-containing_receptor_1_isoform_DDR1c
MUC21  mucin-21_precursor
5   201817       chr6  NT_167244.1  3771610-3973427    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor
6   188088       chr6  NT_167244.1  3168343-3356431    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
7   174903       chr6  NT_167247.1  4421407-4596310    LOC100507722  hypothetical_protein_LOC100507722



Posfai@neb.com
May 11, 2011