Distribution of restriction sites in the human genome

Enzyme:  BtgI               Longest uncut segments
Specificity:  CCRYGG               Repeats in uncut segments
Number of sites:  1158894               Genes in uncut segments
Mean distance between sites:  2469 base pairs
Standard deviation:  3051 base pairs
Site density 405.0 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   491389  chr15  NT_037852.6  1397762-1889151    0.38 % in   9 repeats    0.00 % in 0 genes
2   405135  chr6  NT_167244.1  2357895-2763030    0.47 % in   9 repeats    0.00 % in 0 genes
3   246281  chr6  NT_167244.1  2009529-2255810    1.07 % in   11 repeats    1.72 % in 2 genes
4   223950  chr6  NT_167244.1  4382623-4606573    5.02 % in   26 repeats    0.46 % in 1 genes
5   188290  chr6  NT_167244.1  3785144-3973434    2.10 % in   20 repeats    1.25 % in 1 genes
6   176280  chr6  NT_167244.1  3179843-3356123    0.24 % in   5 repeats    0.22 % in 1 genes
7   176109  chr6  NT_167248.1  510529-686638    8.67 % in   8 repeats    0.00 % in 0 genes
8   174763  chr6  NT_167247.1  4421846-4596609    1.07 % in   6 repeats    100.00 % in 1 genes
9   167132  chr6  NT_167249.1  2138201-2305333    1.04 % in   8 repeats    0.00 % in 0 genes
10   166208  chr6  NT_167247.1  1561942-1728150    0.56 % in   6 repeats    0.00 % in 0 genes
11   164179  chr7  NT_023603.5  31957-196136    100.00 % in   5 repeats    0.00 % in 0 genes
12   152180  chr6  NT_167244.1  2889478-3041658    4.43 % in   35 repeats    0.00 % in 0 genes
13   151595  chr9  NT_008470.19  21693226-21844821    0.28 % in   4 repeats    0.00 % in 0 genes
14   132259  chr1  NT_077389.3  259047-391306    97.17 % in   65 repeats    0.00 % in 0 genes
15   119058  chr6  NT_167245.1  2604630-2723688    1.14 % in   3 repeats    0.00 % in 0 genes
16   118586  chr6  NT_167244.1  1437765-1556351    7.88 % in   35 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
491389  chr15  NT_037852.6  1397762-1889151    7       L2a (3)  (TA)n (1)  MLT1L (1) 
405135  chr6  NT_167244.1  2357895-2763030    7       L4 (2)  AluJb (2)  L1MEg (1) 
246281  chr6  NT_167244.1  2009529-2255810    11  11       MIRb (1)  MIR (1)  MER5A1 (1) 
223950  chr6  NT_167244.1  4382623-4606573    26  16       HERVH-int (4)  AluSx (4)  MER57-int (3) 
188290  chr6  NT_167244.1  3785144-3973434    20  15       L2a (3)  MLT1H-int (2)  AT_rich (2) 
176280  chr6  NT_167244.1  3179843-3356123    4       GC_rich (2)  Charlie4a (1)  (CCG)n (1) 
176109  chr6  NT_167248.1  510529-686638    8       LTR7 (1)  L1PREC2 (1)  L1PA7 (1) 
174763  chr6  NT_167247.1  4421846-4596609    6       (TTAAA)n (1)  MLT1J (1)  MER11A (1) 
167132  chr6  NT_167249.1  2138201-2305333    4       L1MB8 (3)  AluSx (3)  Charlie2b (1) 
10  166208  chr6  NT_167247.1  1561942-1728150    5       MIR (2)  L1MC3 (1)  (GGAA)n (1) 
11  164179  chr7  NT_023603.5  31957-196136    2       L1PA2 (4)  ALR/Alpha (1) 
12  152180  chr6  NT_167244.1  2889478-3041658    35  16       L1MC5 (6)  AluY (5)  AluSc (3) 
13  151595  chr9  NT_008470.19  21693226-21844821    3       L2 (2)  MIR3 (1)  L1M5 (1) 
14  132259  chr1  NT_077389.3  259047-391306    65  12       ALR/Alpha (52)  MLT1J (2)  L1MB1 (2) 
15  119058  chr6  NT_167245.1  2604630-2723688    3       MER5B (1)  MER5A1 (1)  L2a (1) 
16  118586  chr6  NT_167244.1  1437765-1556351    35  23       AluY (5)  L4 (3)  L1MC4a (3) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
3   246281       chr6  NT_167244.1  2009529-2255810    FLOT1  flotillin-1
DDR1  epithelial_discoidin_domain-containing_receptor_1_isoform_DDR1c
4   223950       chr6  NT_167244.1  4382623-4606573    HLA-DPB2  major_histocompatibility_complex,_class_II,_DP_beta_2_(pseudogene)
5   188290       chr6  NT_167244.1  3785144-3973434    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor
6   176280       chr6  NT_167244.1  3179843-3356123    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
8   174763       chr6  NT_167247.1  4421846-4596609    LOC100507722  hypothetical_protein_LOC100507722



Posfai@neb.com
May 11, 2011