Distribution of restriction sites in the human genome

Enzyme:  BsaWI               Longest uncut segments
Specificity:  WCCGGW               Repeats in uncut segments
Number of sites:  276115               Genes in uncut segments
Mean distance between sites:  10362 base pairs
Standard deviation:  12538 base pairs
Site density 96.5 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   495389  chr15  NT_037852.6  1396122-1891511    0.89 % in   20 repeats    0.00 % in 0 genes
2   415818  chr6  NT_167244.1  2348293-2764111    1.97 % in   37 repeats    0.00 % in 0 genes
3   232762  chr6  NT_167244.1  4371897-4604659    7.44 % in   36 repeats    5.05 % in 1 genes
4   230974  chr1  NT_032977.9  68057055-68288029    35.00 % in   362 repeats    100.00 % in 1 genes
5   212721  chr7  NT_023603.5  28789-241510    99.97 % in   13 repeats    0.00 % in 0 genes
6   211641  chrX  NT_011651.17  6699412-6911053    53.83 % in   333 repeats    38.64 % in 3 genes
7   192219  chr6  NT_167244.1  3165906-3358125    4.43 % in   50 repeats    8.46 % in 2 genes
8   189853  chr6  NT_167244.1  3787909-3977762    2.67 % in   24 repeats    0.00 % in 0 genes
9   173630  chr6  NT_167249.1  2129415-2303045    3.40 % in   25 repeats    0.00 % in 0 genes
10   172762  chr6  NT_167247.1  4421951-4594713    0.27 % in   2 repeats    0.00 % in 0 genes
11   171792  chr5  NW_003315917.1  1084023-1255815    22.80 % in   149 repeats    0.00 % in 0 genes
12   171051  chr9  NT_008470.19  21677012-21848063    5.78 % in   37 repeats    0.00 % in 0 genes
13   170227  chr6  NT_167248.1  518912-689139    4.52 % in   5 repeats    0.00 % in 0 genes
14   167978  chr6  NT_167244.1  2000378-2168356    2.78 % in   23 repeats    0.00 % in 0 genes
15   167506  chr6  NT_167247.1  1562918-1730424    1.02 % in   8 repeats    0.00 % in 0 genes
16   165751  chr4  NT_006316.16  391949-557700    5.33 % in   59 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
495389  chr15  NT_037852.6  1396122-1891511    20  17       L2a (3)  L1M5 (2)  U2 (1) 
415818  chr6  NT_167244.1  2348293-2764111    37  25       AluJb (4)  L1ME4a (3)  AluSx (3) 
232762  chr6  NT_167244.1  4371897-4604659    36  21       HUERS-P3-int (7)  MER57-int (3)  HERVH-int (3) 
230974  chr1  NT_032977.9  68057055-68288029    362  126       AT_rich (48)  L2a (21)  MIRb (18) 
212721  chr7  NT_023603.5  28789-241510    13  4       ALR/Alpha (7)  L1PA2 (4)  L1PA3 (1) 
211641  chrX  NT_011651.17  6699412-6911053    333  126       AT_rich (24)  L2c (16)  MIR3 (11) 
192219  chr6  NT_167244.1  3165906-3358125    50  27       L1MC5 (6)  AluSx (6)  L1MB3 (4) 
189853  chr6  NT_167244.1  3787909-3977762    24  20       L2a (3)  MLT1H-int (2)  AT_rich (2) 
173630  chr6  NT_167249.1  2129415-2303045    25  15       MamGypLTR1b (3)  AluJo (3)  MLT2B1 (2) 
10  172762  chr6  NT_167247.1  4421951-4594713    2       MER11A (1)  AluSc (1) 
11  171792  chr5  NW_003315917.1  1084023-1255815    149  67       AT_rich (12)  AluSx (9)  AluJo (9) 
12  171051  chr9  NT_008470.19  21677012-21848063    37  25       MIRb (3)  L1M5 (3)  AluSq (3) 
13  170227  chr6  NT_167248.1  518912-689139    4       AT_rich (2)  L1PREC2 (1)  HERVH-int (1) 
14  167978  chr6  NT_167244.1  2000378-2168356    23  17       AluSx (4)  MIR (2)  FRAM (2) 
15  167506  chr6  NT_167247.1  1562918-1730424    6       MIR (2)  L1MEe (2)  (GGAA)n (1) 
16  165751  chr4  NT_006316.16  391949-557700    59  7       (CA)n (46)  L1M4 (7)  L1PA10 (2) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
3   232762       chr6  NT_167244.1  4371897-4604659    HLA-DPB2  major_histocompatibility_complex,_class_II,_DP_beta_2_(pseudogene)
4   230974       chr1  NT_032977.9  68057055-68288029    LOC100419654  dihydropyrimidine_dehydrogenase_[NADP+]_isoform_2
6   211641       chrX  NT_011651.17  6699412-6911053    RPS6KA6  ribosomal_protein_S6_kinase_alpha-6
MIR548I4  microRNA:hsa-mir-548i-4
HDX  highly_divergent_homeobox_isoform_2
7   192219       chr6  NT_167244.1  3165906-3358125    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor



Posfai@neb.com
May 11, 2011