Distribution of restriction sites in the human genome

Enzyme:  Hpy99I               Longest uncut segments
Specificity:  CGWCG               Repeats in uncut segments
Number of sites:  154311               Genes in uncut segments
Mean distance between sites:  18542 base pairs
Standard deviation:  30957 base pairs
Site density 53.9 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   612207  chr6  NT_025741.15  45320024-45932231    59.72 % in   1009 repeats    0.09 % in 1 genes
2   545598  chr15  NT_037852.6  1359917-1905515    5.95 % in   118 repeats    3.12 % in 2 genes
3   543864  chr2  NT_022135.16  6461464-7005328    64.37 % in   827 repeats    0.00 % in 0 genes
4   524935  chr4  NT_016297.16  593687-1118622    51.33 % in   849 repeats    0.00 % in 0 genes
5   520521  chr14  NT_026437.12  10336143-10856664    44.95 % in   719 repeats    0.59 % in 1 genes
6   509922  chr3  NT_005612.16  67848794-68358716    52.86 % in   798 repeats    0.00 % in 0 genes
7   484370  chr5  NT_034772.6  13346634-13831004    52.27 % in   821 repeats    0.00 % in 0 genes
8   484181  chr12  NT_029419.12  48993313-49477494    55.49 % in   773 repeats    79.04 % in 1 genes
9   458607  chr11  NT_009237.18  40975542-41434149    41.17 % in   788 repeats    0.00 % in 0 genes
10   455450  chr11  NT_009237.18  36987174-37442624    55.32 % in   763 repeats    0.00 % in 0 genes
11   454285  chr11  NT_033899.8  7854375-8308660    50.25 % in   824 repeats    0.00 % in 0 genes
12   446399  chr3  NT_022459.15  16972991-17419390    55.69 % in   701 repeats    0.00 % in 0 genes
13   441951  chr6  NT_167244.1  2326723-2768674    4.91 % in   92 repeats    0.00 % in 0 genes
14   435963  chr5  NT_006713.15  34593708-35029671    54.53 % in   694 repeats    0.00 % in 0 genes
15   429832  chr11  NT_033899.8  9651704-10081536    55.17 % in   756 repeats    0.00 % in 0 genes
16   416145  chrX  NT_011651.17  10759496-11175641    76.27 % in   724 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
612207  chr6  NT_025741.15  45320024-45932231    1009  265       AT_rich (103)  (TA)n (28)  L1ME1 (28) 
545598  chr15  NT_037852.6  1359917-1905515    118  56       L1MDa (10)  Tigger2 (6)  L2 (6) 
543864  chr2  NT_022135.16  6461464-7005328    827  251       AT_rich (68)  MIRb (30)  AluSx (26) 
524935  chr4  NT_016297.16  593687-1118622    849  253       AT_rich (101)  MIR (30)  L2a (26) 
520521  chr14  NT_026437.12  10336143-10856664    719  201       AT_rich (69)  L2c (30)  L2a (30) 
509922  chr3  NT_005612.16  67848794-68358716    798  242       AT_rich (83)  MIR (29)  MIRb (28) 
484370  chr5  NT_034772.6  13346634-13831004    821  241       AT_rich (104)  AluSx (29)  L1MEf (18) 
484181  chr12  NT_029419.12  48993313-49477494    773  231       AT_rich (100)  MIR (25)  L2a (23) 
458607  chr11  NT_009237.18  40975542-41434149    788  199       MIRb (60)  AT_rich (56)  MIR (52) 
10  455450  chr11  NT_009237.18  36987174-37442624    763  227       AT_rich (62)  L2a (35)  MIR (29) 
11  454285  chr11  NT_033899.8  7854375-8308660    824  237       AT_rich (71)  MIR (32)  MIRb (23) 
12  446399  chr3  NT_022459.15  16972991-17419390    701  210       AT_rich (95)  AluY (21)  (TA)n (18) 
13  441951  chr6  NT_167244.1  2326723-2768674    92  45       AluSx (8)  L1MC4a (7)  AluJb (7) 
14  435963  chr5  NT_006713.15  34593708-35029671    694  218       AT_rich (95)  AluSx (20)  (TA)n (16) 
15  429832  chr11  NT_033899.8  9651704-10081536    756  214       AT_rich (50)  L2c (47)  MIRb (43) 
16  416145  chrX  NT_011651.17  10759496-11175641    724  223       AT_rich (49)  AluSx (25)  AluY (21) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
1   612207       chr6  NT_025741.15  45320024-45932231    RPS18P10 
2   545598       chr15  NT_037852.6  1359917-1905515    LOC727914 
LOC100418897 
5   520521       chr14  NT_026437.12  10336143-10856664    LOC100506004  hypothetical_LOC100506004
8   484181       chr12  NT_029419.12  48993313-49477494    MGAT4C  alpha-1,3-mannosyl-glycoprotein_4-beta-N-acetylglucosaminyltransferase_C



Posfai@neb.com
May 11, 2011