Distribution of restriction sites in the human genome

Enzyme:  XhoI               Longest uncut segments
Specificity:  CTCGAG               Repeats in uncut segments
Number of sites:  119999               Genes in uncut segments
Mean distance between sites:  23844 base pairs
Standard deviation:  28573 base pairs
Site density 41.9 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   665272  chr15  NT_037852.6  1248407-1913679    15.15 % in   411 repeats    2.89 % in 3 genes
2   430883  chr3  NT_022459.15  19927575-20358458    49.98 % in   651 repeats    0.00 % in 0 genes
3   423692  chr6  NT_167244.1  2348991-2772683    2.86 % in   53 repeats    0.00 % in 0 genes
4   403942  chr1  NT_167186.1  9643992-10047934    43.32 % in   585 repeats    100.00 % in 1 genes
5   370506  chr2  NT_005403.17  13582230-13952736    39.39 % in   506 repeats    87.02 % in 1 genes
6   370408  chr13  NT_024524.14  36631407-37001815    53.04 % in   587 repeats    0.00 % in 0 genes
7   369145  chr6  NT_167244.1  3628287-3997432    20.13 % in   240 repeats    13.87 % in 3 genes
8   356911  chr12  NT_009714.17  21035206-21392117    54.36 % in   601 repeats    62.36 % in 2 genes
9   350739  chrX  NT_011681.16  317101-667840    73.14 % in   621 repeats    0.00 % in 0 genes
10   348888  chr1  NT_004487.19  43240477-43589365    55.14 % in   576 repeats    0.00 % in 0 genes
11   333779  chr8  NT_167187.1  31283284-31617063    99.14 % in   94 repeats    0.00 % in 0 genes
12   332202  chr13  NT_009952.14  15242616-15574818    39.04 % in   495 repeats    0.00 % in 0 genes
13   329082  chr5  NT_006713.15  13571982-13901064    49.26 % in   527 repeats    0.00 % in 0 genes
14   326096  chr5  NT_034772.6  33576136-33902232    56.94 % in   472 repeats    0.00 % in 0 genes
15   325546  chr5  NT_006576.16  21991759-22317305    41.07 % in   523 repeats    0.00 % in 0 genes
16   323356  chr7  NT_007819.17  9705967-10029323    52.51 % in   478 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
665272  chr15  NT_037852.6  1248407-1913679    411  130       AluSx (24)  AluJb (16)  MIRb (14) 
430883  chr3  NT_022459.15  19927575-20358458    651  201       AT_rich (71)  (TA)n (24)  L2a (22) 
423692  chr6  NT_167244.1  2348991-2772683    53  33       AluJb (4)  L4 (3)  L1ME4a (3) 
403942  chr1  NT_167186.1  9643992-10047934    585  175       AT_rich (55)  MIRb (26)  L2c (25) 
370506  chr2  NT_005403.17  13582230-13952736    506  170       AT_rich (54)  MIRb (26)  MIR (25) 
370408  chr13  NT_024524.14  36631407-37001815    587  194       AT_rich (66)  L2a (17)  (TA)n (15) 
369145  chr6  NT_167244.1  3628287-3997432    240  107       L2a (18)  AT_rich (17)  AluY (9) 
356911  chr12  NT_009714.17  21035206-21392117    601  212       AT_rich (46)  L2c (24)  L2a (18) 
350739  chrX  NT_011681.16  317101-667840    621  210       L2a (22)  AT_rich (22)  MIR (21) 
10  348888  chr1  NT_004487.19  43240477-43589365    576  202       AT_rich (66)  L2a (31)  MIRb (15) 
11  333779  chr8  NT_167187.1  31283284-31617063    94  26       ALR/Alpha (49)  AluY (7)  LTR14C (5) 
12  332202  chr13  NT_009952.14  15242616-15574818    495  157       AT_rich (54)  MIRb (18)  MIR (18) 
13  329082  chr5  NT_006713.15  13571982-13901064    527  185       AT_rich (41)  L2a (21)  MIR (17) 
14  326096  chr5  NT_034772.6  33576136-33902232    472  179       AT_rich (39)  L2a (16)  L1M5 (13) 
15  325546  chr5  NT_006576.16  21991759-22317305    523  166       AT_rich (58)  L2a (34)  AluSx (24) 
16  323356  chr7  NT_007819.17  9705967-10029323    478  175       AT_rich (50)  (TA)n (17)  MIR (14) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
1   665272       chr15  NT_037852.6  1248407-1913679    LOC283804 
LOC727914 
LOC100418897 
4   403942       chr1  NT_167186.1  9643992-10047934    MRPS18BP1  Usherin_isoform_A
5   370506       chr2  NT_005403.17  13582230-13952736    KCNH7  potassium_voltage-gated_channel_subfamily_H_member_7_isoform_2
7   369145       chr6  NT_167244.1  3628287-3997432    HNRNPA1P2  chromosome_6_open_reading_frame_10
BTNL2  butyrophilin-like_protein_2
HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor
8   356911       chr12  NT_009714.17  21035206-21392117    LOC100129646 
CCDC91  coiled-coil_domain-containing_protein_91



Posfai@neb.com
May 11, 2011