Distribution of restriction sites in the human genome

Enzyme:  Kpn2I               Longest uncut segments
Specificity:  TCCGGA               Repeats in uncut segments
Number of sites:  94538               Genes in uncut segments
Mean distance between sites:  30266 base pairs
Standard deviation:  37714 base pairs
Site density 33.0 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   501692  chr11  NT_167190.1  24494515-24996207    51.17 % in   1012 repeats    0.00 % in 0 genes
2   500482  chr1  NT_032977.9  42838043-43338525    53.23 % in   761 repeats    0.00 % in 0 genes
3   498858  chr15  NT_037852.6  1392653-1891511    1.35 % in   25 repeats    0.00 % in 0 genes
4   468985  chr4  NT_016354.19  75828280-76297265    47.28 % in   836 repeats    100.00 % in 1 genes
5   453984  chrX  NT_011651.17  31272241-31726225    82.00 % in   596 repeats    0.81 % in 1 genes
6   440260  chr4  NT_016354.19  16827573-17267833    60.62 % in   624 repeats    55.31 % in 1 genes
7   435976  chr6  NT_167244.1  2337021-2772997    3.98 % in   76 repeats    1.23 % in 1 genes
8   432501  chr2  NT_022171.15  2377082-2809583    46.61 % in   648 repeats    47.15 % in 9 genes
9   425762  chrX  NT_011786.16  158796-584558    73.69 % in   592 repeats    0.00 % in 0 genes
10   414935  chrX  NT_011651.17  3361922-3776857    78.58 % in   585 repeats    0.00 % in 0 genes
11   410516  chr6  NT_007299.13  1-410517    71.02 % in   307 repeats    0.00 % in 0 genes
12   407908  chr22  NT_011520.12  8131962-8539870    60.53 % in   910 repeats    0.00 % in 0 genes
13   405991  chr2  NT_005403.17  32326174-32732165    39.31 % in   560 repeats    0.00 % in 0 genes
14   395793  chr14  NT_026437.12  24121925-24517718    58.47 % in   643 repeats    0.00 % in 0 genes
15   385089  chr8  NT_008046.16  27752203-28137292    57.22 % in   564 repeats    0.00 % in 0 genes
16   382081  chr6  NT_007592.15  18714486-19096567    54.83 % in   629 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
501692  chr11  NT_167190.1  24494515-24996207    1012  223       MIRb (157)  MIR (70)  L2c (67) 
500482  chr1  NT_032977.9  42838043-43338525    761  225       AT_rich (78)  MIRb (41)  L2a (35) 
498858  chr15  NT_037852.6  1392653-1891511    25  20       L2a (3)  L1MDa (3)  L1M5 (2) 
468985  chr4  NT_016354.19  75828280-76297265    836  187       AT_rich (81)  AluSx (45)  AluJo (43) 
453984  chrX  NT_011651.17  31272241-31726225    596  202       AT_rich (20)  L1MA3 (16)  L1MEg (15) 
440260  chr4  NT_016354.19  16827573-17267833    624  191       AT_rich (65)  MIR (29)  L2a (18) 
435976  chr6  NT_167244.1  2337021-2772997    76  42       AluSx (6)  AluJo (5)  AluJb (5) 
432501  chr2  NT_022171.15  2377082-2809583    648  152       L1PA12 (69)  AT_rich (51)  MER5A1 (25) 
425762  chrX  NT_011786.16  158796-584558    592  186       AT_rich (28)  MIRb (17)  MIR (16) 
10  414935  chrX  NT_011651.17  3361922-3776857    585  167       AT_rich (40)  AluSx (19)  L1MB8 (15) 
11  410516  chr6  NT_007299.13  1-410517    307  126       ALR/Alpha (25)  AT_rich (19)  L1PA3 (9) 
12  407908  chr22  NT_011520.12  8131962-8539870    910  198       AluSx (78)  AluJb (53)  AT_rich (51) 
13  405991  chr2  NT_005403.17  32326174-32732165    560  168       AT_rich (45)  MIR (35)  L2a (26) 
14  395793  chr14  NT_026437.12  24121925-24517718    643  212       AT_rich (81)  (TA)n (21)  AluSx (16) 
15  385089  chr8  NT_008046.16  27752203-28137292    564  194       AT_rich (70)  L2a (22)  MIRb (12) 
16  382081  chr6  NT_007592.15  18714486-19096567    629  204       AT_rich (25)  L2a (23)  MIRb (19) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   468985       chr4  NT_016354.19  75828280-76297265    LOC729566  protein_archease-like
5   453984       chrX  NT_011651.17  31272241-31726225    IRS4  insulin_receptor_substrate_4
6   440260       chr4  NT_016354.19  16827573-17267833    TMSL3  thymosin_beta-4-like_protein_3
7   435976       chr6  NT_167244.1  2337021-2772997    HCG22  HLA_complex_group_22
8   432501       chr2  NT_022171.15  2377082-2809583    LOC653924 
FAHD2B  fumarylacetoacetate_hydrolase_domain-containing_protein_2B
ANKRD36  ankyrin_repeat_domain-containing_protein_36A
LOC100506076  hypothetical_LOC100506076,_transcript_variant_1
IGKV2OR2-10 
IGKV2OR2-7 
UBE3AP1 
LOC100506123  hypothetical_LOC100506123,_transcript_variant_2
ANKRD36B  ankyrin_repeat_domain-containing_protein_36B



Posfai@neb.com
May 11, 2011