Distribution of restriction sites in the human genome

Enzyme:  AseI               Longest uncut segments
Specificity:  ATTAAT               Repeats in uncut segments
Number of sites:  1455335               Genes in uncut segments
Mean distance between sites:  1966 base pairs
Standard deviation:  2711 base pairs
Site density 508.6 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   487370  chr15  NT_037852.6  1398325-1885695    0.01 % in   1 repeats    0.00 % in 0 genes
2   402118  chr6  NT_167244.1  2359284-2761402    0.11 % in   2 repeats    0.00 % in 0 genes
3   246259  chr6  NT_167244.1  2009328-2255587    0.99 % in   12 repeats    1.80 % in 2 genes
4   213090  chr6  NT_167244.1  4389381-4602471    1.94 % in   15 repeats    0.00 % in 0 genes
5   186692  chr6  NT_167244.1  3177602-3364294    2.59 % in   26 repeats    5.75 % in 2 genes
6   185437  chr6  NT_167244.1  3789809-3975246    2.11 % in   16 repeats    0.00 % in 0 genes
7   182298  chr6  NT_167247.1  4412589-4594887    1.95 % in   13 repeats    97.52 % in 1 genes
8   172253  chr6  NT_167247.1  1561459-1733712    2.43 % in   18 repeats    0.87 % in 1 genes
9   169999  chr6  NT_167249.1  2135559-2305558    2.34 % in   16 repeats    0.00 % in 0 genes
10   168607  chr4  NT_006316.16  391056-559663    5.29 % in   61 repeats    0.00 % in 0 genes
11   162179  chr6  NT_167248.1  520031-682210    1.77 % in   2 repeats    0.00 % in 0 genes
12   150718  chr9  NT_008470.19  21692744-21843462    0.34 % in   2 repeats    0.00 % in 0 genes
13   146385  chr6  NT_167244.1  2892152-3038537    1.71 % in   12 repeats    0.00 % in 0 genes
14   138349  chr12  NT_009714.17  27190245-27328594    83.94 % in   134 repeats    0.00 % in 0 genes
15   135523  chr1  NT_004350.19  2833548-2969071    18.04 % in   131 repeats    0.00 % in 0 genes
16   128533  chr6  NT_167245.1  2602423-2730956    5.32 % in   21 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
487370  chr15  NT_037852.6  1398325-1885695    1       AT_rich (1) 
402118  chr6  NT_167244.1  2359284-2761402    2       L4 (1)  AluSp (1) 
246259  chr6  NT_167244.1  2009328-2255587    12  11       MIRb (2)  MIR (1)  MER5A1 (1) 
213090  chr6  NT_167244.1  4389381-4602471    15  11       MER57-int (2)  HERVH-int (2)  AluSx (2) 
186692  chr6  NT_167244.1  3177602-3364294    26  15       AluSx (4)  MIR (3)  GC_rich (3) 
185437  chr6  NT_167244.1  3789809-3975246    16  14       MLT1H-int (2)  L2a (2)  THE1B (1) 
182298  chr6  NT_167247.1  4412589-4594887    13  11       L2b (3)  MIRc (1)  MIRb (1) 
172253  chr6  NT_167247.1  1561459-1733712    18  14       MSTB (2)  MIR (2)  L1MEf (2) 
169999  chr6  NT_167249.1  2135559-2305558    16  9       AluSx (4)  L1MB8 (3)  MLT1A (2) 
10  168607  chr4  NT_006316.16  391056-559663    61  8       (CA)n (47)  L1M4 (7)  L1PA10 (2) 
11  162179  chr6  NT_167248.1  520031-682210    2       L1PREC2 (1)  HERVH-int (1) 
12  150718  chr9  NT_008470.19  21692744-21843462    2       LTR67B (1)  L1M5 (1) 
13  146385  chr6  NT_167244.1  2892152-3038537    12  9       L1MC5 (2)  AluY (2)  AluJo (2) 
14  138349  chr12  NT_009714.17  27190245-27328594    134  12       GSATII (116)  GSATX (4)  ALR/Alpha (4) 
15  135523  chr1  NT_004350.19  2833548-2969071    131  52       MIR3 (18)  MIR (8)  L1MEf (8) 
16  128533  chr6  NT_167245.1  2602423-2730956    21  18       Tigger1 (2)  MLT1N2 (2)  L2 (2) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
3   246259       chr6  NT_167244.1  2009328-2255587    FLOT1  flotillin-1
DDR1  epithelial_discoidin_domain-containing_receptor_1_isoform_DDR1c
5   186692       chr6  NT_167244.1  3177602-3364294    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
7   182298       chr6  NT_167247.1  4412589-4594887    LOC100507722  hypothetical_protein_LOC100507722
8   172253       chr6  NT_167247.1  1561459-1733712    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011