Distribution of restriction sites in the human genome

Enzyme:  EciI               Longest uncut segments
Specificity:  GGCGGA               Repeats in uncut segments
Number of sites:  566922               Genes in uncut segments
Mean distance between sites:  5047 base pairs
Standard deviation:  8085 base pairs
Site density 198.1 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   494303  chr15  NT_037852.6  1397194-1891497    0.79 % in   18 repeats    0.00 % in 0 genes
2   404492  chr6  NT_167244.1  2359914-2764406    0.12 % in   3 repeats    0.00 % in 0 genes
3   332453  chrY  NT_011875.12  8386841-8719294    82.26 % in   69 repeats    0.00 % in 0 genes
4   273968  chr11  NT_009237.18  48567064-48841032    93.17 % in   99 repeats    0.70 % in 2 genes
5   251792  chr10  NT_033985.7  1-251793    69.85 % in   207 repeats    6.68 % in 29 genes
6   235627  chr19  NT_011109.16  158653-394280    99.78 % in   58 repeats    0.00 % in 0 genes
7   208532  chr11  NT_009237.18  50412129-50620661    99.81 % in   38 repeats    0.00 % in 0 genes
8   208486  chr6  NT_167244.1  4389908-4598394    0.28 % in   3 repeats    0.00 % in 0 genes
9   201543  chrY  NT_011896.9  5171811-5373354    80.45 % in   309 repeats    0.00 % in 0 genes
10   199085  chr6  NT_167248.1  488877-687962    12.83 % in   40 repeats    0.00 % in 0 genes
11   196094  chr6  NT_167244.1  3788520-3984614    4.51 % in   36 repeats    0.00 % in 0 genes
12   195450  chr8  NT_167187.1  31440279-31635729    99.84 % in   44 repeats    0.00 % in 0 genes
13   185068  chr7  NT_007933.15  125964-311032    99.65 % in   36 repeats    0.00 % in 0 genes
14   184369  chr7  NT_007933.15  311032-495401    98.34 % in   48 repeats    0.00 % in 0 genes
15   183985  chrX  NT_011669.17  149628-333613    98.99 % in   44 repeats    0.00 % in 0 genes
16   177675  chr6  NT_007299.13  222853-400528    57.24 % in   104 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
494303  chr15  NT_037852.6  1397194-1891497    18  15       L2a (3)  L1M5 (2)  U2 (1) 
404492  chr6  NT_167244.1  2359914-2764406    3       L1MEg (1)  AluY (1)  AluSp (1) 
332453  chrY  NT_011875.12  8386841-8719294    69  32       LTR12B (17)  AT_rich (5)  AluY (5) 
273968  chr11  NT_009237.18  48567064-48841032    99  37       ALR/Alpha (28)  L1PA4 (9)  MLT1H1-int (5) 
251792  chr10  NT_033985.7  1-251793    207  43       ALR/Alpha (21)  HSATII (20)  (GAATG)n (17) 
235627  chr19  NT_011109.16  158653-394280    58  13       ALR/Alpha (35)  L1PA4 (7)  L1PA3 (5) 
208532  chr11  NT_009237.18  50412129-50620661    38  11       ALR/Alpha (20)  L1PA4 (4)  L1PA3 (3) 
208486  chr6  NT_167244.1  4389908-4598394    3       AluSx (1)  AluSg/x (1)  AluJo (1) 
201543  chrY  NT_011896.9  5171811-5373354    309  66       BSR/Beta (163)  AT_rich (11)  AluSc (9) 
10  199085  chr6  NT_167248.1  488877-687962    40  33       AT_rich (4)  MER4D (2)  L1PA14 (2) 
11  196094  chr6  NT_167244.1  3788520-3984614    36  28       L2a (6)  MLT1H-int (2)  L1M5 (2) 
12  195450  chr8  NT_167187.1  31440279-31635729    44  12       ALR/Alpha (25)  L1PA2 (3)  MER11C (2) 
13  185068  chr7  NT_007933.15  125964-311032    36  9       ALR/Alpha (20)  L1PA4 (6)  AluY (3) 
14  184369  chr7  NT_007933.15  311032-495401    48  14       ALR/Alpha (26)  AluY (4)  L1PA4 (3) 
15  183985  chrX  NT_011669.17  149628-333613    44  17       ALR/Alpha (22)  MER9B (2)  MER11D (2) 
16  177675  chr6  NT_007299.13  222853-400528    104  57       ALR/Alpha (16)  AT_rich (7)  MIRb (4) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   273968       chr11  NT_009237.18  48567064-48841032    OR4A42P 
OR4A44P 
5   251792       chr10  NT_033985.7  1-251793    LOC100506968 
LOC100506743 
LOC100506987 
LOC100507020  hypothetical_protein_LOC100507020
LOC100507045 
LOC100507078 
LOC100507104 
LOC100507129 
LOC100507154  hypothetical_protein_LOC100507154
LOC100507189 
LOC100507216 
LOC100507234 
LOC100507262 
LOC100507287 
LOC100507320 
LOC100507339  hypothetical_protein_LOC100507339
LOC100507366 
LOC100507385 
LOC100507409 
LOC100507432 
LOC100507451 
LOC100507471 
LOC100507491  hypothetical_protein_LOC100507491
LOC100507517 
LOC100507542  hypothetical_protein_LOC100507542
LOC100507565 
LOC100507597 
LOC100506770 
LOC100507622 



Posfai@neb.com
May 11, 2011