Distribution of restriction sites in the human genome

Enzyme:  Tsp45I               Longest uncut segments
Specificity:  GTSAC               Repeats in uncut segments
Number of sites:  3939603               Genes in uncut segments
Mean distance between sites:  726 base pairs
Standard deviation:  804 base pairs
Site density1376.8 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   486963  chr15  NT_037852.6  1398505-1885468    0.01 % in   1 repeats    0.00 % in 0 genes
2   403062  chr6  NT_167244.1  2358315-2761377    0.33 % in   6 repeats    0.00 % in 0 genes
3   209523  chr6  NT_167244.1  4389076-4598599    0.77 % in   8 repeats    0.00 % in 0 genes
4   180945  chr6  NT_167244.1  3789844-3970789    0.25 % in   3 repeats    0.00 % in 0 genes
5   175579  chr6  NT_167244.1  3180258-3355837    0.19 % in   3 repeats    0.00 % in 0 genes
6   173035  chr6  NT_167247.1  4421399-4594434    0.14 % in   3 repeats    100.00 % in 1 genes
7   167309  chr6  NT_167247.1  1561587-1728896    0.87 % in   7 repeats    0.82 % in 1 genes
8   165401  chr6  NT_167249.1  2138289-2303690    0.28 % in   5 repeats    0.00 % in 0 genes
9   163363  chr6  NT_167248.1  518057-681420    2.48 % in   2 repeats    0.00 % in 0 genes
10   153156  chr9  NT_008470.19  21691903-21845059    0.88 % in   7 repeats    0.00 % in 0 genes
11   147969  chr6  NT_167244.1  2889743-3037712    2.58 % in   20 repeats    0.00 % in 0 genes
12   117902  chr6  NT_167245.1  2606003-2723905    0.32 % in   3 repeats    0.00 % in 0 genes
13   115072  chr6  NT_167247.1  1177623-1292695    0.04 % in   1 repeats    0.00 % in 0 genes
14   114564  chr6  NT_167246.1  3260282-3374846    0.27 % in   3 repeats    0.00 % in 0 genes
15   109853  chr6  NT_167245.1  136572-246425    1.77 % in   4 repeats    0.00 % in 0 genes
16   106152  chr6  NT_167244.1  587472-693624    1.11 % in   6 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
486963  chr15  NT_037852.6  1398505-1885468    1       AT_rich (1) 
403062  chr6  NT_167244.1  2358315-2761377    5       AluJb (2)  L4 (1)  L1ME4a (1) 
209523  chr6  NT_167244.1  4389076-4598599    7       MER57-int (2)  (TTCC)n (1)  L1MC (1) 
180945  chr6  NT_167244.1  3789844-3970789    3       MLT1H-int (1)  MER52D (1)  AluJb (1) 
175579  chr6  NT_167244.1  3180258-3355837    3       GC_rich (1)  Charlie4a (1)  AluSp (1) 
173035  chr6  NT_167247.1  4421399-4594434    3       MIR (1)  MER11A (1)  AluSc (1) 
167309  chr6  NT_167247.1  1561587-1728896    6       MIR (2)  L1MEe (1)  L1MC3 (1) 
165401  chr6  NT_167249.1  2138289-2303690    3       L1MB8 (2)  AluSx (2)  AT_rich (1) 
163363  chr6  NT_167248.1  518057-681420    2       L1PREC2 (1)  HERVH-int (1) 
10  153156  chr9  NT_008470.19  21691903-21845059    5       LTR67B (2)  L2 (2)  MSTA (1) 
11  147969  chr6  NT_167244.1  2889743-3037712    20  14       AluY (4)  LTR48 (2)  AluJo (2) 
12  117902  chr6  NT_167245.1  2606003-2723905    3       MLT1E2 (1)  L2a (1)  L2 (1) 
13  115072  chr6  NT_167247.1  1177623-1292695    1       ERV3-16A3_I-int (1) 
14  114564  chr6  NT_167246.1  3260282-3374846    3       MIRb (1)  MIR3 (1)  AluSx (1) 
15  109853  chr6  NT_167245.1  136572-246425    4       MLT1F (1)  MLT1E2 (1)  MER6 (1) 
16  106152  chr6  NT_167244.1  587472-693624    5       L1MA9 (2)  MER77 (1)  L1PB1 (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
6   173035       chr6  NT_167247.1  4421399-4594434    LOC100507722  hypothetical_protein_LOC100507722
7   167309       chr6  NT_167247.1  1561587-1728896    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011