Distribution of restriction sites in the human genome

Enzyme:  Cac8I               Longest uncut segments
Specificity:  GCNNGC               Repeats in uncut segments
Number of sites:  5401837               Genes in uncut segments
Mean distance between sites:  529 base pairs
Standard deviation:  719 base pairs
Site density1887.9 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   488114  chr15  NT_037852.6  1397191-1885305    0.14 % in   5 repeats    0.00 % in 0 genes
2   401879  chr6  NT_167244.1  2359824-2761703    0.04 % in   1 repeats    0.00 % in 0 genes
3   209319  chr6  NT_167244.1  4389476-4598795    0.67 % in   8 repeats    0.00 % in 0 genes
4   181068  chr6  NT_167244.1  3790326-3971394    0.42 % in   5 repeats    0.00 % in 0 genes
5   175497  chr6  NT_167244.1  3180241-3355738    0.19 % in   3 repeats    0.00 % in 0 genes
6   173291  chr6  NT_167247.1  4422134-4595425    0.68 % in   3 repeats    100.00 % in 1 genes
7   167148  chr6  NT_167247.1  1562797-1729945    0.83 % in   7 repeats    0.09 % in 1 genes
8   165141  chr6  NT_167249.1  2137913-2303054    0.08 % in   2 repeats    0.00 % in 0 genes
9   159537  chr6  NT_167248.1  521816-681353    0.14 % in   2 repeats    0.00 % in 0 genes
10   150886  chr9  NT_008470.19  21693057-21843943    0.18 % in   2 repeats    0.00 % in 0 genes
11   143312  chr6  NT_167244.1  2894544-3037856    0.32 % in   5 repeats    0.00 % in 0 genes
12   119259  chr6  NT_167245.1  2604812-2724071    1.37 % in   5 repeats    0.00 % in 0 genes
13   115185  chr6  NT_167247.1  1177155-1292340    0.44 % in   1 repeats    0.00 % in 0 genes
14   114727  chr6  NT_167246.1  3260724-3375451    0.41 % in   3 repeats    0.00 % in 0 genes
15   110025  chr6  NT_167244.1  588173-698198    3.41 % in   14 repeats    0.00 % in 0 genes
16   108051  chr6  NT_167245.1  138036-246087    0.14 % in   2 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
488114  chr15  NT_037852.6  1397191-1885305    5       MIRc (1)  MIRb (1)  L1M3 (1) 
401879  chr6  NT_167244.1  2359824-2761703    1       AluSp (1) 
209319  chr6  NT_167244.1  4389476-4598795    7       AluSx (2)  (TTCC)n (1)  MER57-int (1) 
181068  chr6  NT_167244.1  3790326-3971394    5       MLT1H-int (1)  MER52D (1)  LTR19B (1) 
175497  chr6  NT_167244.1  3180241-3355738    3       GC_rich (1)  Charlie4a (1)  AluSp (1) 
173291  chr6  NT_167247.1  4422134-4595425    3       MER11A (1)  AluSg/x (1)  AluSc (1) 
167148  chr6  NT_167247.1  1562797-1729945    5       MIR (2)  L1MEe (2)  (GGAA)n (1) 
165141  chr6  NT_167249.1  2137913-2303054    2       L1MC4a (1)  AT_rich (1) 
159537  chr6  NT_167248.1  521816-681353    2       L1PREC2 (1)  HERVH-int (1) 
10  150886  chr9  NT_008470.19  21693057-21843943    2       MIR3 (1)  L1M5 (1) 
11  143312  chr6  NT_167244.1  2894544-3037856    5       L1MC5 (1)  AluY (1)  AluSp (1) 
12  119259  chr6  NT_167245.1  2604812-2724071    5       MLT1E2 (1)  MER5B (1)  MER5A1 (1) 
13  115185  chr6  NT_167247.1  1177155-1292340    1       ERV3-16A3_I-int (1) 
14  114727  chr6  NT_167246.1  3260724-3375451    2       MIRb (2)  AluSx (1) 
15  110025  chr6  NT_167244.1  588173-698198    14  11       L1MA9 (3)  L1MC5 (2)  THE1D (1) 
16  108051  chr6  NT_167245.1  138036-246087    2       MLT1E2 (1)  LTR12C (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
6   173291       chr6  NT_167247.1  4422134-4595425    LOC100507722  hypothetical_protein_LOC100507722
7   167148       chr6  NT_167247.1  1562797-1729945    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011