Distribution of restriction sites in the human genome

Enzyme:  PspPRI               Longest uncut segments
Specificity:  CCYCAG               Repeats in uncut segments
Number of sites:  5215637               Genes in uncut segments
Mean distance between sites:  548 base pairs
Standard deviation:  719 base pairs
Site density1822.8 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   493477  chr15  NT_037852.6  1398012-1891489    0.66 % in   14 repeats    0.00 % in 0 genes
2   401735  chr6  NT_167244.1  2359904-2761639    0.02 % in   1 repeats    0.00 % in 0 genes
3   208420  chr6  NT_167244.1  4389938-4598358    0.25 % in   3 repeats    0.00 % in 0 genes
4   181281  chr6  NT_167244.1  3789376-3970657    0.19 % in   4 repeats    0.00 % in 0 genes
5   175160  chr6  NT_167244.1  3180226-3355386    0.07 % in   2 repeats    0.00 % in 0 genes
6   172386  chr6  NT_167247.1  4421927-4594313    0.03 % in   2 repeats    100.00 % in 1 genes
7   165482  chr6  NT_167249.1  2138068-2303550    0.20 % in   3 repeats    0.00 % in 0 genes
8   165351  chr6  NT_167247.1  1562060-1727411    0.21 % in   4 repeats    0.54 % in 1 genes
9   159409  chr6  NT_167248.1  521867-681276    0.06 % in   2 repeats    0.00 % in 0 genes
10   150740  chr9  NT_008470.19  21693308-21844048    0.04 % in   1 repeats    0.00 % in 0 genes
11   148335  chr7  NT_023603.5  45811-194146    100.00 % in   2 repeats    0.00 % in 0 genes
12   143651  chr6  NT_167244.1  2894138-3037789    0.35 % in   5 repeats    0.00 % in 0 genes
13   117765  chr6  NT_167245.1  2606221-2723986    0.20 % in   3 repeats    0.00 % in 0 genes
14   109929  chr6  NT_167244.1  588274-698203    3.41 % in   14 repeats    0.00 % in 0 genes
15   108379  chr6  NT_167245.1  138028-246407    0.44 % in   3 repeats    0.00 % in 0 genes
16   104862  chr7  NT_007933.15  68186529-68291391    3.81 % in   7 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
493477  chr15  NT_037852.6  1398012-1891489    14  11       L2a (3)  L1M5 (2)  U2 (1) 
401735  chr6  NT_167244.1  2359904-2761639    1       AluSp (1) 
208420  chr6  NT_167244.1  4389938-4598358    3       AluSx (1)  AluSg/x (1)  AluJo (1) 
181281  chr6  NT_167244.1  3789376-3970657    4       MLT1H-int (1)  MER52D (1)  AT_rich (1) 
175160  chr6  NT_167244.1  3180226-3355386    2       GC_rich (1)  AluSp (1) 
172386  chr6  NT_167247.1  4421927-4594313    2       MER11A (1)  AluSc (1) 
165482  chr6  NT_167249.1  2138068-2303550    3       L1MB8 (1)  AT_rich (1)  AluSx (1) 
165351  chr6  NT_167247.1  1562060-1727411    4       MIR (1)  L1MC3 (1)  A-rich (1) 
159409  chr6  NT_167248.1  521867-681276    2       L1PREC2 (1)  HERVH-int (1) 
10  150740  chr9  NT_008470.19  21693308-21844048    1       MIR3 (1) 
11  148335  chr7  NT_023603.5  45811-194146    2       L1PA2 (1)  ALR/Alpha (1) 
12  143651  chr6  NT_167244.1  2894138-3037789    5       L1MC5 (1)  AluY (1)  AluSg1 (1) 
13  117765  chr6  NT_167245.1  2606221-2723986    3       MLT1E2 (1)  L2a (1)  L2 (1) 
14  109929  chr6  NT_167244.1  588274-698203    14  11       L1MA9 (3)  L1MC5 (2)  THE1D (1) 
15  108379  chr6  NT_167245.1  138028-246407    3       MLT1F (1)  MLT1E2 (1)  LTR12C (1) 
16  104862  chr7  NT_007933.15  68186529-68291391    5       L1PB1 (2)  L1MB7 (2)  (TC)n (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
6   172386       chr6  NT_167247.1  4421927-4594313    LOC100507722  hypothetical_protein_LOC100507722
8   165351       chr6  NT_167247.1  1562060-1727411    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011