Distribution of restriction sites in the human genome

Enzyme:  MthZI               Longest uncut segments
Specificity:  CTAG               Repeats in uncut segments
Number of sites:  7771789               Genes in uncut segments
Mean distance between sites:  368 base pairs
Standard deviation:  389 base pairs
Site density2716.1 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   488694  chr15  NT_037852.6  1398729-1887423    0.10 % in   3 repeats    0.00 % in 0 genes
2   402628  chr6  NT_167244.1  2358890-2761518    0.19 % in   3 repeats    0.00 % in 0 genes
3   208757  chr6  NT_167244.1  4389676-4598433    0.40 % in   5 repeats    0.00 % in 0 genes
4   180505  chr6  NT_167244.1  3790225-3970730    0.11 % in   2 repeats    0.00 % in 0 genes
5   176944  chr6  NT_167244.1  3179054-3355998    0.26 % in   6 repeats    0.66 % in 1 genes
6   173085  chr6  NT_167247.1  4421672-4594757    0.29 % in   2 repeats    100.00 % in 1 genes
7   165722  chr6  NT_167249.1  2137316-2303038    0.40 % in   3 repeats    0.00 % in 0 genes
8   160865  chr6  NT_167248.1  520559-681424    0.97 % in   2 repeats    0.00 % in 0 genes
9   150233  chr9  NT_008470.19  21693103-21843336    0.11 % in   1 repeats    0.00 % in 0 genes
10   143817  chr6  NT_167244.1  2894518-3038335    0.64 % in   6 repeats    0.00 % in 0 genes
11   117520  chr6  NT_167245.1  2606109-2723629    0.11 % in   1 repeats    0.00 % in 0 genes
12   115819  chr6  NT_167247.1  1177477-1293296    0.16 % in   1 repeats    0.00 % in 0 genes
13   115142  chr6  NT_167246.1  3260236-3375378    0.57 % in   4 repeats    0.00 % in 0 genes
14   108837  chr6  NT_167245.1  137510-246347    0.86 % in   3 repeats    0.00 % in 0 genes
15   105535  chr6  NT_167244.1  1451123-1556658    0.83 % in   3 repeats    0.00 % in 0 genes
16   104987  chr6  NT_167244.1  588712-693699    0.69 % in   4 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
488694  chr15  NT_037852.6  1398729-1887423    3       MLT1L (1)  LTR33 (1)  AT_rich (1) 
402628  chr6  NT_167244.1  2358890-2761518    3       L4 (1)  AluSp (1)  AluJb (1) 
208757  chr6  NT_167244.1  4389676-4598433    5       (TTCC)n (1)  MER57-int (1)  AluSx (1) 
180505  chr6  NT_167244.1  3790225-3970730    2       MLT1H-int (1)  MER52D (1) 
176944  chr6  NT_167244.1  3179054-3355998    4       GC_rich (3)  Charlie4a (1)  (CCG)n (1) 
173085  chr6  NT_167247.1  4421672-4594757    2       MER11A (1)  AluSc (1) 
165722  chr6  NT_167249.1  2137316-2303038    3       L1MC4a (1)  AT_rich (1)  AluJo (1) 
160865  chr6  NT_167248.1  520559-681424    2       L1PREC2 (1)  HERVH-int (1) 
150233  chr9  NT_008470.19  21693103-21843336    1       L1M5 (1) 
10  143817  chr6  NT_167244.1  2894518-3038335    5       AluJo (2)  L1MC5 (1)  AluY (1) 
11  117520  chr6  NT_167245.1  2606109-2723629    1       L2a (1) 
12  115819  chr6  NT_167247.1  1177477-1293296    1       ERV3-16A3_I-int (1) 
13  115142  chr6  NT_167246.1  3260236-3375378    3       MIRb (2)  MIR3 (1)  AluSx (1) 
14  108837  chr6  NT_167245.1  137510-246347    3       MLT1F (1)  MLT1E2 (1)  LTR12C (1) 
15  105535  chr6  NT_167244.1  1451123-1556658    3       ERV3-16A3_I-int (1)  AluY (1)  AluSg1 (1) 
16  104987  chr6  NT_167244.1  588712-693699    2       L1MA9 (3)  L1PB1 (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
5   176944       chr6  NT_167244.1  3179054-3355998    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
6   173085       chr6  NT_167247.1  4421672-4594757    LOC100507722  hypothetical_protein_LOC100507722



Posfai@neb.com
May 11, 2011