Distribution of restriction sites in the human genome

Enzyme:  MspA1I               Longest uncut segments
Specificity:  CMGCKG               Repeats in uncut segments
Number of sites:  1437339               Genes in uncut segments
Mean distance between sites:  1990 base pairs
Standard deviation:  2663 base pairs
Site density 502.3 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   496981  chr15  NT_037852.6  1394367-1891348    1.00 % in   22 repeats    0.00 % in 0 genes
2   404799  chr6  NT_167244.1  2358973-2763772    0.22 % in   4 repeats    0.00 % in 0 genes
3   210622  chr6  NT_167244.1  4388009-4598631    1.28 % in   8 repeats    0.00 % in 0 genes
4   186168  chr6  NT_167244.1  3785363-3971531    1.28 % in   13 repeats    1.14 % in 1 genes
5   175637  chr6  NT_167244.1  3180187-3355824    0.22 % in   4 repeats    0.02 % in 1 genes
6   174538  chrY  NT_011875.12  8473974-8648512    71.28 % in   9 repeats    0.00 % in 0 genes
7   173760  chr7  NT_023603.5  23397-197157    99.45 % in   11 repeats    0.00 % in 0 genes
8   172370  chr6  NT_167247.1  4422047-4594417    0.10 % in   2 repeats    100.00 % in 1 genes
9   171711  chr6  NT_167247.1  1557481-1729192    2.44 % in   21 repeats    0.00 % in 0 genes
10   171002  chr6  NT_167249.1  2136314-2307316    2.83 % in   22 repeats    0.00 % in 0 genes
11   161594  chr6  NT_167248.1  520235-681829    1.41 % in   2 repeats    0.00 % in 0 genes
12   151519  chr9  NT_008470.19  21692304-21843823    0.51 % in   4 repeats    0.00 % in 0 genes
13   147462  chr6  NT_167244.1  2893759-3041221    2.24 % in   20 repeats    0.00 % in 0 genes
14   132842  chr9  NT_113916.2  28585-161427    99.97 % in   3 repeats    0.00 % in 0 genes
15   120835  chr6  NT_167245.1  2603992-2724827    2.38 % in   8 repeats    0.00 % in 0 genes
16   119548  chr10  NT_008705.16  38711078-38830626    27.48 % in   218 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
496981  chr15  NT_037852.6  1394367-1891348    22  18       L2a (3)  L1MDa (2)  L1M5 (2) 
404799  chr6  NT_167244.1  2358973-2763772    4       L4 (1)  L1MEg (1)  AluSp (1) 
210622  chr6  NT_167244.1  4388009-4598631    7       MER57-int (2)  (TTCC)n (1)  L1MC (1) 
186168  chr6  NT_167244.1  3785363-3971531    13  11       AT_rich (2)  AluJb (2)  MLT1H-int (1) 
175637  chr6  NT_167244.1  3180187-3355824    4       GC_rich (1)  Charlie4a (1)  (CCG)n (1) 
174538  chrY  NT_011875.12  8473974-8648512    2       LTR12B (8)  LTR12D (1) 
173760  chr7  NT_023603.5  23397-197157    11  5       L1PA2 (4)  ALR/Alpha (3)  AT_rich (2) 
172370  chr6  NT_167247.1  4422047-4594417    2       MER11A (1)  AluSc (1) 
171711  chr6  NT_167247.1  1557481-1729192    21  16       Tigger7 (2)  MSTD (2)  MIR (2) 
10  171002  chr6  NT_167249.1  2136314-2307316    22  12       Charlie2b (6)  AluSx (4)  L1MB8 (3) 
11  161594  chr6  NT_167248.1  520235-681829    2       L1PREC2 (1)  HERVH-int (1) 
12  151519  chr9  NT_008470.19  21692304-21843823    3       LTR67B (2)  MIR3 (1)  L1M5 (1) 
13  147462  chr6  NT_167244.1  2893759-3041221    20  10       L1MC5 (6)  AluSc (3)  L2c (2) 
14  132842  chr9  NT_113916.2  28585-161427    2       ALR/Alpha (2)  SUBTEL_sa (1) 
15  120835  chr6  NT_167245.1  2603992-2724827    7       L2 (2)  MLT1N2 (1)  MLT1E2 (1) 
16  119548  chr10  NT_008705.16  38711078-38830626    218  30       GA-rich (24)  (GAATG)n (22)  (AAATG)n (22) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   186168       chr6  NT_167244.1  3785363-3971531    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor
5   175637       chr6  NT_167244.1  3180187-3355824    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
8   172370       chr6  NT_167247.1  4422047-4594417    LOC100507722  hypothetical_protein_LOC100507722



Posfai@neb.com
May 11, 2011