Distribution of restriction sites in the human genome

Enzyme:  XmnI               Longest uncut segments
Specificity:  GAANNNNTTC               Repeats in uncut segments
Number of sites:  991866               Genes in uncut segments
Mean distance between sites:  2884 base pairs
Standard deviation:  3093 base pairs
Site density 346.6 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   493209  chr15  NT_037852.6  1396782-1889991    0.67 % in   15 repeats    0.00 % in 0 genes
2   406574  chr6  NT_167244.1  2359559-2766133    0.48 % in   8 repeats    0.00 % in 0 genes
3   241256  chr6  NT_167244.1  3179375-3420631    3.43 % in   36 repeats    5.52 % in 2 genes
4   213363  chr6  NT_167244.1  4386752-4600115    2.16 % in   16 repeats    0.00 % in 0 genes
5   192621  chr6  NT_167244.1  3780489-3973110    2.91 % in   27 repeats    3.64 % in 1 genes
6   184051  chr6  NT_167247.1  4419165-4603216    2.90 % in   28 repeats    98.76 % in 1 genes
7   166078  chr6  NT_167248.1  516627-682705    4.07 % in   4 repeats    0.00 % in 0 genes
8   165899  chr6  NT_167247.1  1561043-1726942    0.62 % in   3 repeats    1.15 % in 1 genes
9   165094  chr6  NT_167249.1  2138167-2303261    0.03 % in   1 repeats    0.00 % in 0 genes
10   158165  chr6  NT_167244.1  2007226-2165391    0.79 % in   5 repeats    0.00 % in 0 genes
11   150354  chr9  NT_008470.19  21693139-21843493    0.08 % in   1 repeats    0.00 % in 0 genes
12   144386  chr6  NT_167244.1  2893292-3037678    0.42 % in   5 repeats    0.00 % in 0 genes
13   124513  chr6  NT_167245.1  2602392-2726905    4.32 % in   13 repeats    0.00 % in 0 genes
14   124329  chrX  NT_011786.16  4272338-4396667    12.83 % in   73 repeats    0.00 % in 0 genes
15   124175  chr6  NT_167247.1  1170925-1295100    4.19 % in   11 repeats    0.00 % in 0 genes
16   120496  chr6  NT_167244.1  1829807-1950303    6.72 % in   32 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
493209  chr15  NT_037852.6  1396782-1889991    15  12       L2a (3)  L1M5 (2)  U2 (1) 
406574  chr6  NT_167244.1  2359559-2766133    6       LTR84b (2)  AluY (2)  MLT1B (1) 
241256  chr6  NT_167244.1  3179375-3420631    36  18       AluSx (5)  MIR (4)  GC_rich (4) 
213363  chr6  NT_167244.1  4386752-4600115    16  12       MER57-int (3)  AluSx (3)  (TTTTA)n (1) 
192621  chr6  NT_167244.1  3780489-3973110    27  19       L2a (5)  MLT1H-int (2)  AT_rich (2) 
184051  chr6  NT_167247.1  4419165-4603216    28  22       AluSx (3)  MLT1J (2)  L1MC5 (2) 
166078  chr6  NT_167248.1  516627-682705    4       LTR7 (1)  L1PREC2 (1)  L1P4 (1) 
165899  chr6  NT_167247.1  1561043-1726942    3       MIRc (1)  L1MC3 (1)  A-rich (1) 
165094  chr6  NT_167249.1  2138167-2303261    1       AT_rich (1) 
10  158165  chr6  NT_167244.1  2007226-2165391    4       AluSx (2)  MIRb (1)  MIR (1) 
11  150354  chr9  NT_008470.19  21693139-21843493    1       L1M5 (1) 
12  144386  chr6  NT_167244.1  2893292-3037678    5       (TCC)n (1)  L1MC5 (1)  AluY (1) 
13  124513  chr6  NT_167245.1  2602392-2726905    13  10       Tigger1 (2)  MLT1N2 (2)  L2 (2) 
14  124329  chrX  NT_011786.16  4272338-4396667    73  21       MER33 (14)  AluSx (14)  AluSc (13) 
15  124175  chr6  NT_167247.1  1170925-1295100    11  7       L2 (3)  MIRb (2)  ERV3-16A3_I-int (2) 
16  120496  chr6  NT_167244.1  1829807-1950303    32  19       AluSx (5)  L1MC4 (4)  AluJo (3) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
3   241256       chr6  NT_167244.1  3179375-3420631    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
5   192621       chr6  NT_167244.1  3780489-3973110    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor
6   184051       chr6  NT_167247.1  4419165-4603216    LOC100507722  hypothetical_protein_LOC100507722
8   165899       chr6  NT_167247.1  1561043-1726942    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011