Distribution of restriction sites in the human genome

Enzyme:  EsaSSI               Longest uncut segments
Specificity:  GACCAC               Repeats in uncut segments
Number of sites:  885065               Genes in uncut segments
Mean distance between sites:  3232 base pairs
Standard deviation:  3570 base pairs
Site density 309.3 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   492422  chr15  NT_037852.6  1394848-1887270    0.34 % in   10 repeats    0.00 % in 0 genes
2   404767  chr6  NT_167244.1  2356570-2761337    0.59 % in   11 repeats    0.00 % in 0 genes
3   211338  chr6  NT_167244.1  4389289-4600627    1.12 % in   14 repeats    0.00 % in 0 genes
4   188788  chr6  NT_167244.1  3784593-3973381    2.36 % in   23 repeats    1.54 % in 1 genes
5   179839  chr6  NT_167244.1  3176354-3356193    1.36 % in   22 repeats    2.16 % in 2 genes
6   178797  chr6  NT_167247.1  4421412-4600209    2.08 % in   18 repeats    100.00 % in 1 genes
7   177835  chr6  NT_167247.1  1561751-1739586    5.35 % in   30 repeats    0.68 % in 1 genes
8   174083  chr6  NT_167249.1  2130768-2304851    3.74 % in   29 repeats    0.00 % in 0 genes
9   161670  chr6  NT_167248.1  519803-681473    1.46 % in   2 repeats    0.00 % in 0 genes
10   161477  chr7  NT_023603.5  39940-201417    100.00 % in   4 repeats    0.00 % in 0 genes
11   158251  chr9  NT_008470.19  21689652-21847903    2.71 % in   16 repeats    0.00 % in 0 genes
12   156858  chr6  NT_167244.1  2009643-2166501    0.08 % in   1 repeats    0.00 % in 0 genes
13   149491  chr6  NT_167244.1  2888320-3037811    3.49 % in   27 repeats    0.00 % in 0 genes
14   125294  chr6  NT_167244.1  568599-693893    9.60 % in   38 repeats    0.00 % in 0 genes
15   120042  chr6  NT_167245.1  133350-253392    6.24 % in   22 repeats    0.00 % in 0 genes
16   119577  chr6  NT_167245.1  2605140-2724717    1.71 % in   4 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
492422  chr15  NT_037852.6  1394848-1887270    10  10       MLT1L (1)  MIRc (1)  MIRb (1) 
404767  chr6  NT_167244.1  2356570-2761337    11  8       AluJb (3)  L4 (2)  MER8 (1) 
211338  chr6  NT_167244.1  4389289-4600627    14  11       MER57-int (2)  AluSx (2)  AluSg/x (2) 
188788  chr6  NT_167244.1  3784593-3973381    23  16       L2a (4)  MLT1H-int (2)  AT_rich (2) 
179839  chr6  NT_167244.1  3176354-3356193    22  14       AluSx (4)  GC_rich (3)  MER44B (2) 
178797  chr6  NT_167247.1  4421412-4600209    18  14       AluSx (3)  MLT1J (2)  L1MC5 (2) 
177835  chr6  NT_167247.1  1561751-1739586    30  19       L1PB2 (4)  L1MEf (3)  MSTB (2) 
174083  chr6  NT_167249.1  2130768-2304851    29  16       AluSx (5)  L1MB8 (3)  AluJo (3) 
161670  chr6  NT_167248.1  519803-681473    2       L1PREC2 (1)  HERVH-int (1) 
10  161477  chr7  NT_023603.5  39940-201417    2       L1PA2 (2)  ALR/Alpha (2) 
11  158251  chr9  NT_008470.19  21689652-21847903    16  12       MIRb (2)  LTR67B (2)  L2 (2) 
12  156858  chr6  NT_167244.1  2009643-2166501    1       MER5A1 (1) 
13  149491  chr6  NT_167244.1  2888320-3037811    27  17       AluY (4)  AluJb (3)  LTR48 (2) 
14  125294  chr6  NT_167244.1  568599-693893    38  28       L1MA9 (3)  L1M5 (3)  AT_rich (3) 
15  120042  chr6  NT_167245.1  133350-253392    22  20       L2c (2)  AluSx (2)  (TTTC)n (1) 
16  119577  chr6  NT_167245.1  2605140-2724717    3       L2 (2)  MLT1E2 (1)  L2a (1) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   188788       chr6  NT_167244.1  3784593-3973381    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor
5   179839       chr6  NT_167244.1  3176354-3356193    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
6   178797       chr6  NT_167247.1  4421412-4600209    LOC100507722  hypothetical_protein_LOC100507722
7   177835       chr6  NT_167247.1  1561751-1739586    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011