Distribution of restriction sites in the human genome

Enzyme:  AhdI               Longest uncut segments
Specificity:  GACNNNNNGTC               Repeats in uncut segments
Number of sites:  271412               Genes in uncut segments
Mean distance between sites:  10542 base pairs
Standard deviation:  11087 base pairs
Site density 94.9 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   509008  chr15  NT_037852.6  1379583-1888591    3.08 % in   51 repeats    0.00 % in 0 genes
2   409016  chr6  NT_167244.1  2354695-2763711    0.94 % in   18 repeats    0.00 % in 0 genes
3   316333  chrY  NT_011875.12  8413928-8730261    82.59 % in   66 repeats    0.25 % in 1 genes
4   216496  chr6  NT_167244.1  4385866-4602362    3.29 % in   20 repeats    0.00 % in 0 genes
5   206740  chr6  NT_167244.1  3780132-3986872    6.26 % in   46 repeats    3.56 % in 1 genes
6   195184  chr6  NT_167248.1  517395-712579    10.31 % in   41 repeats    2.20 % in 3 genes
7   189424  chr6  NT_167247.1  4410526-4599950    3.54 % in   28 repeats    97.11 % in 2 genes
8   188895  chr6  NT_167244.1  3175357-3364252    3.53 % in   39 repeats    6.85 % in 2 genes
9   177193  chr6  NT_167247.1  1556910-1734103    4.15 % in   34 repeats    0.00 % in 0 genes
10   175657  chr6  NT_167249.1  2131892-2307549    4.60 % in   35 repeats    0.00 % in 0 genes
11   170333  chr6  NT_167244.1  2883745-3054078    8.15 % in   72 repeats    0.00 % in 0 genes
12   169197  chr8  NT_167187.1  31411024-31580221    99.81 % in   43 repeats    0.00 % in 0 genes
13   169020  chr6  NT_167244.1  1997092-2166112    2.68 % in   23 repeats    0.00 % in 0 genes
14   162771  chr7  NT_023603.5  32659-195430    100.00 % in   5 repeats    0.00 % in 0 genes
15   156815  chr9  NT_008470.19  21686855-21843670    3.05 % in   13 repeats    0.00 % in 0 genes
16   150190  chrX  NT_011786.16  4261398-4411588    21.23 % in   117 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
509008  chr15  NT_037852.6  1379583-1888591    51  29       Tigger2 (6)  L1MDa (6)  Charlie5 (3) 
409016  chr6  NT_167244.1  2354695-2763711    18  11       L1ME4a (3)  AluJb (3)  MLT2D (2) 
316333  chrY  NT_011875.12  8413928-8730261    66  27       LTR12B (17)  L1PA16 (7)  L1ME3A (6) 
216496  chr6  NT_167244.1  4385866-4602362    20  13       MER57-int (3)  AluSx (3)  HERVH-int (2) 
206740  chr6  NT_167244.1  3780132-3986872    46  33       L2a (9)  MLT1H-int (2)  L1M5 (2) 
195184  chr6  NT_167248.1  517395-712579    41  27       AT_rich (7)  L2c (3)  L2b (3) 
189424  chr6  NT_167247.1  4410526-4599950    28  21       L2b (3)  AluSx (3)  MLT1J (2) 
188895  chr6  NT_167244.1  3175357-3364252    39  20       AluSx (8)  L1MB3 (4)  MIR (3) 
177193  chr6  NT_167247.1  1556910-1734103    34  26       L2c (3)  Tigger7 (2)  MSTD (2) 
10  175657  chr6  NT_167249.1  2131892-2307549    35  17       Charlie2b (6)  AluSx (6)  L1MB8 (3) 
11  170333  chr6  NT_167244.1  2883745-3054078    72  32       AluY (7)  L1MC5 (6)  AluSc (5) 
12  169197  chr8  NT_167187.1  31411024-31580221    43  14       ALR/Alpha (25)  LTR14C (2)  L1PA5 (2) 
13  169020  chr6  NT_167244.1  1997092-2166112    23  16       AluSx (4)  FRAM (2)  AluY (2) 
14  162771  chr7  NT_023603.5  32659-195430    2       L1PA2 (4)  ALR/Alpha (1) 
15  156815  chr9  NT_008470.19  21686855-21843670    13  9       MER5B (2)  LTR67B (2)  L1M5 (2) 
16  150190  chrX  NT_011786.16  4261398-4411588    117  43       AluSx (16)  MER33 (14)  AluSc (13) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
3   316333       chrY  NT_011875.12  8413928-8730261    ZNF884P 
5   206740       chr6  NT_167244.1  3780132-3986872    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor
6   195184       chr6  NT_167248.1  517395-712579    OR12D1P 
OR11A1  olfactory_receptor_11A1
OR10C1  olfactory_receptor_10C1
7   189424       chr6  NT_167247.1  4410526-4599950    COL11A2P 
LOC100507722  hypothetical_protein_LOC100507722
8   188895       chr6  NT_167244.1  3175357-3364252    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor



Posfai@neb.com
May 11, 2011