Distribution of restriction sites in the human genome

Enzyme:  ApaLI               Longest uncut segments
Specificity:  GTGCAC               Repeats in uncut segments
Number of sites:  490478               Genes in uncut segments
Mean distance between sites:  5833 base pairs
Standard deviation:  6348 base pairs
Site density 171.4 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   506895  chr15  NT_037852.6  1391725-1898620    1.77 % in   35 repeats    1.09 % in 1 genes
2   418253  chr6  NT_167244.1  2354114-2772367    2.01 % in   37 repeats    0.00 % in 0 genes
3   265935  chrY  NT_011875.12  8413146-8679081    80.15 % in   25 repeats    0.00 % in 0 genes
4   264239  chr6  NT_167244.1  2005363-2269602    4.20 % in   51 repeats    5.01 % in 4 genes
5   245332  chr6  NT_167244.1  3172337-3417669    4.43 % in   58 repeats    8.29 % in 2 genes
6   211309  chr6  NT_167244.1  4388443-4599752    1.38 % in   12 repeats    0.00 % in 0 genes
7   188467  chr6  NT_167248.1  504755-693222    12.39 % in   27 repeats    0.60 % in 1 genes
8   186114  chr6  NT_167244.1  3785464-3971578    1.25 % in   13 repeats    1.09 % in 1 genes
9   179789  chr6  NT_167249.1  2131494-2311283    5.47 % in   46 repeats    0.00 % in 0 genes
10   178097  chr6  NT_167247.1  4416556-4594653    0.64 % in   7 repeats    0.00 % in 0 genes
11   173009  chr6  NT_167247.1  1562903-1735912    3.26 % in   19 repeats    0.00 % in 0 genes
12   167468  chr9  NT_008470.19  21691286-21858754    3.83 % in   24 repeats    0.00 % in 0 genes
13   158168  chr6  NT_167244.1  2882674-3040842    7.49 % in   59 repeats    0.00 % in 0 genes
14   139245  chr6  NT_167246.1  3238292-3377537    8.37 % in   60 repeats    0.00 % in 0 genes
15   127581  chr1  NT_077389.3  263997-391578    99.17 % in   57 repeats    0.00 % in 0 genes
16   126463  chrX  NT_011786.16  4272661-4399124    13.10 % in   75 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
506895  chr15  NT_037852.6  1391725-1898620    35  23       L1MDa (5)  L2a (3)  MER44C (2) 
418253  chr6  NT_167244.1  2354114-2772367    37  22       L4 (3)  L1ME4a (3)  AluY (3) 
265935  chrY  NT_011875.12  8413146-8679081    25  7       LTR12B (15)  L1PA7 (3)  LTR12D (2) 
264239  chr6  NT_167244.1  2005363-2269602    51  29       AluSx (7)  MIR (5)  MIRb (3) 
245332  chr6  NT_167244.1  3172337-3417669    58  26       AluSx (10)  L1MC5 (4)  L1MB3 (4) 
211309  chr6  NT_167244.1  4388443-4599752    12  10       MER57-int (2)  AluSx (2)  (TTCC)n (1) 
188467  chr6  NT_167248.1  504755-693222    27  21       AT_rich (4)  MER4D (2)  L1PA14 (2) 
186114  chr6  NT_167244.1  3785464-3971578    13  11       AT_rich (2)  AluJb (2)  MLT1H-int (1) 
179789  chr6  NT_167249.1  2131494-2311283    46  23       Charlie2b (6)  AluSx (6)  L1MB8 (3) 
10  178097  chr6  NT_167247.1  4416556-4594653    7       MIRb (1)  MIR (1)  MER11A (1) 
11  173009  chr6  NT_167247.1  1562903-1735912    19  14       MSTB (2)  MIR (2)  L1PB2 (2) 
12  167468  chr9  NT_008470.19  21691286-21858754    24  19       MIRb (2)  LTR67B (2)  L2 (2) 
13  158168  chr6  NT_167244.1  2882674-3040842    59  25       AluY (7)  L1MC5 (6)  AluSx (5) 
14  139245  chr6  NT_167246.1  3238292-3377537    60  32       AluSx (13)  MIRb (3)  MIR (3) 
15  127581  chr1  NT_077389.3  263997-391578    57  5       ALR/Alpha (52)  MLT1J (2)  L2 (1) 
16  126463  chrX  NT_011786.16  4272661-4399124    75  23       AluSx (15)  MER33 (14)  AluSc (13) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
1   506895       chr15  NT_037852.6  1391725-1898620    LOC100418897 
4   264239       chr6  NT_167244.1  2005363-2269602    LOC100294090  hypothetical_LOC100294090,_transcript_variant_1
FLOT1  flotillin-1
DDR1  epithelial_discoidin_domain-containing_receptor_1_isoform_DDR1c
MUC21  mucin-21_precursor
5   245332       chr6  NT_167244.1  3172337-3417669    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
7   188467       chr6  NT_167248.1  504755-693222    OR12D1P 
8   186114       chr6  NT_167244.1  3785464-3971578    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor



Posfai@neb.com
May 11, 2011