Distribution of restriction sites in the human genome

Enzyme:  BciVI               Longest uncut segments
Specificity:  GTATCC               Repeats in uncut segments
Number of sites:  684929               Genes in uncut segments
Mean distance between sites:  4177 base pairs
Standard deviation:  4407 base pairs
Site density 239.4 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   494760  chr15  NT_037852.6  1393524-1888284    0.71 % in   16 repeats    0.00 % in 0 genes
2   412769  chr6  NT_167244.1  2353233-2766002    1.56 % in   27 repeats    0.00 % in 0 genes
3   212802  chr6  NT_167244.1  4388578-4601380    1.80 % in   14 repeats    0.00 % in 0 genes
4   193967  chr9  NT_008470.19  21676393-21870360    8.76 % in   66 repeats    0.00 % in 0 genes
5   192421  chr6  NT_167244.1  3165185-3357606    4.15 % in   50 repeats    8.56 % in 2 genes
6   185515  chr6  NT_167244.1  3786806-3972321    1.23 % in   12 repeats    0.37 % in 1 genes
7   178793  chr6  NT_167247.1  4420866-4599659    2.14 % in   19 repeats    100.00 % in 1 genes
8   167393  chr6  NT_167249.1  2137593-2304986    1.07 % in   9 repeats    0.00 % in 0 genes
9   166916  chr6  NT_167244.1  1998694-2165610    2.10 % in   19 repeats    0.00 % in 0 genes
10   165408  chr6  NT_167247.1  1562688-1728096    0.41 % in   4 repeats    0.00 % in 0 genes
11   165150  chr6  NT_167244.1  2876379-3041529    10.11 % in   83 repeats    0.00 % in 0 genes
12   161140  chr6  NT_167248.1  521234-682374    1.14 % in   2 repeats    0.00 % in 0 genes
13   154432  chr6  NT_167247.1  1150044-1304476    14.94 % in   62 repeats    0.00 % in 0 genes
14   139783  chr1  NT_167185.1  1492000-1631783    6.02 % in   42 repeats    0.00 % in 0 genes
15   126697  chr6  NT_167245.1  126020-252717    9.29 % in   35 repeats    0.00 % in 0 genes
16   125612  chr6  NT_167245.1  2600557-2726169    5.01 % in   16 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
494760  chr15  NT_037852.6  1393524-1888284    16  13       L1MDa (3)  L2a (2)  (TA)n (1) 
412769  chr6  NT_167244.1  2353233-2766002    27  18       L1ME4a (3)  AluJb (3)  MLT2D (2) 
212802  chr6  NT_167244.1  4388578-4601380    14  11       MER57-int (2)  AluSx (2)  AluSg/x (2) 
193967  chr9  NT_008470.19  21676393-21870360    66  44       MLT1G1 (3)  MIRb (3)  L2 (3) 
192421  chr6  NT_167244.1  3165185-3357606    50  27       L1MC5 (6)  AluSx (6)  L1MB3 (4) 
185515  chr6  NT_167244.1  3786806-3972321    12  10       MLT1H-int (2)  AT_rich (2)  MLT1H (1) 
178793  chr6  NT_167247.1  4420866-4599659    19  15       AluSx (3)  MLT1J (2)  L1MC5 (2) 
167393  chr6  NT_167249.1  2137593-2304986    5       L1MB8 (3)  AluSx (3)  L1MC4a (1) 
166916  chr6  NT_167244.1  1998694-2165610    19  14       AluSx (4)  FRAM (2)  AluJb (2) 
10  165408  chr6  NT_167247.1  1562688-1728096    3       MIR (2)  (GGAA)n (1)  AluSq (1) 
11  165150  chr6  NT_167244.1  2876379-3041529    83  27       AluJo (11)  AluY (8)  AluSx (8) 
12  161140  chr6  NT_167248.1  521234-682374    2       L1PREC2 (1)  HERVH-int (1) 
13  154432  chr6  NT_167247.1  1150044-1304476    62  40       ERV3-16A3_I-int (6)  Charlie9 (5)  L2 (4) 
14  139783  chr1  NT_167185.1  1492000-1631783    42  17       GA-rich (5)  MIRc (4)  (GA)n (4) 
15  126697  chr6  NT_167245.1  126020-252717    35  27       L1MEg (3)  AluSx (3)  (TTTC)n (2) 
16  125612  chr6  NT_167245.1  2600557-2726169    16  13       MLT1N2 (2)  MER21C (2)  L2 (2) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
5   192421       chr6  NT_167244.1  3165185-3357606    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
6   185515       chr6  NT_167244.1  3786806-3972321    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor
7   178793       chr6  NT_167247.1  4420866-4599659    LOC100507722  hypothetical_protein_LOC100507722



Posfai@neb.com
May 11, 2011