Distribution of restriction sites in the human genome

Enzyme:  BscGI               Longest uncut segments
Specificity:  CCCGT               Repeats in uncut segments
Number of sites:  1071903               Genes in uncut segments
Mean distance between sites:  2669 base pairs
Standard deviation:  3657 base pairs
Site density 374.6 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   491874  chr15  NT_037852.6  1397113-1888987    0.44 % in   12 repeats    0.00 % in 0 genes
2   404494  chr6  NT_167244.1  2359861-2764355    0.12 % in   3 repeats    0.00 % in 0 genes
3   213526  chr6  NT_167244.1  4388626-4602152    2.14 % in   15 repeats    0.00 % in 0 genes
4   191889  chr6  NT_167244.1  3781918-3973807    3.08 % in   26 repeats    2.90 % in 1 genes
5   179851  chr6  NT_167249.1  2129788-2309639    5.83 % in   49 repeats    0.00 % in 0 genes
6   178205  chr6  NT_167247.1  1562816-1741021    5.24 % in   32 repeats    0.08 % in 1 genes
7   176367  chr6  NT_167247.1  4421613-4597980    1.50 % in   9 repeats    100.00 % in 1 genes
8   175255  chr6  NT_167244.1  3180106-3355361    0.08 % in   3 repeats    0.07 % in 1 genes
9   163063  chr6  NT_167248.1  519842-682905    2.30 % in   2 repeats    0.00 % in 0 genes
10   156720  chr6  NT_167244.1  2008385-2165105    0.48 % in   4 repeats    0.00 % in 0 genes
11   152769  chr9  NT_008470.19  21690929-21843698    1.04 % in   7 repeats    0.00 % in 0 genes
12   146562  chr6  NT_167244.1  2891349-3037911    1.85 % in   13 repeats    0.00 % in 0 genes
13   124127  chr6  NT_167245.1  2604008-2728135    3.73 % in   14 repeats    0.00 % in 0 genes
14   116502  chr6  NT_167247.1  1175834-1292336    1.57 % in   2 repeats    0.00 % in 0 genes
15   114081  chr6  NT_167246.1  3261159-3375240    0.41 % in   3 repeats    0.00 % in 0 genes
16   111415  chr6  NT_167247.1  2704981-2816396    8.91 % in   30 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
491874  chr15  NT_037852.6  1397113-1888987    12  10       L2a (3)  (TA)n (1)  MLT1L (1) 
404494  chr6  NT_167244.1  2359861-2764355    3       L1MEg (1)  AluY (1)  AluSp (1) 
213526  chr6  NT_167244.1  4388626-4602152    15  11       MER57-int (2)  HERVH-int (2)  AluSx (2) 
191889  chr6  NT_167244.1  3781918-3973807    26  18       L2a (5)  MLT1H-int (2)  AT_rich (2) 
179851  chr6  NT_167249.1  2129788-2309639    49  24       Charlie2b (6)  AluSx (6)  MamGypLTR1b (3) 
178205  chr6  NT_167247.1  1562816-1741021    32  20       L1PB2 (4)  L1MEf (3)  MSTB (2) 
176367  chr6  NT_167247.1  4421613-4597980    7       MLT1J (2)  AluSx (2)  (TTAAA)n (1) 
175255  chr6  NT_167244.1  3180106-3355361    3       GC_rich (1)  (CCG)n (1)  AluSp (1) 
163063  chr6  NT_167248.1  519842-682905    2       L1PREC2 (1)  HERVH-int (1) 
10  156720  chr6  NT_167244.1  2008385-2165105    4       MIRb (1)  MIR (1)  AluY (1) 
11  152769  chr9  NT_008470.19  21690929-21843698    6       LTR67B (2)  MSTA (1)  MIR3 (1) 
12  146562  chr6  NT_167244.1  2891349-3037911    13  10       AluY (3)  AluJo (2)  (TG)n (1) 
13  124127  chr6  NT_167245.1  2604008-2728135    14  12       Tigger1 (2)  L2 (2)  MLT2A2 (1) 
14  116502  chr6  NT_167247.1  1175834-1292336    1       ERV3-16A3_I-int (2) 
15  114081  chr6  NT_167246.1  3261159-3375240    2       MIRb (2)  AluSx (1) 
16  111415  chr6  NT_167247.1  2704981-2816396    30  20       L1MEf (5)  ERV3-16A3_I-int (4)  HAL1 (3) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   191889       chr6  NT_167244.1  3781918-3973807    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor
6   178205       chr6  NT_167247.1  1562816-1741021    LOC100421582  tripartite_motif-containing_protein_26
7   176367       chr6  NT_167247.1  4421613-4597980    LOC100507722  hypothetical_protein_LOC100507722
8   175255       chr6  NT_167244.1  3180106-3355361    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b



Posfai@neb.com
May 11, 2011