Distribution of restriction sites in the human genome

Enzyme:  BspHI               Longest uncut segments
Specificity:  TCATGA               Repeats in uncut segments
Number of sites:  968443               Genes in uncut segments
Mean distance between sites:  2954 base pairs
Standard deviation:  3161 base pairs
Site density 338.5 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   494987  chr15  NT_037852.6  1393184-1888171    0.75 % in   16 repeats    0.00 % in 0 genes
2   404213  chr6  NT_167244.1  2358104-2762317    0.37 % in   7 repeats    0.00 % in 0 genes
3   254652  chr6  NT_167244.1  2002968-2257620    2.56 % in   29 repeats    3.71 % in 3 genes
4   208932  chr6  NT_167244.1  4388976-4597908    0.51 % in   6 repeats    0.00 % in 0 genes
5   187630  chr6  NT_167244.1  3783374-3971004    1.92 % in   15 repeats    2.19 % in 1 genes
6   184628  chr6  NT_167244.1  3174160-3358788    2.71 % in   32 repeats    4.70 % in 2 genes
7   180091  chr6  NT_167247.1  4416822-4596913    1.45 % in   11 repeats    99.84 % in 1 genes
8   167141  chr6  NT_167249.1  2138115-2305256    0.99 % in   8 repeats    0.00 % in 0 genes
9   165141  chr6  NT_167247.1  1561950-1727091    0.16 % in   2 repeats    0.00 % in 0 genes
10   163490  chr6  NT_167248.1  520727-684217    2.56 % in   2 repeats    0.00 % in 0 genes
11   161313  chr7  NT_023603.5  33285-194598    100.00 % in   3 repeats    0.00 % in 0 genes
12   154391  chr9  NT_008470.19  21691331-21845722    1.27 % in   8 repeats    0.00 % in 0 genes
13   148228  chr6  NT_167244.1  2894629-3042857    1.97 % in   18 repeats    0.00 % in 0 genes
14   134475  chr6  NT_167245.1  127203-261678    10.20 % in   54 repeats    0.00 % in 0 genes
15   127961  chr6  NT_167245.1  2597361-2725322    6.96 % in   24 repeats    0.00 % in 0 genes
16   126314  chr1  NT_004350.19  2049154-2175468    3.62 % in   16 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
494987  chr15  NT_037852.6  1393184-1888171    16  13       L1MDa (3)  L2a (2)  (TA)n (1) 
404213  chr6  NT_167244.1  2358104-2762317    5       L4 (2)  AluJb (2)  L1ME4a (1) 
254652  chr6  NT_167244.1  2002968-2257620    29  20       AluSx (5)  AluJb (3)  MIRb (2) 
208932  chr6  NT_167244.1  4388976-4597908    5       MER57-int (2)  (TTCC)n (1)  AluY (1) 
187630  chr6  NT_167244.1  3783374-3971004    15  10       L2a (3)  AT_rich (2)  AluSc (2) 
184628  chr6  NT_167244.1  3174160-3358788    32  18       AluSx (6)  L1MB3 (4)  GC_rich (3) 
180091  chr6  NT_167247.1  4416822-4596913    11  11       (TTAAA)n (1)  MLT1J (1)  MIRb (1) 
167141  chr6  NT_167249.1  2138115-2305256    4       L1MB8 (3)  AluSx (3)  Charlie2b (1) 
165141  chr6  NT_167247.1  1561950-1727091    2       L1MC3 (1)  A-rich (1) 
10  163490  chr6  NT_167248.1  520727-684217    2       L1PREC2 (1)  HERVH-int (1) 
11  161313  chr7  NT_023603.5  33285-194598    2       L1PA2 (2)  ALR/Alpha (1) 
12  154391  chr9  NT_008470.19  21691331-21845722    6       LTR67B (2)  L2 (2)  MSTA (1) 
13  148228  chr6  NT_167244.1  2894629-3042857    18  9       L1MC5 (6)  L2c (2)  AluY (2) 
14  134475  chr6  NT_167245.1  127203-261678    54  37       AluY (5)  AluSx (5)  L2c (3) 
15  127961  chr6  NT_167245.1  2597361-2725322    24  19       MER21-int (3)  MLT1N2 (2)  MER21C (2) 
16  126314  chr1  NT_004350.19  2049154-2175468    16  10       L1MEf (4)  L1MB3 (3)  AluSg (2) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
3   254652       chr6  NT_167244.1  2002968-2257620    LOC100294090  hypothetical_LOC100294090,_transcript_variant_1
FLOT1  flotillin-1
DDR1  epithelial_discoidin_domain-containing_receptor_1_isoform_DDR1c
5   187630       chr6  NT_167244.1  3783374-3971004    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor
6   184628       chr6  NT_167244.1  3174160-3358788    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
7   180091       chr6  NT_167247.1  4416822-4596913    LOC100507722  hypothetical_protein_LOC100507722



Posfai@neb.com
May 11, 2011