Distribution of restriction sites in the human genome

Enzyme:  NmeBI               Longest uncut segments
Specificity:  GACGC               Repeats in uncut segments
Number of sites:  560542               Genes in uncut segments
Mean distance between sites:  5104 base pairs
Standard deviation:  7510 base pairs
Site density 195.9 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   490469  chr15  NT_037852.6  1396654-1887123    0.20 % in   8 repeats    0.00 % in 0 genes
2   409436  chr6  NT_167244.1  2351817-2761253    1.41 % in   28 repeats    0.00 % in 0 genes
3   260907  chr6  NT_167244.1  2001442-2262349    3.79 % in   45 repeats    4.20 % in 3 genes
4   217458  chr6  NT_167244.1  4385100-4602558    3.37 % in   20 repeats    0.00 % in 0 genes
5   216295  chr3  NT_022517.18  35207366-35423661    52.80 % in   362 repeats    2.20 % in 1 genes
6   215083  chr6  NT_167248.1  503586-718669    15.93 % in   68 repeats    2.00 % in 3 genes
7   192442  chr6  NT_167244.1  3171648-3364090    4.53 % in   49 repeats    8.57 % in 2 genes
8   191933  chr6  NT_167247.1  1562770-1754703    9.12 % in   59 repeats    4.32 % in 2 genes
9   186621  chr6  NT_167244.1  3788050-3974671    2.25 % in   20 repeats    0.00 % in 0 genes
10   186202  chr6  NT_167247.1  4410771-4596973    2.68 % in   18 repeats    0.00 % in 0 genes
11   179741  chr6  NT_167249.1  2130063-2309804    5.85 % in   48 repeats    0.00 % in 0 genes
12   166672  chr9  NT_008470.19  21692378-21859050    3.47 % in   22 repeats    0.00 % in 0 genes
13   152399  chr6  NT_167244.1  2889921-3042320    4.13 % in   33 repeats    0.00 % in 0 genes
14   150743  chr6  NT_167244.1  568119-718862    15.11 % in   77 repeats    0.00 % in 0 genes
15   137475  chr6  NT_167244.1  3482361-3619836    10.51 % in   54 repeats    0.00 % in 0 genes
16   134029  chr6  NT_007592.15  55214620-55348649    37.92 % in   220 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
490469  chr15  NT_037852.6  1396654-1887123    8       MLT1L (1)  MIRc (1)  MIRb (1) 
409436  chr6  NT_167244.1  2351817-2761253    28  19       AluJb (4)  L1ME4a (3)  MLT2D (2) 
260907  chr6  NT_167244.1  2001442-2262349    45  28       AluSx (5)  MIRb (3)  L1MEe (3) 
217458  chr6  NT_167244.1  4385100-4602558    20  13       MER57-int (3)  AluSx (3)  HERVH-int (2) 
216295  chr3  NT_022517.18  35207366-35423661    362  129       AT_rich (33)  MIRb (16)  MIRc (13) 
215083  chr6  NT_167248.1  503586-718669    68  50       AT_rich (7)  L2c (3)  L2b (3) 
192442  chr6  NT_167244.1  3171648-3364090    49  24       AluSx (9)  L1MC5 (4)  L1MB3 (4) 
191933  chr6  NT_167247.1  1562770-1754703    59  35       L1MEf (6)  MER21-int (4)  L1PB2 (4) 
186621  chr6  NT_167244.1  3788050-3974671    20  17       MLT1H-int (2)  L2a (2)  AT_rich (2) 
10  186202  chr6  NT_167247.1  4410771-4596973    18  15       L2b (3)  MIRb (2)  (TTAAA)n (1) 
11  179741  chr6  NT_167249.1  2130063-2309804    48  23       Charlie2b (6)  AluSx (6)  MamGypLTR1b (3) 
12  166672  chr9  NT_008470.19  21692378-21859050    22  18       MIRb (2)  L2 (2)  HAL1 (2) 
13  152399  chr6  NT_167244.1  2889921-3042320    33  16       L1MC5 (6)  AluY (5)  AluSc (3) 
14  150743  chr6  NT_167244.1  568119-718862    77  50       AT_rich (6)  L2b (5)  MIRc (3) 
15  137475  chr6  NT_167244.1  3482361-3619836    54  26       AluSx (8)  AluSg (6)  L1M2 (4) 
16  134029  chr6  NT_007592.15  55214620-55348649    220  88       AT_rich (28)  L2c (11)  L1M5 (11) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
3   260907       chr6  NT_167244.1  2001442-2262349    LOC100294090  hypothetical_LOC100294090,_transcript_variant_1
FLOT1  flotillin-1
DDR1  epithelial_discoidin_domain-containing_receptor_1_isoform_DDR1c
5   216295       chr3  NT_022517.18  35207366-35423661    KRT8P18 
6   215083       chr6  NT_167248.1  503586-718669    OR12D1P 
OR11A1  olfactory_receptor_11A1
OR10C1  olfactory_receptor_10C1
7   192442       chr6  NT_167244.1  3171648-3364090    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
8   191933       chr6  NT_167247.1  1562770-1754703    LOC100421582  tripartite_motif-containing_protein_26
LOC100507720 



Posfai@neb.com
May 11, 2011