Distribution of restriction sites in the human genome

Enzyme:  AmaCSI               Longest uncut segments
Specificity:  GCTCCA               Repeats in uncut segments
Number of sites:  1348275               Genes in uncut segments
Mean distance between sites:  2122 base pairs
Standard deviation:  2546 base pairs
Site density 471.2 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   491187  chr15  NT_037852.6  1395482-1886669    0.23 % in   7 repeats    0.00 % in 0 genes
2   404865  chr6  NT_167244.1  2358096-2762961    0.42 % in   8 repeats    0.00 % in 0 genes
3   214244  chr6  NT_167244.1  4389555-4603799    2.46 % in   15 repeats    0.00 % in 0 genes
4   184409  chr6  NT_167244.1  3787420-3971829    1.10 % in   10 repeats    0.04 % in 1 genes
5   176974  chr6  NT_167244.1  3179290-3356264    0.26 % in   6 repeats    0.57 % in 2 genes
6   173584  chr6  NT_167247.1  4421787-4595371    0.64 % in   2 repeats    100.00 % in 1 genes
7   167382  chr6  NT_167249.1  2137883-2305265    1.06 % in   9 repeats    0.00 % in 0 genes
8   166451  chr6  NT_167247.1  1562611-1729062    0.54 % in   6 repeats    0.20 % in 1 genes
9   161259  chr6  NT_167248.1  520069-681328    1.21 % in   2 repeats    0.00 % in 0 genes
10   159555  chr6  NT_167244.1  2009420-2168975    0.79 % in   6 repeats    0.00 % in 0 genes
11   151831  chr6  NT_167244.1  2893726-3045557    2.26 % in   21 repeats    0.00 % in 0 genes
12   150903  chr9  NT_008470.19  21692461-21843364    0.44 % in   2 repeats    0.00 % in 0 genes
13   118812  chr6  NT_167245.1  2605936-2724748    1.07 % in   4 repeats    0.00 % in 0 genes
14   116919  chr6  NT_167246.1  3258580-3375499    0.56 % in   4 repeats    0.00 % in 0 genes
15   115925  chr6  NT_167247.1  1176628-1292553    0.89 % in   1 repeats    0.00 % in 0 genes
16   113934  chr7  NT_007933.15  68177579-68291513    4.58 % in   16 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
491187  chr15  NT_037852.6  1395482-1886669    7       MIRc (1)  MIRb (1)  L1M3 (1) 
404865  chr6  NT_167244.1  2358096-2762961    6       L4 (2)  AluJb (2)  L1MEg (1) 
214244  chr6  NT_167244.1  4389555-4603799    15  11       HERVH-int (3)  AluSx (2)  AluSg/x (2) 
184409  chr6  NT_167244.1  3787420-3971829    10  9       AT_rich (2)  MLT1H-int (1)  MIR (1) 
176974  chr6  NT_167244.1  3179290-3356264    4       GC_rich (3)  Charlie4a (1)  (CCG)n (1) 
173584  chr6  NT_167247.1  4421787-4595371    2       MER11A (1)  AluSc (1) 
167382  chr6  NT_167249.1  2137883-2305265    5       L1MB8 (3)  AluSx (3)  L1MC4a (1) 
166451  chr6  NT_167247.1  1562611-1729062    4       MIR (2)  L1MEe (2)  (GGAA)n (1) 
161259  chr6  NT_167248.1  520069-681328    2       L1PREC2 (1)  HERVH-int (1) 
10  159555  chr6  NT_167244.1  2009420-2168975    6       MIR (1)  MER5A1 (1)  L2 (1) 
11  151831  chr6  NT_167244.1  2893726-3045557    21  10       L1MC5 (6)  L2c (3)  AluSc (3) 
12  150903  chr9  NT_008470.19  21692461-21843364    2       LTR67B (1)  L1M5 (1) 
13  118812  chr6  NT_167245.1  2605936-2724748    3       L2 (2)  MLT1E2 (1)  L2a (1) 
14  116919  chr6  NT_167246.1  3258580-3375499    3       MIRb (2)  MIR3 (1)  AluSx (1) 
15  115925  chr6  NT_167247.1  1176628-1292553    1       ERV3-16A3_I-int (1) 
16  113934  chr7  NT_007933.15  68177579-68291513    16  12       L1PB1 (3)  L1MB7 (2)  AluSx (2) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   184409       chr6  NT_167244.1  3787420-3971829    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor
5   176974       chr6  NT_167244.1  3179290-3356264    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
6   173584       chr6  NT_167247.1  4421787-4595371    LOC100507722  hypothetical_protein_LOC100507722
8   166451       chr6  NT_167247.1  1562611-1729062    LOC100421582  tripartite_motif-containing_protein_26



Posfai@neb.com
May 11, 2011