Distribution of restriction sites in the human genome

Enzyme:  ScaI               Longest uncut segments
Specificity:  AGTACT               Repeats in uncut segments
Number of sites:  539427               Genes in uncut segments
Mean distance between sites:  5304 base pairs
Standard deviation:  5918 base pairs
Site density 188.5 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   495420  chr15  NT_037852.6  1389808-1885228    1.27 % in   17 repeats    0.00 % in 0 genes
2   412745  chr6  NT_167244.1  2353828-2766573    1.51 % in   27 repeats    0.00 % in 0 genes
3   214845  chr6  NT_167244.1  4388503-4603348    2.74 % in   15 repeats    0.00 % in 0 genes
4   184792  chr6  NT_167247.1  4421706-4606498    2.59 % in   24 repeats    98.21 % in 2 genes
5   184744  chr6  NT_167244.1  3176319-3361063    2.87 % in   33 repeats    4.76 % in 2 genes
6   182923  chr6  NT_167244.1  3787704-3970627    0.47 % in   7 repeats    0.00 % in 0 genes
7   171958  chr6  NT_167249.1  4720644-4892602    15.31 % in   142 repeats    29.25 % in 6 genes
8   171767  chr6  NT_167249.1  2136520-2308287    2.85 % in   22 repeats    0.00 % in 0 genes
9   169761  chr6  NT_167247.1  1561965-1731726    1.20 % in   11 repeats    0.00 % in 0 genes
10   164216  chr6  NT_167248.1  518625-682841    2.99 % in   2 repeats    0.00 % in 0 genes
11   163742  chr9  NT_008470.19  21680526-21844268    4.78 % in   27 repeats    0.00 % in 0 genes
12   162782  chr4  NT_006316.16  391166-553948    3.45 % in   56 repeats    0.00 % in 0 genes
13   159184  chr6  NT_167244.1  2009501-2168685    0.75 % in   5 repeats    0.00 % in 0 genes
14   147067  chr6  NT_167244.1  2893991-3041058    2.20 % in   19 repeats    0.00 % in 0 genes
15   145620  chr8  NT_008046.16  55594235-55739855    25.22 % in   218 repeats    0.00 % in 0 genes
16   145264  chr6  NT_167245.1  115007-260271    17.19 % in   67 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
495420  chr15  NT_037852.6  1389808-1885228    17  12       L1MDa (6)  MIRc (1)  MIRb (1) 
412745  chr6  NT_167244.1  2353828-2766573    27  18       L1ME4a (3)  AluJb (3)  MLT2D (2) 
214845  chr6  NT_167244.1  4388503-4603348    15  11       MER57-int (2)  HERVH-int (2)  AluSx (2) 
184792  chr6  NT_167247.1  4421706-4606498    24  18       AluSx (3)  MLT1J (2)  L1MC5 (2) 
184744  chr6  NT_167244.1  3176319-3361063    33  18       AluSx (8)  GC_rich (3)  MIRb (2) 
182923  chr6  NT_167244.1  3787704-3970627    6       AT_rich (2)  MLT1H-int (1)  MIR (1) 
171958  chr6  NT_167249.1  4720644-4892602    142  51       AluSx (16)  AluY (12)  L2a (9) 
171767  chr6  NT_167249.1  2136520-2308287    22  11       Charlie2b (6)  AluSx (4)  L1MB8 (3) 
169761  chr6  NT_167247.1  1561965-1731726    11  9       MIR (2)  L1MEe (2)  L1MEf (1) 
10  164216  chr6  NT_167248.1  518625-682841    2       L1PREC2 (1)  HERVH-int (1) 
11  163742  chr9  NT_008470.19  21680526-21844268    27  19       L1M5 (3)  MLT1G1 (2)  MER5B (2) 
12  162782  chr4  NT_006316.16  391166-553948    56  4       (CA)n (47)  L1M4 (7)  MER5B (1) 
13  159184  chr6  NT_167244.1  2009501-2168685    5       MIR (1)  MER5A1 (1)  L1ME3C (1) 
14  147067  chr6  NT_167244.1  2893991-3041058    19  9       L1MC5 (6)  AluSc (3)  L2c (2) 
15  145620  chr8  NT_008046.16  55594235-55739855    218  61       MIRb (34)  MIR (27)  L2b (16) 
16  145264  chr6  NT_167245.1  115007-260271    67  42       AluY (6)  AluSx (5)  L1MB2 (4) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
4   184792       chr6  NT_167247.1  4421706-4606498    LOC100507722  hypothetical_protein_LOC100507722
COL11A2  collagen_alpha-2(XI)_chain_isoform_4_precursor
5   184744       chr6  NT_167244.1  3176319-3361063    EHMT2  histone-lysine_N-methyltransferase,_H3_lysine-9_specific_3_isoform_b
TNXB  tenascin-X_isoform_1_precursor
7   171958       chr6  NT_167249.1  4720644-4892602    RPS18  40S_ribosomal_protein_S18
RPL35AP4 
RPL12P1  kinesin-like_protein_KIFC1
PHF1  PHD_finger_protein_1_isoform_a
CUTA  protein_CutA_isoform_2
SYNGAP1  ras_GTPase-activating_protein_SynGAP



Posfai@neb.com
May 11, 2011