Distribution of restriction sites in the human genome

Enzyme:  ApyPI               Longest uncut segments
Specificity:  ATCGAC               Repeats in uncut segments
Number of sites:  95027               Genes in uncut segments
Mean distance between sites:  30110 base pairs
Standard deviation:  31599 base pairs
Site density 33.2 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   511948  chr15  NT_037852.6  1392123-1904071    1.89 % in   45 repeats    1.95 % in 1 genes
2   459915  chr6  NT_167244.1  2328814-2788729    6.70 % in   124 repeats    3.01 % in 2 genes
3   417721  chrY  NT_011875.12  8386531-8804252    82.21 % in   197 repeats    0.34 % in 2 genes
4   400458  chr12  NT_029419.12  8134560-8535018    49.16 % in   824 repeats    62.90 % in 3 genes
5   397052  chr6  NT_167244.1  4204488-4601540    20.09 % in   204 repeats    11.30 % in 6 genes
6   378953  chr6  NT_167247.1  4254355-4633308    21.96 % in   267 repeats    67.64 % in 9 genes
7   361165  chr6  NT_007592.15  56455822-56816987    48.13 % in   691 repeats    93.09 % in 3 genes
8   339995  chr3  NT_005612.16  6785740-7125735    39.68 % in   447 repeats    86.37 % in 4 genes
9   338895  chr15  NT_010194.17  14556648-14895543    40.84 % in   647 repeats    0.00 % in 0 genes
10   329839  chr1  NT_032977.9  56018867-56348706    54.07 % in   595 repeats    0.00 % in 0 genes
11   325622  chr4  NT_016354.19  24146883-24472505    60.99 % in   756 repeats    0.00 % in 0 genes
12   319770  chr3  NT_005612.16  85005975-85325745    62.21 % in   556 repeats    0.00 % in 0 genes
13   313468  chr6  NT_007299.13  24229244-24542712    49.92 % in   592 repeats    0.00 % in 0 genes
14   312945  chr14  NT_026437.12  16267292-16580237    57.45 % in   818 repeats    0.00 % in 0 genes
15   311587  chr3  NT_022517.18  12061899-12373486    45.17 % in   616 repeats    0.00 % in 0 genes
16   311375  chr5  NT_006713.15  39357945-39669320    48.99 % in   450 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
511948  chr15  NT_037852.6  1392123-1904071    45  29       AT_rich (5)  L1MDa (4)  (TA)n (3) 
459915  chr6  NT_167244.1  2328814-2788729    124  57       AluJo (8)  L1MC4a (7)  AluY (7) 
417721  chrY  NT_011875.12  8386531-8804252    197  75       LTR12B (17)  AT_rich (13)  AluY (11) 
400458  chr12  NT_029419.12  8134560-8535018    824  194       AT_rich (60)  AluSx (46)  AluJb (46) 
397052  chr6  NT_167244.1  4204488-4601540    204  95       L2c (10)  L1PB1 (10)  MIR (9) 
378953  chr6  NT_167247.1  4254355-4633308    267  113       MIR (12)  L2c (12)  AluSx (11) 
361165  chr6  NT_007592.15  56455822-56816987    691  186       AluSx (50)  AT_rich (45)  MIRb (35) 
339995  chr3  NT_005612.16  6785740-7125735    447  154       AT_rich (37)  AluSx (21)  MIRb (18) 
338895  chr15  NT_010194.17  14556648-14895543    647  107       AluSx (84)  AluJo (45)  AluJb (43) 
10  329839  chr1  NT_032977.9  56018867-56348706    595  163       AT_rich (39)  AluSx (39)  L2a (34) 
11  325622  chr4  NT_016354.19  24146883-24472505    756  200       AluSx (52)  AT_rich (40)  AluJo (37) 
12  319770  chr3  NT_005612.16  85005975-85325745    556  185       AT_rich (31)  MIRb (22)  L2a (18) 
13  313468  chr6  NT_007299.13  24229244-24542712    592  161       AluSx (67)  AT_rich (34)  AluJb (21) 
14  312945  chr14  NT_026437.12  16267292-16580237    818  145       AluSx (120)  AluJb (55)  AluY (53) 
15  311587  chr3  NT_022517.18  12061899-12373486    616  146       MIRb (32)  MIR3 (32)  L2c (30) 
16  311375  chr5  NT_006713.15  39357945-39669320    450  149       AT_rich (52)  MIR (25)  MIRb (21) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
1   511948       chr15  NT_037852.6  1392123-1904071    LOC100418897 
2   459915       chr6  NT_167244.1  2328814-2788729    HCG22  HLA_complex_group_22
MICB  MHC_class_I_polypeptide-related_sequence_B_precursor
3   417721       chrY  NT_011875.12  8386531-8804252    ZNF884P 
ZNF885P 
4   400458       chr12  NT_029419.12  8134560-8535018    LOC400027  hypothetical_LOC400027
LOC100420898  AT-rich_interactive_domain-containing_protein_2
SRSF2IP  SRSF2-interacting_protein
5   397052       chr6  NT_167244.1  4204488-4601540    HLA-DMA  HLA_class_II_histocompatibility_antigen,_DM_alpha_chain_precursor
HLA-DPA1  HLA_class_II_histocompatibility_antigen,_DP_alpha_1_chain_precursor
RPL32P1  HLA_class_II_histocompatibility_antigen,_DP_beta_1_chain_precursor
HLA-DPA2 
COL11A2P 
HLA-DPB2  major_histocompatibility_complex,_class_II,_DP_beta_2_(pseudogene)
6   378953       chr6  NT_167247.1  4254355-4633308    HLA-DMA  HLA_class_II_histocompatibility_antigen,_DM_alpha_chain_precursor
BRD2  bromodomain-containing_protein_2
HLA-DOA  HLA_class_II_histocompatibility_antigen,_DO_alpha_chain_precursor
HLA-DPA1  HLA_class_II_histocompatibility_antigen,_DP_alpha_1_chain_precursor
RPL32P1  HLA_class_II_histocompatibility_antigen,_DP_beta_1_chain_precursor
HLA-DPA2 
COL11A2P 
LOC100507722  hypothetical_protein_LOC100507722
COL11A2  collagen_alpha-2(XI)_chain_isoform_4_precursor
7   361165       chr6  NT_007592.15  56455822-56816987    LOC100506118  bullous_pemphigoid_antigen_1-like
LOC100506868 
FTH1P15  BEN_domain-containing_protein_6
8   339995       chr3  NT_005612.16  6785740-7125735    TMEM45A  transmembrane_protein_45A
GMFBP1  probable_G-protein_coupled_receptor_128_precursor
TFG  protein_TFG_isoform_1
LOC100421580  target_of_Nesh-SH3_precursor



Posfai@neb.com
May 11, 2011