Distribution of restriction sites in the human genome

Enzyme:  BsiEI               Longest uncut segments
Specificity:  CGRYCG               Repeats in uncut segments
Number of sites:  134912               Genes in uncut segments
Mean distance between sites:  21208 base pairs
Standard deviation:  41607 base pairs
Site density 47.1 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   1263361  chrX  NT_011651.17  4000073-5263434    81.69 % in   1846 repeats    0.13 % in 2 genes
2   893871  chrY  NT_011875.12  7905471-8799342    79.18 % in   823 repeats    8.40 % in 7 genes
3   810606  chr6  NT_007299.13  6552652-7363258    59.25 % in   1261 repeats    9.54 % in 1 genes
4   767861  chr1  NT_032977.9  74687079-75454940    62.71 % in   1166 repeats    0.00 % in 0 genes
5   637955  chr1  NT_004487.19  47606000-48243955    48.85 % in   995 repeats    76.74 % in 3 genes
6   636026  chr10  NT_030059.13  60015393-60651419    51.53 % in   1047 repeats    0.24 % in 1 genes
7   634772  chr4  NT_016354.19  58848761-59483533    54.12 % in   888 repeats    0.00 % in 0 genes
8   600898  chr15  NT_037852.6  1303171-1904069    10.36 % in   267 repeats    2.83 % in 2 genes
9   594402  chr6  NT_007299.13  19352857-19947259    57.89 % in   853 repeats    0.00 % in 0 genes
10   584965  chr10  NT_030059.13  9171461-9756426    57.01 % in   922 repeats    0.00 % in 0 genes
11   582231  chr14  NT_026437.12  23985226-24567457    56.88 % in   917 repeats    0.00 % in 0 genes
12   574053  chr5  NT_034772.6  28559276-29133329    52.95 % in   879 repeats    0.00 % in 0 genes
13   570097  chr8  NT_008046.16  17787410-18357507    61.00 % in   885 repeats    0.00 % in 0 genes
14   568426  chr3  NT_022517.18  34093988-34662414    56.25 % in   919 repeats    0.00 % in 0 genes
15   568241  chr5  NT_006576.16  42069152-42637393    52.99 % in   802 repeats    0.00 % in 0 genes
16   564792  chr2  NT_022135.16  13464111-14028903    59.46 % in   943 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
1263361  chrX  NT_011651.17  4000073-5263434    1846  339       AT_rich (100)  (TA)n (44)  AluSx (43) 
893871  chrY  NT_011875.12  7905471-8799342    823  221       AT_rich (58)  AluY (33)  L1M1 (20) 
810606  chr6  NT_007299.13  6552652-7363258    1261  299       AT_rich (152)  MIRb (33)  (TA)n (30) 
767861  chr1  NT_032977.9  74687079-75454940    1166  277       AT_rich (161)  L2a (32)  (TA)n (30) 
637955  chr1  NT_004487.19  47606000-48243955    995  264       AT_rich (122)  L2a (33)  MIRb (28) 
636026  chr10  NT_030059.13  60015393-60651419    1047  261       AT_rich (62)  MIRb (60)  MIR (45) 
634772  chr4  NT_016354.19  58848761-59483533    888  256       AT_rich (123)  L2a (29)  MIRb (19) 
600898  chr15  NT_037852.6  1303171-1904069    267  96       AluSx (17)  AT_rich (12)  (GCTG)n (11) 
594402  chr6  NT_007299.13  19352857-19947259    853  240       AT_rich (85)  MIRb (43)  MIR (25) 
10  584965  chr10  NT_030059.13  9171461-9756426    922  278       AT_rich (82)  MIR (37)  MIRb (31) 
11  582231  chr14  NT_026437.12  23985226-24567457    917  263       AT_rich (115)  (TA)n (28)  AluSx (24) 
12  574053  chr5  NT_034772.6  28559276-29133329    879  245       AT_rich (98)  MIRb (21)  L2a (21) 
13  570097  chr8  NT_008046.16  17787410-18357507    885  224       AT_rich (91)  AluSx (34)  L2c (26) 
14  568426  chr3  NT_022517.18  34093988-34662414    919  249       MIRb (55)  AT_rich (43)  MIR (35) 
15  568241  chr5  NT_006576.16  42069152-42637393    802  234       AT_rich (48)  MIRb (30)  MIR (30) 
16  564792  chr2  NT_022135.16  13464111-14028903    943  251       AT_rich (76)  MIRb (31)  MIR (31) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
1   1263361       chrX  NT_011651.17  4000073-5263434    LOC100129843 
RPL22P22 
2   893871       chrY  NT_011875.12  7905471-8799342    CYorf15A  hypothetical_protein_LOC246126
CYorf15B  lipopolysaccaride-specific_response_5-like_protein
KDM5D  lysine-specific_demethylase_5D_isoform_3
RCC2P2 
ZNF886P 
ZNF884P 
ZNF885P 
3   810606       chr6  NT_007299.13  6552652-7363258    LOC100128293  hypothetical_LOC100507123
5   637955       chr1  NT_004487.19  47606000-48243955    KCNT2  potassium_channel_subfamily_T_member_2
CFH  complement_factor_H_isoform_b_precursor
CFHR3  complement_factor_H-related_protein_3_isoform_2_precursor
6   636026       chr10  NT_030059.13  60015393-60651419    LOC100128304  hypothetical_protein_LOC100128304
8   600898       chr15  NT_037852.6  1303171-1904069    LOC727914 
LOC100418897 



Posfai@neb.com
May 11, 2011