Distribution of restriction sites in the human genome

Enzyme:  SfiI               Longest uncut segments
Specificity:  GGCCNNNNNGGCC               Repeats in uncut segments
Number of sites:  46571               Genes in uncut segments
Mean distance between sites:  61440 base pairs
Standard deviation:  99839 base pairs
Site density 16.3 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   1534704  chr12  NT_029419.12  43944331-45479035    51.04 % in   2528 repeats    47.93 % in 6 genes
2   1476973  chr4  NT_016354.19  42257564-43734537    53.83 % in   2215 repeats    15.55 % in 5 genes
3   1399993  chr4  NT_016354.19  37546338-38946331    44.23 % in   2537 repeats    68.52 % in 17 genes
4   1323009  chr20  NT_011387.8  6730149-8053158    48.70 % in   2129 repeats    7.47 % in 4 genes
5   1315452  chr14  NT_026437.12  43623365-44938817    53.73 % in   2044 repeats    44.62 % in 6 genes
6   1286944  chr2  NT_005403.17  35272984-36559928    52.30 % in   1983 repeats    26.56 % in 2 genes
7   1242806  chr13  NT_024524.14  38230435-39473241    48.70 % in   1844 repeats    9.22 % in 9 genes
8   1232976  chr10  NT_030059.13  8652871-9885847    54.43 % in   1955 repeats    0.31 % in 1 genes
9   1189084  chr3  NT_005612.16  10768163-11957247    46.84 % in   1879 repeats    0.00 % in 0 genes
10   1182277  chr2  NT_022184.15  34720671-35902948    49.46 % in   1776 repeats    0.00 % in 0 genes
11   1177650  chr12  NT_029419.12  22939058-24116708    60.51 % in   1880 repeats    0.00 % in 0 genes
12   1134488  chr4  NT_016297.16  3415699-4550187    54.81 % in   1858 repeats    0.00 % in 0 genes
13   1122040  chr13  NT_024524.14  49262169-50384209    50.13 % in   1808 repeats    0.00 % in 0 genes
14   1116638  chrX  NT_167197.1  18478304-19594942    57.13 % in   1689 repeats    0.00 % in 0 genes
15   1110244  chr2  NT_022184.15  29133498-30243742    30.34 % in   1603 repeats    0.00 % in 0 genes
16   1100518  chr11  NT_009237.18  37185477-38285995    55.28 % in   1854 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
1534704  chr12  NT_029419.12  43944331-45479035    2528  415       AT_rich (231)  L2a (115)  MIR (101) 
1476973  chr4  NT_016354.19  42257564-43734537    2215  396       AT_rich (249)  L2a (80)  (TA)n (56) 
1399993  chr4  NT_016354.19  37546338-38946331    2537  357       AT_rich (199)  AluSx (184)  MIRb (104) 
1323009  chr20  NT_011387.8  6730149-8053158    2129  362       AT_rich (159)  L2a (75)  MIRb (71) 
1315452  chr14  NT_026437.12  43623365-44938817    2044  358       AT_rich (114)  MIRb (87)  L2a (79) 
1286944  chr2  NT_005403.17  35272984-36559928    1983  375       AT_rich (273)  L2a (52)  (TA)n (51) 
1242806  chr13  NT_024524.14  38230435-39473241    1844  371       AT_rich (245)  AluSx (57)  (TA)n (54) 
1232976  chr10  NT_030059.13  8652871-9885847    1955  386       AT_rich (209)  MIR (68)  MIRb (63) 
1189084  chr3  NT_005612.16  10768163-11957247    1879  352       AT_rich (203)  MIR (71)  L2a (67) 
10  1182277  chr2  NT_022184.15  34720671-35902948    1776  301       AT_rich (137)  L2a (84)  L2c (79) 
11  1177650  chr12  NT_029419.12  22939058-24116708    1880  359       AT_rich (197)  L2a (66)  MIR (54) 
12  1134488  chr4  NT_016297.16  3415699-4550187    1858  375       AT_rich (127)  MIR (93)  MIRb (87) 
13  1122040  chr13  NT_024524.14  49262169-50384209    1808  379       AT_rich (233)  L2a (60)  MIR (52) 
14  1116638  chrX  NT_167197.1  18478304-19594942    1689  330       AT_rich (122)  MIRb (64)  L2a (51) 
15  1110244  chr2  NT_022184.15  29133498-30243742    1603  291       AT_rich (195)  MIRb (93)  MIR (91) 
16  1100518  chr11  NT_009237.18  37185477-38285995    1854  361       AT_rich (172)  MIRb (65)  L2a (65) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
1   1534704       chr12  NT_029419.12  43944331-45479035    PPFIA2  liprin-alpha-2
VN1R57P 
CCDC59  thyroid_transcription_factor_1-associated_protein_26
LOC100421061  hypothetical_protein_LOC84190
LOC100418732 
TMTC2  transmembrane_and_TPR_repeat-containing_protein_2
2   1476973       chr4  NT_016354.19  42257564-43734537    LOC100421002 
TRAM1L1  translocating_chain-associated_membrane_protein_1-like_1
RPSAP35 
NT5C3P1 
LOC100132656  bifunctional_heparan_sulfate_N-deacetylase/N-sulfotransferase_3
3   1399993       chr4  NT_016354.19  37546338-38946331    LOC132719 
RPS12P8 
RPL36AP19  chromosome_4_open_reading_frame_32
AP1AR  AP-1_complex-associated_regulatory_protein_isoform_b
TIFA  TRAF-interacting_protein_with_FHA_domain-containing_protein_A
ALPK1  alpha-protein_kinase_1
LOC285412 
NEUROG2  neurogenin-2
LOC728914  prematurely_terminated_mRNA_decay_factor-like
MIR302B  microRNA:hsa-mir-302b
LOC645264 
LOC256085 
RPL32P13 
RPL7AP30 
LOC100131158 
RPS26P25  microRNA:hsa-mir-1243
LOC100507209  hypothetical_protein_LOC100507209
4   1323009       chr20  NT_011387.8  6730149-8053158    SRSF10P2 
HAO1  hydroxyacid_oxidase_1
TMX4  thioredoxin-related_transmembrane_protein_4_precursor
PHKBP1 
5   1315452       chr14  NT_026437.12  43623365-44938817    LOC100129782 
KCNH5  potassium_voltage-gated_channel_subfamily_H_member_5_isoform_2
PARP1P2 
RHOJ  rho-related_GTP-binding_protein_RhoJ_precursor
GPHB5  glycoprotein_hormone_beta-5
PPP2R5E  serine/threonine-protein_phosphatase_2A_56_kDa_regulatory_subunit_epsilon_isoform
6   1286944       chr2  NT_005403.17  35272984-36559928    RPL23AP33  zinc_finger_protein_804A
LOC100506923  hypothetical_LOC100506923
7   1242806       chr13  NT_024524.14  38230435-39473241    PRR20A  proline-rich_protein_20A
PRR20B  proline-rich_protein_20B
PRR20C  proline-rich_protein_20C
PRR20D  proline-rich_protein_20D
PRR20E  proline-rich_protein_20E
SLC25A5P4 
RPL31P53 
LOC100129744  hypothetical_protein_LOC100129744
TRNAE39P 
8   1232976       chr10  NT_030059.13  8652871-9885847    ZWINT  ZW10_interactor_isoform_b



Posfai@neb.com
May 11, 2011