Distribution of restriction sites in the human genome

Enzyme:  RceI               Longest uncut segments
Specificity:  CATCGAC               Repeats in uncut segments
Number of sites:  29094               Genes in uncut segments
Mean distance between sites:  98348 base pairs
Standard deviation:  106383 base pairs
Site density 10.2 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   1507960  chr19  NT_011295.11  11418555-12926515    65.38 % in   2908 repeats    45.43 % in 35 genes
2   1288165  chr14  NT_026437.12  27012405-28300570    59.64 % in   1949 repeats    0.95 % in 2 genes
3   1134000  chr15  NT_010194.17  25106033-26240033    56.78 % in   1688 repeats    53.38 % in 1 genes
4   1118643  chr1  NT_004610.19  14747060-15865703    61.75 % in   3352 repeats    67.40 % in 26 genes
5   1116269  chrX  NT_011786.16  20433652-21549921    58.71 % in   2138 repeats    0.77 % in 3 genes
6   1041990  chr4  NT_022778.16  5550527-6592517    50.34 % in   1581 repeats    27.86 % in 4 genes
7   959804  chr9  NT_008470.19  27669027-28628831    51.54 % in   2032 repeats    56.54 % in 12 genes
8   928849  chr7  NT_007933.15  16063870-16992719    43.33 % in   1412 repeats    100.00 % in 1 genes
9   917071  chr1  NT_032977.9  48598329-49515400    52.84 % in   1435 repeats    0.00 % in 0 genes
10   898820  chr3  NT_005612.16  3539210-4438030    44.74 % in   1221 repeats    0.00 % in 0 genes
11   893067  chr1  NT_032977.9  21120879-22013946    64.26 % in   2139 repeats    0.00 % in 0 genes
12   884569  chr4  NT_022778.16  8359228-9243797    56.83 % in   1587 repeats    0.00 % in 0 genes
13   878732  chr7  NT_007933.15  14757075-15635807    53.38 % in   1785 repeats    0.00 % in 0 genes
14   871905  chr1  NT_032977.9  40252450-41124355    49.41 % in   1560 repeats    0.00 % in 0 genes
15   868982  chr1  NT_032977.9  51976237-52845219    37.64 % in   1367 repeats    0.00 % in 0 genes
16   868183  chr6  NT_025741.15  34506250-35374433    45.90 % in   1352 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
1507960  chr19  NT_011295.11  11418555-12926515    2908  232       AluSx (274)  AluSg (188)  L1MB2 (156) 
1288165  chr14  NT_026437.12  27012405-28300570    1949  383       AT_rich (198)  L2a (59)  MIR (56) 
1134000  chr15  NT_010194.17  25106033-26240033    1688  306       AT_rich (112)  L2a (68)  MIR (61) 
1118643  chr1  NT_004610.19  14747060-15865703    3352  285       AluSx (459)  AluY (210)  AluJb (195) 
1116269  chrX  NT_011786.16  20433652-21549921    2138  342       MIRb (111)  MIR (106)  AluSx (98) 
1041990  chr4  NT_022778.16  5550527-6592517    1581  345       AT_rich (230)  L2a (51)  (TA)n (50) 
959804  chr9  NT_008470.19  27669027-28628831    2032  314       AluSx (160)  AluY (92)  AT_rich (87) 
928849  chr7  NT_007933.15  16063870-16992719    1412  275       AT_rich (125)  MIRb (77)  MIR (61) 
917071  chr1  NT_032977.9  48598329-49515400    1435  308       AT_rich (133)  MIRb (69)  MIR (61) 
10  898820  chr3  NT_005612.16  3539210-4438030    1221  287       AT_rich (123)  MIR (48)  MIRb (47) 
11  893067  chr1  NT_032977.9  21120879-22013946    2139  314       AluSx (241)  MIRb (104)  AT_rich (89) 
12  884569  chr4  NT_022778.16  8359228-9243797    1587  329       AT_rich (109)  AluSx (72)  L2a (70) 
13  878732  chr7  NT_007933.15  14757075-15635807    1785  304       AluSx (189)  AT_rich (86)  AluJo (84) 
14  871905  chr1  NT_032977.9  40252450-41124355    1560  305       AT_rich (146)  AluSx (111)  MIRb (65) 
15  868982  chr1  NT_032977.9  51976237-52845219    1367  266       AT_rich (145)  MIRb (84)  AluSx (83) 
16  868183  chr6  NT_025741.15  34506250-35374433    1352  285       AT_rich (88)  MIRb (74)  AluSx (73) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
1   1507960       chr19  NT_011295.11  11418555-12926515    LOC100421696  zinc_finger_protein_90
RPS16P10 
ZNF486  zinc_finger_protein_486
LOC100419841 
LOC100421704 
LOC284441 
LOC100129265 
LOC100421705 
LOC100131306 
LOC100288623  microRNA_1270-2
LOC100421706  microRNA:hsa-mir-1270
LOC100421697 
LOC100131603  zinc_finger_protein_626_isoform_2
VN1R79P 
LOC100421698 
LOC100418986 
LOC100418987 
LOC100418988 
LOC100418989 
LOC100418990 
LOC100418991 
LOC100418992 
LOC100418993 
ZNF85  zinc_finger_protein_85
KRT18P40 
ZNF430  zinc_finger_protein_430_isoform_2
VN1R80P 
VN1R81P  zinc_finger_protein_714
RPL7AP10  zinc_finger_protein_431
VN1R82P 
RPL36AP51 
VN1R83P 
LOC100421707  zinc_finger_protein_708
LOC100422298  zinc_finger_protein_738
ZNF493  zinc_finger_protein_493_isoform_1
2   1288165       chr14  NT_026437.12  27012405-28300570    LOC100506412  hypothetical_LOC100506412
RPL10L  60S_ribosomal_protein_L10-like
3   1134000       chr15  NT_010194.17  25106033-26240033    LOC100421430  protein_unc-13_homolog_C
4   1118643       chr1  NT_004610.19  14747060-15865703    LOC100420555  hypothetical_protein_LOC199870_isoform_5
STX12  syntaxin-12
PPP1R8  nuclear_inhibitor_of_protein_phosphatase_1_isoform_gamma
C1orf38  protein_THEMIS2_isoform_2
RPA2  replication_protein_A_32_kDa_subunit
SMPDL3B  acid_sphingomyelinase-like_phosphodiesterase_3b_isoform_2
XKR8  XK-related_protein_8
EYA3  eyes_absent_homolog_3
LOC653566  signal_peptidase_complex_subunit_2_homolog_pseudogene
LOC100131223 
RNU7-29P 
PTAFR  platelet-activating_factor_receptor
DNAJC8  dnaJ_homolog_subfamily_C_member_8
ATPIF1  ATPase_inhibitor,_mitochondrial_isoform_3_precursor
SESN2  sestrin-2
MED18  mediator_of_RNA_polymerase_II_transcription_subunit_18
PHACTR4  phosphatase_and_actin_regulator_4_isoform_2
RCC1  regulator_of_chromosome_condensation_isoform_c
TRNAU1AP  tRNA_selenocysteine_1-associated_protein_1
SNORA16A  small_nucleolar_RNA,_H/ACA_box_16A
RAB42  putative_Ras-related_protein_Rab-42_isoform_2
TAF12  transcription_initiation_factor_TFIID_subunit_12
RNU11  RNA,_U11_small_nuclear
GMEB1  glucocorticoid_modulatory_element-binding_protein_1_isoform_2
YTHDF2  YTH_domain_family_protein_2_isoform_2
OPRD1  delta-type_opioid_receptor
5   1116269       chrX  NT_011786.16  20433652-21549921    RAC1P4 
ZIC3  zinc_finger_protein_ZIC_3
ZFYVE9P1 
6   1041990       chr4  NT_022778.16  5550527-6592517    RPS6P5 
LOC401134  hypothetical_LOC401134
LOC100422019 
LOC100507063  hypothetical_LOC100507063
7   959804       chr9  NT_008470.19  27669027-28628831    LOC100506667  hypothetical_protein_LOC100506667
C9orf130  chromosome_9_open_reading_frame_130,_transcript_variant_2
LOC100422232  RAD26L_hypothetical_protein
NCRNA00092  non-protein_coding_RNA_92
LOC158435  hypothetical_LOC158435
EIF4BP3 
HSD17B3  testosterone_17-beta-dehydrogenase_3
SLC35D2  UDP-N-acetylglucosamine/UDP-glucose/GDP-mannose_transporter
LOC100421693  zinc_finger_protein_367
HABP4  intracellular_hyaluronan-binding_protein_4
LOC100507364  hypothetical_LOC100507364
C9orf21  hypothetical_protein_LOC195827
8   928849       chr7  NT_007933.15  16063870-16992719    LOC100421387  ribosomal_protein_L13a_pseudogene_17



Posfai@neb.com
May 11, 2011