Distribution of restriction sites in the human genome

Enzyme:  GauT27I               Longest uncut segments
Specificity:  CGCGCAGG               Repeats in uncut segments
Number of sites:  8460               Genes in uncut segments
Mean distance between sites:  338220 base pairs
Standard deviation:  587467 base pairs
Site density 3.0 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   11320508  chr1  NT_004487.19  37338706-48659214    51.45 % in   18205 repeats    18.79 % in 45 genes
2   7111528  chr8  NT_008046.16  800141-7911669    51.19 % in   11141 repeats    34.86 % in 34 genes
3   6925241  chrX  NT_011651.17  7408186-14333427    70.52 % in   10243 repeats    18.80 % in 30 genes
4   6356831  chr13  NT_024524.14  48256106-54612937    44.86 % in   10114 repeats    27.05 % in 23 genes
5   5672532  chr6  NT_007592.15  47194168-52866700    47.62 % in   9077 repeats    34.12 % in 69 genes
6   5644896  chr8  NT_008046.16  47857870-53502766    50.07 % in   10628 repeats    26.43 % in 14 genes
7   5245445  chr1  NT_032977.9  42723989-47969434    52.74 % in   8602 repeats    47.10 % in 22 genes
8   5224163  chr12  NT_029419.12  46665394-51889557    51.13 % in   8557 repeats    29.65 % in 24 genes
9   5063617  chr12  NT_029419.12  20147232-25210849    53.18 % in   8188 repeats    0.00 % in 0 genes
10   5055977  chr8  NT_008046.16  22747948-27803925    49.54 % in   8223 repeats    0.00 % in 0 genes
11   5034721  chr9  NT_008413.18  26838485-31873206    49.93 % in   8002 repeats    0.00 % in 0 genes
12   4900952  chr4  NT_006316.16  8798035-13698987    49.58 % in   8675 repeats    0.00 % in 0 genes
13   4654775  chr2  NT_022184.15  59346971-64001746    51.95 % in   7408 repeats    0.00 % in 0 genes
14   4625430  chr5  NT_034772.6  10404917-15030347    47.70 % in   7126 repeats    0.00 % in 0 genes
15   4538242  chr18  NT_025028.14  11209263-15747505    45.55 % in   7440 repeats    0.00 % in 0 genes
16   4537461  chrX  NT_011681.16  1-4537462    60.94 % in   7629 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
11320508  chr1  NT_004487.19  37338706-48659214    18205  807       AT_rich (2126)  L2a (675)  AluSx (543) 
7111528  chr8  NT_008046.16  800141-7911669    11141  717       AT_rich (932)  MIRb (471)  L2a (443) 
6925241  chrX  NT_011651.17  7408186-14333427    10243  692       AT_rich (813)  AluSx (259)  (TA)n (211) 
6356831  chr13  NT_024524.14  48256106-54612937    10114  700       AT_rich (1254)  MIR (340)  AluSx (326) 
5672532  chr6  NT_007592.15  47194168-52866700    9077  671       AT_rich (715)  MIRb (412)  MIR (370) 
5644896  chr8  NT_008046.16  47857870-53502766    10628  671       MIRb (1029)  MIR (574)  L2a (475) 
5245445  chr1  NT_032977.9  42723989-47969434    8602  654       AT_rich (676)  MIRb (451)  L2a (378) 
5224163  chr12  NT_029419.12  46665394-51889557    8557  671       AT_rich (900)  L2a (369)  MIRb (331) 
5063617  chr12  NT_029419.12  20147232-25210849    8188  659       AT_rich (694)  MIRb (330)  L2a (303) 
10  5055977  chr8  NT_008046.16  22747948-27803925    8223  680       AT_rich (911)  L2a (325)  MIRb (310) 
11  5034721  chr9  NT_008413.18  26838485-31873206    8002  653       AT_rich (756)  MIRb (378)  MIR (269) 
12  4900952  chr4  NT_006316.16  8798035-13698987    8675  610       AT_rich (583)  MIRb (556)  MIR (466) 
13  4654775  chr2  NT_022184.15  59346971-64001746    7408  650       AT_rich (553)  MIRb (264)  MIR (233) 
14  4625430  chr5  NT_034772.6  10404917-15030347    7126  639       AT_rich (858)  L2a (246)  MIR (221) 
15  4538242  chr18  NT_025028.14  11209263-15747505    7440  612       AT_rich (847)  AluSx (239)  L2a (201) 
16  4537461  chrX  NT_011681.16  1-4537462    7629  630       AT_rich (349)  MIRb (306)  L2a (275) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
1   11320508       chr1  NT_004487.19  37338706-48659214    HMCN1  hemicentin-1_precursor
PRG4  proteoglycan_4_isoform_D
TPR  nucleoprotein_TPR
OCLM  oculomedin
PDC  phosducin_isoform_b
LOC100131939 
PTGS2  prostaglandin_G/H_synthase_2_precursor
PLA2G4A  cytosolic_phospholipase_A2
FDPSP1 
LOC100422527 
LOC100129274 
LOC100421343 
RPS3AP9 
LOC647132 
FAM5C  family_with_sequence_similarity_5,_member_C_precursor
LOC440704  hypothetical_LOC440704
LOC100421399 
LOC100506420  hypothetical_LOC100506420
RGS18  regulator_of_G-protein_signaling_18
LOC647150 
RGS21  regulator_of_G-protein_signaling_21
RGS1  regulator_of_G-protein_signaling_1
RGS13  regulator_of_G-protein_signaling_13
RPS27AP5 
LOC100130137 
RGS2  regulator_of_G-protein_signaling_2
LOC730190 
LOC100506438  hypothetical_LOC100506438
UCHL5  ubiquitin_carboxyl-terminal_hydrolase_isozyme_L5
TROVE2  60_kDa_SS-A/Ro_ribonucleoprotein_isoform_2
GLRX2  glutaredoxin-2,_mitochondrial_isoform_2
B3GALT2  beta-1,3-galactosyltransferase_2
RPL23AP22 
EEF1A1P14 
KCNT2  potassium_channel_subfamily_T_member_2
CFH  complement_factor_H_isoform_b_precursor
CFHR3  complement_factor_H-related_protein_3_isoform_2_precursor
CFHR1  complement_factor_H-related_protein_1_precursor
LOC100289145 
CFHR4  complement_factor_H-related_protein_4_precursor
CFHR2  complement_factor_H-related_protein_2_precursor
CFHR5  complement_factor_H-related_protein_5_precursor
F13B  coagulation_factor_XIII_B_chain_precursor
ASPM  abnormal_spindle-like_microcephaly-associated_protein
ZBTB41  zinc_finger_and_BTB_domain-containing_protein_41
2   7111528       chr8  NT_008046.16  800141-7911669    CPNE3  copine-3
LOC100506313  cyclic_nucleotide-gated_cation_channel_beta-3
CNBD1  cyclic_nucleotide-binding_domain-containing_protein_1
LOC100128412 
LOC642461 
SOX5P 
LOC100419762 
DCAF4L2  DDB1-_and_CUL4-associated_factor_4-like_protein_2
MMP16  matrix_metalloproteinase-16_isoform_2_preproprotein
LOC100129100 
LOC100506342  hypothetical_LOC100506342
RIPK2  receptor-interacting_serine/threonine-protein_kinase_2
COX6B1P6 
OSGIN2  oxidative_stress-induced_growth_inhibitor_2_isoform_2
NBN  nibrin
DECR1  2,4-dienoyl-CoA_reductase,_mitochondrial_precursor
CALB1  calbindin
TMEM64  transmembrane_protein_64_isoform_2
LOC100506351  hypothetical_LOC100506351
NECAB1  N-terminal_EF-hand_calcium-binding_protein_1
LOC100127983  hypothetical_protein_LOC100127983
TMEM55A  transmembrane_protein_55A
LOC100506365  hypothetical_LOC100506365,_transcript_variant_1
OTUD6B  OTU_domain-containing_protein_6B
CPP  leucine-rich_repeat-containing_protein_69
SLC26A7  anion_exchange_transporter_isoform_b
LOC100289644 
MRPS16P1 
RUNX1T1  protein_CBFA2T1_isoform_MTG8c
RPS26P10 
C8orf83  protein_TRIQK
LOC100420406 
LOC389676  hypothetical_LOC389676
LOC100288659  hypothetical_LOC642924
3   6925241       chrX  NT_011651.17  7408186-14333427    UBE2DNL  ubiquitin-conjugating_enzyme_E2D_N-terminal_like_(pseudogene)
APOOL  apolipoprotein_O-like_precursor
SATL1  spermidine/spermine_N(1)-acetyltransferase-like_protein_1
LOC100421745 
ZNF711  zinc_finger_protein_711
POF1B  protein_POF1B
MIR1321  microRNA_1321
LOC730792 
CHM  rab_proteins_geranylgeranyltransferase_component_A_1_isoform_b
LOC441505 
LOC100129298 
BA345E19.2  dachshund_homolog_2_isoform_c
KLHL4  kelch-like_protein_4_isoform_1
RPSAP15 
MRPS22P1 
LOC100129133 
CAPZA1P 
CPXCR1  CPX_chromosomal_region_candidate_gene_1_protein
LOC100421038 
SRIP2 
TGIF2LX  homeobox_protein_TGIF2LX
LOC100130134 
USP12PX 
RNF19BPX 
LOC100419789 
LOC100131981 
LOC100287033 
PABPC5  polyadenylate-binding_protein_5
LOC100132591 
KRT18P11  protocadherin-11_X-linked_isoform_a_precursor
4   6356831       chr13  NT_024524.14  48256106-54612937    PCDH9  protocadherin-9_isoform_2_precursor
LOC730236  hypothetical_protein_LOC730236
RPSAP53 
LOC390411 
OR7E111P 
OR7E33P 
RPL37P21 
RPL12P34 
LOC730239 
LOC100421079 
LOC100128625 
LOC100420198  kelch-like_protein_1
ATXN8OS  ATXN8_opposite_strand_(non-protein_coding)
LOC100288130  uncharacterized_protein_KIAA0802-like
LOC100421226 
RPL35AP31  dachshund_homolog_1_isoform_c
RPS10P21 
RPL21P110 
MZT1  mitotic-spindle_organizing_protein_1
C13orf34  protein_aurora_borealis
DIS3  exosome_complex_exonuclease_RRP44_isoform_b
PIBF1  progesterone-induced-blocking_factor_1
PSMD10P3 
5   5672532       chr6  NT_007592.15  47194168-52866700    TNFRSF21  tumor_necrosis_factor_receptor_superfamily_member_21_precursor
LOC100421517  CD2-associated_protein
GPR111  probable_G-protein_coupled_receptor_111
GPR115  probable_G-protein_coupled_receptor_115_precursor
RPL27AP7 
OPN5  opsin_5,_transcript_variant_2
LOC100505931  hypothetical_LOC100505931
LOC389395 
RBMXP1 
LOC100506698 
LOC100418956 
LOC100287991 
LOC100287991 
RNU7-65P 
LOC442215 
LOC100505950  hypothetical_LOC100505950
MUT  methylmalonyl-CoA_mutase,_mitochondrial_precursor
CENPQ  centromere_protein_Q
GLYATL3  glycine_N-acyltransferase-like_protein_3
C6orf141  hypothetical_protein_LOC135398
RHAG  ammonium_transporter_Rh_type_A
CRISP2  cysteine-rich_secretory_protein_2_precursor
CRISP3  cysteine-rich_secretory_protein_3_isoform_2_precursor
PGK2  phosphoglycerate_kinase_2
CRISP1  cysteine-rich_secretory_protein_1_isoform_2_precursor
DEFB133  beta-defensin_133
DEFB114  beta-defensin_114_precursor
DEFB113  beta-defensin_113_precursor
DEFB110  beta-defensin_110_isoform_a
DEFB112  beta-defensin_112_precursor
LOC100505985  hypothetical_LOC100505985
TFAP2D  transcription_factor_AP-2-delta
TFAP2B  transcription_factor_AP-2-beta
RPS17P5 
LOC100418898 
FTH1P5 
LOC100421020 
LOC100422449 
LOC646517 
RPS15AP20 
PKHD1  fibrocystin_isoform_2
MIR206  microRNA:hsa-mir-206
MIR133B  microRNA:hsa-mir-133b
IL17A  interleukin-17A_precursor
IL17F  interleukin-17F_precursor
SLC25A20P1 
MCM3  DNA_replication_licensing_factor_MCM3
LOC647163  hypothetical_protein_LOC647163
PAQR8  membrane_progestin_receptor_beta
EFHC1  EF-hand_domain-containing_protein_1_isoform_2
TRAM2  translocating_chain-associated_membrane_protein_2
FLJ37798  hypothetical_LOC401264
LOC724104 
LOC730101  hypothetical_LOC730101,_transcript_variant_2
TMEM14A  transmembrane_protein_14A
LOC100420627 
GSTA7P  glutathione_S-transferase_alpha_7,_pseudogene
GSTA2  glutathione_S-transferase_A2
LOC647169 
GSTA1  glutathione_S-transferase_A1
GSTA6P 
GSTA5  glutathione_S-transferase_A5
LOC647175 
LOC647177 
GSTA3  glutathione_S-transferase_A3
GSTA4P 
GSTA4  glutathione_S-transferase_A4
RN7SK  RNA,_7SK_small_nuclear
ICK  serine/threonine-protein_kinase_ICK
6   5644896       chr8  NT_008046.16  47857870-53502766    LOC100129104  hypothetical_LOC100129104
LOC100507162  hypothetical_LOC100507162
LOC100419617 
ZFATAS  ZFAT_antisense_RNA_(non-protein_coding)
MIR30B  microRNA:hsa-mir-30b
MIR30D  microRNA:hsa-mir-30d
RPL23AP56 
LOC286094  hypothetical_LOC286094
MAPRE1P1  KH_domain-containing,_RNA-binding,_signal_transduction-associated_protein_3
LOC100507185  hypothetical_LOC100507185
LOC100129367 
FLJ45872  FLJ45872_protein
FAM135B  hypothetical_protein_LOC51059
COL22A1  collagen,_type_XXII,_alpha_1
7   5245445       chr1  NT_032977.9  42723989-47969434    RPL31P12 
KRT8P21 
LRRIQ3  leucine-rich_repeat_and_IQ_domain-containing_protein_3
FPGT  fucose-1-phosphate_guanylyltransferase
TNNI3K  serine/threonine-protein_kinase_TNNI3K_isoform_b
C1orf173  hypothetical_protein_LOC127254
CRYZ  quinone_oxidoreductase_isoform_c
TYW3  tRNA_wybutosine-synthesizing_protein_3_homolog_isoform_2
LHX8  LIM/homeobox_protein_Lhx8
RPL29P5  choline_transporter-like_protein_5_isoform_A
LOC100421536 
DLSTP1  medium-chain_specific_acyl-CoA_dehydrogenase,_mitochondrial_isoform_b_precursor
SNORD45B  small_nucleolar_RNA,_C/D_box_45B
MSH4  mutS_protein_homolog_4
ASB17  ankyrin_repeat_and_SOCS_box_protein_17
LOC100505563  hypothetical_LOC100505563
LOC100418965  alpha-N-acetylgalactosaminide_alpha-2,6-sialyltransferase_3_isoform_2
TPI1P1 
ST6GALNAC5  alpha-N-acetylgalactosaminide_alpha-2,6-sialyltransferase_5
LOC256483  hypothetical_protein_LOC256483
LOC100421400  GPI-anchor_transamidase_precursor
AK5  adenylate_kinase_isoenzyme_5_isoform_2
8   5224163       chr12  NT_029419.12  46665394-51889557    LOC100128335 
SLC6A15  orphan_sodium-_and_chloride-dependent_neurotransmitter_transporter_NTT73_isoform_2
TSPAN19  putative_tetraspanin-19
LRRIQ1  leucine-rich_repeat_and_IQ_domain-containing_protein_1_isoform_2
ALX1  ALX_homeobox_protein_1
LOC441643 
LOC100129589 
RASSF9  ras_association_domain-containing_protein_9
NTS  neurotensin/neuromedin_N_preproprotein_preproprotein
MGAT4C  alpha-1,3-mannosyl-glycoprotein_4-beta-N-acetylglucosaminyltransferase_C
RPL23AP68 
LOC100507559  hypothetical_LOC100507559
CYCSP30 
LOC100420357  makorin_ring_finger_protein_9,_pseudogene
RPS4XP15 
C12orf50  hypothetical_protein_LOC160419
C12orf29  hypothetical_protein_LOC91298
LOC100420011  centrosomal_protein_of_290_kDa
TMTC3  transmembrane_and_TPR_repeat-containing_protein_3
KITLG  kit_ligand_isoform_b_precursor
LOC728084  hypothetical_LOC728084
LOC100287355 
MRPS6P4 
DUSP6  dual_specificity_protein_phosphatase_6_isoform_b



Posfai@neb.com
May 11, 2011