Distribution of restriction sites in the human genome

Enzyme:  AscI               Longest uncut segments
Specificity:  GGCGCGCC               Repeats in uncut segments
Number of sites:  4466               Genes in uncut segments
Mean distance between sites:  640695 base pairs
Standard deviation:  1072205 base pairs
Site density 1.6 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   12998278  chrX  NT_167197.1  19558386-32556664    51.41 % in   22963 repeats    32.41 % in 78 genes
2   12157066  chr21  NT_011512.11  2763960-14921026    49.33 % in   20172 repeats    20.26 % in 56 genes
3   12048987  chr5  NT_034772.6  3270457-15319444    50.16 % in   18494 repeats    14.94 % in 58 genes
4   11502543  chr3  NT_022459.15  10100261-21602804    45.23 % in   17499 repeats    30.56 % in 19 genes
5   10837886  chr4  NT_022778.16  1-10837887    53.53 % in   16728 repeats    19.76 % in 76 genes
6   10459953  chr18  NT_010966.14  15366768-25826721    43.44 % in   16607 repeats    34.13 % in 30 genes
7   10276467  chrX  NT_011651.17  8688180-18964647    69.49 % in   15440 repeats    17.25 % in 39 genes
8   10140829  chr1  NT_032977.9  46512427-56653256    47.57 % in   16959 repeats    38.14 % in 63 genes
9   9853041  chr13  NT_024524.14  34755553-44608594    50.42 % in   15523 repeats    0.00 % in 0 genes
10   9061834  chr1  NT_032977.9  64978973-74040807    49.81 % in   15332 repeats    0.00 % in 0 genes
11   8890233  chr12  NT_029419.12  42227097-51117330    51.05 % in   14317 repeats    0.00 % in 0 genes
12   8286468  chr16  NT_010498.15  11674352-19960820    48.87 % in   15290 repeats    0.00 % in 0 genes
13   8283608  chr6  NT_007299.13  14580276-22863884    55.54 % in   12847 repeats    0.00 % in 0 genes
14   7960010  chr11  NT_009237.18  35580774-43540784    52.76 % in   13876 repeats    0.00 % in 0 genes
15   7705301  chrX  NT_011651.17  21834165-29539466    59.05 % in   12697 repeats    0.00 % in 0 genes
16   7546992  chr5  NT_023133.13  6274904-13821896    45.94 % in   13127 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
12998278  chrX  NT_167197.1  19558386-32556664    22963  824       AT_rich (1494)  AluSx (1226)  MIRb (742) 
12157066  chr21  NT_011512.11  2763960-14921026    20172  832       AT_rich (2110)  AluSx (754)  AluY (649) 
12048987  chr5  NT_034772.6  3270457-15319444    18494  827       AT_rich (1953)  AluSx (593)  L2a (592) 
11502543  chr3  NT_022459.15  10100261-21602804    17499  822       AT_rich (2113)  L2a (588)  MIR (556) 
10837886  chr4  NT_022778.16  1-10837887    16728  836       AT_rich (1948)  L2a (597)  MIR (518) 
10459953  chr18  NT_010966.14  15366768-25826721    16607  788       AT_rich (1088)  MIRb (760)  MIR (704) 
10276467  chrX  NT_011651.17  8688180-18964647    15440  780       AT_rich (1220)  AluSx (355)  MIR (344) 
10140829  chr1  NT_032977.9  46512427-56653256    16959  792       AT_rich (1331)  MIRb (853)  MIR (755) 
9853041  chr13  NT_024524.14  34755553-44608594    15523  820       AT_rich (1731)  L2a (538)  AluSx (478) 
10  9061834  chr1  NT_032977.9  64978973-74040807    15332  766       AT_rich (1247)  L2a (686)  MIRb (629) 
11  8890233  chr12  NT_029419.12  42227097-51117330    14317  769       AT_rich (1492)  L2a (659)  MIRb (538) 
12  8286468  chr16  NT_010498.15  11674352-19960820    15290  768       MIRb (1111)  AT_rich (921)  MIR (716) 
13  8283608  chr6  NT_007299.13  14580276-22863884    12847  741       AT_rich (1053)  MIRb (489)  AluSx (463) 
14  7960010  chr11  NT_009237.18  35580774-43540784    13876  741       AT_rich (941)  MIRb (866)  MIR (658) 
15  7705301  chrX  NT_011651.17  21834165-29539466    12697  704       AT_rich (572)  AluSx (536)  MIRb (532) 
16  7546992  chr5  NT_023133.13  6274904-13821896    13127  722       MIRb (876)  AT_rich (841)  MIR (680) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
1   12998278       chrX  NT_167197.1  19558386-32556664    SMPX  small_muscular_protein
YY2  transcription_factor_YY2
LOC100418871 
SMS  spermine_synthase
MIR1308  microRNA_1308
ZNF645  zinc_finger_protein_645
LOC729650 
DDX53  DEAD_box_protein_53
FAM3C2 
LOC653707 
LOC100217367  patched_domain-containing_protein_1
PRDX4  peroxiredoxin-4
ACOT9  acyl-coenzyme_A_thioesterase_9,_mitochondrial_isoform_b_precursor
SAT1  diamine_acetyltransferase_1
APOO  apolipoprotein_O_precursor
CXorf58  putative_uncharacterized_protein_CXorf58_isoform_2
KLHL15  kelch-like_protein_15
EIF2S3  eukaryotic_translation_initiation_factor_2_subunit_3
ZFX  zinc_finger_X-chromosomal_protein_isoform_2
FAM48B2  putative_protein_FAM48B2
FAM48B1  protein_FAM48B1
RPS26P58 
PDK3  [Pyruvate_dehydrogenase_[lipoamide]]_kinase_isozyme_3,_mitochondrial_isoform_2_precursor
PCYT1B  choline-phosphate_cytidylyltransferase_B_isoform_1
LOC100288233 
EEF1B4  small_Cajal_body-specific_RNA_23
ARX  homeobox_protein_ARX
LOC139957 
LOC441487 
LOC389842 
MAGEB18  melanoma-associated_antigen_B18
MAGEB6B 
LOC100420245 
MAGEB6  melanoma-associated_antigen_B6
MAGEB5  melanoma-associated_antigen_B5
LOC100130052 
LOC100422209 
LOC100418721 
LOC100421110 
VENTXP1  VENT_homeobox_(Xenopus_laevis)_pseudogene_1
RPL7P58 
HMGA1P1 
LOC100128929 
LOC100289380 
SMEK3P  SMEK_homolog_3,_suppressor_of_mek1_(Dictyostelium)_pseudogene
RDXP2 
LOC100132076 
DCAF8L2  DDB1-_and_CUL4-associated_factor_8-like_protein_2
MAGEB10  melanoma-associated_antigen_B10
LOC392435 
LOC392436 
VKORC1P1 
LOC340569 
DCAF8L1  DDB1-_and_CUL4-associated_factor_8-like_protein_1
IL1RAPL1  interleukin-1_receptor_accessory_protein-like_1_precursor
LOC100129049 
LOC100420324 
MAGEB2  melanoma-associated_antigen_B2
MAGEB3  melanoma-associated_antigen_B3
MAGEB4  melanoma-associated_antigen_B4
MAGEB1  melanoma-associated_antigen_B1
LOC100420075 
NR0B1  nuclear_receptor_subfamily_0_group_B_member_1
CXorf21  hypothetical_protein_LOC80231
CKS1BP6 
FTLP2 
GK  glycerol_kinase_isoform_c
LOC100418759 
TAB3  TGF-beta-activated_kinase_1_and_MAP3K7-binding_protein_3
DMD  dystrophin,_transcript_variant_Dp427l
LOC100130233  dystrophin_Dp40_isoform
MIR548F5  microRNA_548f-5
TBCAP1 
LOC646506 
FAM47A  hypothetical_protein_LOC158724
FTH1P14 
LOC392439 
TMEM47  transmembrane_protein_47
2   12157066       chr21  NT_011512.11  2763960-14921026    USP25  ubiquitin_carboxyl-terminal_hydrolase_25
MIR125B2  microRNA:hsa-mir-125b-2
LOC100421718 
LOC100505929  hypothetical_LOC100505929
tRNA-Gly
RPL39P40 
LOC100505945  transcription_factor_BTF3_homolog_4-like
BTG3  protein_BTG3_isoform_b
C21orf91  protein_EURL_homolog_isoform_3
NCRNA00157  non-protein_coding_RNA_157
RPL37P3 
CHODL  chondrolectin_precursor
TMPRSS15  enteropeptidase_precursor
PPIAL3 
LOC100505973  hypothetical_LOC100505973
SLC6A6P1 
RPL37P4 
LOC100128057 
LOC100421082 
LOC100288151 
C1QBPP 
LOC100422584 
FDPSP6 
KRT18P2 
RPS3AP1 
LOC100421083 
C21orf131  chromosome_21_open_reading_frame_131
LOC100288185 
NCAM2  neural_cell_adhesion_molecule_2_precursor
LOC100420035 
MAPK6PS2 
ZNF299P 
LOC100130310 
EEF1A1P1 
TUBAP 
VN2R20P 
LOC100419737 
RPL13AP7 
NCRNA00158  non-protein_coding_RNA_158
MIR155  microRNA:hsa-mir-155
C21orf71  chromosome_21_open_reading_frame_71
MRPL39  39S_ribosomal_protein_L39,_mitochondrial_isoform_b
FDX1P2  junctional_adhesion_molecule_B_precursor
ATP5J  ATP_synthase-coupling_factor_6,_mitochondrial_isoform_a_precursor
LOC100506106  GA-binding_protein_alpha_chain
APP  amyloid_beta_A4_protein_isoform_f_precursor
LOC100289065  hypothetical_LOC100289065
MARCKSP1 
LOC100506140  hypothetical_LOC100506140
CYYR1  cysteine_and_tyrosine-rich_protein_1_precursor
ADAMTS1  A_disintegrin_and_metalloproteinase_with_thrombospondin_motifs_1_preproprotein_preproprotein
ADAMTS5  A_disintegrin_and_metalloproteinase_with_thrombospondin_motifs_5_preproprotein_preproprotein
GPX1P2 
EIF4A1P1 
LOC100288252 
NCRNA00113  non-protein_coding_RNA_113
3   12048987       chr5  NT_034772.6  3270457-15319444    GPR150  probable_G-protein_coupled_receptor_150
RFESD  Rieske_domain-containing_protein_isoform_2
SPATA9  spermatogenesis-associated_protein_9
HSPD1P11  rho-related_BTB_domain-containing_protein_3
GLRX  glutaredoxin-1
C5orf27  chromosome_5_open_reading_frame_27
ELL2  RNA_polymerase_II_elongation_factor_ELL2
LOC100288964 
MIR583  microRNA:hsa-mir-583
PCSK1  neuroendocrine_convertase_1_isoform_3
CAST  calpastatin,_transcript_variant_12
ERAP1  endoplasmic_reticulum_aminopeptidase_1_isoform_b
ERAP2  endoplasmic_reticulum_aminopeptidase_2
LNPEP  leucyl-cystinyl_aminopeptidase_isoform_2
LOC642737 
LIX1  protein_limb_expression_1_homolog
RIOK2  serine/threonine-protein_kinase_RIO2_isoform_2
YTHDF1P1 
LOC100289037 
LOC391813 
LOC100420130 
PSME2P1 
LOC402221 
LOC100289133 
MRPS35P2 
LOC642909 
DDX18P4 
RGMB  RGM_domain_family_member_B
CHD1  chromodomain-helicase-DNA-binding_protein_1
LOC100289230  hypothetical_LOC100289230
RPS9P3 
LOC728093  putative_POM121-like_protein_1-like
LOC441066 
LOC285706 
LOC100506353 
LOC643031 
LOC100287745 
LOC100289404 
LOC100132456 
LOC100133050  glucuronidase,_beta_pseudogene
LOC441098 
FAM174A  membrane_protein_FAM174A_precursor
ST8SIA4  CMP-N-acetylneuraminate-poly-alpha-2,_8-sialyltransferase_isoform_b
LOC100420593 
OR7H2P 
SLCO4C1  solute_carrier_organic_anion_transporter_family_member_4C1
SLCO6A1  solute_carrier_organic_anion_transporter_family_member_6A1
PAM  peptidyl-glycine_alpha-amidating_monooxygenase_isoform_e_preproprotein
LOC134505 
GIN1  gypsy_retrotransposon_integrase-like_protein_1
PPIP5K2  inositol_hexakisphosphate_and_diphosphoinositol-pentakisphosphate_kinase_2
C5orf30  hypothetical_protein_LOC90355
LOC100129962 
NUDT12  peroxisomal_NADH_pyrophosphatase_NUDT12
RAB9BP1  RAB9B,_member_RAS_oncogene_family_pseudogene_1
LOC345571 
LOC100289569 
LOC100129233  hypothetical_LOC100129233
4   11502543       chr3  NT_022459.15  10100261-21602804    LOC401076 
VDAC1P7  roundabout_homolog_2_isoform_ROBO2b
MRPS17P3 
RPS12P6  roundabout_homolog_1_isoform_d
LOC100130821 
LOC100129557 
LOC100419178  1,4-alpha-glucan-branching_enzyme
RPL7AP23 
CYP51P1 
SRRM1P2 
LOC100130326  similar_to_MUF1_protein
LOC100422711  cell_adhesion_molecule_2_isoform_3
VGLL3  transcription_cofactor_vestigial-like_protein_3
LOC100289640 
CHMP2B  charged_multivesicular_body_protein_2b
POU1F1  pituitary-specific_positive_transcription_factor_1_isoform_beta
KRT8P25 
LOC100129005 
LOC643766 
5   10837886       chr4  NT_022778.16  1-10837887    LOC100507160  hypothetical_LOC100507160
LOC100421808 
RPL17P19 
RPS12P9  latrophilin-3_precursor
RPS15AP17 
RPL21P47 
LOC100289193 
LOC100131441 
LOC644534 
LOC644548 
LOC644578  hypothetical_protein_LOC644578
TECRL  trans-2,3-enoyl-CoA_reductase-like
LOC391657 
RPS6P5 
LOC401134  hypothetical_LOC401134
LOC100422019 
LOC100507063  hypothetical_LOC100507063
LOC100144602  hypothetical_LOC100144602
LOC728048 
MIR1269  microRNA:hsa-mir-1269
RPS23P3 
CENPC1  centromere_protein_C_1
STAP1  signal-transducing_adaptor_protein_1
UBA6  ubiquitin-like_modifier-activating_enzyme_6
LOC100419862  hypothetical_LOC550112
LOC100419046 
GNRHR  gonadotropin-releasing_hormone_receptor_isoform_2
TMPRSS11CP 
TMPRSS11D  transmembrane_protease_serine_11D
TMPRSS11A  transmembrane_protease_serine_11A_isoform_2
LOC644759  serine_protease_Desc4_pseudogene
GRINL1B 
LOC100420693  synaptotagmin_XIV-like
LOC100420694  ferritin,_light_polypeptide_pseudogene_10
TMPRSS11BNL  transmembrane_protease_serine_11B-like_protein
TMPRSS11B  transmembrane_protease_serine_11B
LOC100422188 
LOC100128725 
YTHDC1  YTH_domain-containing_protein_1_isoform_2
MT2P1 
TMPRSS11E  transmembrane_protease_serine_11E
UGT2B29P 
UGT2B17  UDP-glucuronosyltransferase_2B17_precursor
LOC100422402 
LOC100132651 
LOC728807 
UGT2B15  UDP-glucuronosyltransferase_2B15_precursor
LOC728811 
LOC100422026 
LOC100422189 
UGT2B10  UDP-glucuronosyltransferase_2B10_isoform_2
LOC100174950 
LOC100421000  UDP-glucuronosyltransferase_2A3_precursor
LOC100289568 
LOC642381 
UGT2B27P 
LOC100422020 
UGT2B26P 
UGT2B7  UDP-glucuronosyltransferase_2B7_precursor
LOC100127903 
LOC642474 
UGT2B11  UDP-glucuronosyltransferase_2B11_precursor
LOC100422021 
UGT2B28  UDP-glucuronosyltransferase_2B28_precursor
LOC100422022 
LOC642496 
LOC100422190 
UGT2B25P 
LOC100422191 
LOC100422029 
UGT2B24P 
UGT2B4  UDP-glucuronosyltransferase_2B4_precursor
LOC100422023 
LOC100422024 
UGT2A2  UDP-glucuronosyltransferase_2A2
SULT1B1  sulfotransferase_family_cytosolic_1B_member_1
6   10459953       chr18  NT_010966.14  15366768-25826721    FHOD3  FH1/FH2_domain-containing_protein_3
C18orf10  tubulin_polyglutamylase_complex_subunit_2
KIAA1328  hinderin
LOC100506821  hypothetical_LOC100506821
CELF4  CUGBP_Elav-like_family_member_4_isoform_4
LOC100506837  hypothetical_LOC100506837
MIR4318  microRNA_4318
RPL12P40 
LOC100506854  hypothetical_LOC100506854
RPL17P45 
KC6  keratoconus_gene_6
NPM1P1 
LOC100301521 
PIK3C3  phosphatidylinositol_3-kinase_catalytic_subunit_type_3
RIT2  GTP-binding_protein_Rit2
SYT4  synaptotagmin-4
IBTKP1 
KRT8P5 
MIR4319  microRNA_4319
SLC14A2  urea_transporter_2
SLC14A1  urea_transporter_1_isoform_2
SIGLEC15  sialic_acid-binding_Ig-like_lectin_15_precursor
KIAA1632  hypothetical_protein_LOC57724
LOC100422497  proline-serine-threonine_phosphatase-interacting_protein_2
ATP5A1  tRNA-Lys
HAUS1  HAUS_augmin-like_complex_subunit_1
C18orf25  hypothetical_protein_LOC147339_isoform_b
C18orf23  chromosome_18_open_reading_frame_23
LOXHD1  lipoxygenase_homology_domain-containing_protein_1_isoform_3
RPS21P6  alpha-2,8-sialyltransferase_8E
7   10276467       chrX  NT_011651.17  8688180-18964647    BA345E19.2  dachshund_homolog_2_isoform_c
KLHL4  kelch-like_protein_4_isoform_1
RPSAP15 
MRPS22P1 
LOC100129133 
CAPZA1P 
CPXCR1  CPX_chromosomal_region_candidate_gene_1_protein
LOC100421038 
SRIP2 
TGIF2LX  homeobox_protein_TGIF2LX
LOC100130134 
USP12PX 
RNF19BPX 
LOC100419789 
LOC100131981 
LOC100287033 
PABPC5  polyadenylate-binding_protein_5
LOC100132591 
KRT18P11  protocadherin-11_X-linked_isoform_a_precursor
RPL26P36 
LOC401602 
LOC100422576 
ST13P18 
LOC100131340 
RPL7P55 
NAP1L3  nucleosome_assembly_protein_1-like_3
FAM133A  hypothetical_protein_LOC286499
LOC643371 
PAICSP7 
CCNB1IP1P3 
MIR548M  microRNA_548m
CALM1P1 
LOC100129001 
LOC100128595 
RPS7P13 
LOC100420872 
LOC648927 
RPS29P28 
LOC643486  bromodomain,_testis-specific_pseudogene
8   10140829       chr1  NT_032977.9  46512427-56653256    LOC100418965  alpha-N-acetylgalactosaminide_alpha-2,6-sialyltransferase_3_isoform_2
TPI1P1 
ST6GALNAC5  alpha-N-acetylgalactosaminide_alpha-2,6-sialyltransferase_5
LOC256483  hypothetical_protein_LOC256483
LOC100421400  GPI-anchor_transamidase_precursor
AK5  adenylate_kinase_isoenzyme_5_isoform_2
ZZZ3  ZZ-type_zinc_finger-containing_protein_3
USP33  ubiquitin_carboxyl-terminal_hydrolase_33_isoform_3
LOC100131291 
CCDC55P1  axin_interactor,_dorsalization-associated_protein-like
C1orf118  chromosome_1_open_reading_frame_118
NEXN  nexilin_isoform_2
FUBP1  far_upstream_element-binding_protein_1
DNAJB4  dnaJ_homolog_subfamily_B_member_4
LOC100131495 
LOC100422537  PDZ_domain-containing_protein_GIPC2
LOC100132264 
MGC27382  hypothetical_MGC27382
PTGFR  prostaglandin_F2-alpha_receptor_isoform_b_precursor
IFI44L  interferon-induced_protein_44-like
IFI44  interferon-induced_protein_44
RPL23P3 
LOC652549 
ELTD1  EGF,_latrophilin_and_seven_transmembrane_domain-containing_protein_1_precursor
LOC729779 
ADH5P2 
LOC553139 
HMGB1P18 
LOC100129325 
LOC100129325 
RPL7P10 
RPL10AP4 
LOC729817 
LOC646555 
LOC646556 
RPS20P7 
ST13P20 
LPHN2  latrophilin-2_precursor
ARF4P5 
TTLL7  tubulin_polyglutamylase_TTLL7
PRKACB  cAMP-dependent_protein_kinase_catalytic_subunit_beta_isoform_1
LOC100128775 
SAMD13  sterile_alpha_motif_domain-containing_protein_13_isoform_2
UOX  urate_oxidase,_pseudogene
DNASE2B  deoxyribonuclease-2-beta_isoform_2
RPF1  ribosome_production_factor_1
GNG5  guanine_nucleotide-binding_protein_G(I)/G(S)/G(O)_subunit_gamma-5_precursor
LOC100505741  spermatogenesis-associated_protein_1-like
CTBS  di-N-acetylchitobiase_precursor
C1orf180  chromosome_1_open_reading_frame_180
SSX2IP  afadin-_and_alpha-actinin-binding_protein_isoform_2
LPAR3  lysophosphatidic_acid_receptor_3
MCOLN2  mucolipin-2
MCOLN3  mucolipin-3
WDR63  WD_repeat-containing_protein_63
SYDE2  rho_GTPase-activating_protein_SYDE2
C1orf52  hypothetical_protein_LOC148423
BCL10  B-cell_lymphoma/leukemia_10
LOC646626  hypothetical_protein_LOC646626
DDAH1  N(G),N(G)-dimethylarginine_dimethylaminohydrolase_1_isoform_2
CYR61  protein_CYR61_precursor
ZNHIT6  box_C/D_snoRNA_protein_1_isoform_2
COL24A1  collagen_alpha-1(XXIV)_chain_precursor



Posfai@neb.com
May 11, 2011