Distribution of restriction sites in the human genome

Enzyme:  SpeI               Longest uncut segments
Specificity:  ACTAGT               Repeats in uncut segments
Number of sites:  393191               Genes in uncut segments
Mean distance between sites:  7277 base pairs
Standard deviation:  8275 base pairs
Site density 137.4 per megabase               Help


Distribution of closely spaced sites

Distribution of sites within 7 STD distance


Help
Longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeat content Gene content
1   503029  chr15  NT_037852.6  1385566-1888595    2.25 % in   31 repeats    0.00 % in 0 genes
2   414587  chr6  NT_167244.1  2353278-2767865    1.89 % in   31 repeats    0.00 % in 0 genes
3   311477  chr16  NT_010393.16  476195-787672    21.65 % in   355 repeats    79.92 % in 31 genes
4   255828  chr6  NT_167244.1  2009129-2264957    2.99 % in   35 repeats    2.53 % in 3 genes
5   223409  chr6  NT_167244.1  4381338-4604747    4.49 % in   23 repeats    1.03 % in 1 genes
6   221770  chrY  NT_011875.12  8429882-8651652    77.38 % in   13 repeats    0.00 % in 0 genes
7   213243  chr6  NT_167249.1  2092240-2305483    9.71 % in   90 repeats    0.00 % in 0 genes
8   207652  chr6  NT_167244.1  3772075-3979727    5.45 % in   51 repeats    6.29 % in 1 genes
9   203654  chr19  NT_011255.14  469641-673295    40.42 % in   426 repeats    0.00 % in 0 genes
10   193721  chr1  NT_004350.19  1983631-2177352    18.63 % in   120 repeats    0.00 % in 0 genes
11   193133  chr14  NT_026437.12  86092808-86285941    29.21 % in   265 repeats    0.00 % in 0 genes
12   189943  chr9  NT_008470.19  61474505-61664448    46.49 % in   463 repeats    0.00 % in 0 genes
13   188065  chr7  NT_007819.17  1501464-1689529    39.85 % in   365 repeats    0.00 % in 0 genes
14   186130  chr6  NT_167247.1  4415150-4601280    3.03 % in   27 repeats    0.00 % in 0 genes
15   183595  chr6  NT_167247.1  1544626-1728221    5.19 % in   49 repeats    0.00 % in 0 genes
16   181116  chr1  NT_004350.19  1802515-1983631    12.80 % in   148 repeats    0.00 % in 0 genes


Help
Repeats in longest uncut segments
# Length  Chr  Scaffold  Coordinates  Repeats
Total  Distinct    Most  Second  Third 
503029  chr15  NT_037852.6  1385566-1888595    31  22       L1MDa (6)  Tigger2 (3)  L2a (2) 
414587  chr6  NT_167244.1  2353278-2767865    31  21       L1ME4a (3)  AluY (3)  AluJb (3) 
311477  chr16  NT_010393.16  476195-787672    355  89       AluSx (39)  AluY (31)  GC_rich (27) 
255828  chr6  NT_167244.1  2009129-2264957    35  24       MIRb (3)  L1MEe (3)  AluSx (3) 
223409  chr6  NT_167244.1  4381338-4604747    23  15       MER57-int (3)  HERVH-int (3)  AluSx (3) 
221770  chrY  NT_011875.12  8429882-8651652    13  2       LTR12B (12)  LTR12D (1) 
213243  chr6  NT_167249.1  2092240-2305483    90  44       AluSx (10)  AluJb (9)  AluJo (5) 
207652  chr6  NT_167244.1  3772075-3979727    51  33       L2a (8)  AT_rich (4)  MIR (3) 
203654  chr19  NT_011255.14  469641-673295    426  81       AluSx (66)  AluY (33)  AluSg (22) 
10  193721  chr1  NT_004350.19  1983631-2177352    120  59       AluSx (9)  L1MEg (8)  L1MEf (7) 
11  193133  chr14  NT_026437.12  86092808-86285941    265  98       AluSx (27)  GC_rich (11)  AluY (9) 
12  189943  chr9  NT_008470.19  61474505-61664448    463  94       AluSx (63)  MIRb (44)  MIR (35) 
13  188065  chr7  NT_007819.17  1501464-1689529    365  88       AluSx (41)  AluJo (36)  AluJb (20) 
14  186130  chr6  NT_167247.1  4415150-4601280    27  22       AluSx (3)  MLT1J (2)  MIRc (2) 
15  183595  chr6  NT_167247.1  1544626-1728221    49  29       MIR3 (5)  MIRc (4)  L2c (4) 
16  181116  chr1  NT_004350.19  1802515-1983631    148  53       MIRb (14)  MIR (11)  AluSx (9) 


Help
Genes in longest uncut segments
Sgmnt   Length (bp)  Chr  Scaffold  Coordinates  Gene symbol  Gene function 
3   311477       chr16  NT_010393.16  476195-787672    RAB11FIP3  rab11_family-interacting_protein_3_isoform_2
NCRNA00235  non-protein_coding_RNA_235
SOLH  calpain-15
C16orf11  hypothetical_protein_LOC146325
NHLRC4  NHL-repeat-containing_protein_4
PIGQ  phosphatidylinositol_N-acetylglucosaminyltransferase_subunit_Q_isoform_2
LOC100507068  ras-related_protein_Rab-40C_isoform_b
WFIKKN1  WAP,_kazal,_immunoglobulin,_kunitz_and_NTR_domain-containing_protein_1_precursor
C16orf13  hypothetical_protein_LOC84326_isoform_d
tRNA-Gly
LOC100287175  hypothetical_LOC100287175
LOC100130285  hypothetical_LOC100130285
FAM195A  hypothetical_protein_LOC84331
LOC100507146  hypothetical_LOC100507146
LOC100507167  hypothetical_LOC100507167
RHOT2  mitochondrial_Rho_GTPase_2
RHBDL1  rhomboid-related_protein_1
STUB1  E3_ubiquitin-protein_ligase_CHIP
JMJD8  jmjC_domain-containing_protein_8
WDR24  WD_repeat-containing_protein_24
FBXL16  F-box/LRR-repeat_protein_16
LOC100287301  hypothetical_LOC100287301
METRN  meteorin_precursor
FAM173A  hypothetical_protein_LOC65990
CCDC78  coiled-coil_domain-containing_protein_78
HAGHL  hydroxyacylglutathione_hydrolase-like_protein_isoform_1
NARFL  cytosolic_Fe-S_cluster_assembly_factor_NARFL
MSLN  mesothelin_isoform_2_preproprotein
MIR662  microRNA:hsa-mir-662
RPUSD1  RNA_pseudouridylate_synthase_domain-containing_protein_1
CHTF18  chromosome_transmission_fidelity_protein_18_homolog
4   255828       chr6  NT_167244.1  2009129-2264957    FLOT1  flotillin-1
DDR1  epithelial_discoidin_domain-containing_receptor_1_isoform_DDR1c
MUC21  mucin-21_precursor
5   223409       chr6  NT_167244.1  4381338-4604747    HLA-DPB2  major_histocompatibility_complex,_class_II,_DP_beta_2_(pseudogene)
8   207652       chr6  NT_167244.1  3772075-3979727    HLA-DRB3  major_histocompatibility_complex,_class_II,_DR_beta_3_precursor



Posfai@neb.com
May 11, 2011