【摘 要】
:
Measures relating word frequencies and expectations have been constantly of interest in Bioinfor-matics studies. With sequence data becoming massively available
【机 构】
:
Dipartimento di Ingegneria dell' Informazione,Department of Computer Sciences,Celera Genomics,Depar
【基金项目】
:
the Italian Ministry of University and Re-search, the Research Program of the University of Padova and Bourns College of Engineering, University of California,Riverside;Italian Ministry of University
论文部分内容阅读
Measures relating word frequencies and expectations have been constantly of interest in Bioinfor-matics studies. With sequence data becoming massively available, exhaustive enumeration of such measures have become conceivable, and yet pose significant computational burden even when limited to words of bounded max-imum length. In addition, the display of the huge tables possibly resulting from these counts poses practical problems of visualization and inference. VERBUMCULUS is a suite of software tools for the efficient and fast detection of over- or under-represented words in nucleotide sequences. The inner core of VERBUMCULUS rests on subtly interwoven properties of statistics,pattern matching and combinatorics on words, that enable one to limit drastically and a priori the set of over-or under-represented candidate words of all lengths in a given sequence, thereby rendering it more feasible both to detect and visualize such words in a fast and practically useful way. This paper is devoted to the description of the facility at the outset and to report experimental results, ranging from simulations on synthetic data to the discovery of regulatory elements on the upstream regions of a set of genes of the yeast.The software VERBUMCULUS is accessible at http://www. cs. ucr. edu/ stelo/Verbumculus/or http://wwwdbl.dei. unipd. it/Verbumculus/
其他文献
A simple route for the preparation of lipo-alkaloid is presented. When aconitine or one of its analogues is heated with a fatty acid for 20 min at 100 ℃ in wat
Metalloporphyrin compounds have been extensively studied in many functional chemistry fields, such as photo-physics and liquid crystal quality studies[1-5]. But
From the ethanol extract of the whole plant of Boschniakia himalaica Hook. f. et. Thoms, a new and two known lignans have been isolated and identified as 7-meth
The crystal structure of the title compound, 2-isobutyl-6-(2',4'-dichlorophenyl)- imidazo[2,1-b]-1,3,4-thiadiazole (C14H13Cl2N3S, Mr = 326.23), has been synthes
The effects of Mo, Mn and Zr transitional metals on the catalytic performance of Ru/sepiolite for CO2 methanation were investigated. The results indicated that
We extend the approach of solving master equations for density matrices by projecting it onto the thermal entangled state representation (Hong-Yi Fan and Jun-Hu
The reaction of MoO2(acac)2 with 2-amino-6-methyl-pyridine (amp) in the mixed solvent of DMF (N,N-dimethylformamide) and water affords the title complex [H-amp]
The phase diagram of the quaternary system of sodium dodecyl trioxyethylene sulfate(SDES)/n-butanol/n-octane/water was obtained at (30.0±0.1) ℃. There exists
The ultrathin multilayer films of rare-earth-containing polyoxometalate cluster K17[Eu(P2Mo17O61)2](EuPMo) and poly(allylamine hydrochloride)(PAH) have been pre
In the present work, fine barium ferrite powder has been synthesized through a one-step hydrothermal process in an autoclave at [OH^-]/[Cl^-] ratio of 2:1 in th