Verbumculus and the Discovery of Unusual Words

来源 :计算机科学技术学报 | 被引量 : 0次 | 上传用户:zhangfegnlin
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Measures relating word frequencies and expectations have been constantly of interest in Bioinfor-matics studies. With sequence data becoming massively available, exhaustive enumeration of such measures have become conceivable, and yet pose significant computational burden even when limited to words of bounded max-imum length. In addition, the display of the huge tables possibly resulting from these counts poses practical problems of visualization and inference. VERBUMCULUS is a suite of software tools for the efficient and fast detection of over- or under-represented words in nucleotide sequences. The inner core of VERBUMCULUS rests on subtly interwoven properties of statistics,pattern matching and combinatorics on words, that enable one to limit drastically and a priori the set of over-or under-represented candidate words of all lengths in a given sequence, thereby rendering it more feasible both to detect and visualize such words in a fast and practically useful way. This paper is devoted to the description of the facility at the outset and to report experimental results, ranging from simulations on synthetic data to the discovery of regulatory elements on the upstream regions of a set of genes of the yeast.The software VERBUMCULUS is accessible at http://www. cs. ucr. edu/ stelo/Verbumculus/or http://wwwdbl.dei. unipd. it/Verbumculus/
其他文献
A simple route for the preparation of lipo-alkaloid is presented. When aconitine or one of its analogues is heated with a fatty acid for 20 min at 100 ℃ in wat
Metalloporphyrin compounds have been extensively studied in many functional chemistry fields, such as photo-physics and liquid crystal quality studies[1-5]. But
From the ethanol extract of the whole plant of Boschniakia himalaica Hook. f. et. Thoms, a new and two known lignans have been isolated and identified as 7-meth
The crystal structure of the title compound, 2-isobutyl-6-(2',4'-dichlorophenyl)- imidazo[2,1-b]-1,3,4-thiadiazole (C14H13Cl2N3S, Mr = 326.23), has been synthes
The effects of Mo, Mn and Zr transitional metals on the catalytic performance of Ru/sepiolite for CO2 methanation were investigated. The results indicated that
We extend the approach of solving master equations for density matrices by projecting it onto the thermal entangled state representation (Hong-Yi Fan and Jun-Hu
The reaction of MoO2(acac)2 with 2-amino-6-methyl-pyridine (amp) in the mixed solvent of DMF (N,N-dimethylformamide) and water affords the title complex [H-amp]
The phase diagram of the quaternary system of sodium dodecyl trioxyethylene sulfate(SDES)/n-butanol/n-octane/water was obtained at (30.0±0.1) ℃. There exists
The ultrathin multilayer films of rare-earth-containing polyoxometalate cluster K17[Eu(P2Mo17O61)2](EuPMo) and poly(allylamine hydrochloride)(PAH) have been pre
In the present work, fine barium ferrite powder has been synthesized through a one-step hydrothermal process in an autoclave at [OH^-]/[Cl^-] ratio of 2:1 in th