A Method to Build a Super Small but Practically Accurate Language Model for Handheld Devices

来源 :计算机科学技术学报 | 被引量 : 0次 | 上传用户:zye284818093
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
In this paper, an important question, whether a small language model can be practically accurate enough, is raised. Afterwards, the purpose of a language model, the problems that a language model faces, and the factors that affect the performance of a language model,are analyzed. Finally, a novel method for language model compression is proposed, which makes the large language model usable for applications in handheld devices, such as mobiles, smart phones, personal digital assistants (PDAs), and handheld personal computers (HPCs). In the proposed language model compression method, three aspects are included. First, the language model parameters are analyzed and a criterion based on the importance measure of n-grams is used to determine which n-grams should be kept and which removed. Second, a piecewise linear warping method is proposed to be used to compress the uni-gram count values in the full language model. And third, a rank-based quantization method is adopted to quantize the bi-gram probability values. Experiments show that by using this compression method the language model can be reduced dramatically to only about 1M bytes while the performance almost does not decrease. This provides good evidence that a language model compressed by means of a well-designed compression technique is practically accurate enough, and it makes the language model usable in handheld devices.
其他文献
The microstructure of 40Cr steel sample and its surface is ultra-fined through saltbath cyclic quenching and high frequency hardening, then the superplasticity
激素替代疗法可明显缓解卵巢功能衰退所致的一系列症状,但其同时作用于女性生殖系统,与妇科恶性肿瘤发生相关.国内外文献就此报道的结果差异很大,本文就激素替代疗法与妇科恶
Based on the characteristic equation for power-law fluid and the Prandtl boundary layer equation, using the similarity method similar to that of Newtonian fluid
The Ti-48Al alloy was pack siliconized with 15%Si+85%Al2 O3. The microstructure of the siliconized coating on the TiAl-based alloy was analyzed and its effect o
A novel method, named critical-network-based (CNB), for timing optimization in global routing is presented in this paper. The essence of this method is differen
1992年 3月至 2 0 0 0年 3月我们共收治有严重骨壁破坏的上颌窦出血性坏死性息肉 2 8例 ,现报告如下。1 资料与方法1.1 临床资料  2 8例中男 17例 ,女 11例 ,2 1~ 74岁 ,平
Plant structure, representing the physical link among different organs, includes many similar substructures. In this paper, a new method is presented to constru
A review is given of grafting-from methods using living polymerizations, with emphasis on grafting from polymer microspheres using ATRP.
The nondendritic semi-solid slurry preparation of 60Si2Mn spring steel is studied in this paper. The experiments show that when stirred for 2 minutes under the
Power is an important design constraint in embedded computing systems. To meet the power constraint, microarchitecture and hardware designed to achieve high per