基于Jaccard的移动终端自动识别并行算法及其MapReduce实现(英文)

来源 :中国通信 | 被引量 : 0次 | 上传用户:xiuxiumumu
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
The ability of accurate and scalable mobile device recognition is critically important for mobile network operators and ISPs to understand their customers’ behaviours and enhance their user experience.In this paper,we propose a novel method for mobile device model recognition by using statistical information derived from large amounts of mobile network traffic data.Specifically,we create a Jaccardbased coefficient measure method to identify a proper keyword representing each mobile device model from massive unstructured textual HTTP access logs.To handle the large amount of traffic data generated from large mobile networks,this method is designed as a set of parallel algorithms,and is implemented through the MapReduce framework which is a distributed parallel programming model with proven low-cost and high-efficiency features.Evaluations using real data sets show that our method can accurately recognise mobile client models while meeting the scalability and producer-independency requirements of large mobile network operators.Results show that a 91.5% accuracy rate is achieved for recognising mobile client models from 2 billion records,which is dramatically higher than existing solutions. The ability of accurate and scalable mobile device recognition is critically important for mobile network operators and ISPs to understand their customers’ behaviours and enhance their user experience. In this paper, we propose a novel method for mobile device model recognition by using statistical information derived from large amounts of mobile network traffic data. Specifically, we create a Jaccardbased coefficient measure method to identify a proper keyword representing each mobile device model from massive unstructured textual HTTP access logs. To handle the large amount of traffic data data from large mobile networks, this method is designed as a set of parallel algorithms, and is implemented through the MapReduce framework which is a distributed parallel programming model with proven low-cost and high-efficiency features. Evaluation using real data sets show that our method can be accurate recognizer mobile client models while meeting the scalability and producer-independency requir ements of large mobile network operators. Results show that a 91.5% accuracy rate is achieved for recognizing mobile client models from 2 billion records, which is substantially higher than existing solutions.
其他文献
美国物理化学研究公司报导的PCRC—55型变换器,是根据阻抗变化来感应相对湿度的装置。这个微小的传感器(1/4″×1/2″×1/16″)把导电的表面层和非导电基板形成一体,其湿度
当前判断激光管的工作寿命要靠实际点燃来实验,这对管子的检验、改革、研究极端不利。因此,探索一种较可靠的快速推算其寿命的方法,是极待解决的课题。激光管的失效原因是多
1.1990年11月11日,我第一次踏上中国的土地。宛如一见钟情,从此以后,这片土地就一直深深地吸引着我,激动着我,使我受益匪浅。在此期间,我重新找到了许多在欧洲早已沦丧的价值
快速调节器是根据时间最佳控制原理做成的一种新型工业控制仪表。由于快速控制本身是一种非线性控制规律,因此它的特性不能用传递函数的方法来讨论,但可以在偏差及其变化速
构。一、问题的提出两相交流伺服电机在自动控制系统中广泛地作为执行元件或用它去控制其它的执行机图1是采用伺服电机的模拟调节系统的方框图。 Structure. First, the pr
This paper investigates the uplink throughput of Cognitive Radio Cellular Networks(CRCNs).As oppose to traditional performance evaluation schemes which mainly a
1.概述 首钢初轧厂均热炉共有24个炉坑,其中有12个炉坑使用仪表自动调节系统,其余12个圹坑目前还 1. Overview Shougang first rolling mill soaking furnace a total of 2
目的探讨模拟失重大鼠心室肌自噬相关基因Beclin-1和微管相关蛋白轻链3(LC3)的表达变化。方法雄性SD大鼠12只,随机分为对照组和模拟失重组。模拟失重4周后取心室肌组织,采用R
本文讨论了 Z—80微型计算机并行接口(PIO)与多字节模/数转换器输出之间通道的设计问题,并给出了实现的线路和用 Z—80汇编语言编写的数据输入程序。这种线路适合在以 Z—80
 2000年,我国银行间同业拆借市场和债券市场继续稳步发展,交易量迅猛增加,市场基础建设和管理工作得到进一步加强,为货币政策的有效实施创造了良好的条件。   一、 2000年货币