Keyword Extraction from Scientific Research Projects Based on SRP-TF-IDF

来源 :电子学报(英文版) | 被引量 : 0次 | 上传用户:ktcargo147
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Keyword extraction by Term frequency-Inverse document frequency (TF-IDF) is used for text information retrieval and mining in many domains,such as news text,social contact text,and medical text.However,keyword extraction in special domains still needs to be improved and optimized,particularly in the scientific research field.The traditional TF-IDF algorithm considers only the word frequency in documents,but not the domain characteristics.Therefore,we propose the Scientific research project TF-IDF (SRP-TF-IDF) model,which combines TF-IDF with a weight balance algorithm designed to recalculate candidate keywords.We have implemented the SRP-TF-IDF model and verified that our method has better precision,recall,and F1 score than the traditional TF-IDF and TextRank methods.In addition,we investigated the parameter of our weight balance algorithm to find an optimal value for keyword extraction from scientific research projects.
其他文献
针对轴承套圈小尺寸密封槽存在无法使用精密仪器直接测量的问题,提出使用复制胶泥结合影像测量仪对密封槽尺寸进行测量的方法。根据实际使用及测量数据表明该方法无需考虑密封槽尺寸及结构的影响,能够准确测量密封槽尺寸,提高加工质量及效率。
为了明确稠油在火烧油层高温氧化过程中的内在机理,应用热重、微商热重、差示扫描量热方法获取热分析基础数据,并应用Kissinger-Akahira-Sunose(KAS)等转化率法、Kissinger微分法和Coats-Redfern积分法进行高温氧化动力学参数分析。实验结果表明,应用KAS等转化率法、Kissinger微分法和Coats-Redfern积分法求取的稠油高温氧化反应动力学参数均在正常范围内,线性相关系数均在0.95以上。在高温反应阶段,油焦与氧气充分接触后燃烧以三维扩散为主。氧气浓度梯度是影
Increasing pulses Coherent processing interval (CPI) can effectively improve the location parameters estimation performance in passive localization.However,for
Digit information has been used in many areas and has been widely spread in the Internet era because of its convenience.However,many ill-disposed attackers,such
针对两列滚动体尺寸及接触角都不同的双列向心球面滚子轴承(简称:非对称双列向心球面滚子轴承)的基本额定动载荷的理论计算方法,分析了单列及多列线接触向心滚子轴承额定动载荷的计算原理,并以240/600为例,给出了非对称双列向心球面滚子轴承基本额定动载荷的计算求解过程。
The three-party authenticated key agree-ment protocol is a significant cryptographic mechanism for secure communication,which encourages two entities to authent
In the field of robust audio watermarking,how to seek a good trade-off between robustness and imperceptibility is challenging.The existing studies use the same
为研究最优的、低成本圆弧型的汽车圆锥滚子轴承的滚子母线修形方案,本文应用MASTA作为计算分析工具,计算出不同圆弧、不同圆弧-直线的母线修形方案下的轴承中各粒滚子的应力;并借助正交试验法找出纯圆弧修形方案和圆弧-直线修形方案中各个几何参数对于轴承滚子最大应力的影响机制;最后,根据正交试验的结果找出圆弧型滚子母线和圆弧-直线型滚子母线的最优几何参数组合。
随着移动通信系统的高速建设和发展,射频信号的包络信号带宽越来越宽,功率峰均比越来越大,恒定电压供电的基站功放效率越来越低。针对该问题,论文设计了一种电源调制电路。该电路输出的电压根据峰值检测后的电压值对功率放大器的供电端进行实时控制,保持功率放大器工作状态一直在最高效率点上,从而提高功率放大器的效率。实验表明:在原电路负载不变的情况下,设计的电源调制电路的工作效率提高了30%以上,为使用包络跟踪技术提高功率放大器的效率提供了理论支持与实验支撑。
Density peak clustering (DPC) can identify cluster centers quickly,without any prior knowledge.It is supposed that the cluster centers have a high density and l