论文部分内容阅读
网页本身的复杂性和不确定性在一定程度上限制了系统自动主题信息查找的功能。而主要信息查找提取可以帮助我们解决这一难题。通过从网页中删除多余信息,找到用户要的主题信息数据,这样可以在很大程度上提高用户查询信息的准确性和查询的效率,也为网页数据信息的提取奠定了坚实的基础。网页主题信息的查询提取在实际生活和工作中有很大的研究价值。
The complexity and uncertainty of the web page itself to a certain extent, limited the system automatically thematic information search function. The main information extraction can help us solve this problem. By removing redundant information from the web page and finding the subject information data that the user wants, the accuracy of the user inquiry information and the query efficiency can be greatly improved, and the solid foundation for the web data information extraction can also be established. Query extraction of thematic information on the web has great research value in real life and work.