Web crawler method and system based on improved pagerank
A web crawler and webpage technology, applied in the Internet field, can solve problems such as lower crawler efficiency, and achieve fast data collection, improved crawler efficiency, and strong pertinence
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Benefits of technology
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0037] In order to make the technical means, creative features, goals and effects achieved by the present invention easy to understand, the present invention will be further clarified below in conjunction with specific drawings.
[0038] In the prior art, although the web crawler has the ability to automatically extract webpage information, there is a problem that some pages reuse keywords to improve the search ranking; for this reason, the technical concept of the present invention includes: using the PageRank algorithm in the webpage crawler, according to The access relationship between crawled webpages generates a relationship matrix, and then generates an initial probability matrix according to the number of webpages, and finally iteratively calculates the webpage weights, and outputs the convergence results in descending order. Based on the above method, the problem of reusing keywords in some pages in the web crawler to improve the search ranking is solved.
[0039] Specifi
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap