Title :
|
Document structure similarity methods: a review
|
Author :
|
Deepika, Ashutosh Dixit
|
Conference :
|
National Conference on Science in Media SIM 2012 (December 3-4, 2012) Organized by YMCA University, Faridabad (India)
|
Keywords :
|
C Search Engines, Web Crawler, Document
Structure
|
Abstract :
|
The primary goal of Search Engines is to provide
user information relevant to its query. For this purpose a web
crawler is used which is a part of search engine and responsible
for fetching data. The crawler traverses the web and provides
pages to the search engines. Generally crawling is based on
content but it is observed that structure of a page plays an
important role in getting more relevant data .This paper reviews
some methods given by various researchers in which crawling is
based on structure of a page rather than content.
|
Download Paper :
|
|