IJIRST (International Journal for Innovative Research in Science & Technology)ISSN (online) : 2349-6010

 International Journal for Innovative Research in Science & Technology

A Survey on Semantic Focused Web Crawler for Information Discovery Using Data Mining Technique


Print Email Cite
International Journal for Innovative Research in Science & Technology
Volume 1 Issue - 7
Year of Publication : 2014
Authors : Ruchika Patel ; Pooja Bhatt

BibTeX:

@article{IJIRSTV1I7061,
     title={A Survey on Semantic Focused Web Crawler for Information Discovery Using Data Mining Technique},
     author={Ruchika Patel and Pooja Bhatt},
     journal={International Journal for Innovative Research in Science & Technology},
     volume={1},
     number={7},
     pages={168--170},
     year={},
     url={http://www.ijirst.org/articles/IJIRSTV1I7061.pdf},
     publisher={IJIRST (International Journal for Innovative Research in Science & Technology)},
}



Abstract:

Data mining is the process of extraction of hidden predictive information from the huge databases. It is a new technology with great latent to help companies focus on the most important information in their data warehouses. Web mining is a data mining techniques which automatically discover information from web documents. The amount of data and its dynamicity makes it impossible to crawl the World Wide Web (WWW) completely. It's a challenge in front of crawlers to crawl only the relevant pages from this information explosion. Thus a focused crawler solves this issue of relevancy by focusing on web pages for some given topic or a set of topics. Nowadays finding meaningful information among the billions of information resources on the World Wide Web is a difficult task due to growing popularity of the Internet. This paper basically focuses on study of the various techniques of data mining for finding the relevant information from World Wide Web using web crawler.


Keywords:

Web Mining, Web Crawler, Focused Crawler, World Wide Web (WWW)


Download Article