A survey in semantic web technologies-inspired focused crawlers

Dong, H, Hussain, F and Chang, E 2008, 'A survey in semantic web technologies-inspired focused crawlers', in C.Shonirugen (ed.) Proceedings of the Third International Conference on Digital Information Management (ICDIM 2008), London, UK., 13-16 November 2008, pp. 934-936.


Document type: Conference Paper
Collection: Conference Papers

Title A survey in semantic web technologies-inspired focused crawlers
Author(s) Dong, H
Hussain, F
Chang, E
Year 2008
Conference name Third International Conference on Digital Information Management (ICDIM 2008)
Conference location London, UK.
Conference dates 13-16 November 2008
Proceedings title Proceedings of the Third International Conference on Digital Information Management (ICDIM 2008)
Editor(s) C.Shonirugen
Publisher IEEE Xplore
Place of publication Piscataway, N.J.
Start page 934
End page 936
Total pages 3
Abstract Crawlers are software which can traverse the internet and retrieve webpages by hyperlinks. In the face of the inundant spam websites, traditional web crawlers cannot function well to solve this problem. Semantic focused crawlers utilize semantic web technologies to analyze the semantics of hyperlinks and web documents. This paper briefly reviews the recent studies on one category of semantic focused crawlers - ontology-based focused crawlers, which are a series of crawlers that utilize ontologies to link the fetched web documents with the ontological concepts (topics). The purpose of this is to organize and categorize web documents, or filtering irrelevant webpages with regards to the topics. A brief comparison are made among these crawlers, from six perspectives - domain, working environment, special functions, technologies utilized, evaluation metrics and evaluation results. The conclusion with respect to this comparison is made in the final section.
Subjects Information Retrieval and Web Search
Keyword(s) Clustering algorithms Crawlers Ecosystems Information filtering Information filters Joining processes Ontologies Search engines Semantic Web Uniform resource locators
DOI - identifier 10.1109/ICDIM.2008.4746736
Copyright notice © 2008 IEEE
ISBN 9781424429172
Versions
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 0 times in Thomson Reuters Web of Science Article
Scopus Citation Count Cited 12 times in Scopus Article | Citations
Altmetric details:
Access Statistics: 150 Abstract Views  -  Detailed Statistics
Created: Wed, 30 Apr 2014, 09:52:00 EST by Catalyst Administrator
© 2014 RMIT Research Repository • Powered by Fez SoftwareContact us