Neuerscheinungen 2011Stand: 2020-01-07 |
Schnellsuche
ISBN/Stichwort/Autor
|
Herderstraße 10 10625 Berlin Tel.: 030 315 714 16 Fax 030 315 714 14 info@buchspektrum.de |
Ritu Shandilya
A Domain Specific Information Retrieval
A Domain Specific Indexing Technique for Hidden Web Documents
2011. 112 S.
Verlag/Jahr: VDM VERLAG DR. MÜLLER 2011
ISBN: 3-639-37356-1 (3639373561)
Neue ISBN: 978-3-639-37356-1 (9783639373561)
Preis und Lieferzeit: Bitte klicken
The web creates new challenges for information retrieval as the amount of information on the web is growing rapidly. One of the challenges is to crawl the information hidden behind a search form, as a tremendous amount of high quality content is hidden behind the search forms. This high quality information can be retrieved by hidden web crawler using a Web query front-end to the database with standard HTML form elements. The documents retrieved by a hidden web crawler are more relevant, as these documents are accessible only through dynamically generated pages, delivered in response to a query. To index these documents efficiently, the search engine requires new indexing technique that optimizes speed and performance for finding relevant documents for a search query. In this paper, a new technique to index hidden web crawled documents is being proposed that not only indexes the documents more efficiently but also gives a classification of documents. In the technique, attributes of a query interface and their value sets are employed to index the documents.
She received the M.Tech in Computer Engineering from Shobhit University, Meerut, India and MCA from U.P.Technical University, Lucknow, India.Presently, she is working as Assistant Professor in School of Computer Engineering and Information Technology in Shobhit University. Her areas of interests are Search Engines, Crawlers and Data Mining.