Neuerscheinungen 2012Stand: 2020-01-07 |
Schnellsuche
ISBN/Stichwort/Autor
|
Herderstraße 10 10625 Berlin Tel.: 030 315 714 16 Fax 030 315 714 14 info@buchspektrum.de |
André Reckhemke
The Construction of String Similarity Predicates
Application-specific and Index-supported String Similarity Predicates - Fundamentals and Design of Similarity Queries
Aufl. 2012. 88 S. 220 mm
Verlag/Jahr: AV AKADEMIKERVERLAG 2012
ISBN: 3-639-43969-4 (3639439694) / 3-8364-6638-4 (3836466384)
Neue ISBN: 978-3-639-43969-4 (9783639439694) / 978-3-8364-6638-7 (9783836466387)
Preis und Lieferzeit: Bitte klicken
Revision with unchanged content. In times of worldwide globalisation the knowledge of useful information is becoming increasingly important. Parallel to genetic engineering, the expansion of the Internet produces similar volumes of data - frequently saved in text files. One of the most relevant intersection is the usage of approximate string matching in large text data. The Internet has to face the challenge of not only to concentrating on request times but also finding more context-relevant information. Associated with this aim, further steps in this field have to take into consideration that documents can include mistakes in orthography or words being abbreviated. Other areas of information are substituted with their acronyms or are less important and can be ignored. All of these tasks are united in the fields of computational linguistics. This master thesis shows stepwise the tokenising of real text, the homogenisation of words, and the storage in a specific index structure for subsequent approximate string matching - in consideration of secondary storage. A prototype programmed in Java completes the current work.
André Reckhemke, geb. am 30.04.1973 in Braunschweig, Ausbildung: Dipl. Informatiker (FH) und Master of Science.