buchspektrum Internet-Buchhandlung

Neuerscheinungen 2010

Stand: 2020-01-07
Schnellsuche
ISBN/Stichwort/Autor
Herderstraße 10
10625 Berlin
Tel.: 030 315 714 16
Fax 030 315 714 14
info@buchspektrum.de

Zdenek Ceska

Automatic Plagiarism Detection Based on Latent Semantic Analysis


Theory and Practice
2010. 128 S. 220 mm
Verlag/Jahr: VDM VERLAG DR. MÜLLER 2010
ISBN: 3-639-28207-8 (3639282078)
Neue ISBN: 978-3-639-28207-8 (9783639282078)

Preis und Lieferzeit: Bitte klicken


Plagiarism is a widely spread problem that is the main focus of interest these days. The main objective of this work is the application of Latent Semantic Analysis (LSA) framework in the field of written-text plagiarism detection. This particular field faces various issues that are discussed thoroughly. In order to infer the latent semantics from the given text, Singular Value Decomposition (SVD) is employed for the purpose of large statistical computations. To overcome issues connected with a large amount of extracted N-grams from the text, a feature selection and subsequently a random indexing techniques are applied. Moreover, this thesis deals with the influence of text pre-processing on the accuracy of plagiarism detection. Simultaneously, the aspects of multilingual environment are explored. Various approaches in common use are discussed and compared with the new proposed method.
Zden k e ka, Ph.D.: Studied Computer Science and Engineering at the Faculty of Applied Sciences, University of West Bohemia, Czech Republic. He has worked at various positions in the field of Software Engineering - Analysis, Architecture, and Development - including Research and Teaching at the University of West Bohemia.