For speech recognition, OCR, etc. determination of the structural properties of a natural language is essential. These properties can be analyzed under two different categories; morphological and statistical analysis. For statistical analysis, a corpus which is a representative sample of the natural language is needed. Word n-gram frequencies of that corpus can be determined by using suitable algorithms and missing n-grams can be estimated by using smoothing techniques. In this study, in order to compare and apply smoothing techniques to Turkish, a corpus named TurCo was created. In order to calculate word n-grams, different algorithms were tested. After finding n-gram word lists, their characteristics were analyzed. For generalization, Zipf?s Law was applied, and to increase the accuracy in Zipf?s Law, Mandelbrot Law was applied by finding the appropriate constants of Mandelbrot. As the corpus could not be big enough to represent all of the language, smoothing techniques were used to estimate the unseen word n-grams. This study can help professionals working on speech recognition, cryptanalysis, and author recognition in Turkish.
Les informations fournies dans la section « Synopsis » peuvent faire référence à une autre édition de ce titre.
For speech recognition, OCR, etc. determination of the structural properties of a natural language is essential. These properties can be analyzed under two different categories; morphological and statistical analysis. For statistical analysis, a corpus which is a representative sample of the natural language is needed. Word n-gram frequencies of that corpus can be determined by using suitable algorithms and missing n-grams can be estimated by using smoothing techniques. In this study, in order to compare and apply smoothing techniques to Turkish, a corpus named TurCo was created. In order to calculate word n-grams, different algorithms were tested. After finding n-gram word lists, their characteristics were analyzed. For generalization, Zipf?s Law was applied, and to increase the accuracy in Zipf?s Law, Mandelbrot Law was applied by finding the appropriate constants of Mandelbrot. As the corpus could not be big enough to represent all of the language, smoothing techniques were used to estimate the unseen word n-grams. This study can help professionals working on speech recognition, cryptanalysis, and author recognition in Turkish.
Les informations fournies dans la section « A propos du livre » peuvent faire référence à une autre édition de ce titre.
Vendeur : BuchWeltWeit Ludwig Meier e.K., Bergisch Gladbach, Allemagne
Taschenbuch. Etat : Neu. This item is printed on demand - it takes 3-4 days longer - Neuware -For speech recognition, OCR, etc. determination of the structural properties of a natural language is essential. These properties can be analyzed under two different categories; morphological and statistical analysis. For statistical analysis, a corpus which is a representative sample of the natural language is needed. Word n-gram frequencies of that corpus can be determined by using suitable algorithms and missing n-grams can be estimated by using smoothing techniques. In this study, in order to compare and apply smoothing techniques to Turkish, a corpus named TurCo was created. In order to calculate word n-grams, different algorithms were tested. After finding n-gram word lists, their characteristics were analyzed. For generalization, Zipf s Law was applied, and to increase the accuracy in Zipf s Law, Mandelbrot Law was applied by finding the appropriate constants of Mandelbrot. As the corpus could not be big enough to represent all of the language, smoothing techniques were used to estimate the unseen word n-grams. This study can help professionals working on speech recognition, cryptanalysis, and author recognition in Turkish. 140 pp. Englisch. N° de réf. du vendeur 9783838351582
Quantité disponible : 2 disponible(s)
Vendeur : moluna, Greven, Allemagne
Etat : New. N° de réf. du vendeur 5415572
Quantité disponible : Plus de 20 disponibles
Vendeur : buchversandmimpf2000, Emtmannsberg, BAYE, Allemagne
Taschenbuch. Etat : Neu. This item is printed on demand - Print on Demand Titel. Neuware -For speech recognition, OCR, etc. determination of the structural properties of a natural language is essential. These properties can be analyzed under two different categories; morphological and statistical analysis. For statistical analysis, a corpus which is a representative sample of the natural language is needed. Word n-gram frequencies of that corpus can be determined by using suitable algorithms and missing n-grams can be estimated by using smoothing techniques. In this study, in order to compare and apply smoothing techniques to Turkish, a corpus named TurCo was created. In order to calculate word n-grams, different algorithms were tested. After finding n-gram word lists, their characteristics were analyzed. For generalization, Zipf''s Law was applied, and to increase the accuracy in Zipf''s Law, Mandelbrot Law was applied by finding the appropriate constants of Mandelbrot. As the corpus could not be big enough to represent all of the language, smoothing techniques were used to estimate the unseen word n-grams. This study can help professionals working on speech recognition, cryptanalysis, and author recognition in Turkish.VDM Verlag, Dudweiler Landstraße 99, 66123 Saarbrücken 140 pp. Englisch. N° de réf. du vendeur 9783838351582
Quantité disponible : 1 disponible(s)
Vendeur : AHA-BUCH GmbH, Einbeck, Allemagne
Taschenbuch. Etat : Neu. nach der Bestellung gedruckt Neuware - Printed after ordering - For speech recognition, OCR, etc. determination of the structural properties of a natural language is essential. These properties can be analyzed under two different categories; morphological and statistical analysis. For statistical analysis, a corpus which is a representative sample of the natural language is needed. Word n-gram frequencies of that corpus can be determined by using suitable algorithms and missing n-grams can be estimated by using smoothing techniques. In this study, in order to compare and apply smoothing techniques to Turkish, a corpus named TurCo was created. In order to calculate word n-grams, different algorithms were tested. After finding n-gram word lists, their characteristics were analyzed. For generalization, Zipf s Law was applied, and to increase the accuracy in Zipf s Law, Mandelbrot Law was applied by finding the appropriate constants of Mandelbrot. As the corpus could not be big enough to represent all of the language, smoothing techniques were used to estimate the unseen word n-grams. This study can help professionals working on speech recognition, cryptanalysis, and author recognition in Turkish. N° de réf. du vendeur 9783838351582
Quantité disponible : 1 disponible(s)
Vendeur : preigu, Osnabrück, Allemagne
Taschenbuch. Etat : Neu. Statistical Properties of Turkish Words | Contemporary Printed Turkish Word Characteristics and Smoothing Techniques | Gökhan Dalkiliç | Taschenbuch | 140 S. | Englisch | 2010 | LAP LAMBERT Academic Publishing | EAN 9783838351582 | Verantwortliche Person für die EU: preigu GmbH & Co. KG, Lengericher Landstr. 19, 49078 Osnabrück, mail[at]preigu[dot]de | Anbieter: preigu. N° de réf. du vendeur 101208603
Quantité disponible : 5 disponible(s)
Vendeur : Mispah books, Redhill, SURRE, Royaume-Uni
Paperback. Etat : Like New. LIKE NEW. SHIPS FROM MULTIPLE LOCATIONS. book. N° de réf. du vendeur ERICA79038383515846
Quantité disponible : 1 disponible(s)