Analyzing the Performance Differences Between Pattern Matching and Compressed Pattern Matching on Texts

Yükleniyor...
Küçük Resim

Tarih

2013

Dergi Başlığı

Dergi ISSN

Cilt Başlığı

Yayıncı

IEEE

Erişim Hakkı

info:eu-repo/semantics/closedAccess

Özet

In this study the statistics of pattern matching on text data and the statistics of compressed pattern matching on compressed form of the same text data are compared. A new application has been developed to count the character matching numbers in compressed and uncompressed texts individually. Also a new text compression algorithm that allows compressed pattern matching by using classical pattern matching algorithms without any change is presented in this paper. In this paper while the presented compression algorithm based on digram and trigram substitution has been giving about 30-35% compression factor, the duration of compressed pattern matching on compressed text is calculated less than the duration of pattern matching on uncompressed text. Also it is confirmed that the number of character comparison on compressed texts while doing a compressed pattern matching is less than the number of character comparison on uncompressed texts. Thus the aim of the developed compression algorithm is to point out the difference in text processing between compressed and uncompressed text and to form opinions for another applications.

Açıklama

10th International Conference on Electronics, Computer and Computation (ICECCO) -- NOV 07-09, 2013 -- Turgut Ozal Univ, Ankara, TURKEY

Anahtar Kelimeler

Data compression, Pattern matching, Compressed Pattern Matching, Pattern Substitution

Kaynak

2013 International Conference on Electronics, Computer and Computation (Icecco)

WoS Q Değeri

N/A

Scopus Q Değeri

Cilt

Sayı

Künye