Analyzing the Performance Differences Between Pattern Matching and Compressed Pattern Matching on Texts
Özet
In this study the statistics of pattern matching on text data and the statistics of compressed pattern matching on compressed form of the same text data are compared. A new application has been developed to count the character matching numbers in compressed and uncompressed texts individually. Also a new text compression algorithm that allows compressed pattern matching by using classical pattern matching algorithms without any change is presented in this paper. In this paper while the presented compression algorithm based on digram and trigram substitution has been giving about 30-35% compression factor, the duration of compressed pattern matching on compressed text is calculated less than the duration of pattern matching on uncompressed text. Also it is confirmed that the number of character comparison on compressed texts while doing a compressed pattern matching is less than the number of character comparison on uncompressed texts. Thus the aim of the developed compression algorithm is to point out the difference in text processing between compressed and uncompressed text and to form opinions for another applications.
Koleksiyonlar
İlgili Öğeler
Başlık, yazar, küratör ve konuya göre gösterilen ilgili öğeler.
-
Assessment of salusin alpha and salusin beta levels in patients with newly diagnosed dipper and non-dipper hypertension
Alpsoy, Şeref; Doğan, Burçin; Özkaramanlı Gür, Demet; Akyüz, Aydın; Fidan, Çiğdem; Güzel, Savaş; Özkoyuncu, Berna (Taylor & Francis Inc, 2021)Objective The pathophysiology of non-dipper hypertension has not been clarified. The relationship between salusins with atherosclerosis and hypertension has gained attention in recent years. The aim of this paper is to ... -
Modeling land use/land cover change and mapping morphological fragmentation of agricultural lands in Thrace Region/Turkey
Altürk, Bahadır; Konukçu, Fatih (Springer, 2020)Human factors such as development of new roads and urban and industrial areas have caused reduction and fragmentation of agricultural lands in Thrace Region as in many parts of the world over the past few decades. To ... -
A new word-based compression model allowing compressed pattern matching
Buluş, Halil Nusret; Carus, Aydın; Mesut, Altan (Tubitak Scientific & Technical Research Council Turkey, 2017)In this study a new semistatic data compression model that has a fast coding process and that allows compressed pattern matching is introduced. The name of the proposed model is chosen as tagged word-based compression ...