Yazar "Diri, Banu" seçeneğine göre listele
Listeleniyor 1 - 3 / 3
Sayfa Başına Sonuç
Sıralama seçenekleri
Öğe Analyzing the Performance Differences Between Pattern Matching and Compressed Pattern Matching on Texts(IEEE, 2013) Erdoğan, Cihat; Buluş, Halil Nusret; Diri, BanuIn this study the statistics of pattern matching on text data and the statistics of compressed pattern matching on compressed form of the same text data are compared. A new application has been developed to count the character matching numbers in compressed and uncompressed texts individually. Also a new text compression algorithm that allows compressed pattern matching by using classical pattern matching algorithms without any change is presented in this paper. In this paper while the presented compression algorithm based on digram and trigram substitution has been giving about 30-35% compression factor, the duration of compressed pattern matching on compressed text is calculated less than the duration of pattern matching on uncompressed text. Also it is confirmed that the number of character comparison on compressed texts while doing a compressed pattern matching is less than the number of character comparison on uncompressed texts. Thus the aim of the developed compression algorithm is to point out the difference in text processing between compressed and uncompressed text and to form opinions for another applications.Öğe Estimation of the proteomic cancer co-expression sub networks by using association estimators(Public Library Science, 2017) Erdoğan, Cihat; Kurt, Zeyneb; Diri, BanuIn this study, the association estimators, which have significant influences on the gene network inference methods and used for determining the molecular interactions, were examined within the co-expression network inference concept. By using the proteomic data from five different cancer types, the hub genes/proteins within the disease-associated gene-gene/protein-protein interaction sub networks were identified. Proteomic data from various cancer types is collected from The Cancer Proteome Atlas (TCPA). Correlation and mutual information (MI) based nine association estimators that are commonly used in the literature, were compared in this study. As the gold standard to measure the association estimators' performance, a multi-layer data integration platform on gene-disease associations (DisGeNET) and the Molecular Signatures Database (MSigDB) was used. Fisher's exact test was used to evaluate the performance of the association estimators by comparing the created co-expression networks with the disease-associated pathways. It was observed that the MI based estimators provided more successful results than the Pearson and Spearman correlation approaches, which are used in the estimation of biological networks in the weighted correlation network analysis (WGCNA) package. In correlation-based methods, the best average success rate for five cancer types was 60%, while in MI-based methods the average success ratio was 71% for James-Stein Shrinkage (Shrink) and 64% for Schurmann-Grassberger (SG) association estimator, respectively. Moreover, the hub genes and the inferred sub networks are presented for the consideration of researchers and experimentalists.Öğe Investigation of Association Estimators in Network Inference Algorithms on Breast Cancer Proteomic Data(IEEE, 2017) Erdoğan, Cihat; Kurt, Zeyneb; Diri, BanuIn this study, association estimators applied in the network inference methods used to determine disease-related molecular interactions using breast cancer, which is the most common type of cancer in women, proteomic data were examined and hub genes in the gene-gene interaction network related to the disease were identified. Proteomic data of 901 breast cancer patients were generated using reverse phase protein array provided by The Cancer Proteome Atlas (TCPA) as a data set. Correlations and mutual information (MI) based estimators used in the literature were compared in the study, and WGCNA and minet R packages were used. As a result, it is seen that the MI based shrink estimator method has more successful results than the correlation-based adjacency function used in the estimation of biological networks in the WGCNA package. Achievement rates have ranged from 0.67 to 1.00 in the shrink estimation, with adjacency functions ranging from 0.33 to 0.86 for different module counts. In addition, hub genes and inferenced networks of successful results arc presented for the review of biologists.