Examining the Impact of Feature Selection on Classification of User Reviews in Web Pages
dc.authorid | 0000-0002-3971-2676 | |
dc.authorid | 0000-0003-4351-2244 | |
dc.authorscopusid | 54783608800 | |
dc.authorscopusid | 57194265151 | |
dc.authorwosid | OZHAN, Erkan/N-8743-2016 | |
dc.authorwosid | Uzun, Erdinç/AAG-5529-2019 | |
dc.contributor.author | Uzun, Erdinç | |
dc.contributor.author | Özhan, Erkan | |
dc.date.accessioned | 2022-05-11T14:15:53Z | |
dc.date.available | 2022-05-11T14:15:53Z | |
dc.date.issued | 2018 | |
dc.department | Fakülteler, Çorlu Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü | |
dc.description | International Conference on Artificial Intelligence and Data Processing (IDAP) -- SEP 28-30, 2018 -- Inonu Univ, Malatya, TURKEY | |
dc.description.abstract | The user reviews in web pages can provide useful information about the content of the web page for text processing applications. Automatically extracting data from a web page is a crucial process for these applications. One of the used methods in this process is to construct a learning model with an appropriate classification method using features that are derived from data. However, some features can be either redundant or irrelevant for this model. In this study, an imbalanced dataset including 47 shallow text features obtained from web pages is utilized for extracting of the user reviews. Then, various well-known feature selection techniques are applied to reduce the number of these features. The effects of this reduction on the classification methods are also examined. The experimental results indicate that approximately half of the features are sufficient for the classification task. Additionally, the AdaBoost classifier gives the best results concerning precision of about 0.930 for the review layout prediction. | |
dc.description.sponsorship | Inonu Univ, Comp Sci Dept, IEEE Turkey Sect, Anatolian Sci | |
dc.description.sponsorship | Namik Kemal University Research FundNamik Kemal University | |
dc.description.sponsorship | The authors acknowledge the support received from the Namik Kemal University Research Fund. | |
dc.identifier.isbn | 978-1-5386-6878-8 | |
dc.identifier.scopus | 2-s2.0-85062568576 | |
dc.identifier.uri | https://hdl.handle.net/20.500.11776/6107 | |
dc.identifier.wos | WOS:000458717400054 | |
dc.identifier.wosquality | N/A | |
dc.indekslendigikaynak | Web of Science | |
dc.indekslendigikaynak | Scopus | |
dc.institutionauthor | Uzun, Erdinç | |
dc.institutionauthor | Özhan, Erkan | |
dc.language.iso | en | |
dc.publisher | IEEE | |
dc.relation.ispartof | 2018 International Conference on Artificial Intelligence and Data Processing (Idap) | |
dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | |
dc.subject | web data extraction | |
dc.subject | feature selection | |
dc.subject | classification methods | |
dc.subject | review layout detection | |
dc.subject | imbalanced dataset | |
dc.subject | Hybrid Approach | |
dc.title | Examining the Impact of Feature Selection on Classification of User Reviews in Web Pages | |
dc.type | Conference Object |
Dosyalar
Orijinal paket
1 - 1 / 1
Küçük Resim Yok
- İsim:
- 6107.pdf
- Boyut:
- 450.83 KB
- Biçim:
- Adobe Portable Document Format
- Açıklama:
- Tam Metin / Full Text