Comparison of machine learning models for sentiment analysis of big Turkish web-based data

dc.contributor.authorOzmen, Cemile Gokce
dc.contributor.authorGunduz, Selim
dc.date.accessioned2025-03-24T12:10:28Z
dc.date.available2025-03-24T12:10:28Z
dc.date.issued2025en_US
dc.departmentHKÜ, İktisadi, İdari ve Sosyal Bilimler Fakültesi, İşletme Bölümüen_US
dc.description.abstractE-commerce sites have generated large amounts of unstructured data as they allow millions of users to generate product reviews. Thus, although there have been significant improvements in the characteristics of big data, such as speed and volume, developing various analysis techniques to monitor, understand, and extract useful information from this web-based data has become challenging. This study aims to analyze cosmetic products on a Turkish-based e-commerce website with sentiment analysis and to create a new domain-specific Turkish sentiment dictionary model with manual labeling. In the study, a Turkish sentiment dictionary consisting of 65,378 words was created by manually labeling 875,455 product reviews for 24 cosmetic brands sold on the Turkey-based trendyol e-commerce site, and sentiment analysis was performed using this dictionary. The dataset, divided into seven product groups, was analyzed using K-NN, SVM, DT, RF, and LR algorithms to address three classification problems. The algorithms were evaluated with comparative analysis using accuracy, precision, recall, and f-1 score metrics. SVM gave the highest performance result with over 93% accuracy, 92% precision, 93% recall, and a 91% f-1 score in all product groups. The dictionary model created for the cosmetics industry in the study helps businesses and researchers to use their resources more efficiently and save time by performing fast and low-cost analyses on large datasets of product reviews. Moreover, by analyzing customer feedback, brands can offer long-lasting and environmentally friendly products that align with customers' feelings. Thus, businesses have the opportunity to develop or improve products.en_US
dc.identifier.citationÖzmen, CG. & Gündüz, S. (2025). Comparison of machine learning models for sentiment analysis of big Turkish web-based data. Applıed Scıences-Basel. ( 15, 5.). https://doi.org/10.3390/app15052297.en_US
dc.identifier.doi10.3390/app15052297
dc.identifier.issn2076-3417
dc.identifier.issue5en_US
dc.identifier.orcid0000-0003-4983-915Xen_US
dc.identifier.scopus2-s2.0-86000495295
dc.identifier.scopusqualityQ1
dc.identifier.urihttps://doi.org/10.3390/app15052297
dc.identifier.uriWOS:001442397000001
dc.identifier.urihttps://hdl.handle.net/20.500.11782/4810
dc.identifier.volume15en_US
dc.identifier.wosN/A
dc.identifier.wosqualityN/A
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherMdpien_US
dc.relation.ispartofApplıed Scıences-Basel
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectmachine learningen_US
dc.subjectnatural language processingen_US
dc.subjectsentiment analysisen_US
dc.titleComparison of machine learning models for sentiment analysis of big Turkish web-based data
dc.typeArticle

Dosyalar

Orijinal paket

Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
001442397000001.pdf
Boyut:
376.42 KB
Biçim:
Adobe Portable Document Format
Açıklama:
Makale Dosyası

Lisans paketi

Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
license.txt
Boyut:
1.44 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: