CNN vs. LSTM for Turkish text classification

dc.contributor.authorYayla, Melih
dc.contributor.authorDiyar Demirkol, Mustafa
dc.contributor.authorAlqaraleh, Saed
dc.contributor.institutionauthorYayla, Melih
dc.contributor.institutionauthorDiyar Demirkol, Mustafa
dc.contributor.institutionauthorAlqaraleh, Saed
dc.date.accessioned2023-03-07T11:42:04Z
dc.date.available2023-03-07T11:42:04Z
dc.date.issued2021en_US
dc.departmentHKÜ, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümüen_US
dc.description.abstractIn this paper, the efficiency of two states of the art text classification techniques, i.e., Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) for supporting the Turkish text classification has been investigated. In addition, the effect of the main preprocessing steps such as Tokenization, Stop Word Elimination, Stemming, etc. has also been studied. Several experiments using "TTC-3600"dataset were performed, and it has been observed that both CNN and LSTM can efficiently support the Turkish language and can achieve quite good performance. Related to data preprocessing, results indicated that such a process improves the performance, however, for the Turkish language, it is preferred to exclude stemming. Also, by comparing the performance of feature extraction techniques for processing Turkish language, Word2Vec outperforms TF-IDF. © 2021 IEEE.en_US
dc.identifier.citationYayla, M., Diyar Demirkol, M., Alqaraleh, S. (2021). CNN vs. LSTM for Turkish text classification. 2021 International Conference on INnovations in Intelligent SysTems and Applications, INISTA 2021 - Proceedings: Code 172175.en_US
dc.identifier.doi10.1109/INISTA52262.2021.9548407
dc.identifier.isbn978-166543603-8
dc.identifier.orcid0000-0003-1373-5375en_US
dc.identifier.orcid0000-0001-6373-6849en_US
dc.identifier.orcid0000-0002-7146-3905en_US
dc.identifier.scopus2-s2.0-85116666678
dc.identifier.scopusqualityN/A
dc.identifier.urihttps://hdl.handle.net/20.500.11782/3107
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherInstitute of Electrical and Electronics Engineers Inc.en_US
dc.relation.ispartof2021 International Conference on INnovations in Intelligent SysTems and Applications, INISTA 2021 - Proceedings
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectConvolutional Neural Networksen_US
dc.subjectLong Short-Term Memoryen_US
dc.subjectNatural Language Processingen_US
dc.subjectText Classificationen_US
dc.subjectTurkish Languageen_US
dc.titleCNN vs. LSTM for Turkish text classification
dc.typeConference Object

Dosyalar

Orijinal paket

Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
makale - yayıncı sürümü59.pdf
Boyut:
1.03 MB
Biçim:
Adobe Portable Document Format
Açıklama:
Makale Dosyası

Lisans paketi

Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
license.txt
Boyut:
1.44 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: