CNN vs. LSTM for Turkish text classification
| dc.contributor.author | Yayla, Melih | |
| dc.contributor.author | Diyar Demirkol, Mustafa | |
| dc.contributor.author | Alqaraleh, Saed | |
| dc.contributor.institutionauthor | Yayla, Melih | |
| dc.contributor.institutionauthor | Diyar Demirkol, Mustafa | |
| dc.contributor.institutionauthor | Alqaraleh, Saed | |
| dc.date.accessioned | 2023-03-07T11:42:04Z | |
| dc.date.available | 2023-03-07T11:42:04Z | |
| dc.date.issued | 2021 | en_US |
| dc.department | HKÜ, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü | en_US |
| dc.description.abstract | In this paper, the efficiency of two states of the art text classification techniques, i.e., Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) for supporting the Turkish text classification has been investigated. In addition, the effect of the main preprocessing steps such as Tokenization, Stop Word Elimination, Stemming, etc. has also been studied. Several experiments using "TTC-3600"dataset were performed, and it has been observed that both CNN and LSTM can efficiently support the Turkish language and can achieve quite good performance. Related to data preprocessing, results indicated that such a process improves the performance, however, for the Turkish language, it is preferred to exclude stemming. Also, by comparing the performance of feature extraction techniques for processing Turkish language, Word2Vec outperforms TF-IDF. © 2021 IEEE. | en_US |
| dc.identifier.citation | Yayla, M., Diyar Demirkol, M., Alqaraleh, S. (2021). CNN vs. LSTM for Turkish text classification. 2021 International Conference on INnovations in Intelligent SysTems and Applications, INISTA 2021 - Proceedings: Code 172175. | en_US |
| dc.identifier.doi | 10.1109/INISTA52262.2021.9548407 | |
| dc.identifier.isbn | 978-166543603-8 | |
| dc.identifier.orcid | 0000-0003-1373-5375 | en_US |
| dc.identifier.orcid | 0000-0001-6373-6849 | en_US |
| dc.identifier.orcid | 0000-0002-7146-3905 | en_US |
| dc.identifier.scopus | 2-s2.0-85116666678 | |
| dc.identifier.scopusquality | N/A | |
| dc.identifier.uri | https://hdl.handle.net/20.500.11782/3107 | |
| dc.indekslendigikaynak | Scopus | |
| dc.language.iso | en | |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | en_US |
| dc.relation.ispartof | 2021 International Conference on INnovations in Intelligent SysTems and Applications, INISTA 2021 - Proceedings | |
| dc.relation.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
| dc.rights | info:eu-repo/semantics/openAccess | en_US |
| dc.subject | Convolutional Neural Networks | en_US |
| dc.subject | Long Short-Term Memory | en_US |
| dc.subject | Natural Language Processing | en_US |
| dc.subject | Text Classification | en_US |
| dc.subject | Turkish Language | en_US |
| dc.title | CNN vs. LSTM for Turkish text classification | |
| dc.type | Conference Object |










