Performance Comparison of Turkish Web Pages Classification

dc.contributor.authorAlqaraleh, Saed
dc.contributor.authorNergiz Sirin, Hatice Meltem
dc.contributor.authorOzkan, Furkan
dc.contributor.institutionauthorAlqaraleh, Saed
dc.contributor.institutionauthorNergiz Sirin, Hatice Meltem
dc.contributor.institutionauthorOzkan, Furkan
dc.date.accessioned2023-03-13T06:03:43Z
dc.date.available2023-03-13T06:03:43Z
dc.date.issued2021en_US
dc.departmentHKÜ, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümüen_US
dc.description.abstractNowadays., web page classification is essential for efficient and fast search engines. There is an ever-increasing need for automatic classification techniques with higher classification accuracy. In this article., a performance comparison of existing Turkish language CNN models for web pages classification systems is performed. In more detail., the content of web pages is extracted first., then preprocessing steps that aim to detect the important parts and eliminate useless contents are used. Next., Bert word embedding is integrated to represent the texts by efficient numerical vectors. Finally., three state-of-the-art CNN models that fully support the Turkish language are investigated to find the best classifier. Overall., the three studied models obtained an acceptable performance while classifying the Turkish webpages., however., the third model was able to achieve slightly better than the other two models. © 2021 IEEE.en_US
dc.identifier.citationAlqaraleh, S., Nergiz Sirin, H. M., Ozkan, F. (2021). Performance Comparison of Turkish Web Pages Classification. Proceedings - 2021 Innovations in Intelligent Systems and Applications Conference, ASYU 2021: Code 174400.en_US
dc.identifier.doi10.1109/ASYU52992.2021.9599069
dc.identifier.isbn978-166543405-8
dc.identifier.orcid0000-0002-7146-3905en_US
dc.identifier.orcid0000-0002-7354-1364en_US
dc.identifier.orcid0000-0002-3724-4856en_US
dc.identifier.scopus2-s2.0-85123187492
dc.identifier.scopusqualityN/A
dc.identifier.urihttps://hdl.handle.net/20.500.11782/3120
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherInstitute of Electrical and Electronics Engineers Inc.en_US
dc.relation.ispartofProceedings - 2021 Innovations in Intelligent Systems and Applications Conference, ASYU 2021
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectconvolutional neural networksen_US
dc.subjectmulti-label techniqueen_US
dc.subjecttextual contenten_US
dc.subjectweb page classificationen_US
dc.titlePerformance Comparison of Turkish Web Pages Classification
dc.typeConference Object

Dosyalar

Orijinal paket

Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
makale - yayıncı sürümü71.pdf
Boyut:
2.95 MB
Biçim:
Adobe Portable Document Format
Açıklama:
Makale Dosyası

Lisans paketi

Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
license.txt
Boyut:
1.44 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: