Comparison of different machine learning techniques on location extraction by utilizing geo-tagged tweets: A case study

dc.contributor.authorEligüzel, Nazmiye
dc.contributor.authorÇetinkaya, Cihan
dc.contributor.authorDereli, T.
dc.date.accessioned2020-12-03T11:09:20Z
dc.date.available2020-12-03T11:09:20Z
dc.date.issuedOctober 2020en_US
dc.departmentHKÜ, Mühendislik Fakültesi, Elektrik Elektronik Mühendisliği Bölümüen_US
dc.description.abstractIn emergencies, Twitter is an important platform to get situational awareness simultaneously. Therefore, information about Twitter users’ location is a fundamental aspect to understand the disaster effects. But location extraction is a challenging task. Most of the Twitter users do not share their locations in their tweets. In that respect, there are different methods proposed for location extraction which cover different fields such as statistics, machine learning, etc. This study is a sample study that utilizes geo-tagged tweets to demonstrate the importance of the location in disaster management by taking three cases into consideration. In our study, tweets are obtained by utilizing the “earthquake” keyword to determine the location of Twitter users. Tweets are evaluated by utilizing the Latent Dirichlet Allocation (LDA) topic model and sentiment analysis through machine learning classification algorithms including the Multinomial and Gaussian Naïve Bayes, Support Vector Machine (SVM), Decision Tree, Random Forest, Extra Trees, Neural Network, k Nearest Neighbor (kNN), Stochastic Gradient Descent (SGD), and Adaptive Boosting (AdaBoost) classifications. Therefore, 10 different machine learning algorithms are applied in our study by utilizing sentiment analysis based on location-specific disaster-related tweets by aiming fast and correct response in a disaster situation. In addition, the effectiveness of each algorithm is evaluated in order to gather the right machine learning algorithm. Moreover, topic extraction via LDA is provided to comprehend the situation after a disaster. The gathered results from the application of three cases indicate that Multinomial Naïve Bayes and Extra Trees machine learning algorithms give the best results with an F-measure value over 80%. The study aims to provide a quick response to earthquakes by applying the aforementioned techniques. © 2020 Elsevier Ltden_US
dc.identifier.citationEligüzel, N., Çetinkaya, C., & Dereli, T. (October 01, 2020). Comparison of different machine learning techniques on location extraction by utilizing geo-tagged tweets: A case study. Advanced Engineering Informatics, 46, 101151.en_US
dc.identifier.doi10.1016/j.aei.2020.101151
dc.identifier.issn14740346
dc.identifier.orcid0000-0002-2130-5503en_US
dc.identifier.scopus2-s2.0-85090404146
dc.identifier.scopusqualityQ1
dc.identifier.urihttps://doi.org/10.1016/j.aei.2020.101151
dc.identifier.urihttps://hdl.handle.net/20.500.11782/2148
dc.identifier.volume46en_US
dc.identifier.wosWOS:000607575400007
dc.identifier.wosqualityQ1
dc.indekslendigikaynakWeb of Science
dc.indekslendigikaynakScopus
dc.language.isoen
dc.publisherElsevier Ltden_US
dc.relation.ispartofAdvanced Engineering Informatics
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectGeo-taggeden_US
dc.subjectLDAen_US
dc.subjectLocation extractionen_US
dc.subjectMachine learningen_US
dc.subjectSentimenten_US
dc.subjectTweeten_US
dc.titleComparison of different machine learning techniques on location extraction by utilizing geo-tagged tweets: A case study
dc.typeArticle

Dosyalar

Orijinal paket

Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
0.1016j.aei.2020.101151.pdf
Boyut:
1.71 MB
Biçim:
Adobe Portable Document Format
Açıklama:
Makale Dosyası

Lisans paketi

Listeleniyor 1 - 1 / 1
Yükleniyor...
Küçük Resim
İsim:
license.txt
Boyut:
1.44 KB
Biçim:
Item-specific license agreed upon to submission
Açıklama: