skip to main content
10.1145/3658271.3658340acmotherconferencesArticle/Chapter ViewAbstractPublication PagessbsiConference Proceedingsconference-collections
research-article

Persona and issue analysis on tourism social media: a case study of Pirenópolis, Goiás, Brazil

Published: 23 May 2024 Publication History

Abstract

Context: In 2022, tourism earned R$208 billion, 28% more than in 2021, indicating a growing interest on the part of travelers in practical experiences during their trips, as well as creating content such as opinions, ratings, and recommendations. This has led both the public and private sectors to direct their efforts to improve service quality. Problem: Due to the large data volumes, discovering the tourist profile manually becomes impractical, making it imperative to adopt knowledge discovery techniques that automate this process. Solution: We present a solution for identifying the tourist profiles through persona analysis, using hotels from Pirenópolis in Goiás, Brazil, as a case study. IS Theory: The work was developed using the Social Media Engagement Theory about content generated by users on TripAdvisor and Booking.com platforms due to users experiencing a feeling of community, aligning with the central idea of theory highlighting the importance of active engagement. Method: In this study, we adopted the Cross-Industry Standard Process for Data Mining method, a consolidated approach in data mining. Using this method, we conducted exploratory analysis, profile analysis, topic modeling, and the grouping technique, enabling us to prepare a report based on the analyzed data. Summary of Results: The analysis of the topics resulted in identifying ten topics that reveal the preferences and needs of travelers. The application of the grouping technique allowed the creation of personas based on this data, enriching the understanding of the user profile. This results in a purposeful report on products and services best adapted to travelers’ needs. Contributions and Impact in IS area: Facing challenges in systems in the era of innovation based on connected open data, this work contributes to this discussion as it dynamically fits, providing an innovative and practical approach to dealing with expanding information, automating data analysis processes, and increasing the operational efficiency of tourism companies in forming strategies to enhance decision-making.

References

[1]
Marlisson Jean Amaral Aleixo, Gabriele de Sousa Araujo, Fernando Almeida do Carmo, Antonio Fernando Lavareda Jacob Junior, and Fábio Manoel França Lobato. 2023. Análise Comparativa de Plataformas de Mídias Sociais Turísticas: um Estudo de Caso de Booking e TripAdvisor para o Município de Santarém, Pará. Anais do Computer on the Beach 14 (2023), 070–077.
[2]
Gabriele de S Araújo, Jéssica Brenda P Leite, Marcelino S da Silva, Antonio FL Jacob Junior, and Fábio MF Lobato. 2023. Natural Language Processing and Social Media: a systematic mapping on Brazilian leading events. In Anais do XX Encontro Nacional de Inteligência Artificial e Computacional. SBC, 741–755.
[3]
Francisco Jose Arenas-Márquez, Rocio Martinez-Torres, and Sergio Toral. 2021. Convolutional neural encoding of online reviews for the identification of travel group type topics on TripAdvisor. Information Processing & Management 58, 5 (2021), 102645.
[4]
Clodis Boscarioli, Renata Mendes de Araujo, Rita Suzana Maciel, Valdemar Vicente Graciano Neto, Flavio Oquendo, Elisa Yumi Nakagawa, Flavia Cristina Berrnardini, José Viterbo, Dalessandro Vianna, Carlos Bazilio Martins, 2017. I GranDSI-BR: Grand Research Challenges in Information Systems in Brazil 2016-2026. (2017).
[5]
Matthias Carnein and Heike Trautmann. 2019. Optimizing data stream representation: An extensive survey on stream clustering algorithms. Business & Information Systems Engineering 61 (2019), 277–297.
[6]
Douglas Cirqueira, Fernando Almeida, Gültekin Cakir, Antonio Jacob, Fabio Lobato, Marija Bezbradica, and Markus Helfert. 2020. Explainable sentiment analysis application for social media crisis management in retail. (2020).
[7]
Senika Dewnarain, Haywantee Ramkissoon, and Felix Mavondo. 2019. Social customer relationship management in the hospitality industry. Journal of Hospitality 1, 1 (2019), 1–14.
[8]
Paul M Di Gangi and Molly M Wasko. 2016. Social media engagement theory: Exploring the influence of user engagement on social media usage. Journal of Organizational and End User Computing (JOEUC) 28, 2 (2016), 53–73.
[9]
Roman Egger and Joanne Yu. 2022. A topic modeling comparison between lda, nmf, top2vec, and bertopic to demystify twitter posts. Frontiers in sociology 7 (2022).
[10]
Diana Foris, Natalia Crihalmean, and Tiberiu Foris. 2020. Exploring the environmental practices in hospitality through booking websites and online tourist reviews. Sustainability 12, 24 (2020), 10282.
[11]
Galati Francesco and Galati Roberta. 2019. Cross-country analysis of perception and emphasis of hotel attributes. Tourism Management 74 (2019), 24–42.
[12]
Ruan dos Santos Gondim, Valeria Maria da Silva Pinheiro, Romena Karissa Octavio Oliveira, Alex Sandro Gomes, Francisco Milton Mendes Neto, and Bruno de Sousa Monteiro. 2022. Designing future scenarios in e-tourism. In Proceedings of the 11th Euro American Conference on Telematics and Information Systems. 1–9.
[13]
Maarten Grootendorst. 2022. BERTopic: Neural topic modeling with a class-based TF-IDF procedure. arXiv preprint arXiv:2203.05794 (2022).
[14]
Jocelyn Siah Chee Hao and Chris Ong Siew Har. 2014. A study of preferences of business female travelers on the selection of accommodation. Procedia-Social and Behavioral Sciences 144 (2014), 176–186.
[15]
Zhiping Hou, Fasheng Cui, Yongheng Meng, Tonghui Lian, and Caihua Yu. 2019. Opinion mining from online travel reviews: A comparative analysis of Chinese major OTAs using semantic association analysis. Tourism Management 74 (2019).
[16]
Tzu-Kuang Hsu, Yi-Fan Tsai, and Herg-Huey Wu. 2009. The preference analysis for tourist choice of destination: A case study of Taiwan. Tourism management 30, 2 (2009), 288–297.
[17]
Mostafa Kamalpour, Atae Rezaei Aghdam, Shuxiang Xu, Ehsan Ghasem Khani, and Aryan Baghi. 2017. Uncovering hotel guests preferences through data mining techniques. IJCSNS 17, 8 (2017), 1.
[18]
Rui Kang. 2020. Using logistic regression for persona segmentation in tourism: A case study. Social Behavior and Personality: an international journal 48, 4 (2020).
[19]
Qin Li, Shaobo Li, Sen Zhang, Jie Hu, and Jianjun Hu. 2019. A review of text corpus-based tourism big data mining. Applied Sciences 9, 16 (2019), 3300.
[20]
Yue Li, Xiaoquan Chu, Dong Tian, Jianying Feng, and Weisong Mu. 2021. Customer segmentation using K-means clustering and the adaptive particle swarm optimization algorithm. Applied Soft Computing 113 (2021), 107924.
[21]
Gaojun Liu and Xingyu Wu. 2019. Using collaborative filtering algorithms combined with Doc2Vec for movie recommendation. In 2019 IEEE 3rd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC). IEEE, 1461–1464.
[22]
Fábio Lobato, Márcia Pinheiro, Antonio Jacob, Olaf Reinhold, and Ádamo Santana. 2017. Social crm: Biggest challenges to make it work in the real world. In Business Information Systems Workshops: BIS 2016 International Workshops, Leipzig, Germany, July 6-8, 2016, Revised Papers 19. Springer, 221–232.
[23]
Christoph Lutz and Gemma Newlands. 2018. Consumer segmentation within the sharing economy: The case of Airbnb. Journal of Business Research 88 (2018).
[24]
Ariane Belcavelo Silveira Marques and Wescley Silva Xavier. [n. d.]. Determinações Espaciais a partir do Desenvolvimento do Turismo em Pirenópolis-GO. ([n. d.]).
[25]
Sukavanan Nanjundan, Shreeviknesh Sankaran, CR Arjun, and G Paavai Anand. 2019. Identifying the number of clusters for K-Means: A hypersphere density based approach. arXiv preprint arXiv:1912.00643 (2019).
[26]
Helen Noble and Gary Mitchell. 2016. What is grounded theory?Evidence-based nursing 19, 2 (2016), 34–35.
[27]
Hong-gang Peng, Hong-yu Zhang, and Jian-qiang Wang. 2018. Cloud decision support model for selecting hotels on TripAdvisor. com with probabilistic linguistic information. International Journal of Hospitality Management 68 (2018), 124–138.
[28]
Prefeitura Municipal de Pirenópolis. 2023. Calendário das Festividades Tradicionais 2023. https://pirenopolis.go.gov.br/calendario-das-festividades-tradicionais-2023/ Acesso em: 25 de novembro de 2023.
[29]
IG Karang Komala Putra, Gede Indrawan, and I Made Candiasa. 2019. Data Mining Technique To Get Characteristics Customers of Bendesa Hotel With K-MEANS Algorithm. In Proceedings of the 2nd International Conference on Software Engineering and Information Management. 134–137.
[30]
Wilson Abrahão Rabahy. 2020. Análise e perspectivas do turismo no Brasil. Revista Brasileira de Pesquisa em Turismo 14 (2020), 1–13.
[31]
Nadia Nahrin Rahman and Monira Begum. 2020. Impact of Social Media on Youths to Enhance the Tourism in Bangladesh. International Journal of Research and Analytical Reviews 7, 4 (2020), 937–953.
[32]
Paul Ralph and Sebastian Baltes. 2022. Paving the way for mature secondary research: the seven types of literature review. In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 1632–1636.
[33]
Rosana CB Rego and Verônica ML Silva. 2021. Predicting Gender of Brazilian Names Using Deep Learning. arXiv preprint arXiv:2106.10156 (2021).
[34]
Hosung Timothy Rhee and Sung-Byung Yang. 2015. How does hotel attribute importance vary among different travelers? An exploratory case study based on a conjoint analysis. Electronic markets 25 (2015), 211–226.
[35]
Johannes Schneider, Stefan Seidel, Marcus Basalla, and Jan vom Brocke. 2023. Reuse, Reduce, Support: Design Principles for Green Data Mining. Business & Information Systems Engineering 65, 1 (2023), 65–83.
[36]
Ling-Ling Tao and Tian-Hui You. 2022. A multi-criteria decision-making model for hotel selection by online reviews: Considering the traveller types and the interdependencies among criteria. Applied Intelligence 52, 11 (2022), 12436–12456.
[37]
Anastasios-Ioannis Theocharidis, Maria Argyropoulou, George Karavasilis, Vasiliki Vrana, and Evangelos Kehris. 2020. An approach towards investigating factors affecting intention to book a hotel room through social media. Sustainability 12, 21 (2020), 8973.
[38]
Ian Van Buskirk, Aaron Clauset, and Daniel B Larremore. 2023. An open-source cultural consensus approach to name-based gender classification. In Proceedings of the International AAAI Conference on Web and Social Media, Vol. 17. 866–877.
[39]
Le Wang, Xiao-kang Wang, Juan-juan Peng, and Jian-qiang Wang. 2020. The differences in hotel selection among various types of travellers: A comparative analysis with a useful bounded rationality behavioural decision support model. Tourism management 76 (2020), 103961.
[40]
Yali Wang and Lu Feng. 2021. Research on User Profile of Characteristic Towns with Cultural Tourism Based on K-means Clustering. In 2021 2nd International Conference on Artificial Intelligence and Information Systems. 1–4.
[41]
Rüdiger Wirth and Jochen Hipp. 2000. CRISP-DM: Towards a standard process model for data mining. In Proceedings of the 4th international conference on the practical applications of knowledge discovery and data mining, Vol. 1. Manchester.
[42]
Mustafa Eren Yildirim, Murat Kaya, and Ibrahim FurkanInce. 2022. A case study: unsupervised approach for tourist profile analysis by k-means clustering in turkey. Journal of Internet Computing and Services 23, 1 (2022), 11–17.

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
SBSI '24: Proceedings of the 20th Brazilian Symposium on Information Systems
May 2024
708 pages
Publication rights licensed to ACM. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of a national government. As such, the Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 May 2024

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Booking.com
  2. Text Mining
  3. Topic modeling
  4. Tourism
  5. TripAdvisor
  6. User profile

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • Fundação Amazônia de Amparo a Estudos e Pesquisas - FAPESPA
  • Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)

Conference

SBSI '24
SBSI '24: XX Brazilian Symposium on Information Systems
May 20 - 23, 2024
Juiz de Fora, Brazil

Acceptance Rates

Overall Acceptance Rate 181 of 557 submissions, 32%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 24
    Total Downloads
  • Downloads (Last 12 months)24
  • Downloads (Last 6 weeks)4
Reflects downloads up to 25 Dec 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media