From Post to Personality: Harnessing LLMs for MBTI Prediction in Social Media

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Personality prediction from social media posts is a critical task that implies diverse applications in psychology and sociology. The Myers-Briggs Type Indicator (MBTI), a popular personality inventory, has been traditionally predicted by machine learning (ML) and deep learning (DL) techniques. Recently, the success of Large Language Models (LLMs) has revealed their huge potential in understanding and inferring personality traits from social media content. However, directly exploiting LLMs for MBTI prediction faces two key challenges: the hallucination problem inherent in LLMs and the naturally imbalanced distribution of MBTI types in the population. In this paper, we propose PostToPersonality (P2P), a novel LLM- based framework for MBTI prediction from social media posts of individuals. Specifically, P2P leverages Retrieval-Augmented Generation with in-context learning to mitigate hallucination in LLMs. Furthermore, we fine-tune a pre-trained LLM to improve model specification in MBTI understanding with synthetic minority oversampling, which balances the class imbalance by generating synthetic samples. Experiments conducted on a real-world social media dataset demonstrate that P2P achieves state-of-the-art performance compared with 10 ML/DL baselines.

Original languageEnglish
Title of host publicationCIKM 2025 - Proceedings of the 34th ACM International Conference on Information and Knowledge Management
PublisherAssociation for Computing Machinery, Inc
Pages5011-5015
Number of pages5
ISBN (Electronic)9798400720406
DOIs
Publication statusPublished - 10 Nov 2025
Event34th ACM International Conference on Information and Knowledge Management, CIKM 2025 - Seoul, Korea, Republic of
Duration: 10 Nov 202514 Nov 2025

Publication series

NameCIKM 2025 - Proceedings of the 34th ACM International Conference on Information and Knowledge Management

Conference

Conference34th ACM International Conference on Information and Knowledge Management, CIKM 2025
Country/TerritoryKorea, Republic of
CitySeoul
Period10/11/2514/11/25

Keywords

  • personality prediction
  • social media analysis

Fingerprint

Dive into the research topics of 'From Post to Personality: Harnessing LLMs for MBTI Prediction in Social Media'. Together they form a unique fingerprint.

Cite this