Augmented Language Dataset for Enhanced Personality Profiling

Authors

  • Mohmad Azhar Teli Department of Computer Science; University of Kashmir, Hazratbal Srinagar, Srinagar 190006, India
  • Manzoor Ahmad Chachoo Department of Computer Science; University of Kashmir, Hazratbal Srinagar, Srinagar 190006, India

DOI:

https://doi.org/10.32985/ijeces.16.1.7

Keywords:

Personality, Social Signal Processing, Natural Language Processing

Abstract

The lexical hypothesis asserts that language encompasses all meaningful individual differences in personality. Language is a vital tool for communication and self-expression, making it essential for understanding and assessing human personality. This paper investigates personality recognition from language use, emphasizing the significance of language in capturing and analyzing personality traits. A comprehensive literature review examines various approaches and techniques in personality recognition. We investigate the effectiveness of language use in predicting personality traits, employing multiple feature extraction and data augmentation techniques to enhance the accuracy and robustness of the personality recognition models. Our approach involves training a generative model, PersonaG, on the Essays dataset, subsequently using it to generate augmented data (AUG-Essays). We compare the performance of machine learning classifiers using LIWC, TF-IDF, Glove, and Word-Vec features on both Essays and AUG-Essays datasets. Our findings demonstrate significant improvements in predictive performance, offering valuable insights for applications in human resources, marketing, and beyond.

Downloads

Published

2025-01-02

How to Cite

[1]
M. Azhar Teli and M. A. Chachoo, “Augmented Language Dataset for Enhanced Personality Profiling”, IJECES, vol. 16, no. 1, pp. 65-74, Jan. 2025.

Issue

Section

Original Scientific Papers