Augmented Language Dataset for Enhanced Personality Profiling

Mohmad Azhar Teli; Manzoor Ahmad Chachoo

doi:10.32985/ijeces.16.1.7

Authors

Mohmad Azhar Teli Department of Computer Science; University of Kashmir, Hazratbal Srinagar, Srinagar 190006, India
Manzoor Ahmad Chachoo Department of Computer Science; University of Kashmir, Hazratbal Srinagar, Srinagar 190006, India

DOI:

https://doi.org/10.32985/ijeces.16.1.7

Keywords:

Personality, Social Signal Processing, Natural Language Processing

Abstract

The lexical hypothesis asserts that language encompasses all meaningful individual differences in personality. Language is a vital tool for communication and self-expression, making it essential for understanding and assessing human personality. This paper investigates personality recognition from language use, emphasizing the significance of language in capturing and analyzing personality traits. A comprehensive literature review examines various approaches and techniques in personality recognition. We investigate the effectiveness of language use in predicting personality traits, employing multiple feature extraction and data augmentation techniques to enhance the accuracy and robustness of the personality recognition models. Our approach involves training a generative model, PersonaG, on the Essays dataset, subsequently using it to generate augmented data (AUG-Essays). We compare the performance of machine learning classifiers using LIWC, TF-IDF, Glove, and Word-Vec features on both Essays and AUG-Essays datasets. Our findings demonstrate significant improvements in predictive performance, offering valuable insights for applications in human resources, marketing, and beyond.

Augmented Language Dataset for Enhanced Personality Profiling

Authors

DOI:

Keywords:

Abstract

Downloads

Published

How to Cite

Issue

Section

License

Information

Make a Submission

JCR Impact factor for 2024

0.9