Author profiling

Author profiling is a method of analyzing a number of texts and texts of the author (eg age and gender) based on stylistic and content-based features.


Automatic Authorship Identification (AAI) has existed for almost 120 years. Thomas Corwin Mendenhall was the first to examine works by Francis Bacon , William Shakespeare , and Christopher Marlowe . [1] Since then, things have changed rapidly due to the development of technology.

There are three major fields in AAI: authorship attribution, author identification, and author profiling. In the first two, the goal is to recognize the author of a set of authors, while in author profiling, the goal is to find specific features of the author, based on stylistic- or content-based features. [2] [3]

The author profiling task is a problem, due to their difficulty. It has been studied by many researchers and, while some show, it still has many unexplored areas and room for improvement. Through the organizational efforts of the PAN, clarification needed ] many teams around the globe try every year to find the characteristics of authors. [4] [5] [6]

Characteristics vary between approaches, but age and gender are usually among them. [7] [8] [9] [10] [11] Other personality traits include the zodiac and occupation of the author. [4] [9]


  1. Jump up^ Mendenhall, Thomas Corwin. “The characteristic curves of composition.” Science (1887): 237-249.
  2. Jump up^ Mikros, George K., and Kostas Perifanos. “Authorship Attribution in Greek Tweets Using Author’s Multilevel N-Gram Profiles.” 2013 AAAI Spring Symposium Series. 2013.
  3. Jump up^ Stamatatos, Efstathios. “A survey of modern authorship attribution methods.” Journal of the American Society for Information Science and Technology 60.3 (2009): 538-556.
  4. ^ Jump up to:b Rangel Francisco, et al. “Overview of the 3rd Author Profiling Task at PAN 2015.” KEY. 2015.
  5. Jump up^ Rangel, Francisco, et al. “Overview of the 2nd author profiling task at pan 2014.” THIS Workshop Proceedings. Flight. 1180. THIS Workshop Proceedings, 2014.
  6. Jump up^ Rangel, Francisco, et al. “Overview of the author profiling task at pan 2013.” Conference on Multilingual and Multimodal Information Access Evaluation. CELCT, 2013.
  7. Jump up^ Argamon, Shlomo, et al. “Mining the blogosphere: Age, gender and the types of selfexpression.” First Monday 12.9 (2007).
  8. Jump up^ Nguyen, Dong-Phuong, et al. “” How old do you think I am? “A study of language and age in Twitter. (2013).
  9. ^ Jump up to:b Schler, Jonathan, et al. “Effects of Age and Gender on Blogging.” AAAI Spring Symposium: Computational Approaches to Analyzing Weblogs. Flight. 6. 2006.
  10. Jump up^ Argamon, Shlomo, et al. “Gender, gender, and writing style in formal written texts.” TEXT-THE HAGUE THEN AMSTERDAM THEN BERLIN- 23.3 (2003): 321-346.
  11. Jump up^ Koppel, Moshe, Shlomo Argamon, and Anat Rachel Shimoni. “Automatically categorizing written texts by author gender.” Literary and Linguistic Computing 17.4 (2002): 401-412.