Date of Award
Natural language processing, Plot lines, Fiction, Computer science
When following a plot in a story, categorization is something that humans do without even thinking; whether this is simple classification like “This is science fiction” or more complex trope recognition like recognizing a Chekhov's gun or a rags to riches storyline, humans group stories with other similar stories. Research has been done to categorize basic plots and acknowledge common story tropes on the literary side, however, there is not a formula or set way to determine these plots in a story line automatically. This paper explores multiple natural language processing techniques in an attempt to automatically compare and cluster a fictional story into categories in an unsupervised manner. The aim is to mimic how a human may look deeper into a plot, find similar concepts like certain words being used, the types of words being used, for example an adventure book may have more verbs, as well as the sentiment of the sentences in order to group books into similar clusters.
Copyright is held by the author. User is responsible for all copyright compliance.
Crutchfield, Dalton J., "Using Natural Language Processing to Categorize Fictional Literature in an Unsupervised Manner" (2020). Electronic Theses and Dissertations. 1741.
Received from ProQuest
Dalton J. Crutchfield