Abstract Representations of Plot Struture

Micha Elsner

Abstract


Since the 18th century, the novel has been one of the defining forms of English writing, a mainstay of popular entertainment and academic criticism. Despite its importance, however, there are few computational studies of the large-scale structure of novels—and many popular repre- sentations for discourse modeling do not work very well for novelistic texts. This paper describes a high-level representation of plot structure which tracks the frequency of mentions of different characters, topics and emotional words over time. The representation can distinguish with high accuracy between real novels and artificially permuted surrogates; characters are important for eliminating random permutations, while topics are effective at distinguishing beginnings from ends. 


Keywords


ose; plot structure; novel; discourse modeling

Full Text:

PDF

References


Alm, Cecilia Ovesdotter and Richard Sproat. 2005. Emotional Sequencing and Development in Fairy Tales. In ACII , pages 668–674.

Ang, Robert. 2012. The Writer’s Toolkit. Master’s thesis, University of Edinburgh.

Bamman, David, Brendan O’Connor, and Noah A. Smith. 2013. Learning Latent Personas of Film Characters. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 352–361. Sofia, Bulgaria: Association for Computational Linguistics.

Bamman, David, Ted Underwood, and Noah A. Smith. 2014. A Bayesian Mixed Effects Model of Literary Character. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 370–379. Baltimore, Maryland: Association for Computational Linguistics.

Barzilay, Regina and Mirella Lapata. 2005. Modeling Local Coherence: an Entity-Based Approach . In Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL’05).

Bhattacharya, Indrajit and Lise Getoor. 2005. Relational clustering for multi-type entity resolution. In Proceedings of the 4th international workshop on Multi-relational mining, MRDM ’05, pages 3–12. New York, NY, USA: ACM. ISBN 1-59593-212-7.

Bishop, Christopher M. 2006. Pattern Recognition and Machine Learning (Information Science and Statistics). Secaucus, NJ, USA: Springer-Verlag New York, Inc. ISBN 0387310738.

Blei, David, Andrew Y. Ng, and Michael I. Jordan. 2001. Latent Dirichlet Allocation. Journal of Machine Learning Research 3:2003.

Blei, David M. and John D. Lafferty. 2006. Dynamic Topic Models. In ICML.

Boughorbel, Sabri, Jean-Philippe Tarel, and Francois Fleuret. 2004. Non-Mercer Kernels for SVM Object Recognition. In BMVC, pages 1–10.

Chambers, Nathanael and Dan Jurafsky. 2009. Unsupervised Learning of Narrative Schemas and their Participants. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP, pages 602–610. Suntec, Singapore: Association for Computational Linguistics.

Chang, Jonathan, Jordan Boyd-Graber, Chong Wang, Sean Gerrish, and David M. Blei. 2009. Reading Tea Leaves: How Humans Interpret Topic Models. In Neural Information Processing Systems.

Charniak, Eugene. 2001. Unsupervised learning of name structure from coreference data. In Second Meeting of the North American Chapter of the Association for Computational Linguistics (NACL-01).

Coll Ardanuy, Mariona and Caroline Sporleder. 2014. Structure-based Clus- tering of Novels. In Proceedings of Computational Linguistics for Literature (CLFL). Gothenburg, Sweden.

Crane, R.S. 2002. The Concept of Plot and the Plot of Tom Jones. In B. Richardson, ed., Narrative dynamics : essays on time, plot, closure, and frames. The Ohio State University Press.

Elsner, Micha. 2012. Character-based kernels for novelistic plot structure. In Proceedings of the 13th Conference of the European Chapter of the As- sociation for Computational Linguistics, pages 634–644. Avignon, France: Association for Computational Linguistics.

Elsner, Micha, Joseph Austerweil, and Eugene Charniak. 2007. A unified local and global model for discourse coherence. In Proceedings of HLT- NAACL ’07.

Elson, David, Nicholas Dames, and Kathleen McKeown. 2010. Extracting Social Networks from Literary Fiction. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 138–147. Uppsala, Sweden: Association for Computational Linguistics.

Elson, David K. and Kathleen R. McKeown. 2010. Building a Bank of Semantically Encoded Narratives. In N. C. C. Chair), K. Choukri, B. Maegaard, J. Mariani, J. Odijk, S. Piperidis, M. Rosner, and D. Tapias, eds., Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC’10). Valletta, Malta: European Language Resources Association (ELRA). ISBN 2-9517408-6-7.

Feng, Vanessa Wei and Graeme Hirst. 2012. Extending the Entity-based Coherence Model with Multiple Ranks. In Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics, pages 315–324. Avignon, France: Association for Computational Linguistics.

Finlayson, Mark A. 2009. Deriving narrative morphologies via analogical story merging. In New Frontiers in Analogy Research: Proceedings of the Second International Conference on Analogy, pages 127–136. Sofia, Bulgaria: New Bulgarian University Press.

Goyal, Amit, Ellen Riloff, and Hal Daume III. 2010. Automatically Producing Plot Unit Representations for Narrative Text. In Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pages 77–86. Cambridge, MA: Association for Computational Linguistics.

Gretton, Arthur, Karsten M. Borgwardt, Malte Rasch, Bernhard Schlkopf, and Alexander J. Smola. 2007. A Kernel Method for the Two-Sample-Problem. In B. Schlkopf, J. Platt, and T. Hoffman, eds., Advances in Neural Information Processing Systems 19, pages 513–520. Cambridge, MA: MIT Press.

Gnen, Mehmet and Ethem Alpaydın. 2011. Multiple kernel learning algorithms. The Journal of Machine Learning Research 12:2211–2268.

Haussler, David. 1999. Convolution Kernels on Discrete Structures. Tech. Rep. UCSC-CRL-99-10, Computer Science Department, UC Santa Cruz.

Huff, Jason. 2010. Autosummarize. McNally Jackson Books. http://jason-huff.com/pro jects/autosummarize/.

Joachims, Thorsten. 2006. Training linear SVMs in linear time. In Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 217–226. ACM.

Karamanis, Nikiforos, Massimo Poesio, Chris Mellish, and Jon Oberlander. 2004. Evaluating Centering-Based Metrics of Coherence. In ACL, pages 391–398.

Kazantseva, Anna and Stan Szpakowicz. 2010. Summarizing short stories. Computational Linguistics pages 71–109.

Kim, Dae Il and Erik B. Sudderth. 2011. The Doubly Correlated Nonparametric Topic Model. In NIPS, pages 1980–1988.

Lehnert, Wendy. 1981. Plot Units and Narrative Summarization. Cognitive Science 4:293–331.

Li, Boyang, Stephen Lee-Urban, Darren Scott Appling, and Mark O Riedl. 2012. Crowdsourcing narrative intelligence. Advances in Cognitive Systems 2:25–42.

Liang, Percy, Ben Taskar, and Dan Klein. 2006. Alignment by agreement. In Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics, pages 104–111. Association for Computational Linguistics.

Matusov, Evgeny, Richard Zens, and Hermann Ney. 2004. Symmetric word alignments for statistical machine translation. In Proceedings of the 20th international conference on Computational Linguistics, page 219. Association for Computational Linguistics.

McCallum, Andrew. 2002. MALLET: A Machine Learning for Language Toolkit.

McIntyre, Neil and Mirella Lapata. 2009. Learning to tell tales: A data-driven approach to story generation. In Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1, pages 217–225. Association for Computational Linguistics.

McIntyre, Neil and Mirella Lapata. 2010. Plot Induction and Evolutionary Search for Story Generation. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, pages 1562–1572. Uppsala, Sweden: Association for Computational Linguistics.

Miller, G., A.R. Beckwith, C. Fellbaum, D. Gross, and K. Miller. 1993. In- troduction to WordNet: an on-line lexical database. Tech. rep., Princeton University.

Mimno, David and David Blei. 2011. Bayesian Checking for Topic Models. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pages 227–237. Edinburgh, Scotland, UK.: Association for Computational Linguistics.

Mohammad, Saif. 2011. From Once Upon a Time to Happily Ever After: Tracking Emotions in Novels and Fairy Tales. In Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities, pages 105–114. Portland, OR, USA: Association for Computational Linguistics.

Mohammad, Saif and Peter Turney. 2010. Emotions Evoked by Common Words and Phrases: Using Mechanical Turk to Create an Emotion Lexicon. In Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text, pages 26–34. Los Angeles, CA: Association for Computational Linguistics.

Mohammad, Saif M. 2012. From once upon a time to happily ever after: Tracking emotions in mail and books . Decision Support Systems 53(4):730 – 741.

O’Keefe, Timothy, Silvia Pareti, James R. Curran, Irena Koprinska, and Matthew Honnibal. 2012. A Sequence Labelling Approach to Quote Attribution. In Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pages 790–799. Jeju Island, Korea: Association for Computa- tional Linguistics.

O’Neill, Brian and Mark Riedl. 2011. Toward a computational framework of suspense and dramatic arc. In Affective Computing and Intelligent Interaction, pages 246–255. Springer.

Phelan, James and Peter J. Rabinowitz. 2012. Narrative as Rhetoric. In D. Herman, J. Phelan, P. J. Rabinowitz, B. Richardson, and R. Warhol, eds., Narrative Theory. The Ohio State University Press.

Post, Matt. 2011. Judging Grammaticality with Tree Substitution Grammar Derivations. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, pages 217–222. Portland, Oregon, USA: Association for Computational Linguistics.

Propp, Vladimir. 1968. Morphology of the Folktale. University of Texas Press, 2nd edn.

Ryan, Marie-Laure. 1991. Possible worlds, artificial intelligence and narrative theory. Bloomington: Indiana University Press.

Salway, Andrew and David Herman. 2011. Digital Corpora as Theory-building Resource. In R. Page and B. Thomas, eds., New Narratives: Stories and Storytelling in the Digital Age, pages 120–137. University of Nebraska.

Schank, Rogert and Robert Abelson. 1977. Scripts, plans, goals and understanding: An inquiry into human knowledge structures. Hillsdale, NJ.: Lawrence Erlbaum Associates.

Socher, Richard, Jeffrey Pennington, Eric H. Huang, Andrew Y. Ng, and Christopher D. Manning. 2011. Semi-Supervised Recursive Autoencoders for Predicting Sentiment Distributions. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pages 151– 161. Edinburgh, Scotland, UK.: Association for Computational Linguistics.

Vishwanathan, S. V. N., Nicol N. Schraudolph, Risi Kondor, and Karsten M Borgwardt. 2010. Graph kernels. The Journal of Machine Learning Re- search 11:1201–1242.

Volkova, Ekaterina P., Betty Mohler, Detmar Meurers, Dale Gerdemann, and Heinrich H. Blthoff. 2010. Emotional Perception of Fairy Tales: Achieving Agreement in Emotion Annotation of Text. In Proceedings of the NAACL HLT 2010 Workshop on Computational Approaches to Analysis and Generation of Emotion in Text, pages 98–106. Los Angeles, CA: Association for Computational Linguistics.

Wallace, Byron. 2012. Multiple Narrative Disentanglement: Unraveling Infinite Jest. In Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Lan- guage Technologies, pages 1–10. Montr ́eal, Canada: Association for Com- putational Linguistics.

Wilson, Theresa, Janyce Wiebe, and Paul Hoffmann. 2005. Recognizing Contextual Polarity in Phrase-Level Sentiment Analysis. In Proceedings of Human Language Technology Conference and Conference on Empiri- cal Methods in Natural Language Processing, pages 347–354. Vancouver, British Columbia, Canada: Association for Computational Linguistics.

Yessenalina, Ainur and Claire Cardie. 2011. Compositional Matrix-Space Models for Sentiment Analysis. In Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing, pages 172–182. Edinburgh, Scotland, UK.: Association for Computational Linguistics.

Yu, Chun-Nam John and Thorsten Joachims. 2009. Learning structural SVMs with latent variables. In Proceedings of the 26th Annual International Conference on Machine Learning, ICML ’09, pages 1169–1176. New York, NY, USA: ACM. ISBN 978-1-60558-516-1.


Refbacks

  • There are currently no refbacks.