Detection and Resolution of Verb Phrase Ellipsis

Marjorie McShane, Petr Babkin

Abstract


Verb phrase (VP) ellipsis is the omission of a verb phrase whose mean- ing can be reconstructed from the linguistic or real-world context. It is licensed in English by auxiliary verbs, often modal auxiliaries: She can go to Hawaii but he can’t [e]. This paper describes a system called ViPER (VP Ellipsis Resolver) that detects and resolves VP ellipsis, relying on linguistic principles such as syntactic parallelism, modality correlations, and the delineation of core vs. peripheral sentence con- stituents. The key insight guiding the work is that not all cases of el- lipsis are equally difficult: some can be detected and resolved with high confidence even before we are able to build systems with human-level semantic and pragmatic understanding of text. 


Keywords


VP ellipsis; rule-based NLP; syntactic parallelism; modality

Full Text:

PDF

References


Artstein, R. and M. Poesio. 2008. Inter-coder agreement for computational linguistics. Computational Linguistics 34(4):555–596.

Baldwin, B. 1997. CogNIAC: High precision coreference with limited knowledge and linguistic resources. In Proceedings of a Workshop on Operational Factors in Practical, Robust Anaphora Resolution for Unrestricted Texts. Association for Computational Linguistics.

Bos, J. and J. Spenader. 2011. An annotated corpus for the analysis of VP ellipsis. Language Resources and Evaluation 45:463–494.

Davies, M. 2008. The Corpus of Contemporary American English: 450 million words, 1990-present. Available online at http://corpus.byu.edu/coca/. Fiengo, R. and R. May. 1994. Indices and Identity. Cambridge, Mass.: The

MIT Press.

Gagnon, M. and L. Da Sylva. 2005. Text summarization by sentence extraction and syntactic pruning. In Proceedings of Computational Linguistics in the North East.

Goodall, G. 2009. Parallel Structures in Syntax: Coordination, Causatives and Restructuring. Cambridge University Press.

Graff, D. and C. Cieri. 2003. English Gigaword. Linguistic Data Consortium, Philadelphia.

Grosz, B., A. K. Joshi, and S. Weinstein. 1995. Centering: A framework for modeling the local coherence of discourse. Computational Linguistics 21(2):203–225.

Hardt, D. 1997. An empirical approach to VP ellipsis. Computational Linguistics 23(4):525–541.

Hirshman, L. and N. Chinchor. 1997. MUC-7 Coreference Task Definition. Version 3.0. In Proceedings of the Seventh Message Understanding Conference (MUC-7). Applications International Corporation.

Hobbs, J. R. and A. Kehler. 1998. A theory of parallelism and the case of VP ellipsis. In Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics (ACL-98), pages 394–401.

Knight, K. and D. Marcu. 2002. Summarization beyond sentence extraction: A probabilistic approach to sentence compression. Artificial Intelligence 139(1):91–107.

Krippendorff, K. 2010. Krippendorff’s alpha. In N. Salkind, ed., Encyclopedia of Research Design, pages 669–674. Thousand Oaks, CA: SAGE Publica- tions, Inc.

Lee, H., A. Chang, Y. Peirsman, N. Chambers, M. Surdeanu, and D. Jurafsky. 2013. Deterministic coreference resolution based on entity-centric, precision-ranked rules. Computational Linguistics 39(4):885–916.

Manning, C. D., M. Surdeanu, J. Bauer, J. Finkel, S. J. Bethard, and D. McClosky. 2014. The Stanford CoreNLP Natural Language Processing Toolkit. In Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pages 55–60.

McShane, M. 2000. Hierarchies of parallelism in elliptical Polish structures. Journal of Slavic Linguistics 8:83–117.

McShane, M. and S. Nirenburg. 2012. A knowledge representation language for natural language processing, simulation and reasoning. International Journal of Semantic Computing 6:3–23.

Mitkov, R. 2001. Outstanding issues in anaphora resolution. In Computa- tional Linguistics and Intelligent Text Processing, pages 110–125. Springer.

Nielsen, L. A. 2005. A corpus-based study of verb phrase ellipsis identification and resolution. Ph.D. thesis, King’s College London.

Nirenburg, S. and V. Raskin. 2004. Ontological Semantics. Cambridge, Mass.:The MIT Press.

Olsson, F. 2004. A survey of machine learning for reference resolution in textual discourse. Tech. Rep. T2004:02. ISSN 1100-3154. ISRN:SICS-T-2004/02-SE., SICS.

Poesio, M., R. Stevenson, B. di Eugenio, and J. Hitzeman. 2004. Centering: A parametric theory and its instantiations. Computational Linguistics 30:309–363.

Stoyanov, V., N. Gilbert, C. Cardie, and E. Riloff. 2009. Conundrums in noun phrase coreference resolution: Making sense of the state-of-the-art. In Proceedings of Joint Conference of the 47th Annual Meeting of the Association of Computational Linguistics and the 4th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Lan- guage Processing.

Vanderwende, L., H. Suzuki, C. Brockett, and A. Nenkova. 2007. Beyond SumBasic: Task-focused summarization with sentence simplification and lexical expansion. Information Processing and Management 43(6):1606– 1618.

Webber, B. L. 1988. Discourse deixis: reference to discourse segments. In Proceedings of the 26th Annual Meeting of the Association for Computational Linguistics (ACL ’88), pages 113–122.


Refbacks

  • There are currently no refbacks.