Ontology-based rhetorical figures recognition
Abstract
Automatic recognition of rhetorical figures (similes, irony, sarcasm, humor, metaphors, etc.) is increasingly used in natural language processing tasks, primarily to improve sentiment classification, machine translation, but also analyzis of linguistic structures on different levels. In this paper, we propose a method of automatic recognition and classification of rhetorical figures from a group of tropes that uses ontological inference rules in an ontology based on Serbian WordNet (SWN). A binary classification method was carried out on the rhetorical figure simile and evaluated by ROC curve (AUC = 0.696) which indicates that it can be successfully used in solving these types of tasks. Also it is proposed a semi-automatic ontology learning method, for further learning ontology SWN, by increasing the number and the type of relationships that can assist in the detection of figurative language in the texts in Serbian.References
Alistar Kennedy and Diana Inkpen, “Sentiment Classification of Movie Reviews Using Contextual Valence Shifters”, Computational Intelligence (special issue), 22, no. 2 (2006): 110–125.
Andrew Hardie, Veronica Koller, Paul Rayson and Elena Semino, „Exploiting a semantic annotation tool for metaphor analysis“, In Proceedings of the Corpus Linguistics 2007 Conference (2007).
Antonio Reyes and Paolo Rosso, „Building Corpora for Figurative Language Processing: The Case of Irony Detection“, In Proceedings of the 4th International Workshop on Corpora for Research on Emotion Sentiment & Social Signals, Satellite of LREC 2012 (ELRA, 2012), 94–98. (2012a).
Antonio Reyes and Paolo Rosso, “Making objective decisions from subjective data: Detecting irony in customer reviews”, Decision Support Systems, 53, no. 4 (2012): 754–760. (2012b).
Ashley R. Kelly, Nike A. Abbott, Randy Allen Harris, Chrysanne DiMarco and David R. Cheriton, „Toward an ontology of rhetorical figures“, In Proceedings of the 28th ACM International Conference on Design of Communication SIGDOC '10 (New York, NY, USA: ACM, 2010), 123–130.
Bo Pang, Lillian Lee and Shivakumar Vaithyanathan, „Thumbs up? Sentiment Classification using Machine Learning Techniques“, In Proceedings of the ACL-02 conference on Empirical Methods in Natural Language Processing (EMNLP) (Stroudsburg, PA, USA: ACL, 2002), 79–86.
Christiane Fellbaum, ed. WordNet: An Electronic Lexical Database (Cambridge, MA: MIT Press, 1998).
Cristina Nicolae, Gabriel Nicolae and Sanda Harabagiu, „UTD-HLT-CG: Semantic Architecture for Metonymy Resolution and Classification of Nominal Relations“, In Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007) (Stroudsburg, PA, USA: ACL, 2007), 454–459.
Dan Fass, „met*:A method for discriminating metonymy and metaphor by computer“ Computational Linguistics, 17, no. 1 (1991):49–90.
Daniel G. Bobrow. „A question-answering system for high school algebra word problems“, In Proceedings of AFIPS conference, 26, (New York, NY, USA: ACM, 1964), 591–614.
Daniel Devatman Hromada, „Initial Experiments with Multilingual Extraction of Rhetoric Figures by means of PERL-compatible Regular Expressions“, In Proceedings of the Student Research Workshop associated with The 8th International Conference on Recent Advances in Natural Language Processing (RANLP) (Hissar, Bulgaria: ACL, 2011), 85–90.
Dmitry Davidov, Oren Tsur and Ari Rappoport, „A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews“, In Proceedings of the Fourth International AAAI Conference on Weblogs and Social Media (ICWSM-2010), (Stroudsburg, PA, USA: ACL, 2010), 107–116.
Ekaterina Shutova, Simone Teufel and Anna Korhonen, “Statistical Metaphor Processing”, Computional Linguistics, 39, no. 2 (2013): 301–353.
Elena Filatova, „Irony and sarcasm: Corpus generation and analysis using crowdsourcing“. In Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’12) (ELRA, 2012), 392–398.
Francesco Barbieri, Francesco Ronzano and Horacio Saggion, „UPF-taln: SemEval 2015 Tasks 10 and 11. Sentiment Analysis of Literal and Figurative Language in Twitter“, In Proceedings of the 9th International Workshop on Semantic Evaluation, (Denver, Colorado: SemEval, 2015), 704–708.
Jakub Gawryjolek, Chrysanne Di Marco and Randy A. Harris, „An Annotation Tool for Automatically Detecting Rhetorical Figures – System Demonstration“, In Proceedings of the IJCAI-09 workshop on Computational Models of Natural Argument. (Pasadena, CA, 2009).
Jelena Mitrović, “Electronic Tools and Resources for Multi-Word Unit Detection and Research in Serbian” (poster presented at Тhe 2th General Meeting of The IC1207 COST Action, PARSEME. Athens, Greece, 10-11 March, 2014).
Jelena Mitrović, Miljana Mladenović and Cvetana Krstev, “Adding MWEs to Serbian Lexical Resources Using Crowdsourcing” (poster presented at Тhe 5th PARSEME general meeting. Iași, Romania, 23–24 September, 2015).
Johannes Leveling, „Metonymy Recognition Using Different Kinds of Context for a Memory-Based Learner“, In Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007) (Stroudsburg, PA, USA: ACL, 2007), 153–156.
Katja Markert and Malvina Nissim, „Metonymy resolution as a classification task“, In Proceedings of EMNLP, (Stroudsburg, PA, USA: ACL, 2002), 204–213.
Krešimir Bagić, Rječnik stilskih figura (Zagreb: Školska knjiga, 2012).
Lowri Williams, Christian Bannister, Miguel Arribas-Ayllon, Alun Preece and Irena Spasic, „The role of idioms in sentiment analysis“, Expert Systems with Applications 42, no. 21 (2015): 7375 – 7385.
Massimo Poesio and Ron Artstein, „Anaphoric Annotation in the ARRAU Corpus“, In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08) (ELRA, 2008), 1170–1174.
Miljana Mladenović and Jelena Mitrović, “Ontology of Rhetorical Figures for Serbian”, Text, Speech and Dialogue, TSD 2013, 8082 (2013): 386–393.
Miljana Mladenović and Jelena Mitrović, „Semantic Networks for Serbian: New Functionalities of Developing and Maintaining a WordNet Tool“, In Natural Language Processing for Serbian – Resources and Application, editors Gordana Pavlović Lažetić, Cvetana Krstev, Ivan Obradović and Duško Vitas (Beograd, University of Belgrade, Faculty of Mathematics: 2014), 1-11.
Miljana Mladenović, Jelena Mitrović and Cvetana Krstev, „Introducing a Language-independent Model for Adding a New Semantic Relation Between Adjectives and Nouns in a WordNet“, In Proceedings of Eight Global WordNet Conference (Bucharest, Romania, 2016), 218-225.
Miljana Mladenović, Jelena Mitrović, Cvetana Krstev and Duško Vitas, “Hybrid Sentiment Analysis Framework For A Morphologically Rich Language”, Journal of Intelligent Information Systems, 46, no. 3 (2016): 599-620.
Miloš Utvić, “Construction of a reference corpus of contemporary Serbian language” (PhD diss., University of Belgrade, Faculty of linguistics, 2014).
Paula Carvalho, Luís Sarmento, Jorge Teixeira and Mário J. Silva, „Liars and Saviors in a Sentiment Annotated Corpus of Comments to Political Debates“, In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2, (Stroudsburg, PA, USA: ACL, 2011), 564–568.
Paula Carvalho, Luís Sarmento, Mário J. Silva and Eugénio de Oliveira, „Clues for Detecting Irony in User-generated Contents: Oh...!! It's "So Easy" ;-)“, In Proceedings of the 1st International CIKM Workshop on Topic-sentiment Analysis for Mass Opinion (New York, NY, USA: ACM, 2009), 53–56.
Randy A. Harris and Chrysanne Di Marco, “Constructing a rhetorical figuration ontology” (presented at Symposium on Persuasive Technology and Digital Behavior Intervention, Convention of the Society for the Study of Artificial Intelligence and Simulation of Behaviour (AISB), Edinburgh, 2009).
Richárd Farkas, Eszter Simon, György Szarvas and Dánel Varga, „Gyder: Maxent metonymy resolution“, In Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007) (Stroudsburg, PA, USA: ACL, 2007), 161–164.
Roberto González-Ibáñez, Smaranda Muresan and Nina Wacholder, „Identifying sarcasm in Twitter: a closer look“, In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics:shortpapers (ACL-2011) (Stroudsburg, PA, USA: ACL, 2011), 581-586.
Ruslan Mitkov, Anaphora Resolution (Cambridge, UK: Longman, 2002).
Stefano Baccianella, Andrea Esuli and Fabrizio Sebastiani, „SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining“, In Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10) (ELRA, 2010).
Svetla Koeva, Cvetana Krstev and Duško Vitas, „Morpho-semantic Relations in WordNet - a Case Study for two Slavic Languages“, In Proceedings of Global WordNet Conference, (Stroudsburg, PA, USA: ACL, 2008), 239–253.
Thomas R. Gruber, “A translation approach to portable ontologies”, Knowledge Acquisition, 5, no. 2 (1993): 199–220.
Tony Veale, „Detecting and Generating Ironic Comparisons: An Application of Creative Information Retrieval“, Artificial Intelligence of Humor, Papers from the 2012 {AAAI} Fall Symposium (Arlington, Virginia, USA, November 2-4, 2012).
Tony Veale and Yanfen Hao, „Support structures for linguistic creativity: a computational analysis of creative irony in similes“, In Proceedings of CogSci 2009, the 31st annual meeting of the cognitive science society (Austin, Texas: Cognitive Science Society, 2009), 1376–1381.
Vassiliki Rentoumi, Stefanos Petrakis, Manfred Klenner, George A. Vouros and Vangelis Karkaletsis, „United we stand - improving sentiment analysis by joining machine learning and rule based methods“, In Proceedings of the 7th Language Resources and Evaluation Conference (LREC 2010) (ELRA, 2010).
Veronika Koller, Andrew Hardie, Paul Rayson and Elena Semino, „Using a semantic annotation tool for the analysis of metaphor in discourse“, In metaphorik.de, 15 (2008):141–160.
Vladan Devedžić, Semantic Web and Education, Monograph (Berlin Heidelberg New York: Springer, 2006).
Yachary J. Mason, “CorMet: a computationa l, corpus-based conventional metaphor extraction system”, Computational Linguistics, 30, no. 1 (2004): 23–44.
Yanfen Hao and Tony Veale, “An Ironic Fist in a Velvet Glove: Creative Mis-Representation in the Construction of Ironic Similes”, Journal Minds and Machines, 20, no. 4 (2010): 635–650.
Andrew Hardie, Veronica Koller, Paul Rayson and Elena Semino, „Exploiting a semantic annotation tool for metaphor analysis“, In Proceedings of the Corpus Linguistics 2007 Conference (2007).
Antonio Reyes and Paolo Rosso, „Building Corpora for Figurative Language Processing: The Case of Irony Detection“, In Proceedings of the 4th International Workshop on Corpora for Research on Emotion Sentiment & Social Signals, Satellite of LREC 2012 (ELRA, 2012), 94–98. (2012a).
Antonio Reyes and Paolo Rosso, “Making objective decisions from subjective data: Detecting irony in customer reviews”, Decision Support Systems, 53, no. 4 (2012): 754–760. (2012b).
Ashley R. Kelly, Nike A. Abbott, Randy Allen Harris, Chrysanne DiMarco and David R. Cheriton, „Toward an ontology of rhetorical figures“, In Proceedings of the 28th ACM International Conference on Design of Communication SIGDOC '10 (New York, NY, USA: ACM, 2010), 123–130.
Bo Pang, Lillian Lee and Shivakumar Vaithyanathan, „Thumbs up? Sentiment Classification using Machine Learning Techniques“, In Proceedings of the ACL-02 conference on Empirical Methods in Natural Language Processing (EMNLP) (Stroudsburg, PA, USA: ACL, 2002), 79–86.
Christiane Fellbaum, ed. WordNet: An Electronic Lexical Database (Cambridge, MA: MIT Press, 1998).
Cristina Nicolae, Gabriel Nicolae and Sanda Harabagiu, „UTD-HLT-CG: Semantic Architecture for Metonymy Resolution and Classification of Nominal Relations“, In Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007) (Stroudsburg, PA, USA: ACL, 2007), 454–459.
Dan Fass, „met*:A method for discriminating metonymy and metaphor by computer“ Computational Linguistics, 17, no. 1 (1991):49–90.
Daniel G. Bobrow. „A question-answering system for high school algebra word problems“, In Proceedings of AFIPS conference, 26, (New York, NY, USA: ACM, 1964), 591–614.
Daniel Devatman Hromada, „Initial Experiments with Multilingual Extraction of Rhetoric Figures by means of PERL-compatible Regular Expressions“, In Proceedings of the Student Research Workshop associated with The 8th International Conference on Recent Advances in Natural Language Processing (RANLP) (Hissar, Bulgaria: ACL, 2011), 85–90.
Dmitry Davidov, Oren Tsur and Ari Rappoport, „A Great Catchy Name: Semi-Supervised Recognition of Sarcastic Sentences in Online Product Reviews“, In Proceedings of the Fourth International AAAI Conference on Weblogs and Social Media (ICWSM-2010), (Stroudsburg, PA, USA: ACL, 2010), 107–116.
Ekaterina Shutova, Simone Teufel and Anna Korhonen, “Statistical Metaphor Processing”, Computional Linguistics, 39, no. 2 (2013): 301–353.
Elena Filatova, „Irony and sarcasm: Corpus generation and analysis using crowdsourcing“. In Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’12) (ELRA, 2012), 392–398.
Francesco Barbieri, Francesco Ronzano and Horacio Saggion, „UPF-taln: SemEval 2015 Tasks 10 and 11. Sentiment Analysis of Literal and Figurative Language in Twitter“, In Proceedings of the 9th International Workshop on Semantic Evaluation, (Denver, Colorado: SemEval, 2015), 704–708.
Jakub Gawryjolek, Chrysanne Di Marco and Randy A. Harris, „An Annotation Tool for Automatically Detecting Rhetorical Figures – System Demonstration“, In Proceedings of the IJCAI-09 workshop on Computational Models of Natural Argument. (Pasadena, CA, 2009).
Jelena Mitrović, “Electronic Tools and Resources for Multi-Word Unit Detection and Research in Serbian” (poster presented at Тhe 2th General Meeting of The IC1207 COST Action, PARSEME. Athens, Greece, 10-11 March, 2014).
Jelena Mitrović, Miljana Mladenović and Cvetana Krstev, “Adding MWEs to Serbian Lexical Resources Using Crowdsourcing” (poster presented at Тhe 5th PARSEME general meeting. Iași, Romania, 23–24 September, 2015).
Johannes Leveling, „Metonymy Recognition Using Different Kinds of Context for a Memory-Based Learner“, In Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007) (Stroudsburg, PA, USA: ACL, 2007), 153–156.
Katja Markert and Malvina Nissim, „Metonymy resolution as a classification task“, In Proceedings of EMNLP, (Stroudsburg, PA, USA: ACL, 2002), 204–213.
Krešimir Bagić, Rječnik stilskih figura (Zagreb: Školska knjiga, 2012).
Lowri Williams, Christian Bannister, Miguel Arribas-Ayllon, Alun Preece and Irena Spasic, „The role of idioms in sentiment analysis“, Expert Systems with Applications 42, no. 21 (2015): 7375 – 7385.
Massimo Poesio and Ron Artstein, „Anaphoric Annotation in the ARRAU Corpus“, In Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08) (ELRA, 2008), 1170–1174.
Miljana Mladenović and Jelena Mitrović, “Ontology of Rhetorical Figures for Serbian”, Text, Speech and Dialogue, TSD 2013, 8082 (2013): 386–393.
Miljana Mladenović and Jelena Mitrović, „Semantic Networks for Serbian: New Functionalities of Developing and Maintaining a WordNet Tool“, In Natural Language Processing for Serbian – Resources and Application, editors Gordana Pavlović Lažetić, Cvetana Krstev, Ivan Obradović and Duško Vitas (Beograd, University of Belgrade, Faculty of Mathematics: 2014), 1-11.
Miljana Mladenović, Jelena Mitrović and Cvetana Krstev, „Introducing a Language-independent Model for Adding a New Semantic Relation Between Adjectives and Nouns in a WordNet“, In Proceedings of Eight Global WordNet Conference (Bucharest, Romania, 2016), 218-225.
Miljana Mladenović, Jelena Mitrović, Cvetana Krstev and Duško Vitas, “Hybrid Sentiment Analysis Framework For A Morphologically Rich Language”, Journal of Intelligent Information Systems, 46, no. 3 (2016): 599-620.
Miloš Utvić, “Construction of a reference corpus of contemporary Serbian language” (PhD diss., University of Belgrade, Faculty of linguistics, 2014).
Paula Carvalho, Luís Sarmento, Jorge Teixeira and Mário J. Silva, „Liars and Saviors in a Sentiment Annotated Corpus of Comments to Political Debates“, In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: Short Papers, 2, (Stroudsburg, PA, USA: ACL, 2011), 564–568.
Paula Carvalho, Luís Sarmento, Mário J. Silva and Eugénio de Oliveira, „Clues for Detecting Irony in User-generated Contents: Oh...!! It's "So Easy" ;-)“, In Proceedings of the 1st International CIKM Workshop on Topic-sentiment Analysis for Mass Opinion (New York, NY, USA: ACM, 2009), 53–56.
Randy A. Harris and Chrysanne Di Marco, “Constructing a rhetorical figuration ontology” (presented at Symposium on Persuasive Technology and Digital Behavior Intervention, Convention of the Society for the Study of Artificial Intelligence and Simulation of Behaviour (AISB), Edinburgh, 2009).
Richárd Farkas, Eszter Simon, György Szarvas and Dánel Varga, „Gyder: Maxent metonymy resolution“, In Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007) (Stroudsburg, PA, USA: ACL, 2007), 161–164.
Roberto González-Ibáñez, Smaranda Muresan and Nina Wacholder, „Identifying sarcasm in Twitter: a closer look“, In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics:shortpapers (ACL-2011) (Stroudsburg, PA, USA: ACL, 2011), 581-586.
Ruslan Mitkov, Anaphora Resolution (Cambridge, UK: Longman, 2002).
Stefano Baccianella, Andrea Esuli and Fabrizio Sebastiani, „SentiWordNet 3.0: An Enhanced Lexical Resource for Sentiment Analysis and Opinion Mining“, In Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10) (ELRA, 2010).
Svetla Koeva, Cvetana Krstev and Duško Vitas, „Morpho-semantic Relations in WordNet - a Case Study for two Slavic Languages“, In Proceedings of Global WordNet Conference, (Stroudsburg, PA, USA: ACL, 2008), 239–253.
Thomas R. Gruber, “A translation approach to portable ontologies”, Knowledge Acquisition, 5, no. 2 (1993): 199–220.
Tony Veale, „Detecting and Generating Ironic Comparisons: An Application of Creative Information Retrieval“, Artificial Intelligence of Humor, Papers from the 2012 {AAAI} Fall Symposium (Arlington, Virginia, USA, November 2-4, 2012).
Tony Veale and Yanfen Hao, „Support structures for linguistic creativity: a computational analysis of creative irony in similes“, In Proceedings of CogSci 2009, the 31st annual meeting of the cognitive science society (Austin, Texas: Cognitive Science Society, 2009), 1376–1381.
Vassiliki Rentoumi, Stefanos Petrakis, Manfred Klenner, George A. Vouros and Vangelis Karkaletsis, „United we stand - improving sentiment analysis by joining machine learning and rule based methods“, In Proceedings of the 7th Language Resources and Evaluation Conference (LREC 2010) (ELRA, 2010).
Veronika Koller, Andrew Hardie, Paul Rayson and Elena Semino, „Using a semantic annotation tool for the analysis of metaphor in discourse“, In metaphorik.de, 15 (2008):141–160.
Vladan Devedžić, Semantic Web and Education, Monograph (Berlin Heidelberg New York: Springer, 2006).
Yachary J. Mason, “CorMet: a computationa l, corpus-based conventional metaphor extraction system”, Computational Linguistics, 30, no. 1 (2004): 23–44.
Yanfen Hao and Tony Veale, “An Ironic Fist in a Velvet Glove: Creative Mis-Representation in the Construction of Ironic Similes”, Journal Minds and Machines, 20, no. 4 (2010): 635–650.
Published
2016-06-09
How to Cite
MLADENOVIĆ, Miljana.
Ontology-based rhetorical figures recognition.
Infotheca - Journal for Digital Humanities, [S.l.], v. 16, n. 1-2, june 2016.
ISSN 2217-9461.
Available at: <https://infoteka.bg.ac.rs/ojs/index.php/Infoteka/article/view/2016.16.1_2.2_en>. Date accessed: 18 nov. 2024.
doi: https://doi.org/10.18485/infotheca.2016.16.1_2.2.
Section
Articles
Keywords
rhetorical figures; simile; ontology-based classification; WordNet