Movie Scene Event Extraction with Graph Attention Network Based on Argument Correlation Information
Abstract
:1. Introduction
- We introduce the correlation information of argument roles to further improve joint movie scene event extraction.
- We propose an SRC-based GAT to capture the semantic features and integrate the correlation information of argument roles into the semantic features.
- We constructed a movie scene extraction dataset to verify the effectiveness of our model. The experimental results show that our model outperforms competitive models, and the correlation information between argument roles can help to improve the performance of movie scene event extraction.
2. Related Work
3. Model
3.1. Attentive High-Level Role Module
3.2. Event Trigger Extraction
3.3. Event Argument Extraction
4. Experiments
4.1. Experiment Setup
- MovieSceneEvent: We constructed a movie scene event extraction dataset named MovieSceneEvent for this research. To construct a movie-scene-specific event extraction dataset, we first summarized 12 common types of events based on the research needs and the suggestions of professionals in the film field. Then, we chose sentences related to these events from movie script texts. These movie scripts were selected from 13 common genres of movies (including romance, comedy, action, war movies, and so on). According to the defined event types, we first used the manually defined template to roughly screen out the texts related to the defined event type from the script text and then manually filter these texts. Finally, these sentences were further labeled manually. We asked two annotators to label each sample. If their labeling was consistent, that result was used for the sample. If not, a third annotator was used to ensure the accuracy of the labeling. The movie scene event extraction dataset contains 5852 training samples and 486 testing samples, with 12 event types and 18 argument roles.
- ACE2005: Following previous works [3,28], we also adopted ACE2005, the widely used event extraction dataset, to evaluate the effectiveness of our model. It contains 599 documents, with 13,672 labeled sentences in the ACE2005 dataset, and these sentences are labeled with 8 given event types, 33 event subtypes, and 35 argument roles. Following [3,26], we split the ACE2005 dataset into 529, 30, and 40 documents for training, development, and testing, respectively.
4.2. Overall Performance
- (1)
- Our model steadily outperforms all other competitive models in both the trigger extraction and argument extraction of movie scene event extraction and open domain event extraction, which indicates that the SRC information can benefit both trigger and argument extraction in event extraction.
- (2)
- In argument extraction, our model significantly outperforms prior work, which may be due to the fact that the SRC information has a more direct correlation with the argument role.
- (3)
- It worth noting that the drop of F1 between both argument identification and classification, as well as trigger identification and classification, is smaller than in previous works, which means the SRC information is able to benefit the classification of both argument role and trigger event types. SRC information helps to maintain more semantic information between identification and classification.
- (4)
- When concerning the performance on open domain datasets, the improvement of our model is much smaller. This is probably due to the composition of argument roles in movie scene event extraction being much easier to generalize into several superior role concepts. Thus, the influence of SRC information is more significant.
4.3. Effect of Superior Role Concept
4.4. Influence of Dataset Size
5. Discussion
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Mcclosky, D.; Surdeanu, M.; Manning, C.D. Event Extraction as Dependency Parsing. In Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, Portland, OR, USA, 19–24 June 2011; pp. 1626–1635. [Google Scholar]
- Liao, R. Using Document Level Cross-Event Inference to Improve Event Extraction. In Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics, Uppsala, Sweden, 11–16 July 2010; pp. 789–797. [Google Scholar]
- Li, Q.; Ji, H.; Huang, L. Joint Event Extraction via Structured Prediction with Global Features. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Sofia, Bulgaria, 4–9 August 2013; pp. 73–82. [Google Scholar]
- Ji, H.; Grishman, R. Refining Event Extraction through Cross-Document Inference. In Proceedings of the ACL-08: HLT, Columbus, OH, USA, 19–20 June 2008; pp. 254–262. [Google Scholar]
- Huang, R.; Riloff, E. Modeling Textual Cohesion for Event Extraction. Proc. AAAI Conf. Artif. Intell. 2021, 26, 1664–1670. [Google Scholar] [CrossRef]
- Sha, L.; Qian, F.; Chang, B.; Chang, B.; Sui, Z. Jointly Extracting Event Triggers and Arguments by Dependency-Bridge Rnn and Tensor-Based Argument Interaction. Proc. AAAI Conf. Artif. Intell. 2018, 32, 5916–5923. [Google Scholar] [CrossRef]
- Nguyen, T.H.; Grishman, R. Graph Convolutional Networks with Argument-Aware Pooling for Event Detection. Proc. AAAI Conf. Artif. Intell. 2018, 32, 5900–5907. [Google Scholar] [CrossRef]
- Nguyen, T.H.; Grishman, R. Event Detection and Domain Adaptation with Convolutional Neural Networks. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), Beijing, China, 26–31 July 2015; Association for Computational Linguistics: Stroudsburg, PA, USA, 2015. [Google Scholar]
- Chen, Y.; Xu, L.; Liu, K.; Zeng, D.; Zhao, J. Event Extraction via Dynamic Multi-Pooling Convolutional Neural Networks. In Proceedings of the 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), Beijing, China, 26–31 July 2015; Association for Computational Linguistics: Stroudsburg, PA, USA, 2015. [Google Scholar]
- Chen, Y.; Chen, T.; Ebner, S.; White, A.S.; Van Durme, B. Reading the Manual: Event Extraction as Definition Comprehension. arXiv 2019, arXiv:1912.01586. [Google Scholar]
- Du, X.; Cardie, C. Event Extraction by Answering (Almost) Natural Questions. arXiv 2020, arXiv:2004.13625. [Google Scholar]
- Petar, V.; Cucurull, G.; Casanova, A. Graph Attention Networks. arXiv 2017, arXiv:1710.10903. [Google Scholar]
- Yan, M.; Lou, X.; Chan, C.A.; Wang, Y.; Jiang, W. A semantic and emotion-based dual latent variable generation model for a dialogue system. CAAI Trans. Intell. Technol. 2023, 1–12. [Google Scholar] [CrossRef]
- Yi, Q.; Zhang, G.; Zhang, S. Utilizing Entity-Based Gated Convolution and Multilevel Sentence Attention to Improve Distantly Supervised Relation Extraction. Comput. Intell. Neurosci. 2021, 2021, 6110885. [Google Scholar] [CrossRef] [PubMed]
- Liu, W.; Pang, J.; Du, Q.; Li, N.; Yang, S. A Method of Short Text Representation Fusion with Weighted Word Embeddings and Extended Topic Information. Sensors 2022, 22, 1066. [Google Scholar] [CrossRef] [PubMed]
- Pota, M.; Ventura, M.; Catelli, R.; Esposito, M. An Effective BERT-Based Pipeline for Twitter Sentiment Analysis: A Case Study in Italian. Sensors 2021, 21, 133. [Google Scholar] [CrossRef] [PubMed]
- Yan, M.; Li, S.; Chan, C.A.; Shen, Y.; Yu, Y. Mobility Prediction Using a Weighted Markov Model Based on Mobile User Classification. Sensors 2021, 21, 1740. [Google Scholar] [CrossRef] [PubMed]
- Kriman, S.; Ji, H. Joint Detection and Coreference Resolution of Entities and Events with Document-Level Context Aggregation. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: Student Research Workshop, Bangkok, Thailand, 5–6 August 2021; Association for Computational Linguistics: Stroudsburg, PA, USA, 2021. [Google Scholar]
- Lyu, Q.; Zhang, H.; Sulem, E.; Roth, D. Zero-Shot Event Extraction via Transfer Learning: Challenges and Insights. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), Online, 1–6 August 2021; Association for Computational Linguistics: Stroudsburg, PA, USA, 2021. [Google Scholar]
- Lin, Y.; Ji, H.; Huang, F.; Wu, L. A Joint Neural Model for Information Extraction with Global Features. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, Online, 5–10 July 2020; pp. 7999–8009. [Google Scholar]
- Li, D.; Huang, L.; Ji, H.; Han, J. Biomedical Event Extraction Based on Knowledge-Driven Tree-LSTM. In Proceedings of the NAACL2019, Minneapolis, MN, USA, 2–7 June 2019; pp. 1421–1430. [Google Scholar]
- He, H.; Ning, Q.; Roth, D. QuASE: Question-Answer Driven Sentence Encoding. arXiv 2019, arXiv:1909.00333. [Google Scholar]
- Liu, J.; Chen, Y.; Liu, K.; Bi, W.; Liu, X. Event Extraction as Machine Reading Comprehension. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), Online, 16–20 November 2020; Association for Computational Linguistics: Stroudsburg, PA, USA, 2020. [Google Scholar]
- Finkel, J.R.; Grenager, T.; Manning, C. Incorporating Non-Local Information into Information Extraction Systems by Gibbs Sampling. In Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics—ACL’05, Ann Arbor, MI, USA, 25–30 June 2005; Association for Computational Linguistics: Morristown, NJ, USA, 2005. [Google Scholar]
- Devlin, J.; Chang, M.-W.; Lee, K.; Toutanova, K. BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding. arXiv 2018, arXiv:1810.04805. [Google Scholar]
- Weston, J.; Chopra, S.; Bordes, A. Memory Networks. arXiv 2014, arXiv:1410.3916. [Google Scholar]
- Tarjan, R. Depth-first search and linear graph algorithms. SIAM J. Comput. 1972, 1, 146–160. [Google Scholar] [CrossRef]
- Yang, B.; Mitchell, T.M. Joint Extraction of Events and Entities within a Document Context. arXiv 2016, arXiv:1609.03632. [Google Scholar]
- Lu, Y.; Lin, H.; Xu, J.; Han, X.; Tang, J.; Li, A.; Sun, L.; Liao, M.; Chen, S. Text2Event: Controllable Sequence-Tostructure Generation for End-to-End Event Extraction. arXiv 2021, arXiv:2106.09232. [Google Scholar]
- Yi, Q.; Zhang, G.; Liu, J.; Zhang, S. Movie Scene Argument Extraction with Trigger Action Information. In Proceedings of the 021 International Conference on Culture-oriented Science & Technology (ICCST), Beijing, China, 18–21 November 2021. [Google Scholar]
Parameter | Value |
---|---|
Word embedding size | 768 |
Batch size | 25 |
Epoch size | 20 |
Dropout rate | 0.5 |
Learning rate | 0.005 |
Optimizer | AdaGrad |
Model | Trigger | Argument | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Identification | Classification | Identification | Classification | |||||||||
P | R | F1 | P | R | F1 | P | R | F1 | P | R | F1 | |
JOINTFEATURE | 61.0 | 63.2 | 62.1 | 70.1 | 51.6 | 59.4 | 51.0 | 40.9 | 45.4 | 44.3 | 41.6 | 42.9 |
DbRNN | 63.3 | 61.8 | 62.5 | 61.1 | 50.7 | 55.4 | 41.7 | 49.5 | 45.2 | 43.5 | 45.6 | 44.5 |
Joint3EE | 65.8 | 72.9 | 69.1 | 60.5 | 66.7 | 63.4 | 48.9 | 51.1 | 49.9 | 50.7 | 42.8 | 46.4 |
BS | 66.4 | 70.8 | 68.2 | 61.7 | 68.1 | 64.7 | 42.0 | 43.3 | 42.6 | 40.1 | 35.9 | 37.9 |
Text2Event | 68.2 | 70.3 | 69.2 | 62.1 | 66.2 | 64.1 | 45.3 | 47.2 | 46.2 | 47.3 | 48.4 | 47.8 |
Ours | 69.1 | 71.6 | 70.3 | 65.6 | 69.1 | 67.3 | 50.6 | 57.3 | 53.7 | 53.3 | 47.3 | 50.1 |
Model | Trigger | Argument | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Identification | Classification | Identification | Classification | |||||||||
P | R | F1 | P | R | F1 | P | R | F1 | P | R | F1 | |
JOINTFEATURE | 77.6 | 65.4 | 70.1 | 75.1 | 63.3 | 68.7 | 73.7 | 38.5 | 50.6 | 70.6 | 36.9 | 48.4 |
dbRNN | - | - | - | 70.1 | 69.8 | 71.9 | - | - | 57.2 | - | - | 50.1 |
Joint3EE | 70.5 | 74.5 | 72.5 | 68.0 | 71.8 | 69.8 | 59.9 | 59.8 | 59.9 | 52.1 | 52.1 | 52.1 |
BS | 68.9 | 77.3 | 72.9 | 66.7 | 74.7 | 70.5 | 44.9 | 41.2 | 43.0 | 44.3 | 40.7 | 42.4 |
Text2Event | - | - | - | 71.2 | 72.5 | 71.8 | - | - | - | 54.0 | 54.8 | 54.4 |
Ours | 70.4 | 76.6 | 73.3 | 70.2 | 75.1 | 72.6 | 58.4 | 53.3 | 55.7 | 56.7 | 52.8 | 54.7 |
Model | Trigger | Argument | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Identification | Classification | Identification | Classification | |||||||||
P | R | F1 | P | R | F1 | P | R | F1 | P | R | F1 | |
GAT | 65.8 | 68.3 | 67.0 | 66.7 | 64.8 | 65.7 | 50.9 | 52.1 | 51.5 | 45.1 | 42.1 | 43.5 |
GAT-TRI+SRC | 66.1 | 65.2 | 65.6 | 66.0 | 62.2 | 64.0 | 55.6 | 51.1 | 53.2 | 48.5 | 40.6 | 44.2 |
GAT-ARG+SRC | 65.3 | 72.4 | 68.6 | 65.5 | 70.0 | 67.7 | 52.8 | 50.9 | 51.8 | 50.9 | 47.1 | 48.9 |
Ours | 69.1 | 71.6 | 70.3 | 65.6 | 69.1 | 67.3 | 50.6 | 57.3 | 53.7 | 53.3 | 47.3 | 50.1 |
Size | Trigger | Argument | ||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|
Identification | Classification | Identification | Classification | |||||||||
P | R | F1 | P | R | F1 | P | R | F1 | P | R | F1 | |
25% | 45.8 | 32.3 | 37.9 | 29.6 | 35.4 | 32.2 | 25.6 | 21.1 | 23.1 | 21.1 | 28.1 | 24.1 |
50% | 49.1 | 47.2 | 48.1 | 46.0 | 52.2 | 48.9 | 45.3 | 49.1 | 47.1 | 38.5 | 42.6 | 40.4 |
75% | 65.5 | 68.4 | 66.9 | 64.5 | 70.0 | 67.1 | 50.8 | 52.9 | 51.8 | 49.9 | 46.1 | 47.9 |
100% | 69.1 | 71.6 | 70.3 | 65.6 | 69.1 | 67.3 | 50.6 | 57.3 | 53.7 | 53.3 | 47.3 | 50.1 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Yi, Q.; Zhang, G.; Liu, J.; Zhang, S. Movie Scene Event Extraction with Graph Attention Network Based on Argument Correlation Information. Sensors 2023, 23, 2285. https://doi.org/10.3390/s23042285
Yi Q, Zhang G, Liu J, Zhang S. Movie Scene Event Extraction with Graph Attention Network Based on Argument Correlation Information. Sensors. 2023; 23(4):2285. https://doi.org/10.3390/s23042285
Chicago/Turabian StyleYi, Qian, Guixuan Zhang, Jie Liu, and Shuwu Zhang. 2023. "Movie Scene Event Extraction with Graph Attention Network Based on Argument Correlation Information" Sensors 23, no. 4: 2285. https://doi.org/10.3390/s23042285
APA StyleYi, Q., Zhang, G., Liu, J., & Zhang, S. (2023). Movie Scene Event Extraction with Graph Attention Network Based on Argument Correlation Information. Sensors, 23(4), 2285. https://doi.org/10.3390/s23042285