skip to main content
10.1145/3459637.3482089acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
short-paper

Distilling Numeral Information for Volatility Forecasting

Published: 30 October 2021 Publication History

Abstract

The volatility of stock price reflects the risk of stock and influences the risk of investor's portfolio. It is also a crucial part of pricing derivative securities. Researchers have paid their attention to predict the stock volatility with different kinds of textual data. However, most of them focus on using word information only. Few touch on capturing the numeral information in textual data, providing fine-grained clues for financial document understanding. In this paper, we present a novel dataset, ECNum, for understanding the numerals in the transcript of earnings conference calls. We propose a simple but efficient method, Numeral-Aware Model (NAM), for enhancing the capacity of numeral understanding of neural network models. We employ the distilled information in the stock volatility forecasting task and achieve the best performance compared to the previous works in short-term scenarios.

Supplementary Material

MP4 File (ConCall-CIKM-2021.mp4)
The volatility of stock price reflects the risk of stock and influences the risk of investor's portfolio. It is also a crucial part of pricing derivative securities. Researchers have paid their attention to predict the stock volatility with different kinds of textual data. However, most of them focus on using word information only. Few touch on capturing the numeral information in textual data, providing fine-grained clues for financial document understanding. In this paper, we present a novel dataset, ECNum, for understanding the numerals in the transcript of earnings conference calls. We propose a simple but efficient method, Numeral-Aware Model (NAM), for enhancing the capacity of numeral understanding of neural network models. We employ the distilled information in the stock volatility forecasting task and achieve the best performance compared to the previous works in short-term scenarios.

References

[1]
Nilabhra Bhattacharya, Ervin L Black, Theodore E Christensen, and Chad R Larson. 2003. Assessing the relative informativeness and permanence of pro forma earnings and GAAP operating earnings. Journal of Accounting and Economics, Vol. 36, 1--3 (2003), 285--319.
[2]
Dirk E Black, Theodore E Christensen, Jack T Ciesielski, and Benjamin C Whipple. 2018. Non-GAAP reporting: Evidence from academia and current practice. Journal of Business Finance & Accounting, Vol. 45, 3--4 (2018), 259--294.
[3]
Tim Bollerslev. 1986. Generalized autoregressive conditional heteroskedasticity. Journal of econometrics, Vol. 31, 3 (1986), 307--327.
[4]
Mark T Bradshaw, Theodore E Christensen, Kurt H Gee, and Benjamin C Whipple. 2018. Analysts' GAAP earnings forecasts and their implications for accounting research. Journal of Accounting and Economics, Vol. 66, 1 (2018), 46--66.
[5]
Chung-Chi Chen, Hen-Hsen Huang, and Hsin-Hsi Chen. 2020. NumClaim: Investor's Fine-grained Claim Detection. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management. 1973--1976.
[6]
Chung-Chi Chen, Hen-Hsen Huang, Yow-Ting Shiue, and Hsin-Hsi Chen. 2018. Numeral understanding in financial tweets for fine-grained crowd-based forecasting. In 2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI). IEEE, 136--143.
[7]
Chung-Chi Chen, Hen-Hsen Huang, Hiroya Takamura, and Hsin-Hsi Chen. 2019. Overview of the ntcir-14 finnum task: Fine-grained numeral understanding in financial social media data. In Proceedings of the 14th NTCIR Conference on Evaluation of Information Access Technologies. 19--27.
[8]
Necs at Dereli and Murat Saraclar. 2019. Convolutional Neural Networks for Financial Text Regression. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop. Association for Computational Linguistics, Florence, Italy, 331--337. https://doi.org/10.18653/v1/P19--2046
[9]
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1. Association for Computational Linguistics, Minneapolis, Minnesota, 4171--4186.
[10]
W Brooke Elliott. 2006. Are investors influenced by pro forma emphasis and reconciliations in earnings announcements? The Accounting Review, Vol. 81, 1 (2006), 113--133.
[11]
James R Frederickson and Jeffrey S Miller. 2004. The effects of pro forma earnings disclosures on analysts' and nonprofessional investors' equity valuation judgments. The Accounting Review, Vol. 79, 3 (2004), 667--686.
[12]
Katherine Keith and Amanda Stent. 2019. Modeling Financial Analysts' Decision Making via the Pragmatics and Semantics of Earnings Calls. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Florence, Italy, 493--503. https://doi.org/10.18653/v1/P19--1047
[13]
Yoon Kim. 2014. Convolutional Neural Networks for Sentence Classification. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). Association for Computational Linguistics, Doha, Qatar, 1746--1751. https://doi.org/10.3115/v1/D14--1181
[14]
Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).
[15]
Matthew Lamm, Arun Chaganty, Christopher D. Manning, Dan Jurafsky, and Percy Liang. 2018. Textual Analogy Parsing: What's Shared and What's Compared among Analogous Facts. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics, Brussels, Belgium, 82--92. https://doi.org/10.18653/v1/D18--1008
[16]
Daniel B Nelson. 1991. Conditional heteroskedasticity in asset returns: A new approach. Econometrica: Journal of the Econometric Society (1991), 347--370.
[17]
Yu Qin and Yi Yang. 2019. What You Say and How You Say It Matters: Predicting Stock Volatility Using Verbal and Vocal Cues. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics. Association for Computational Linguistics, Florence, Italy, 390--401. https://doi.org/10.18653/v1/P19--1038
[18]
Colin Raffel, Noam Shazeer, Adam Roberts, Katherine Lee, Sharan Narang, Michael Matena, Yanqi Zhou, Wei Li, and Peter J Liu. 2019. Exploring the limits of transfer learning with a unified text-to-text transformer. arXiv preprint arXiv:1910.10683 (2019).
[19]
Navid Rekabsaz, Mihai Lupu, Artem Baklanov, Alexander Dür, Linda Andersson, and Allan Hanbury. 2017. Volatility Prediction using Financial Disclosures Sentiments with Word Embedding-based IR Models. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Vancouver, Canada, 1712--1721. https://doi.org/10.18653/v1/P17--1157
[20]
Georgios Spithourakis and Sebastian Riedel. 2018. Numeracy for Language Models: Evaluating and Improving their Ability to Predict Numbers. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Association for Computational Linguistics, Melbourne, Australia, 2104--2115. https://doi.org/10.18653/v1/P18--1196
[21]
Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems. 5998--6008.
[22]
Elmar R Venter, David Emanuel, and Steven F Cahan. 2014. The value relevance of mandatory non-GAAP earnings. Abacus, Vol. 50, 1 (2014), 1--24.
[23]
Matthew M Wieland, Mark C Dawkins, and Michael T Dugan. 2013. The differential value relevance of S&P's core earnings versus GAAP earnings: the role of stock option expense. Journal of Business Finance & Accounting, Vol. 40, 1--2 (2013), 55--81.
[24]
Linyi Yang, Tin Lok James Ng, Barry Smyth, and Riuhai Dong. 2020. HTML: Hierarchical Transformer-Based Multi-Task Learning for Volatility Prediction. In Proceedings of The Web Conference 2020 (Taipei, Taiwan) (WWW '20). Association for Computing Machinery, New York, NY, USA, 441--451.

Cited By

View all
  • (2023)A Numeral and Affective Knowledge Enhanced Network for Aspect-based Financial Sentiment Analysis2023 IEEE 35th International Conference on Tools with Artificial Intelligence (ICTAI)10.1109/ICTAI59109.2023.00139(926-933)Online publication date: 6-Nov-2023

Index Terms

  1. Distilling Numeral Information for Volatility Forecasting

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    CIKM '21: Proceedings of the 30th ACM International Conference on Information & Knowledge Management
    October 2021
    4966 pages
    ISBN:9781450384469
    DOI:10.1145/3459637
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 30 October 2021

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. numeracy
    2. opinion mining
    3. volatility forecasting

    Qualifiers

    • Short-paper

    Funding Sources

    Conference

    CIKM '21
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 520 of 2,712 submissions, 19%

    Upcoming Conference

    CIKM '25

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)21
    • Downloads (Last 6 weeks)2
    Reflects downloads up to 07 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2023)A Numeral and Affective Knowledge Enhanced Network for Aspect-based Financial Sentiment Analysis2023 IEEE 35th International Conference on Tools with Artificial Intelligence (ICTAI)10.1109/ICTAI59109.2023.00139(926-933)Online publication date: 6-Nov-2023

    View Options

    Get Access

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media