Stock Prediction by Polyglot Sentiment Analysis on Twitter
DOI:
https://doi.org/10.47611/jsr.v12i2.1910Keywords:
Sentiment Analysis, Natural Language Processing, Twitter, Stock Prediction, Polyglot, DesjardinsAbstract
Research in the economics field has extensively documented the impact of media sentiments on the stock market. Sentiment analysis, as a tool to predict equity prices, has been popularized in the past years. Recently, Twitter has received a lot of attention due to the diversity of opinions on social media platforms. A common obstruction to sentiment analysis is the resource gap between English and other languages. This pilot study examines the effect of polyglotism in tweets concerning bilingual companies and develops a model that extracts sentiments from polyglot tweets to make price predictions. Results suggest that taking non-English tweets into consideration decreases errors in price predictions and that the random forest models have higher performance than linear regression models. The results of this pilot study need to be confirmed with larger sets of data.
Downloads
Metrics
References or Bibliography
Tetlock, Paul C. (2005) Giving Content to Investor Sentiment: The Role of Media in the Stock Market. Journal of Finance. https://ssrn.com/abstract=685145
Jin, Z., Yang, Y., & Liu, Y. (2020). Stock closing price prediction based on sentiment analysis and LSTM. Neural Computing & Applications, 32(13), 9713–9729. https://doi-org.ezproxy.marianopolis.edu/10.1007/s00521-019-04504-2
Mendoza-Urdiales, R. A., Núñez-Mora, J. A., Santillán-Salgado, R. J., & Valencia-Herrera, H. (2022). Twitter Sentiment Analysis and Influence on Stock Performance Using Transfer Entropy and EGARCH Methods. Entropy, 24(7), N.PAG. https://doi-org.ezproxy.marianopolis.edu/10.3390/e24070874
Kim, J., Jung, H.-Y., Lee, Y., & Lee, J.-H. (2009). Conveying Subjectivity of a Lexicon of One Language into Another Using a Bilingual Dictionary and a Link Analysis Algorithm. International Journal of Computer Processing of Languages, 22(2/3), 205–218. https://doi-org.ezproxy.marianopolis.edu/10.1142/S1793840609002044
Tellez, E. S., Miranda-Jiménez, S., Graff, M., Moctezuma, D., Suárez, R. R., & Siordia, O. S. (2017). A simple approach to multilingual polarity classification in Twitter. Pattern Recognition Letters, 94, 68–74. https://doi-org.ezproxy.marianopolis.edu/10.1016/j.patrec.2017.05.024
Xiu, D. (2022, November 8). Expected Returns and Foundation Models of Language. Seminar presented at the meeting of NYU Courant Mathematics Department.
Statistics Canada. (2011) Visual Census. https://www12.statcan.gc.ca/census-recensement/2011/dp-pd/vc-rv/index.cfm?Lang=ENG&VIEW=D&GEOCODE=462&TOPIC_ID=4
Published
How to Cite
Issue
Section
Copyright (c) 2023 Yizhou Wang
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Copyright holder(s) granted JSR a perpetual, non-exclusive license to distriute & display this article.