Quantifying Trading Behavior in Financial Markets Using Google Trends
https://doi.org/10.1038/SREP01684…
Sign up for access to the world's latest research
Abstract
Crises in financial markets affect humans worldwide. Detailed market data on trading decisions reflect some of the complex human behavior that has led to these crises. We suggest that massive new data sources resulting from human interaction with the Internet may offer a new perspective on the behavior of market participants in periods of large market movements. By analyzing changes in Google query volumes for search terms related to finance, we find patterns that may be interpreted as ''early warning signs'' of stock market moves. Our results illustrate the potential that combining extensive behavioral data sets offers for a better understanding of collective human behavior.
Key takeaways
AI
AI
- Google Trends data reflects behavioral patterns that may serve as early warning signs for stock market movements.
- Analyzing Google search volumes from 2004-2011 reveals potential for constructing profitable trading strategies.
- A 'Google Trends strategy' yielded a 326% increase in portfolio value based on search terms related to finance.
- Search volume strategies outperform random investment strategies, with statistically significant higher returns.
- U.S. search volume data is more effective than global data in predicting U.S. stock market trends.
Related papers
International Journal of Advanced Computer Science and Applications
Financial decisions are among the most significant life-changing decisions that individuals make. There is a strong correlation between financial decision making and human behavior. In this research the relationship between what people think and how stock market moves is investigated. The data from 2010 to 2015 of some of business, political and financial events which directly impact the local stock market in Pakistan is analyzed. The data was collected from search engine Google via Google trends. The association between internet searches regarding the political or business events and how the subsequent stock market moves is established. It was found that increase in search of these topics may lead to stock market fall or rise. The overall objective of this research is to predict Karachi Stock Exchange (now known as Pakistan stock exchange) 100 index by quantifying the semantics of international market. In addition to that, the relation between what an individual thinks while searching on Google which affects the local market is also investigated. The collected data has been mined by Multiclass Neural Network and Multiclass Decision Trees. The result shows that Multiclass Decision Trees performed best with an accuracy of 94%.
2013
The complex behavior of financial markets emerges from decisions made by many traders. Here, we exploit a large corpus of daily print issues of the Financial Times from 2 nd January 2007 until 31 st December 2012 to quantify the relationship between decisions taken in financial markets and developments in financial news. We find a positive correlation between the daily number of mentions of a company in the Financial Times and the daily transaction volume of a company's stock both on the day before the news is released, and on the same day as the news is released. Our results provide quantitative support for the suggestion that movements in financial markets and movements in financial news are intrinsically interlinked.
The inference of interconnectivity structure between different stochastic processes is of particular interest, in many fields such as the study of stock markets. A typical assumption in the estimation process of causal inference is stationarity, which violates the applicability of existing statistical analysis techniques. For this purpose, the most recent numerical algorithm called Auto-Regressive Estimation (ARE) algorithm has been successfully applied to EEG Brain data for detecting diseases. This manuscript introduces its application, for the first time, to stock market data. The applications, here, can contribute to a substantially deeper understanding of stock market dynamics and interdependencies before, during and after the crisis. This provides investors and market players with valuable information, which enables them to efficiently selecting less risky portfolios. From this, I argue that the utilised algorithm provides novel information regarding the detection of early warnings, which previous approaches could not assess. Furthermore, the distinction between markets that will fall into a crisis and the ones that will not, was evidenced.
SSRN Electronic Journal, 2014
In this paper we look into the interaction of Google's search queries and several aspects of international equity markets. Using a novel methodology for selecting words and a VAR modeling approach, we study whether the search queries of finance related words can have an impact on market uncertainty, log returns and traded volume of four different english speaking countries. Our overall main results are positive, we find several words that have a robust impact over the explained variables. Particularly, the most robust result we find is that an increase of the search queries including the word stock predicts an increase of volatility and decrease of index prices in the next week. With the support of this finding, we investigate the performance of a market timing strategy based on the search frequency of this word and compare it against random words from the Word-Net database and a naive buy and hold strategy. The results of this empirical application are positive and particularly stronger during the global crisis of 2008, indicating the potential of this new set of data in predicting financial markets.
International journal of financial engineering and risk management, 2018
In this paper, we present a methodology for utilizing Google Search Indices obtained from the Google Trends website as a means for measuring potential investor interest in stocks listed on the Dow Jones Index (Dow 30). We accomplish this task by utilizing a Long Short-Term Memory network that correlates changes in the search volume for a given asset with changes in the actual trade volume for said asset. Additionally, by using these predictions, we formulate a concise trading strategy in the hopes of being able to outperform the market and analyze the results of this new strategy by backtesting across weekly closing price data for the last six months of 2016. With these tests, it was discovered that about 43% of the time, the machine learning based trading strategy outperformed the baseline sample indicating that there is indeed a correlation between price movements for certain assets on the Dow 30 and the number of Google searches for said assets. Furthermore, while the scope of our study was limited to the Dow 30 in order to mitigate selection bias, we nonetheless hypothesize that numerous other assets that similarly possess a predictable correlation to Google Search Volumes are likely to exist thereby making the trading algorithm described in this paper applicable beyond the narrow scope of this study.
SSRN Electronic Journal, 2000
The new digital revolution of big data is deeply changing our capability of understanding society and forecasting the outcome of many social and economic systems. Unfortunately, information can be very heterogeneous in the importance, relevance, and surprise it conveys, affecting severely the predictive power of semantic and statistical methods. Here we show that the aggregation of web users' behavior can be elicited to overcome this problem in a hard to predict complex system, namely the financial market. Specifically, our in-sample analysis shows that the combined use of sentiment analysis of news and browsing activity of users of Yahoo! Finance greatly helps forecasting intra-day and daily price changes of a set of 100 highly capitalized US stocks traded in the period 2012-2013. Sentiment analysis or browsing activity when taken alone have very small or no predictive power. Conversely, when considering a news signal where in a given time interval we compute the average sentiment of the clicked news, weighted by the number of clicks, we show that for nearly 50% of the companies such signal Granger-causes hourly price returns. Our result indicates a "wisdom-of-the-crowd" effect that allows to exploit users' activity to identify and weigh properly the relevant and surprising news, enhancing considerably the forecasting power of the news sentiment.
2014
We discuss the claims that data from Google Trends contain enough data to predict future nancial index returns. We rst review the many subtle (and less subtle) biases that may aect the backtest of a trading strategy, particularly when based on such data. Expectedly, the choice of keywords is crucial: by using an industry-grade backtest system, we verify that random nance-related keywords do not to contain more exploitable predictive information than random keywords related to illnesses, classic cars and arcade games. However, other keywords applied on suitable assets yield robustly protable strategies, thereby conrming the intuition of .
Scientific Reports, 2014
Search engines have changed the way people find relevant information on the Internet. It was suggested that investors use these Internet tools during the acquisition decision process and that the data sets generated by search engines are related to stock market movements. This study aims to analyze the relationship between this social phenomenon and the evolution of the Brazilian stock exchange. After previous studies, the correlation between Google's query volumes for terms related to a specific index of the Brazilian stock market (Ibovespa) and stock market metrics (opening, closing, high and low prices) was analyzed. The data showed a positive correlation between investor attention, represented by Google's search volumes and market prices. These results suggest that Google Trends data is more positively correlated to the highs of financial data, suggesting that investors tend to search more on the web when the market reaches its peak or vice versa. This article highlights the potential that this source of information has in understanding the Brazilian stock market.
arXiv (Cornell University), 2014
The new digital revolution of big data is deeply changing our capability of understanding society and forecasting the outcome of many social and economic systems. Unfortunately, information can be very heterogeneous in the importance, relevance, and surprise it conveys, affecting severely the predictive power of semantic and statistical methods. Here we show that the aggregation of web users' behavior can be elicited to overcome this problem in a hard to predict complex system, namely the financial market. Specifically, we show that the combined use of sentiment analysis of news and browsing activity of users of Yahoo! Finance allows to forecast intra-day and daily price changes of a set of 100 highly capitalized US stocks traded in the period 2012-2013. Sentiment analysis or browsing activity when taken alone have very small or no predictive power. Conversely, when considering a news signal where in a given time interval we compute the average sentiment of the clicked news, weighted by the number of clicks, we show that for more the 50% of the companies such signal Granger-causes price returns. Our result indicates a "wisdom-of-the-crowd" effect that allows to exploit users' activity to identify and weigh properly the relevant and surprising news, enhancing considerably the forecasting power of the news sentiment.
References (35)
- Vespignani, A. Predicting the Behavior of Techno-Social Systems. Science 325, 425-428 (2009).
- Lazer, D. et al. Computational Social Science. Science 323, 721-723. (2009).
- Perc, M. Evolution of the most common English words and phrases over the centuries. J. R. Soc. Interface 9, 3323-3328 (2012).
- Petersen, A. M., Tenenbaum, J. N., Havlin, S., Stanley, H. E. & Perc, M. Languages cool as they expand: Allometric scaling and the decreasing need for new words. Scientific Reports 2, 943 (2012).
- Christakis, N. A. & Fowler, J. H. Connected: The surprising power of our social networks and how they shape our lives (Little, Brown and Company, 2009).
- Fehr, E. Behavioural science -The economics of impatience. Nature 415, 269-272 (2002).
- Shleifer, A. Inefficient Markets: An Introduction to Behavioral Finance (Oxford University Press, Oxford, 2000).
- Lillo, F., Farmer, J. D. & Mantegna, R. N. Econophysics -Master curve for price- impact function. Nature 421, 129-130 (2003).
- Gabaix, X., Gopikrishnan, P., Plerou, V. & Stanley, H. E. A theory of power-law distributions in financial market fluctuations. Nature 423, 267-270 (2003).
- Preis, T., Kenett, D. Y., Stanley, H. E., Helbing, D. & Ben-Jacob, E. Quantifying the Behavior of Stock Correlations Under Market Stress. Scientific Reports 2, 752 (2012).
- Preis, T., Schneider, J. J. & Stanley, H. E. Switching processes in financial markets. PNAS 108, 7674-7678 (2011).
- Preis, T. Econophysics -complex correlations and trend switchings in financial time series. European Physical Journal Special Topics 194, 5-86 (2011).
- Bunde, A., Schellnhuber, H. J. & Kropp, J., eds. The Science of Disasters: Climate Disruptions, Heart Attacks, and Market Crashes (Springer, Berlin, 2002).
- Vandewalle, N. & Ausloos, M. Coherent and random sequences in financial fluctuations. Physica A 246, 454-459 (1997).
- Podobnik, B., Horvatic, D., Petersen, A. M. & Stanley, H. E. Cross-correlations between volume change and price change. PNAS 106, 22079-22084 (2009).
- Sornette, D., Woodard, R. & Zhou, W. X. The 2006-2008 oil bubble: Evidence of speculation, and prediction. Physica A 388, 1571-1576. (2009).
- Watanabe, K., Takayasu, H. & Takayasu, M. A mathematical definition of the financial bubbles and crashes. Physica A 383, 120-124 (2007).
- Bouchaud, J. P., Matacz, A. & Potters, M. Leverage effect in financial markets: the retarded volatility model. Physical Review Letters 87, 228701 (2001).
- Hommes, C. H. Modeling the stylized facts in finance through simple nonlinear adaptive systems. PNAS 99, 7221-7228 (2002).
- Haldane, A. G. & May, R. M. Systemic risk in banking ecosystems. Nature 469, 351-355 (2011).
- Lux, T. & Marchesi, M. Scaling and criticality in a stochastic multi-agent model of a financial market. Nature 397, 498-500 (1999).
- Krugman, P. The Self-Organizing Economy (Blackwell, Cambridge, Massachusetts, 1996).
- Sornette, D. & von der Becke, S. Complexity clouds finance-risk models. Nature 471, 166 (2011).
- Schweitzer, F. et al. Economic Networks: The New Challenges. Science 325, 422-425 (2009).
- Garlaschelli, D., Caldarelli, G. & Pietronero, L. Universal scaling relations in food webs. Nature 423, 165-168 (2003).
- Onnela, J. P., Arbesman, S., Gonzalez, M. C., Barabasi, A. L. & Christakis, N. A. Geographic Constraints on Social Network Groups. PLoS One 6, e16939 (2011).
- Buldyrev, S. V., Parshani, R., Paul, G., Stanley, H. E. & Havlin, S. Catastrophic cascade of failures in interdependent networks. Nature 464, 1025-1028 (2010).
- Simon, H. A. A behavioral model of rational choice. Quarterly Journal of Economics 69, 99-118 (1955).
- Mondria, J., Wu, T. & Zhang, Y. The determinants of international investment and attention allocation: Using internet search query data. Journal of International Economics 82, 85-95 (2010).
- Ginsberg, J. et al. Detecting influenza epidemics using search engine query data. Nature 457, 1012-1014 (2009).
- Preis, T., Reith, D. & Stanley, H. E. Complex dynamics of our economic life on different scales: insights from search engine query data. Phil. Trans. R. Soc. A 368, 5707-5719 (2010).
- Bordino, I. et al. Web Search Queries Can Predict Stock Market Volumes. PLoS One 7, e40014 (2012).
- Choi, H. & Varian, H. Predicting the Present with Google Trends. The Economic Record 88, 2-9 (2012).
- Preis, T., Moat, H. S., Stanley, H. E. & Bishop, S. R. Quantifying the Advantage of Looking Forward. Scientific Reports 2, 350 (2012).
- Kendall, M. A New Measure of Rank Correlation. Biometrika 30, 81-89 (1938).
FAQs
AI
How does Google Trends data correlate with stock market movements over time?add
The study shows that search volume changes for financial terms can predict stock price trends, particularly during the period from 2004 to 2011.
What specific trading strategy did the research develop using Google Trends data?add
The Google Trends strategy suggested selling or buying DJIA based on prior search volume changes, yielding a cumulative return of 326%.
What is the relevance of financial search terms according to the research?add
The research indicates that higher financial relevance of search terms correlates positively with trading strategy returns, evidenced by Kendall's tau of 0.275.
What were the comparative returns of different trading strategies in the research?add
Google Trends strategies outperformed both random investment strategies and buy-and-hold strategies, achieving returns of 60% compared to 16% and 33%, respectively.
Why are U.S. search volume data more effective for market predictions?add
The analysis revealed that U.S. search volume data yields better prediction accuracy due to the higher proportion of local traders compared to global data.
Eugene Stanley