Semi-strong Efficient Market of Bitcoin and Twitter: an Analysis of Semantic Vector Spaces of Extracted Keywords and Light Gradient Boosting Machine Models
–arXiv.org Artificial Intelligence
This study extends the examination of the Efficient-Market Hypothesis in Bitcoin market during a five year fluctuation period, from September 1 2017 to September 1 2022, by analyzing 28,739,514 qualified tweets containing the targeted topic "Bitcoin". Unlike previous studies, we extracted fundamental keywords as an informative proxy for carrying out the study of the EMH in the Bitcoin market rather than focusing on sentiment analysis, information volume, or price data. We tested market efficiency in hourly, 4-hourly, and daily time periods to understand the speed and accuracy of market reactions towards the information within different thresholds. A sequence of machine learning methods and textual analyses were used, including measurements of distances of semantic vector spaces of information, keywords extraction and encoding model, and Light Gradient Boosting Machine (LGBM) classifiers. Our results suggest that 78.06% (83.08%), 84.63% (87.77%), and 94.03% (94.60%) of hourly, 4-hourly, and daily bullish (bearish) market movements can be attributed to public information within organic tweets.
arXiv.org Artificial Intelligence
Sep-24-2024
- Country:
- Asia > Middle East
- UAE (0.14)
- North America > United States (0.46)
- Asia > Middle East
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Banking & Finance > Trading (1.00)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Ensemble Learning (0.61)
- Neural Networks (0.45)
- Performance Analysis (0.46)
- Supervised Learning > Representation Of Examples (0.71)
- Natural Language > Information Retrieval (0.48)
- Machine Learning
- Communications > Social Media (1.00)
- e-Commerce > Financial Technology (1.00)
- Artificial Intelligence
- Information Technology