Aligning Multilingual News for Stock Return Prediction
October 22, 2025 Β· Declared Dead Β· π Social Science Research Network
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Yuntao Wu, Lynn Tao, Ing-Haw Cheng, Charles Martineau, Yoshio Nozawa, John Hull, Andreas Veneris
arXiv ID
2510.19203
Category
q-fin.CP
Cross-listed
cs.CL
Citations
0
Venue
Social Science Research Network
Last Checked
3 months ago
Abstract
News spreads rapidly across languages and regions, but translations may lose subtle nuances. We propose a method to align sentences in multilingual news articles using optimal transport, identifying semantically similar content across languages. We apply this method to align more than 140,000 pairs of Bloomberg English and Japanese news articles covering around 3500 stocks in Tokyo exchange over 2012-2024. Aligned sentences are sparser, more interpretable, and exhibit higher semantic similarity. Return scores constructed from aligned sentences show stronger correlations with realized stock returns, and long-short trading strategies based on these alignments achieve 10\% higher Sharpe ratios than analyzing the full text sample.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β q-fin.CP
R.I.P.
π»
Ghosted
R.I.P.
π»
Ghosted
Deep Reinforcement Learning for Trading
R.I.P.
π»
Ghosted
Solving the Optimal Trading Trajectory Problem Using a Quantum Annealer
R.I.P.
π»
Ghosted
Neural networks for option pricing and hedging: a literature review
R.I.P.
π»
Ghosted
Lagged correlation-based deep learning for directional trend change prediction in financial time series
R.I.P.
π»
Ghosted
QLBS: Q-Learner in the Black-Scholes(-Merton) Worlds
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Federated Learning: Strategies for Improving Communication Efficiency
R.I.P.
π»
Ghosted
In-Datacenter Performance Analysis of a Tensor Processing Unit
R.I.P.
π»
Ghosted
Deep Convolutional Neural Networks for Computer-Aided Detection: CNN Architectures, Dataset Characteristics and Transfer Learning
R.I.P.
π»
Ghosted