Cross-Lingual Sponsored Search via Dual-Encoder and Graph Neural Networks for Context-Aware Query Translation in Advertising Platforms
DOI:
https://doi.org/10.71222/j9qwrc36Keywords:
cross-lingual sponsored search, query translation, graph neural network, dual-encoder, advertising recommendation, business intelligenceAbstract
Cross-lingual sponsored search is crucial for global advertising platforms, where users from different language backgrounds interact with multilingual ads. Traditional machine translation methods often fail to capture query-specific contextual cues, leading to semantic ambiguities that negatively impact click-through rates (CTR) and conversion rates (CVR). To address this challenge, we propose AdGraphTrans, a novel dual-encoder framework enhanced with graph neural networks (GNNs) for context-aware query translation in advertising. Specifically, user queries and ad contents are independently encoded using multilingual Transformer-based encoders (mBERT/XLM-R), and contextual relations-such as co-clicked ads, user search sessions, and query-ad co-occurrence-are modeled as a heterogeneous graph. A graph attention network (GAT) is then applied to refine embeddings by leveraging semantic and behavioral context. These embeddings are aligned via contrastive learning to reduce translation ambiguity. Experiments conducted on a cross-lingual sponsored search dataset collected from Google Ads and Amazon Ads (EN-ZH, EN-ES, EN-FR pairs) demonstrate that AdGraphTrans significantly improves query translation quality, achieving a BLEU score of 38.9 and semantic similarity (cosine score) of 0.83, outperforming strong baselines such as mBERT and M2M-100. Moreover, in downstream ad retrieval tasks, AdGraphTrans yields +4.67% CTR and +1.72% CVR improvements over baseline methods. These results confirm that incorporating graph-based contextual signals with dual-encoder translation provides a robust solution for enhancing cross-lingual sponsored search in advertising platforms.
References
1. P. Biswas, S. G. Neogi, D. Arockiam, and A. Mitra, "The Importance of Multi-Lingual Sentiment Analysis in Digital Marketing," In 2024 11th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions)(ICRITO), March, 2024, pp. 1-8. doi: 10.1109/icrito61523.2024.10522115
2. J. Turian, L. Shen, and I. D. Melamed, "Evaluation of machine translation and its evaluation," In Proceedings of machine translation summit IX: papers., 2003.
3. E. Biçici, "Residual connections improve click-through rate and conversion rate prediction performance," Neural Computing and Applications, vol. 37, no. 17, pp. 10675-10688, 2025. doi: 10.1007/s00521-024-10617-0
4. B. Muller, A. Anastasopoulos, B. Sagot, and D. Seddah, "When being unseen from mBERT is just the beginning: Handling new languages with multilingual language models," In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, June, 2021, pp. 448-462. doi: 10.18653/v1/2021.naacl-main.38
5. A. Kumar, and V. H. C. Albuquerque, "Sentiment analysis using XLM-R transformer and zero-shot transfer learning on resource-poor Indian language," Transactions on Asian and Low-Resource Language Information Processing, vol. 20, no. 5, pp. 1-13, 2021. doi: 10.1145/3461764
6. A. Mohammadshahi, V. Nikoulina, A. Berard, C. Brun, J. Henderson, and L. Besacier, "SMaLL-100: Introducing shallow multilingual machine translation model for low-resource languages," arXiv preprint arXiv:2210.11621, 2022. doi: 10.18653/v1/2022.emnlp-main.571
7. Z. Pan, E. Chen, Q. Liu, T. Xu, H. Ma, and H. Lin, "Sparse factorization machines for click-through rate prediction," In 2016 IEEE 16th International Conference on Data Mining (ICDM), December, 2016, pp. 400-409. doi: 10.1109/icdm.2016.0051
8. J. Gligorijevic, D. Gligorijevic, I. Stojkovic, X. Bai, A. Goyal, and Z. Obradovic, "Deeply supervised model for click-through rate prediction in sponsored search," Data Mining and Knowledge Discovery, vol. 33, no. 5, pp. 1446-1467, 2019. doi: 10.1007/s10618-019-00625-3
9. E. Reiter, "A structured review of the validity of BLEU," Computational Linguistics, vol. 44, no. 3, pp. 393-401, 2018. doi: 10.1162/coli_a_00322
10. P. Sitikhu, K. Pahi, P. Thapa, and S. Shakya, "A comparison of semantic similarity methods for maximum human interpretability," In 2019 artificial intelligence for transforming business and society (AITB), November, 2019, pp. 1-4. doi: 10.1109/aitb48515.2019.8947433
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Ziyang Gao, Yuanliang Qu, Yi Han (Author)

This work is licensed under a Creative Commons Attribution 4.0 International License.







