Personalization in E-Commerce: Optimizing Recommendations for Multimodal Content
Bulycheva Mariia , Senior Applied Scientist, Zalando GermanyAbstract
This article examines modern approaches to the personalization of multimodal content in e-commerce, driven by the growing complexity of user requests and the evident need to adapt recommendations to diverse data formats and content modality. The relevance of this topic is underscored by the increasing volume of information associated with the article, including text, images, video, and audio, which necessitates the application of specialized methods for precise customization and improved personalization. The purpose of the study is to develop original proposals for optimizing recommendation algorithms based on multimodal information, enabling the consideration of both context and individual user preferences. The research reveals contradictions in the literature—while many studies focus on specific aspects of personalization, such as textual data or visual elements, integrative approaches to the analyzed content are insufficiently addressed. The author proposes solutions combining deep learning methods and behavioral model analysis to achieve more accurate results in predicting audience interests. The materials presented in this work will be useful for e-commerce professionals, developers of recommendation systems, and researchers focused on evaluating behavioral patterns.
Keywords
algorithms, deep learning, multimodal content, personalization, user behavior, recommendations, e-commerce
References
Bibi R. Query-by-visual-search: multimodal framework for content-based image retrieval / R. Bibi, Z. Mehmood, R.M. Yousaf, T. Saba, M. Sardaraz, A. Rehman // Journal of Ambient Intelligence and Humanized Computing. – 2020. – Vol. 11. – No. 11. – Pp. 5629-5648.
Boztepe E.B. An approach for audio-visual content understanding of video using multimodal deep learning methodology / E.B. Boztepe, B. Karakaya, B. Karasulu, İ. Ünlü // Sakarya University Journal of Computer and Information Sciences. – 2022. – Vol. 5. – No. 2. – Pp. 181-207.
Liu Yu. Scanning, attention, and reasoning multimodal content for sentiment analysis / Yu. Liu, Zh. Li, Ke. Zhou, L. Zhang, L. Li, P. Tian, Sh. Shen // Knowledge-Based Systems. – 2023. – Vol. 268.
Lu Yu. Online content-based sequential recommendation considering multimodal contrastive representation and dynamic preferences / Yu. Lu, Y. Duan // Neural Computing & Applications. – 2024. – Vol. 36. – No. 13. – Pp. 7085-7103.
Silvester S. Dual-blend insight recommendation system for e-commerce recommendations and enhance personalization / S. Silvester, Sh. Kurain // Indonesian Journal of Electrical Engineering and Computer Science. – 2024. – Vol. 34. – No. 2. – P. 1181-1191.
Syed I. The multimodal trust effects of face, voice, and sentence content / I. Syed, M. Baart, J. Vroomen // Multisensory Research. – 2024. – Vol. 37. – No. 2. – Pp. 125-141.
Thangavel P. A lexicon-based approach for sentiment analysis of multimodal content in tweets / P. Thangavel, R. Lourdusamy // Multimedia Tools and Applications. – 2023. – Vol. 82. – No. 16. – Pp. 24203-24226.
Wasilewski A. One size does not fit all: multivariant user interface personalization in e-commerce / A. Wasilewski, G. Kolaczek // IEEE Access. – 2024. – Vol. 12. – Pp. 65570-65582.
Zhang Z. A survey on multimodal-guided visual content synthesis / Z. Zhang, Z. Li, K. Wei, S. Pan, Ch. Deng // Neurocomputing. – 2022. – Vol. 497. – Pp. 110-128.
Zhu Ya. Affective video content analysis via multimodal deep quality embedding network / Ya. Zhu, Zh. Chen, F. Wu // IEEE Transactions on Affective Computing. – 2022. – Vol. 13. – No. 3. – Pp. 1401-1415
Article Statistics
Downloads
Copyright License
Copyright (c) 2025 Bulycheva Mariia

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors retain the copyright of their manuscripts, and all Open Access articles are disseminated under the terms of the Creative Commons Attribution License 4.0 (CC-BY), which licenses unrestricted use, distribution, and reproduction in any medium, provided that the original work is appropriately cited. The use of general descriptive names, trade names, trademarks, and so forth in this publication, even if not specifically identified, does not imply that these names are not protected by the relevant laws and regulations.