| Abstract: |
Recommender Systems (RS) are critical for personalization across digital platforms. However, two fundamental limitations persist: data sparsity and linguistic low-resource conditions. The first refers to insufficient user-item interactions, while the second concerns the lack of linguistic resources such as annotated data, lexicons, or pre-trained embeddings. Both problems degrade the representational capacity of models and hinder accurate predictions. In this work, we propose a Multimodal recommender system based on DeBERTa, integrating multiple feature sources (ratings, textual reviews, and metadata) through an early fusion strategy to mitigate both challenges simultaneously. Experiments conducted on MovieLens, Amazon datasets, and the multilingual IndicHash dataset demonstrate the superiority of our approach over traditional models like GRU, SVD, and Transformer baselines, as well as specialized multilingual systems, particularly under extreme sparsity and low-resource conditions. |