Paper published in a book (Scientific congresses, symposiums and conference proceedings)
Identifying the Correlation Between Language Distance and Cross-Lingual Transfer in a Multilingual Representation Space
PHILIPPY, Fred; Guo, Siwen; Haddadan, Shohreh
2023In Beinborn, Lisa; Goswami, Koustava; Murado{\u{g}}lu, Saliha et al. (Eds.) Proceedings of the 5th Workshop on Research in Computational Linguistic Typology and Multilingual NLP
Peer reviewed
 

Files


Full Text
2023.sigtyp-1.3.pdf
Publisher postprint (266.82 kB) Creative Commons License - Attribution
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
NLP; Multilingual; Cross-Lingual; Language Models
Abstract :
[en] Prior research has investigated the impact of various linguistic features on cross-lingual transfer performance. In this study, we investigate the manner in which this effect can be mapped onto the representation space. While past studies have focused on the impact on cross-lingual alignment in multilingual language models during fine-tuning, this study examines the absolute evolution of the respective language representation spaces produced by MLLMs. We place a specific emphasis on the role of linguistic characteristics and investigate their inter-correlation with the impact on representation spaces and cross-lingual transfer performance. Additionally, this paper provides preliminary evidence of how these findings can be leveraged to enhance transfer to linguistically distant languages.
Disciplines :
Computer science
Author, co-author :
PHILIPPY, Fred  ;  University of Luxembourg ; Zortify S.A. > Zortify Labs
Guo, Siwen;  Zortify S.A. > Zortify Labs
Haddadan, Shohreh;  Zortify S.A. > Zortify Labs
External co-authors :
no
Language :
English
Title :
Identifying the Correlation Between Language Distance and Cross-Lingual Transfer in a Multilingual Representation Space
Publication date :
May 2023
Event name :
5th Workshop on Research in Computational Linguistic Typology and Multilingual NLP
Event organizer :
Association for Computational Linguistics
Event place :
Dubrovnik, Croatia
Event date :
May, 6th 2023
Audience :
International
Main work title :
Proceedings of the 5th Workshop on Research in Computational Linguistic Typology and Multilingual NLP
Author, co-author :
Goswami, Koustava
Murado{\u{g}}lu, Saliha
Sorokin, Alexey
Kumar, Ritesh
Shcherbakov, Andreas
Ponti, Edoardo M.
Cotterell, Ryan
Vylomova, Ekaterina
Editor :
Beinborn, Lisa
Publisher :
Association for Computational Linguistics
Pages :
22–29
Peer reviewed :
Peer reviewed
Focus Area :
Computational Sciences
Available on ORBilu :
since 24 November 2023

Statistics


Number of views
59 (4 by Unilu)
Number of downloads
38 (1 by Unilu)

Scopus citations®
 
4
Scopus citations®
without self-citations
3
OpenAlex citations
 
1

Bibliography


Similar publications



Contact ORBilu