TY - GEN
T1 - Late-Interaction InfoNCE
T2 - 10th International Conference on Computer and Information Processing Technology, ISCIPT 2025
AU - Bu, Yunhan
AU - Wang, Lun
AU - Huang, Tianwei
AU - Hamdulla, Askar
AU - Zhang, Huaping
AU - Zhang, Quan
N1 - Publisher Copyright:
© 2025 IEEE.
PY - 2025
Y1 - 2025
N2 - Multi-hop reasoning presents significant challenges in natural language processing, particularly due to asymmetric retrieval where evidence within a reasoning chain varies greatly in retrieval difficulty. Implicit evidence, which lacks explicit retrieval identifiers, is often overshadowed by more readily retrievable explicit evidence, leading models to favor superficial associations over deeper semantic reasoning. To address this, we propose Late-Interaction InfoNCE, a novel contrastive loss function that integrates both dot product and Hadamard product interactions to enhance the representation of semantic relationships between queries and evidence samples. This method improves the model's ability to capture fine-grained feature couplings and global semantic alignments simultaneously. Additionally, we introduce a false negative masking strategy to reduce interference from semantically related negative samples. Experiments on the EX-FEVER dataset demonstrate that our approach achieves state-of-the-art performance across multiple retrieval metrics, including Recall and MRR, validating its effectiveness in complex asymmetric retrieval scenarios.
AB - Multi-hop reasoning presents significant challenges in natural language processing, particularly due to asymmetric retrieval where evidence within a reasoning chain varies greatly in retrieval difficulty. Implicit evidence, which lacks explicit retrieval identifiers, is often overshadowed by more readily retrievable explicit evidence, leading models to favor superficial associations over deeper semantic reasoning. To address this, we propose Late-Interaction InfoNCE, a novel contrastive loss function that integrates both dot product and Hadamard product interactions to enhance the representation of semantic relationships between queries and evidence samples. This method improves the model's ability to capture fine-grained feature couplings and global semantic alignments simultaneously. Additionally, we introduce a false negative masking strategy to reduce interference from semantically related negative samples. Experiments on the EX-FEVER dataset demonstrate that our approach achieves state-of-the-art performance across multiple retrieval metrics, including Recall and MRR, validating its effectiveness in complex asymmetric retrieval scenarios.
KW - Asymmetric Retrieval
KW - Late Interaction
KW - Multi-hop Reasoning
UR - https://www.scopus.com/pages/publications/105030482161
U2 - 10.1109/ISCIPT67144.2025.11264929
DO - 10.1109/ISCIPT67144.2025.11264929
M3 - Conference contribution
AN - SCOPUS:105030482161
T3 - 2025 10th International Conference on Computer and Information Processing Technology, ISCIPT 2025
SP - 631
EP - 635
BT - 2025 10th International Conference on Computer and Information Processing Technology, ISCIPT 2025
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 12 September 2025 through 14 September 2025
ER -