跳到主要导航 跳到搜索 跳到主要内容

Unravelling the semantic mysteries of transformers layer by layer

  • Cheng Zhang
  • , Jinxin Lv
  • , Jingxu Cao
  • , Jiachuan Sheng*
  • , Dawei Song
  • , Tiancheng Zhang
  • *此作品的通讯作者
  • Tianjin University of Finance and Economics
  • Beijing Institute of Technology
  • Tianjin University

科研成果: 期刊稿件文章同行评审

摘要

Despite the significant success of transformer models and their successors in various natural language processing (NLP) applications, their internal workings are still not fully understood. Much of the current interpretability research has focused primarily on numerical components, often missing the complex semantic layers within these models. To fill this gap, this study explores the interpretability of the transformer model, a cornerstone of modern NLP, by addressing the semantic complexities of its multi-layer architecture. We identify three key questions: (i) the influence of the multi-layer structure on semantic processing, (ii) the unique contributions of each layer to model performance, and (iii) methodologies for determining optimal layer counts for the encoder and decoder. To tackle these issues, we introduce the semantic interpreter for transformer hierarchy, an innovative framework that employs convex hull metrics to visualize and assess semantic quality and quantity. Our contributions include novel methods for semantic assessment, a dual analytical framework that integrates cumulative and layer-to-layer perspectives, and insights into the dynamics of encoding and decoding. This comprehensive approach aims to enhance the understanding of Transformer models, ultimately guiding their refinement for improved interpretability and effectiveness in natural language tasks.

源语言英语
页(从-至)1237-1251
页数15
期刊Computer Journal
68
9
DOI
出版状态已出版 - 1 9月 2025
已对外发布

指纹

探究 'Unravelling the semantic mysteries of transformers layer by layer' 的科研主题。它们共同构成独一无二的指纹。

引用此