混合曲率空间中的几何自适应元学习方法

Translated title of the contribution: Geometry-Adaptive Meta-Learning in Mixed-Curvature Spaces

Zhi Gao, Yu Wei Wu*, Yun De Jia

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Meta-learning has shown effectiveness in helping learning models quickly adapt to new tasks by learning prior knowledge. In the process of adaptation to new tasks, the matching degree between the geometric structure of space and the geometric structure of data plays an important role in the generalization ability of the model. In many practical applications, data has diverse non-Euclidean structures. For example, natural language has non-Euclidean hierarchical structures, and face images have non-Euclidean cyclical structures. Existing research has shown that the geometric structure of Riemannian manifolds matches the non-Euclidean structures of real-world data, providing theoretical feasibility for modeling data using Riemannian manifolds. In this paper, we propose a geometry-adaptive meta-learning method in mixed-curvature spaces, which uses multiple mixed-curvature spaces to model data and produces matching Riemannian geometry for non-Euclidean structures. We build a multi-mixed-curvature neural network that represents the geometry of mixed-curvature space as curvature, number, and dimensionality of the curvature spaces, through which the geometry adaptation to non-Euclidean structures is achieved via a gradient descent process. We further introduce a geometry initialization generation scheme and geometry updating scheme. Through only a few optimization steps, the geometric structure of the underlying space can quickly match non-Euclidean structures of data, accelerating the gradient descent process. We conduct experiments on few-shot classification, few-shot regression, and image completion to evaluate the effectiveness of our method. Compared with meta-learning methods in Euclidean space, our method improves the accuracy by 3% in few-shot classification tasks, and reduces mean square error by half in few-shot regression tasks, showing the effectiveness of our method.

Translated title of the contributionGeometry-Adaptive Meta-Learning in Mixed-Curvature Spaces
Original languageChinese (Traditional)
Pages (from-to)2289-2306
Number of pages18
JournalJisuanji Xuebao/Chinese Journal of Computers
Volume47
Issue number10
DOIs
Publication statusPublished - Oct 2024

Cite this