TY - GEN
T1 - Few-shot Learning for Multi-Modality Tasks
AU - Chen, Jie
AU - Ye, Qixiang
AU - Yang, Xiaoshan
AU - Zhou, S. Kevin
AU - Hong, Xiaopeng
AU - Zhang, Li
N1 - Publisher Copyright:
© 2021 Owner/Author.
PY - 2021/10/17
Y1 - 2021/10/17
N2 - Recent deep learning methods rely on a large amount of labeled data to achieve high performance. These methods may be impractical in some scenarios, where manual data annotation is costly or the samples of certain categories are scarce (e.g., tumor lesions, endangered animals and rare individual activities). When only limited annotated samples are available, these methods usually suffer from the overfitting problem severely, which degrades the performance significantly. In contrast, humans can recognize the objects in the images rapidly and correctly with their prior knowledge after exposed to only a few annotated samples. To simulate the learning schema of humans and relieve the reliance on the large-scale annotation benchmarks, researchers start shifting towards the few-shot learning problem: they try to learn a model to correctly recognize novel categories with only a few annotated samples.
AB - Recent deep learning methods rely on a large amount of labeled data to achieve high performance. These methods may be impractical in some scenarios, where manual data annotation is costly or the samples of certain categories are scarce (e.g., tumor lesions, endangered animals and rare individual activities). When only limited annotated samples are available, these methods usually suffer from the overfitting problem severely, which degrades the performance significantly. In contrast, humans can recognize the objects in the images rapidly and correctly with their prior knowledge after exposed to only a few annotated samples. To simulate the learning schema of humans and relieve the reliance on the large-scale annotation benchmarks, researchers start shifting towards the few-shot learning problem: they try to learn a model to correctly recognize novel categories with only a few annotated samples.
KW - few-shot learning
KW - multi-modal learning
UR - http://www.scopus.com/inward/record.url?scp=85119359641&partnerID=8YFLogxK
U2 - 10.1145/3474085.3478873
DO - 10.1145/3474085.3478873
M3 - Conference contribution
AN - SCOPUS:85119359641
T3 - MM 2021 - Proceedings of the 29th ACM International Conference on Multimedia
SP - 5673
EP - 5674
BT - MM 2021 - Proceedings of the 29th ACM International Conference on Multimedia
PB - Association for Computing Machinery, Inc
T2 - 29th ACM International Conference on Multimedia, MM 2021
Y2 - 20 October 2021 through 24 October 2021
ER -