TY - GEN
T1 - Self-adaptive Perception Model for Action Segment Detection
AU - Li, Jiahe
AU - Li, Kan
AU - Niu, Xin
N1 - Publisher Copyright:
© 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
PY - 2022
Y1 - 2022
N2 - Action segment detection is an important yet challenging problem, since we need to localize the proposals which contain an action instance in a long untrimmed video with arbitrary length and random position. This task requires us not only to find the precise moment of starting and ending of an action instance, but also to detect action instances as many as possible. We propose a new model Self-Adaptive Perception to address this problem. We predict the action boundaries by classifying start and end of each action as separate components, allowing our model to predict the starting and ending boundaries roughly and generate candidate proposals. We evaluate each candidate proposals by a novel and flexible architecture called Discriminator. It can extract enough semantic information and generate precise confidence score of whether a proposal contains an action within its region, which benefit from the self-adaptive architecture. We conduct solid and rich experiments on large dataset Activity-Net, the result shows that our method achieves a competitive performance, outperforming most published state-of-the-art method in the field. And further experiments demonstrate the effect of each module of our model.
AB - Action segment detection is an important yet challenging problem, since we need to localize the proposals which contain an action instance in a long untrimmed video with arbitrary length and random position. This task requires us not only to find the precise moment of starting and ending of an action instance, but also to detect action instances as many as possible. We propose a new model Self-Adaptive Perception to address this problem. We predict the action boundaries by classifying start and end of each action as separate components, allowing our model to predict the starting and ending boundaries roughly and generate candidate proposals. We evaluate each candidate proposals by a novel and flexible architecture called Discriminator. It can extract enough semantic information and generate precise confidence score of whether a proposal contains an action within its region, which benefit from the self-adaptive architecture. We conduct solid and rich experiments on large dataset Activity-Net, the result shows that our method achieves a competitive performance, outperforming most published state-of-the-art method in the field. And further experiments demonstrate the effect of each module of our model.
KW - Action detection
KW - Deep learning
KW - Video analysis
UR - http://www.scopus.com/inward/record.url?scp=85112547604&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-80119-9_54
DO - 10.1007/978-3-030-80119-9_54
M3 - Conference contribution
AN - SCOPUS:85112547604
SN - 9783030801182
T3 - Lecture Notes in Networks and Systems
SP - 834
EP - 845
BT - Intelligent Computing - Proceedings of the 2021 Computing Conference
A2 - Arai, Kohei
PB - Springer Science and Business Media Deutschland GmbH
T2 - Computing Conference, 2021
Y2 - 15 July 2021 through 16 July 2021
ER -