摘要
In this paper we present an empirical study of object category recognition using generalized samples and a set of sequential tests. We study 33 categories, each consisting of a small data set of 30 instances. To increase the amount of training data we have, we use a compositional object model to learn a representation for each category from which we select 30 additional templates with varied appearance from the training set. These samples better span the appearance space and form an augmented training set ΩT of 1980 (60x33) training templates. To perform recognition on a testing image, we use a set of sequential tests to project ΩT into different representation spaces to narrow the number of candidate matches in ΩT. We use "graphlets" (structural elements), as our local features and model ΩT at each stage using histograms of graphlets over categories, histograms of graphlets over object instances, histograms of pairs of graphlets over objects, shape context. Each test is increasingly computationally expensive, and by the end of the cascade we have a small candidate set remaining to use with our most powerful test, a top-down graph matching algorithm. We achieve an 81.4 % classification rate on classifying 800 testing images in 33 categories, 15.2% more accurate than a method without generalized samples.
源语言 | 英语 |
---|---|
DOI | |
出版状态 | 已出版 - 2007 |
活动 | 2007 IEEE 11th International Conference on Computer Vision, ICCV - Rio de Janeiro, 巴西 期限: 14 10月 2007 → 21 10月 2007 |
会议
会议 | 2007 IEEE 11th International Conference on Computer Vision, ICCV |
---|---|
国家/地区 | 巴西 |
市 | Rio de Janeiro |
时期 | 14/10/07 → 21/10/07 |