Dynamic Bayesian network based visual focus of attention recognition

Li Geng Dong; Hui Jun Di; Lin Mi Tao; Guang You Xu

Dynamic Bayesian network based visual focus of attention recognition

Li Geng Dong^*, Hui Jun Di, Lin Mi Tao, Guang You Xu

^*Corresponding author for this work

Tsinghua University

Research output: Contribution to journal › Article › peer-review

2 Citations (Scopus)

Abstract

Visual focus of attention recognition is usually based on head pose estimation. However, in a real application, it is difficult to accurately estimate the head pose due to large pose variations, low resolution images and varying illuminations. To handle the problem, we propose a dynamic Bayesian network model to infer the visual focus of attention. The head pose is not explicitly computed but measured by a similarity vector which represents the likelihoods of multiple face pose clusters. The model encodes the probabilistic relations among multiple foci of attention, multiple user locations and faces captured by multiple cameras. Data are collected in a prototype ambient kitchen and results show that the model is effective.

Original language	English
Pages (from-to)	140-146
Number of pages	7
Journal	Tien Tzu Hsueh Pao/Acta Electronica Sinica
Volume	39
Issue number	3 A
Publication status	Published - Mar 2011
Externally published	Yes

Keywords

Dynamic Bayesian network
The ambient kitchen
Visual focus of attention recognition

Cite this

@article{cd0950800249422490c3ee57fb981ac4,

title = "Dynamic Bayesian network based visual focus of attention recognition",

abstract = "Visual focus of attention recognition is usually based on head pose estimation. However, in a real application, it is difficult to accurately estimate the head pose due to large pose variations, low resolution images and varying illuminations. To handle the problem, we propose a dynamic Bayesian network model to infer the visual focus of attention. The head pose is not explicitly computed but measured by a similarity vector which represents the likelihoods of multiple face pose clusters. The model encodes the probabilistic relations among multiple foci of attention, multiple user locations and faces captured by multiple cameras. Data are collected in a prototype ambient kitchen and results show that the model is effective.",

keywords = "Dynamic Bayesian network, The ambient kitchen, Visual focus of attention recognition",

author = "Dong, {Li Geng} and Di, {Hui Jun} and Tao, {Lin Mi} and Xu, {Guang You}",

year = "2011",

month = mar,

language = "English",

volume = "39",

pages = "140--146",

journal = "Tien Tzu Hsueh Pao/Acta Electronica Sinica",

issn = "0372-2112",

publisher = "Chinese Institute of Electronics",

number = "3 A",

}

TY - JOUR

T1 - Dynamic Bayesian network based visual focus of attention recognition

AU - Dong, Li Geng

AU - Di, Hui Jun

AU - Tao, Lin Mi

AU - Xu, Guang You

PY - 2011/3

Y1 - 2011/3

N2 - Visual focus of attention recognition is usually based on head pose estimation. However, in a real application, it is difficult to accurately estimate the head pose due to large pose variations, low resolution images and varying illuminations. To handle the problem, we propose a dynamic Bayesian network model to infer the visual focus of attention. The head pose is not explicitly computed but measured by a similarity vector which represents the likelihoods of multiple face pose clusters. The model encodes the probabilistic relations among multiple foci of attention, multiple user locations and faces captured by multiple cameras. Data are collected in a prototype ambient kitchen and results show that the model is effective.

AB - Visual focus of attention recognition is usually based on head pose estimation. However, in a real application, it is difficult to accurately estimate the head pose due to large pose variations, low resolution images and varying illuminations. To handle the problem, we propose a dynamic Bayesian network model to infer the visual focus of attention. The head pose is not explicitly computed but measured by a similarity vector which represents the likelihoods of multiple face pose clusters. The model encodes the probabilistic relations among multiple foci of attention, multiple user locations and faces captured by multiple cameras. Data are collected in a prototype ambient kitchen and results show that the model is effective.

KW - Dynamic Bayesian network

KW - The ambient kitchen

KW - Visual focus of attention recognition

UR - http://www.scopus.com/inward/record.url?scp=79959611269&partnerID=8YFLogxK

M3 - Article

AN - SCOPUS:79959611269

SN - 0372-2112

VL - 39

SP - 140

EP - 146

JO - Tien Tzu Hsueh Pao/Acta Electronica Sinica

JF - Tien Tzu Hsueh Pao/Acta Electronica Sinica

IS - 3 A

ER -

Dynamic Bayesian network based visual focus of attention recognition

Abstract

Keywords

Other files and links

Fingerprint

Cite this