Hierarchical Attention Network for Visually-Aware Food Recommendation

Xiaoyan Gao, Fuli Feng, Xiangnan He, Heyan Huang*, Xinyu Guan, Chong Feng, Zhaoyan Ming, Tat Seng Chua

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

77 Citations (Scopus)

Abstract

Food recommender systems play an important role in assisting users to identify the desired food to eat. Deciding what food to eat is a complex and multi-faceted process, which is influenced by many factors such as the ingredients, appearance of the recipe, the user's personal preference on food, and various contexts like what had been eaten in the past meals. This work formulates the food recommendation problem as predicting user preference on recipes based on three key factors that determine a user's choice on food, namely, 1) the user's (and other users') history; 2) the ingredients of a recipe; and 3) the descriptive image of a recipe. To address this challenging problem, this work develops a dedicated neural network-based solution Hierarchical Attention based Food Recommendation (HAFR) which is capable of: 1) capturing the collaborative filtering effect like what similar users tend to eat; 2) inferring a user's preference at the ingredient level; and 3) learning user preference from the recipe's visual images. To evaluate our proposed method, this work constructs a large-scale dataset consisting of millions of ratings from AllRecipes.com. Extensive experiments show that our method outperforms several competing recommender solutions like Factorization Machine and Visual Bayesian Personalized Ranking with an average improvement of 12%, offering promising results in predicting user preference on food.

Original languageEnglish
Article number8859291
Pages (from-to)1647-1659
Number of pages13
JournalIEEE Transactions on Multimedia
Volume22
Issue number6
DOIs
Publication statusPublished - Jun 2020

Keywords

  • Collaborative Filtering
  • Food Recommender Systems
  • Hierarchical Attention
  • Ingredients
  • Recipe Image

Fingerprint

Dive into the research topics of 'Hierarchical Attention Network for Visually-Aware Food Recommendation'. Together they form a unique fingerprint.

Cite this