Estimating Nutritional Composition from Food Volume Via Deep Learning-Based Depth and Segmentation Models

Anh Le

Faculty of Information Technology, Ho Chi Minh City University of Education, Ho Chi Minh City, Vietnam.

Anh Do

Faculty of Information Technology, Ho Chi Minh City University of Education, Ho Chi Minh City, Vietnam.

Thanh Nguyen

Faculty of Information Technology, Ho Chi Minh City University of Education, Ho Chi Minh City, Vietnam.

Binh Nguyen

Faculty of Information Technology, Ho Chi Minh City University of Education, Ho Chi Minh City, Vietnam.

An Tran

Faculty of Information Technology, Ho Chi Minh City University of Education, Ho Chi Minh City, Vietnam.

Nha Tran *

Faculty of Information Technology, Ho Chi Minh City University of Education, Ho Chi Minh City, Vietnam.

*Author to whom correspondence should be addressed.


Abstract

Nutrition plays a critical role in human health, with a balanced diet being essential for preventing non-communicable diseases, enhancing immune function, and improving quality of life. However, dietary imbalances contribute to significant global health issues, including obesity and malnutrition, which have far-reaching economic and health consequences. This research aims to address these challenges by developing a method for estimating the nutritional content of food items from a single 2D image. Our approach integrates a U-Net architecture with a ResNet18 encoder for depth prediction and employs FoodSAM for precise food segmentation. These components enable the calculation of food volume and mass, which are then used to estimate nutritional content based on the USDA database. Experimental results show that our model achieves a mean relative error (MRE) ranging from 11.18% to 50.35% for individual food items. Furthermore, our method maintains consistent mass predictions across various scenarios, including complex food combinations. This method demonstrates robustness in handling foods with diverse shapes and colors, providing a solid foundation for practical dietary tracking applications. By enabling nutritional monitoring, our approach has the potential to support public health initiatives and promote healthier lifestyles.

Keywords: Computer vision, nutritional estimation, volume estimation, depth estimation, food segmentation


How to Cite

Le, Anh, Anh Do, Thanh Nguyen, Binh Nguyen, An Tran, and Nha Tran. 2025. “Estimating Nutritional Composition from Food Volume Via Deep Learning-Based Depth and Segmentation Models ”. Asian Journal of Research in Computer Science 18 (5):219-33. https://doi.org/10.9734/ajrcos/2025/v18i5650.

Downloads

Download data is not yet available.