A Deep Learning Approach to Semantic Segmentation for Architectural Blueprint Interpretation and Geographic-based Material Recommendation

Authors

  • Ashley Chon TJHSST
  • Peter Gabor Thomas Jefferson High School for Science and Technology

DOI:

https://doi.org/10.47611/jsrhs.v13i1.6186

Keywords:

Semantic Segmentation, Blueprint Image, Materials Recommendation

Abstract

The planning and building of houses is financially-consuming and time-consuming, causing housing development to struggle to keep up with rapidly growing populations. In the rapid expansion of residences, the environmental implications of such developments are frequently disregarded. Though few recent scientists have attempted to leverage machine learning to expedite the house modeling process, these models do not take into consideration environmental factors such as geographic location, temperature, and landscaping. There is a pressing need for developing an automated and accurate system that offers comprehensive interpretation of architectural blueprints, providing essential guidance and insights for various aspects of the construction process. This study incorporates these factors by leveraging machine learning to develop software models for cost-effective and environmentally sustainable housing. To address the aforementioned problem, I propose a machine learning-based semantic segmentation network for architectural blueprint interpretation. This method is designed to address the complex task of understanding blueprints, discerning room types and their spatial arrangement. To achieve this, I employ convolutional neural networks widely recognized for their effectiveness in image analysis tasks. I also introduce a material recommendation method that aligns with the geographical context of the construction site. Extensive experiments, prove that the proposed method outperforms the previous state-of-the-art method in accurately generating semantic segmentation maps for the inputted blueprint images. I expect that these superior segmentation results will significantly enhance the architectural planning process by providing architects and designers with a more detailed and informative representation of blueprint layouts, thus aiding in better decision-making and design refinement.

Downloads

Download data is not yet available.

References or Bibliography

Chen, L. C., Papandreou, G., Schroff, F., & Adam, H. (2017). Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587. https://doi.org/10.48550/arXiv.1706.05587

Fu, J., Liu, J., Wang, Y., Zhou, J., Wang, C., & Lu, H. (2019). Stacked deconvolutional network for semantic segmentation. IEEE Transactions on Image Processing. https://doi.org/10.1109/TIP.2019.2895460

Lin, G., Milan, A., Shen, C., & Reid, I. (2017). Refinenet: Multi-path refinement networks for high-resolution semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1925-1934). https://doi.org/10.48550/arXiv.1611.06612

Minaee, S., Boykov, Y., Porikli, F., Plaza, A., Kehtarnavaz, N., & Terzopoulos, D. (2021). Image segmentation using deep learning: A survey. IEEE transactions on pattern analysis and machine intelligence, 44(7), 3523-3542.

Ronneberger, O., Fischer, P., & Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18 (pp. 234-241). Springer International Publishing. https://doi.org/10.48550/arXiv.1505.04597

Wang, G., Luo, P., Lin, L., & Wang, X. (2017). Learning object interactions and descriptions for semantic image segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 5859-5867). https://doi.org/10.1109/CVPR.2017.556

Wu, Z., Shen, C., & Van Den Hengel, A. (2019). Wider or deeper: Revisiting the resnet model for visual recognition. Pattern Recognition, 90, 119-133. https://doi.org/10.48550/arXiv.1611.10080

Zhao, H., Shi, J., Qi, X., Wang, X., & Jia, J. (2017). Pyramid scene parsing network. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2881-2890). https://doi.org/10.48550/arXiv.1612.01105

Published

02-29-2024

How to Cite

Chon, A., & Gabor, P. (2024). A Deep Learning Approach to Semantic Segmentation for Architectural Blueprint Interpretation and Geographic-based Material Recommendation. Journal of Student Research, 13(1). https://doi.org/10.47611/jsrhs.v13i1.6186

Issue

Section

HS Research Articles