A Deep Learning Approach to Semantic Segmentation for Architectural Blueprint Interpretation and Geographic-based Material Recommendation
DOI:
https://doi.org/10.47611/jsrhs.v13i1.6186Keywords:
Semantic Segmentation, Blueprint Image, Materials RecommendationAbstract
The planning and building of houses is financially-consuming and time-consuming, causing housing development to struggle to keep up with rapidly growing populations. In the rapid expansion of residences, the environmental implications of such developments are frequently disregarded. Though few recent scientists have attempted to leverage machine learning to expedite the house modeling process, these models do not take into consideration environmental factors such as geographic location, temperature, and landscaping. There is a pressing need for developing an automated and accurate system that offers comprehensive interpretation of architectural blueprints, providing essential guidance and insights for various aspects of the construction process. This study incorporates these factors by leveraging machine learning to develop software models for cost-effective and environmentally sustainable housing. To address the aforementioned problem, I propose a machine learning-based semantic segmentation network for architectural blueprint interpretation. This method is designed to address the complex task of understanding blueprints, discerning room types and their spatial arrangement. To achieve this, I employ convolutional neural networks widely recognized for their effectiveness in image analysis tasks. I also introduce a material recommendation method that aligns with the geographical context of the construction site. Extensive experiments, prove that the proposed method outperforms the previous state-of-the-art method in accurately generating semantic segmentation maps for the inputted blueprint images. I expect that these superior segmentation results will significantly enhance the architectural planning process by providing architects and designers with a more detailed and informative representation of blueprint layouts, thus aiding in better decision-making and design refinement.
Downloads
References or Bibliography
Chen, L. C., Papandreou, G., Schroff, F., & Adam, H. (2017). Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587. https://doi.org/10.48550/arXiv.1706.05587
Fu, J., Liu, J., Wang, Y., Zhou, J., Wang, C., & Lu, H. (2019). Stacked deconvolutional network for semantic segmentation. IEEE Transactions on Image Processing. https://doi.org/10.1109/TIP.2019.2895460
Lin, G., Milan, A., Shen, C., & Reid, I. (2017). Refinenet: Multi-path refinement networks for high-resolution semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1925-1934). https://doi.org/10.48550/arXiv.1611.06612
Minaee, S., Boykov, Y., Porikli, F., Plaza, A., Kehtarnavaz, N., & Terzopoulos, D. (2021). Image segmentation using deep learning: A survey. IEEE transactions on pattern analysis and machine intelligence, 44(7), 3523-3542.
Ronneberger, O., Fischer, P., & Brox, T. (2015). U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18 (pp. 234-241). Springer International Publishing. https://doi.org/10.48550/arXiv.1505.04597
Wang, G., Luo, P., Lin, L., & Wang, X. (2017). Learning object interactions and descriptions for semantic image segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 5859-5867). https://doi.org/10.1109/CVPR.2017.556
Wu, Z., Shen, C., & Van Den Hengel, A. (2019). Wider or deeper: Revisiting the resnet model for visual recognition. Pattern Recognition, 90, 119-133. https://doi.org/10.48550/arXiv.1611.10080
Zhao, H., Shi, J., Qi, X., Wang, X., & Jia, J. (2017). Pyramid scene parsing network. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2881-2890). https://doi.org/10.48550/arXiv.1612.01105
Published
How to Cite
Issue
Section
Copyright (c) 2024 Ashley Chon; Peter Gabor
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Copyright holder(s) granted JSR a perpetual, non-exclusive license to distriute & display this article.