Architectural Space Recognition from Blueprints Using Machine Learning-Based Semantic Segmentation
DOI:
https://doi.org/10.47611/jsrhs.v13i2.6483Keywords:
Blueprint, Convolutional Neural Network, Image SegmentationAbstract
Architectural blueprints have been providing fundamental processes for urban planning, construction, and interior design for professionals. It contains a visual representation of building plans and specifications in the construction and design industry. However, for the non-expert, it is difficult to understand these complex technical drawings. This is because of the specialized language, symbols, and technical knowledge required to understand architectural blueprints. This challenge often leads to misunderstandings which can extend the construction process and escalate costs. To bridge this knowledge gap, in this research paper, I propose a machine-based blueprint interpretation method using semantic segmentation. The proposed method takes blueprints as input and generates semantic segmentation maps. These maps categorize and isolate distinct architectural areas into predefined classes, including rooms, kitchens, and bathrooms. The proposed machine learning model is tested on a publicly available dataset of architectural blueprints. Through comprehensive quantitative and qualitative assessments, it is shown that the proposed method achieves state-of-the-art performance.
Downloads
References or Bibliography
Chen, L. C., Papandreou, G., Schroff, F., & Adam, H. (2017). Rethinking atrous convolution for semantic image segmentation. arXiv preprint arXiv:1706.05587. https://doi.org/10.48550/arXiv.1706.05587
Lin, G., Milan, A., Shen, C., & Reid, I. (2017). Refinenet: Multi-path refinement networks for high-resolution semantic segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 1925-1934). https://doi.org/10.48550/arXiv.1611.06612
Liu, X., Deng, Z., & Yang, Y. (2019). Recent progress in semantic image segmentation. Artificial Intelligence Review, 52, 1089-1106.
Wang, G., Luo, P., Lin, L., & Wang, X. (2017). Learning object interactions and descriptions for semantic image segmentation. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 5859-5867). https://doi.org/10.1109/CVPR.2017.556
Wu, Z., Shen, C., & Van Den Hengel, A. (2019). Wider or deeper: Revisiting the resnet model for visual recognition. Pattern Recognition, 90, 119-133. https://doi.org/10.48550/arXiv.1611.10080
Zhao, H., Shi, J., Qi, X., Wang, X., & Jia, J. (2017). Pyramid scene parsing network. In Proceedings of the IEEE conference on computer vision and pattern recognition (pp. 2881-2890). https://doi.org/10.48550/arXiv.1612.01105
Published
How to Cite
Issue
Section
Copyright (c) 2024 Shiho Yu; Su Jin
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Copyright holder(s) granted JSR a perpetual, non-exclusive license to distriute & display this article.