Deep learning based anime character sketch quality recognition
DOI:
https://doi.org/10.47611/jsrhs.v13i2.6797Keywords:
Deep learning, anime classification, transfer learning, image classificationAbstract
The explosion of the global anime market is poised to redefine the entertainment industry, boasting a valuation of USD 31.23 billion as of 2023 and projected to climb at a CAGR of 9.8% until 2030. At the heart of this growth is a vibrant community of artists and enthusiasts who navigate challenges such as artist scarcity and the lack of advanced tools for qualitative feedback and effective content promotion. Our study targets the core of anime creation sketching by evaluating the essential drawing elements and assessing their implementation in professional-quality anime portraits. We introduce an automated approach to predicting the anime quality using advances in deep learning. Utilizing transfer learning, we enhance three pre-trained models—MobileNetV2, ResNet50, and VGG16—with a customized dense layer, refining their capabilities for the binary classification of anime character sketches. The dataset employed comprises 155 images, categorized as 'Good' and 'Bad' to reflect the quality of sketches. A balanced split into training, validation, and testing subsets ensures a quantitative evaluation of data not seen previously by the model. The models, pre-trained on ImageNet and fine-tuned with our dataset, demonstrate varied sensitivity to hyperparameters, with the MobileNetV2 and ResNet50 model attaining a peak validation accuracy of 94% and the highest test accuracy of 79% indicating its potential as a robust tool for quality assessment in the anime industry. An interesting outcome of the research is that a lightweight MobileNetV2 model with much fewer parameters compared to other models resulted in the highest test accuracy.
Downloads
References or Bibliography
Anime Market Size, Share & Trends Analysis Report By Type (T.V., Movie, Video, Internet Distribution, Merchandising, Music), By Genre (Action & Adventure, Sci-Fi & Fantasy, Romance & Drama, Sports, and Others), By Region, And Segment Forecasts, 2024 - 2030. (2023, Jan 28). Anime Market Size & Trends. https://www.grandviewresearch.com/industry-analysis/anime-market
Anime Market Outlook. (2023, jan 25). https://www.futuremarketinsights.com/reports/anime-market
https://headphonesaddict.com/anime-statistics/
Pan, Y. (2018). Adapt or Die! The Social and Economic Dynamics of Japan’s Animation Industry.
https://repository.usfca.edu/capstone/762/
Pencil. (2023, April 6). Top 10 Fundamentals For Drawing and Sketching. Pencil Perceptions. https://www.pencilperceptions.com/10-fundamentals-for-drawing-and-sketching/
Li, H., Guo, S., Lyu, K., Yang, X., Chen, T., Zhu, J., & Zeng, H. (2022). A Challenging Benchmark of Anime Style Recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 4721-4730). https://arxiv.org/pdf/2204.14034.pdf
Huang, Z., Xie, H., Fukusato, T., & Miyata, K. (2023). AniFaceDrawing: Anime Portrait Exploration during Your Sketching. arXiv preprint arXiv:2306.07476. https://dl.acm.org/doi/abs/10.1145/3588432.3591548
Hammond, T., Kumar, S. P. A., Runyon, M., Cherian, J., Williford, B., Keshavabhotla, S., ... & Linsey, J. (2018). It’s not just about accuracy: Metrics that matter when modeling expert sketching ability. ACM Transactions on Interactive Intelligent Systems (TiiS), 8(3), 1-47. https://dl.acm.org/doi/pdf/10.1145/3181673
Byrne, I., Kanaoka, Y., Pollack, N. E., Rhee, H. J., & Sommers, P. M. (2019). An Analysis of Airport Delays Across the United States, 2012-2018. Journal of Student Research, 8(2). https://doi.org/10.47611/jsr.v8i2.775
Dong, K., Zhou, C., Ruan, Y., & Li, Y. (2020, December). MobileNetV2 model for image classification. In 2020 2nd International Conference on Information Technology and Computer Application (ITCA) (pp. 476-480). IEEE. https://ieeexplore.ieee.org/abstract/document/9422058
Koonce, B., & Koonce, B. (2021). ResNet 50. Convolutional Neural Networks with Swift for Tensorflow: Image Recognition and Dataset Categorization, 63-72. https://link.springer.com/chapter/10.1007/978-1-4842-6168-2_6
O'Shea, K., & Nash, R. (2015). An introduction to convolutional neural networks. arXiv preprint arXiv:1511.08458. https://www.researchgate.net/publication/337105858_Transfer_learning_using_VGG-16_with_Deep_Convolutional_Neural_Network_for_Classifying_Images
Hosna, A., Merry, E., Gyalmo, J., Alom, Z., Aung, Z., & Azim, M. A. (2022). Transfer learning: a friendly introduction. Journal of Big Data, 9(1), 102. https://journalofbigdata.springeropen.com/articles/10.1186/s40537-022-00652-w
Li, H., Guo, S., Lyu, K., Yang, X., Chen, T., Zhu, J., & Zeng, H. (2022). A Challenging Benchmark of Anime Style Recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 4721-4730). https://openaccess.thecvf.com/content/CVPR2022W/VDU/html/Li_A_Challenging_Benchmark_of_Anime_Style_Recognition_CVPRW_2022_paper.html
Published
How to Cite
Issue
Section
Copyright (c) 2024 Ishita Yanamadala; Atul Dubey, Divya Rajagiri
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Copyright holder(s) granted JSR a perpetual, non-exclusive license to distriute & display this article.