Deep learning based anime character sketch quality recognition

Authors

  • Ishita Yanamadala Dougherty Valley High School
  • Atul Dubey
  • Divya Rajagiri

DOI:

https://doi.org/10.47611/jsrhs.v13i2.6797

Keywords:

Deep learning, anime classification, transfer learning, image classification

Abstract

The explosion of the global anime market is poised to redefine the entertainment industry, boasting a valuation of USD 31.23 billion as of 2023 and projected to climb at a CAGR of 9.8% until 2030. At the heart of this growth is a vibrant community of artists and enthusiasts who navigate challenges such as artist scarcity and the lack of advanced tools for qualitative feedback and effective content promotion. Our study targets the core of anime creation sketching by evaluating the essential drawing elements and assessing their implementation in professional-quality anime portraits. We introduce an automated approach to predicting the anime quality using advances in deep learning. Utilizing transfer learning, we enhance three pre-trained models—MobileNetV2, ResNet50, and VGG16—with a customized dense layer, refining their capabilities for the binary classification of anime character sketches. The dataset employed comprises 155 images, categorized as 'Good' and 'Bad' to reflect the quality of sketches. A balanced split into training, validation, and testing subsets ensures a quantitative evaluation of data not seen previously by the model. The models, pre-trained on ImageNet and fine-tuned with our dataset, demonstrate varied sensitivity to hyperparameters, with the MobileNetV2 and ResNet50 model attaining a peak validation accuracy of 94% and the highest test accuracy of 79% indicating its potential as a robust tool for quality assessment in the anime industry. An interesting outcome of the research is that a lightweight MobileNetV2 model with much fewer parameters compared to other models resulted in the highest test accuracy.

Downloads

Download data is not yet available.

References or Bibliography

Anime Market Size, Share & Trends Analysis Report By Type (T.V., Movie, Video, Internet Distribution, Merchandising, Music), By Genre (Action & Adventure, Sci-Fi & Fantasy, Romance & Drama, Sports, and Others), By Region, And Segment Forecasts, 2024 - 2030. (2023, Jan 28). Anime Market Size & Trends. https://www.grandviewresearch.com/industry-analysis/anime-market

Anime Market Outlook. (2023, jan 25). https://www.futuremarketinsights.com/reports/anime-market

https://headphonesaddict.com/anime-statistics/

Pan, Y. (2018). Adapt or Die! The Social and Economic Dynamics of Japan’s Animation Industry.

https://repository.usfca.edu/capstone/762/

Pencil. (2023, April 6). Top 10 Fundamentals For Drawing and Sketching. Pencil Perceptions. https://www.pencilperceptions.com/10-fundamentals-for-drawing-and-sketching/

Li, H., Guo, S., Lyu, K., Yang, X., Chen, T., Zhu, J., & Zeng, H. (2022). A Challenging Benchmark of Anime Style Recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 4721-4730). https://arxiv.org/pdf/2204.14034.pdf

Huang, Z., Xie, H., Fukusato, T., & Miyata, K. (2023). AniFaceDrawing: Anime Portrait Exploration during Your Sketching. arXiv preprint arXiv:2306.07476. https://dl.acm.org/doi/abs/10.1145/3588432.3591548

Hammond, T., Kumar, S. P. A., Runyon, M., Cherian, J., Williford, B., Keshavabhotla, S., ... & Linsey, J. (2018). It’s not just about accuracy: Metrics that matter when modeling expert sketching ability. ACM Transactions on Interactive Intelligent Systems (TiiS), 8(3), 1-47. https://dl.acm.org/doi/pdf/10.1145/3181673

Byrne, I., Kanaoka, Y., Pollack, N. E., Rhee, H. J., & Sommers, P. M. (2019). An Analysis of Airport Delays Across the United States, 2012-2018. Journal of Student Research, 8(2). https://doi.org/10.47611/jsr.v8i2.775

Dong, K., Zhou, C., Ruan, Y., & Li, Y. (2020, December). MobileNetV2 model for image classification. In 2020 2nd International Conference on Information Technology and Computer Application (ITCA) (pp. 476-480). IEEE. https://ieeexplore.ieee.org/abstract/document/9422058

Koonce, B., & Koonce, B. (2021). ResNet 50. Convolutional Neural Networks with Swift for Tensorflow: Image Recognition and Dataset Categorization, 63-72. https://link.springer.com/chapter/10.1007/978-1-4842-6168-2_6

O'Shea, K., & Nash, R. (2015). An introduction to convolutional neural networks. arXiv preprint arXiv:1511.08458. https://www.researchgate.net/publication/337105858_Transfer_learning_using_VGG-16_with_Deep_Convolutional_Neural_Network_for_Classifying_Images

Hosna, A., Merry, E., Gyalmo, J., Alom, Z., Aung, Z., & Azim, M. A. (2022). Transfer learning: a friendly introduction. Journal of Big Data, 9(1), 102. https://journalofbigdata.springeropen.com/articles/10.1186/s40537-022-00652-w

Li, H., Guo, S., Lyu, K., Yang, X., Chen, T., Zhu, J., & Zeng, H. (2022). A Challenging Benchmark of Anime Style Recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (pp. 4721-4730). https://openaccess.thecvf.com/content/CVPR2022W/VDU/html/Li_A_Challenging_Benchmark_of_Anime_Style_Recognition_CVPRW_2022_paper.html

Published

05-31-2024

How to Cite

Yanamadala, I., Dubey, A., & Rajagiri, D. (2024). Deep learning based anime character sketch quality recognition. Journal of Student Research, 13(2). https://doi.org/10.47611/jsrhs.v13i2.6797

Issue

Section

HS Research Projects