DIGITAL ARCHIVING OF FOLK ART THROUGH MACHINE LEARNING
DOI:
https://doi.org/10.29121/shodhkosh.v7.i1s.2026.7083Keywords:
Digital Archiving, Folk Art Preservation, Machine Learning, Computer Vision, Hybrid CNN–Transformer Models, Contrastive Learning, Class Imbalance, Content-Based Image RetrievalAbstract [English]
Computational problems in the digital preservation of folk art are distinct because of the visual diversities, long tail distributions of motifs, and incomplete meta-data. The proposed paper presents a machine learning-based intelligent system of digital archiving folk art, which incorporates the visual feature learning, the automatic mechanisms of semantic enrichment, and the scaling of the retrieval mechanisms. The method uses both a hybrid convolutional neural network and vision transformer architecture to learn local motifs patterns as well as global compositional patterns. In order to tackle the class imbalance and style difference, contrastive self-supervised learning is used together with imbalance-aware supervised losses, such as class reweighting and focal loss. The expert assisted and weakly supervised annotations are used to form a structured dataset and then it is rigorously statistically analyzed to inform the model design. The experimental outcomes indicate that there is a high enhancement in the macro-averaging classification performance and content-based image retrieval precision, especially in the underrepresented motifs. System surface analysis indicates that embedding-based indexing and use of approximate nearest-neighbor search has a low query latency and strong retrieval fairness of motif frequencies. The suggested framework can allow scalable, equitable, and semantically meaningful digital archiving which serves as a generalizable answer to cultural heritage preservation and multimedia archival systems.
References
Alpaydin, E. (2020). Introduction to Machine Learning. Mit Press.
Adegbite, M. A. (2025). Data Privacy and Data Security Challenges in Digital Finance. Journal of Digital Security and Forensics, 2(1), 6–19. https://doi.org/10.29121/digisecforensics.v2.i1.2025.40 DOI: https://doi.org/10.29121/digisecforensics.v2.i1.2025.40
Bassier, M., Bonduel, M., Derdaele, J., and Vergauwen, M. (2020). Processing Existing Building Geometry for Reuse as Linked Data. Automation in Construction, 115, 103180. https://doi.org/10.1016/j.autcon.2020.103180 DOI: https://doi.org/10.1016/j.autcon.2020.103180
Bongini, P., Becattini, F., and Del Bimbo, A. (2022). Is GPT-3 All You Need for Visual Question Answering in Cultural Heritage? In European Conference on Computer Vision (ECCV) (268–281). Springer. https://doi.org/10.1007/978-3-031-25056-9_18 DOI: https://doi.org/10.1007/978-3-031-25056-9_18
Casillo, M., Colace, F., Conte, D., Lombardi, M., Santaniello, D., and Valentino, C. (2023). Context-Aware Recommender Systems and Cultural Heritage: A Survey. Journal of Ambient Intelligence and Humanized Computing, 14, 3109–3127. Https://Doi.Org/10.1007/S12652-021-03438-9 DOI: https://doi.org/10.1007/s12652-021-03438-9
Castano, S., Ferrara, A., Montanelli, S., and Periti, F. (2021, September). From Digital to Computational Humanities: The VAST Project Vision. In Proceedings of the Italian Symposium on Advanced Database Systems (Vol. 2994, 24–35).
Europeana Pro. (2023). CRAFTED: Enrich and Promote Traditional and Contemporary Crafts.
Farella, E. M., Malek, S., and Remondino, F. (2022). Colorizing the Past: Deep Learning for the Automatic Colorization of Historical Aerial Images. Journal of Imaging, 8(10), 269. https://doi.org/10.3390/jimaging8100269 DOI: https://doi.org/10.3390/jimaging8100269
Fiorucci, M., Khoroshiltseva, M., Pontil, M., Traviglia, A., Del Bue, A., and James, S. (2020). Machine Learning for Cultural Heritage: A Survey. Pattern Recognition Letters, 133, 102–108. https://doi.org/10.1016/j.patrec.2020.02.017 DOI: https://doi.org/10.1016/j.patrec.2020.02.017
Haliassos, A., Barmpoutis, P., Stathaki, T., Quirke, S., and Constantinides, A. (2020). Classification and Detection of Symbols in Ancient Papyri. In F. Liarokapis, A. Voulodimos, N. Doulamis, and A. Doulamis (Eds.), Visual Computing for Cultural Heritage (121–140). Springer. https://doi.org/10.1007/978-3-030-37191-3_7 DOI: https://doi.org/10.1007/978-3-030-37191-3_7
Helm, J. M., Swiergosz, A. M., Haeberle, H. S., Karnuta, J. M., Schaffer, J. L., Krebs, V. E., Spitzer, A. I., and Ramkumar, P. N. (2020). Machine Learning and Artificial Intelligence: Definitions, Applications, and Future Directions. Current Reviews in Musculoskeletal Medicine, 13, 69–76. https://doi.org/10.1007/s12178-020-09600-8 DOI: https://doi.org/10.1007/s12178-020-09600-8
Konstantakis, M., Alexandridis, G., And Caridakis, G. (2020). A Personalized Heritage-Oriented Recommender System Based on Extended Cultural Tourist Typologies. Big Data and Cognitive Computing, 4(1), 12. https://doi.org/10.3390/bdcc4020012 DOI: https://doi.org/10.3390/bdcc4020012
Li, P., Shi, Z., Ding, Y., Zhao, L., Ma, Z., Xiao, H., and Li, H. (2021). Analysis of the Temporal and Spatial Characteristics of Material Cultural Heritage Driven by Big Data—Take Museum Relics as an Example. Information, 12(4), 153. https://doi.org/10.3390/info12040153 DOI: https://doi.org/10.3390/info12040153
Moral-Andrés, F., Merino-Gómez, E., Reviriego, P., and Lombardi, F. (2022). Can Artificial Intelligence Reconstruct Ancient Mosaics? Studies in Conservation, 1–14.
Nockels, J., Gooding, P., Ames, S., and Terras, M. (2022). Understanding the Application of Handwritten Text Recognition Technology in Heritage Contexts: A Systematic Review of Transkribus in published research. Archival Science, 22, 367–392. https://doi.org/10.1007/s10502-022-09397-0 DOI: https://doi.org/10.1007/s10502-022-09397-0
Rei, L., Mladenic, D., Dorozynski, M., Rottensteiner, F., Schleider, T., Troncy, R., Lozano, J. S., and Salvatella, M. G. (2023). Multimodal Metadata Assignment for Cultural Heritage Artifacts. Multimedia Systems, 29, 847–869. https://Doi.Org/10.1007/S00530-022-01025-2 DOI: https://doi.org/10.1007/s00530-022-01025-2
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Nagajayant Nagamani, Kalpana Rawat, Prashant Anerao, Dr. Rajesh Uttam Kanthe, Atish Baburao Mane, Ponmurugan Panneerselvam

This work is licensed under a Creative Commons Attribution 4.0 International License.
With the licence CC-BY, authors retain the copyright, allowing anyone to download, reuse, re-print, modify, distribute, and/or copy their contribution. The work must be properly attributed to its author.
It is not necessary to ask for further permission from the author or journal board.
This journal provides immediate open access to its content on the principle that making research freely available to the public supports a greater global exchange of knowledge.























