MANAGING DIGITAL PHOTOGRAPHY ASSETS WITH AI TOOLS
DOI:
https://doi.org/10.29121/shodhkosh.v7.i1s.2026.7116Keywords:
Digital Asset Management, Artificial Intelligence, Image Classification, Multimodal Learning, Automated Tagging, Photography WorkflowsAbstract [English]
The management of high-resolution photographs collections in large quantities has been complicated by the fact that the volumes of images, their heterogeneous file formats and full but usually sporadic metadata has grown exponentially. Manual and rule-based Digital Asset Management (DAM) systems are not conducive to the efficient organization, retrieval and re-use of photographic assets, especially by the professional photographer and creative industries that have a restrictive production schedule. The paper suggests an AI-powered model of digital photography management based on introducing the latest developments in computer vision, multimodal learning, and natural language processing. It uses the models of deep learning based on image classification, such as convolutional neural networks and vision transformer, to process visual information automatically and retrieve high-level semantic information. Such characteristics are incorporated in shared representation spaces to facilitate semantic clustering, similarity search and context based organization. Simultaneously, natural language processing methods are used to create descriptive labels and tags by combining visual intelligence with additional hidden data like EXIF information, capture settings and geospatial position. The offered system also includes image quality scoring and aesthetic evaluation modules to help photographers to curate, rank, and choose images according to technical and artistic requirements.
References
Achterberg, J., Arel, R., Grinberg, T., Chaibi, A., Bach, J., and Tzagkarakis, N. (2023). Generative Image Model Benchmark for Reasoning and Representation (GIMBRR). In Proceedings of the AAAI 2023 Spring Symposium Series (EDGeS), San Mateo, CA, United States (March 27–29).
Amer, S. (2023). AI Imagery and the Overton Window. SSRN. https://doi.org/10.2139/ssrn.4776793 DOI: https://doi.org/10.2139/ssrn.4776793
Betzalel, E., Penso, C., Navon, A., and Fetaya, E. (2022). A Study on the Evaluation of Generative Models. arXiv (arXiv:2206.10935).
Borji, A. (2023). Generated Faces in the Wild: Quantitative Comparison of Stable Diffusion, Midjourney and DALL-E 2. arXiv (arXiv:2210.00586).
Brisco, R., Hay, L., and Dhami, S. (2023). Exploring the Role of Text-to-Image AI in Concept Generation. Proceedings of the Design Society, 3, 1835–1844. https://doi.org/10.1017/pds.2023.184 DOI: https://doi.org/10.1017/pds.2023.184
Bubaš, G., Čižmešija, A., and Kovačić, A. (2024). Development of an Assessment Scale for Measurement of Usability and User Experience Characteristics of Bing Chat Conversational AI. Future Internet, 16, 4. https://doi.org/10.3390/fi16010004 DOI: https://doi.org/10.3390/fi16010004
Ferreira, Â., and Casteleiro-Pitrez, J. (2023). Inteligência Artificial no Design de Comunicação em Portugal: Estudo de Caso sobre as Perspetivas de 10 Designers Profissionais de Pequenas e Médias Empresas. ROTURA—Revista de Comunicação, Cultura e Artes, 3, 114–133.
Martínez, G., Watson, L., Reviriego, P., Hernández, J. A., Juarez, M., and Sarkar, R. (2024). Towards Understanding the Interplay of Generative Artificial Intelligence and the Internet. In F. Cuzzolin and M. Sultana (Eds.), Epistemic Uncertainty in Artificial Intelligence (Epi UAI 2023) (Lecture Notes in Computer Science, Vol. 14523). Springer. https://doi.org/10.1007/978-3-031-57963-9_5 DOI: https://doi.org/10.1007/978-3-031-57963-9_5
Oppenlaender, J. (2022). The Creativity of Text-to-Image Generation. In Proceedings of the 25th International Academic Mindtrek Conference (Academic Mindtrek ’22) (192–202). Association for Computing Machinery. https://doi.org/10.1145/3569219.3569352 DOI: https://doi.org/10.1145/3569219.3569352
Oppenlaender, J. (2024). The Cultivated Practices of Text-to-Image Generation. In R. Rousi, C. von Koskull, and V. Roto (Eds.), Humane Autonomous Technology (Chapter 14). Palgrave Macmillan. https://doi.org/10.1007/978-3-031-66528-8_14 DOI: https://doi.org/10.1007/978-3-031-66528-8_14
Paananen, V., Oppenlaender, J., and Visuri, A. (2023). Using Text-to-Image Generation for Architectural Design Ideation. International Journal of Architectural Computing, 22(3), 458–474. https://doi.org/10.1177/14780771231222783 DOI: https://doi.org/10.1177/14780771231222783
Pažin, L. (2024). USING PlatfoRMS and Tools To Create Business IntelligenCE. ShodhAI: Journal of Artificial Intelligence, 1(1), 68–75. https://doi.org/10.29121/shodhai.v1.i1.2024.10 DOI: https://doi.org/10.29121/shodhai.v1.i1.2024.10
Radford, A., Kim, J. W., Hallacy, C., Ramesh, A., Goh, G., Agarwal, S., Sastry, G., Askell, A., Mishkin, P., Clark, J., Krueger, G., and Sutskever, I. (2021). Learning Transferable Visual Models From Natural Language Supervision. In Proceedings of the 38th International Conference on Machine Learning (Proceedings of Machine Learning Research, Vol. 139, 8748–8763). PMLR.
Samuelson, P. (2023). Generative AI Meets Copyright. Science, 381, 158–161. https://doi.org/10.1126/science.adi0656 DOI: https://doi.org/10.1126/science.adi0656
Schetinger, V., Di Bartolomeo, S., El-Assady, M., McNutt, A., Miller, M., Passos, J., and Adams, J. (2023). Doom or Deliciousness: Challenges and Opportunities for Visualization in the Age of Generative Models. Computer Graphics Forum, 42, 423–435. https://doi.org/10.1111/cgf.14841 DOI: https://doi.org/10.1111/cgf.14841
Shneiderman, B. (2022). Human Centered AI. Oxford University Press. https://doi.org/10.1145/3538882.3542790 DOI: https://doi.org/10.1093/oso/9780192845290.001.0001
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Dr. Vikrant Nangare, Chandrashekhar Ramesh Ramtirthkar, Keerthika K, Dr. Gajanan P Arsalwad, Priyadarshani Singh, Prabhakar Sharma

This work is licensed under a Creative Commons Attribution 4.0 International License.
With the licence CC-BY, authors retain the copyright, allowing anyone to download, reuse, re-print, modify, distribute, and/or copy their contribution. The work must be properly attributed to its author.
It is not necessary to ask for further permission from the author or journal board.
This journal provides immediate open access to its content on the principle that making research freely available to the public supports a greater global exchange of knowledge.























