Alexandrov, P. A., A. A. Prusakov, G. N. Antonova, M. N. Shakhov, S. E. Stelmak, A. V. Beklemisheva, and V. G. Sarkisov. “The Potential of Current Multimodal Transformers for Image Analysis”. Russian Journal of Cybernetics, Vol. 7, no. 1, Mar. 2026, pp. 93-103, https://en.jcyb.ru/nisii_tech/article/view/486.