Pranav Durai*
Stanford Center for Innovation in In Vivo Imaging, Stanford University School of Medicine, Stanford, CA 94305, USA
*Corresponding Author: Pranav Durai, Stanford Center for Innovation in In Vivo Imaging, Stanford University School of Medicine, Stanford, CA 94305, USA.
Received: September 06, 2024; Published: December 13, 2024
Citation: Pranav Durai. “QuViT: Quantum Vision Transformer". Acta Scientific Computer Sciences 6.9 (2024):05-10.
Image classification has traditionally relied on Convolutional Neural Networks (CNNs) for their ability to extract visual features, recognize and learn patterns. However, the emergence of Vision Transformers (ViTs) as an alternative approach, inspired by Trans formers in language tasks, brings the potential for capturing global image relationships and achieving competitive performance, interpretability, and scalability. The field of quantum computing has shown great promise, heralding a new era in computation and problem-solving. By harnessing the principles of quantum mechanics, quantum computers offer the potential to perform calculations at a scale and speed that surpass classical computers. This paper introduces QuViT, a quantum-accelerated vision transformer. With a novel q-input engine, q-encoder, and q-decoder, the proposed QuViT model follows a hybrid- approach that provides a promising avenue for building a quantum vision transformer that can handle yottabyte- scale image classification tasks with high accuracy, ef ficiency and paradigm shifting performance.
Citation: Pranav Durai. “QuViT: Quantum Vision Transformer".Acta Scientific Computer Sciences 6.9 (2025): 05-10.
Copyright: © 2024 Pranav Durai. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.