Thoughts and Theory

Vision Transformer (ViT) has been gaining momentum in recent years. This article will explain the paper “Do Vision Transformers See Like Convolutional Neural Networks?” (Raghu et al., 2021) published by Google Research and Google Brain, and explore the difference between the conventionally used CNN and Vision Transformer.

The abstract of this paper and the content of this blog

There are six…

Akihiro FUJII

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store