undefined | Better HN

0 pointscriticaltinker4y ago0 comments

Good suggestion, it was tough to narrow down the list! Here is a link to the ViT paper in case others are interested [1].

According to the latest ImageNet standings [2], ViT appears to have slipped to second place in Top-1 Accuracy. CoAtNet-7 is the new leader, but only by a slight margin and at the cost of what appears to be a significantly larger model.

[1] Scaling Vision Transformers https://paperswithcode.com/paper/scaling-vision-transformers

[2] https://paperswithcode.com/sota/image-classification-on-imag...

0 comments

kettleballroll4y ago

That isn't the ViT paper, this one is https://paperswithcode.com/paper/an-image-is-worth-16x16-wor...

j / k navigate · click thread line to collapse

0 pointscriticaltinker4y ago0 comments

Good suggestion, it was tough to narrow down the list! Here is a link to the ViT paper in case others are interested [1].

[1] Scaling Vision Transformers https://paperswithcode.com/paper/scaling-vision-transformers

[2] https://paperswithcode.com/sota/image-classification-on-imag...

0 comments

kettleballroll4y ago

That isn't the ViT paper, this one is https://paperswithcode.com/paper/an-image-is-worth-16x16-wor...

j / k navigate · click thread line to collapse