当前位置:网站首页>清华&商汤&上海AI&CUHK提出Siamese Image Modeling,兼具linear probing和密集预测性能!
清华&商汤&上海AI&CUHK提出Siamese Image Modeling,兼具linear probing和密集预测性能!
2022-06-26 17:32:00 【智源社区】
本文分享论文『Siamese Image Modeling for Self-Supervised Vision Representation Learning』,由清华(黄高组)&商汤(代季峰组)&上海AI Lab&CUHK提出Siamese Image Modeling,兼具linear probing和密集预测性能!

论文链接:
http://arxiv.org/abs/2206.01204
摘要
自监督学习(SSL)在各种下游视觉任务上都提供了优异的性能。目前提出了两种主流SSL框架,即实例鉴别(ID)和掩蔽图像建模(MIM)。ID将来自同一图像的不同视图的表示拉到在一起。它在 linear probing方面表现良好,但在检测性能方面较差。另一方面,MIM在给定mask图像的情况下重建原始内容。它擅长密集预测,但在linear probing上表现不佳。它们的区别是由于忽视了语义对齐或空间敏感性的表示要求。
具体而言,作者观察到:(1)语义对齐要求将语义相似的视图投影到附近的表示中,这可以通过对比不同的视图和强数据增强来实现;(2) 空间敏感性要求对图像中的局部结构进行建模。因此,使用掩蔽图像预测密集表示是有益的,因为它模拟了图像内容的条件分布。

边栏推荐
- [ten thousand words summary] starting from the end, analyze in detail how to fill in the college entrance examination volunteers
- 二分查找-2
- 牛客网:设计LRU缓存结构 设计LFU缓存结构
- Preparing for the Blue Bridge Cup and ccf-csp
- vue--vuerouter缓存路由组件
- Redis and database data consistency
- Leetcode HOT100 (22--- bracket generation)
- How sparksql returns a specific day of the week by date -dayofweek function
- Play with Linux and easily install and configure MySQL
- 20: Chapter 3: develop the pass service: 3: get through the redis server in the program; (it only connects with the redis server and does not involve specific business development)
猜你喜欢

Discussion: the next generation of stable coins

SQL injection for Web Security (3)

Redis OM . Net redis object mapping framework

LeetCode——226. Flip binary tree (BFS)

ACL 2022 | 基于神经标签搜索的零样本多语言抽取式文本摘要

10 cloud security best practices that enterprises need to know

The latest masterpiece of Alibaba, which took 182 days to produce 1015 pages of distributed full stack manual, is so delicious

Web3 decentralized storage ecological landscape
![[buuctf.reverse] 126-130](/img/df/e35633d85caeff1dece62a66cb7804.png)
[buuctf.reverse] 126-130

Vscode usage - Remote SSH configuration description
随机推荐
Prometeus 2.34.0 new features
类型多样的石膏PBR多通道贴图素材,速来收藏!
[suggested collection] 11 online communities suitable for programmers
Today, I met a "migrant worker" who took out 38K from Tencent, which let me see the ceiling of the foundation
Leetcode topic [array] -283- move zero
Platform management background and merchant menu resource management: merchant registration management design
并发之Synchronized说明
Army chat -- registration of Registration Center
玩转Linux,轻松安装配置MySQL
Implementation of MySQL master-slave architecture
Don't believe it, 98% of programmers are like this
Redis and database data consistency
【万字总结】以终为始,详细分析高考志愿该怎么填
Secrets of gear contract
The texstudio official website cannot be opened
When I was in the library, I thought of the yuan sharing mode
Interpretation of new plug-ins | how to enhance authentication capability with forward auth
分布式缓存/缓存集群简介
Classical synchronization problem
接水面试题