当前位置:网站首页>Talk about the multimodal project of fire
Talk about the multimodal project of fire
2022-06-21 09:59:00 【woshicver】
Multimodal machine learning , English full name MultiModal Machine Learning (MMML), The aim is to achieve the ability to process and understand multi-source modal information by means of machine learning .
Each source or form of information , You can call it a mode . for example , People have a sense of touch , auditory , Vision , The sense of smell ; The message has voice 、 video 、 Words and other media ; A variety of sensors , Such as radar 、 infrared 、 Accelerometer, etc. . Each of the above can be called a mode .
Modes can also be very broadly defined , For example, we can think of two different languages as two modes , Even the data sets collected in two different cases , You can think of it as two modes .
The present , Multimodal technology has a wide range of application scenarios , Such as Taobao Search 、AI subtitle 、AI Virtual digital human 、 Humanoid interaction 、 Intelligent assistant 、 Product recommendation and information flow advertising 、 Image vector retrieval of video frame and face frame 、 Voice interaction, etc .
We are honored to invite in-service senior algorithm researchers Clark teacher , utilize 1 About an hour or so , Systematically sort out multimodal technology for you .
Live sharing
01
PART
01 The development trend of multimodal models
02 Multimodal data set
03 Common multimodal downstream tasks
02
PART
Lecturer

Live time
03
PART
6 month 22 Friday night 20:00-21:00
Students interested in multimodal Technology , Scan the QR code below , Reservation live broadcast .

Sweep code payment 0.1 Yuan means the appointment is successful
Live broadcast when the party staff contact you ~
04
PART
Multimodal learning path

01 Fundamentals of multimodal theory
Study multimodal pre training related papers ——CLIP、ALIGN、VILT
02 Self supervised algorithm
Learn some self-monitoring schemes that may be used in multimodal pre training ——MAE、DINO、MOCO
03 Introduction to multimodal downstream tasks
Mainly understand VQA The tasks and nlvr Mission
04 Multimodal applications
Image Captioning Case study 、 Alibaba e-commerce cross modal retrieval case . Understand the task introduction 、baseline build 、 Model optimization 、 Result display .
05 Multimodal project
AI Smart copywriting 、 Mobile photo album management and retrieval based on multimodal pre training model 、AI Lip recognition 、 Automatic driving based on deep multimodal target detection and semantic segmentation
6 month 22 Friday night 20:00-21:00
Students interested in multimodal Technology , Scan the QR code below , Reservation live broadcast .

Sweep code payment 0.1 Yuan means the appointment is successful
Live broadcast when the party staff contact you ~

边栏推荐
- Definition of annotations and annotation compiler
- Verification code ----- SVG captcha
- Form Validation
- Embedded software project process and project startup instructions (example)
- 程序员新人周一优化一行代码,周三被劝退?
- Appareils pris en charge par Arcore
- 使用shapeit进行单倍型分析
- Several ways to trigger link jump
- Mid 2022 Summary - step by step, step by step
- 从零开始做网站11-博客开发
猜你喜欢

【实战】STM32 FreeRTOS移植系列教程1:FreeRTOS 二值信号量使用
![[practice] stm32mp157 development tutorial FreeRTOS system 3: FreeRTOS counting semaphore](/img/b1/e4b944877fecc079a772b81c55bfc8.jpg)
[practice] stm32mp157 development tutorial FreeRTOS system 3: FreeRTOS counting semaphore

Stm32mp1 cortex M4 Development Chapter 11: expansion board buzzer control

程序員新人周一優化一行代碼,周三被勸退?

stm32mp1 Cortex M4开发篇11:扩展板蜂鸣器控制

TC software outline design document (mobile group control)
![The most authoritative Lei niukesi in history --- embedded Ai Road line [yyds]](/img/0c/95930c7c49c5ebeee9c179c035b317.jpg)
The most authoritative Lei niukesi in history --- embedded Ai Road line [yyds]

开课报名|「Takin开源特训营」第一期来啦!手把手教你搞定全链路压测!

聊聊大火的多模态项目

Introduction to ground plane in unity
随机推荐
Underlying principle of Concurrency: thread, resource sharing, volatile keyword
How to select embedded hands-on projects and embedded open source projects
On the problem of class member variable pollution in the context of one-time concurrence
optional类,便利函数,创建Optional,Optional对象操作以及Optional流
110. JS event loop and setimmediate, process.nexttick
字符串
stm32mp1 Cortex M4开发篇9:扩展板空气温湿度传感器控制
DSP gossip: how to save the compiled variables on the chip when the variables are defined in the code
Are you still using localstorage directly? The thinnest in the whole network: secondary encapsulation of local storage (including encryption, decryption and expiration processing)
简易的安卓天气app(三)——城市管理、数据库操作
燎原之势 阿里云数据库“百城聚力”助中小企业数智化转型
Form Validation
Polymorphic & class object & registered factory & Reflection & dynamic proxy
Mid 2022 Summary - step by step, step by step
【实战】STM32 FreeRTOS移植系列教程5:FreeRTOS消息队列
Lodash real on demand approach
109. use of usereducer in hooks (counter case)
【实战】STM32 FreeRTOS移植系列教程2:FreeRTOS 互斥信号量
Clipboard learning records and pit encountered
[cloud native | kubernetes] kubernetes configuration (XV)