当前位置:网站首页>Notes on Flickr's dataset
Notes on Flickr's dataset
2022-07-25 17:34:00 【Wsyoneself】
- flickr8k Image annotation dataset :
- Data set containing 8,000 Zhang image , Each image is paired with five different titles , These titles provide content descriptions of objects and events in the pictures
- This data set seems to be related to the image description task ( Generate a text description for the image ) of .
- Image subtitles generate excellent data sets that can be used :flickr8k Data sets , Realistic and relatively small .
- Flickr30K It's from Flickr The sorted out content downloaded from 30k Pictures and data sets corresponding to description sentences
- IGEODATA Data sets :
- The data set consists of ten bzip2 Compressed files (yfcc100m_dataset-0.bz2 To yfcc100m_dataset-9.bz2) form , Each file contains 10M That's ok , Each row contains the following tab delimited fields :* Photo / Video identifier 、* user NSID、* The user nickname 、* Date of shooting 、* Upload date 、* Capture devices 、* title 、* describe 、* user tags( Comma separated )、* machine tags( Comma separated )、* longitude 、* latitude 、* accuracy 、* Photo / Video page URL、* Photo / Video downloading URL、* License name 、* license URL、* Photo / Video server identifier 、* Photo / Video field identifier 、* Photo / Video confidentiality 、* Photo / Original confidential video 、* Expansion of the original photo 、* Photo / Video Tags (0= Photo ,1= video )
- The field containing free-form text has been URL code . Not all fields have values , Especially the camera 、 title 、 describe 、 Mark 、EXIF、 longitude 、 The latitude and precision fields may be empty . Please note that , The original extension is only meaningful for photos , It doesn't make sense for video ( Please check the first few bytes of the video to determine its file format ).
- In addition to dataset files , Also provided is a photo containing / Video identifier and its corresponding MD5 Hash (yfcc100m_hash.bz2) The file of . These hashes will be used for externally hosted expansion packs ( For example, function 、 notes ), As an indirect layer , To Hide Photos / Direct access to video information .
边栏推荐
- 第三章、数据类型和变量
- ROS学习笔记(四)ros 无法rosdep init 或者update解决方法
- ACL 2022 | 基于最优传输的对比学习实现可解释的语义文本相似性
- 如何看一本书
- Customize MVC project login registration and tree menu
- [Hardware Engineer] can't select components?
- WPF 实现用户头像选择器
- 多项式相加
- postgreSQL 密码区分大小写 ,有参数控制吗?
- 【Cadence Allegro PCB设计】error: Possible pin type conflict GND/VCC Power Connected to Output
猜你喜欢
随机推荐
I2C通信——时序图
Headless mode of new selenium4.3 in egde browser
第四章:操作符
How to prevent the unburned gas when the city gas safety is alarmed again?
01. Sum of two numbers
Beyond convnext, replknet | look 51 × 51 convolution kernel how to break ten thousand volumes!
栈的顺序存储结构,链式存储结构及实现
11、照相机与透镜
"Digital security" alert NFT's seven Scams
OSPF综合实验
Excel表格 / WPS表格中怎么在下拉滚动时让第一行标题固定住?
We were tossed all night by a Kong performance bug
02. Add two numbers
Is there a principal guaranteed product for financial management?
Wu Enda logistic regression 2
8 年产品经验,我总结了这些持续高效研发实践经验 · 研发篇
[knowledge atlas] practice -- Practice of question and answer system based on medical knowledge atlas (Part5 end): information retrieval and result assembly
OSPF---开放式最短优先路径协议
Does PgSQL have a useful graphical management tool?
Multi tenant software development architecture








