当前位置:网站首页>Li Hongyi machine learning 2022-hw1
Li Hongyi machine learning 2022-hw1
2022-07-23 10:42:00 【Fish tree (◔◡◔)】
List of articles
Preface
The article only serves as a record , Of course, there is reference to the ideas of the boss ヾ(•ω•`)o
Homework Introduction
kaggle link : link1
Google Colab link :link2
According to various states in the United States 4 Epidemic related data provided on the day , Forecast No 5 Day diagnostic rate 
Standard for evaluation 
Tips 
Feature selection
This part can be first in jupyter or pycharm Medium python It will be more convenient to do it in the console
Read in the training set and test set , View the following numbers 
Yes, of course , The training set has one more column than the test set , This column is the information that needs to be predicted , Find the column name of this column ‘tested_positive.4’, The corresponding column index is 117, That's the last column 
Through the introduction, we know the former 37 The column data is the relevant information of the state , This part is represented by the unique heat code , Use here pandas Of corr Function analysis from 38 The data after the column is the same as ‘tested_positive.4’ The correlation between 
Here I will make the correlation greater than 0.8 Select the features of , Pay attention to the last one ‘tested_positive.4’ You can't choose to go in , In the training stage, this is used to calculate loss Of 
Get their index further 
Yes select_feat Part of the code is modified 
Different model architectures and optimizers
Because it is not big data , Increasing the depth of the model may lead to over fitting 、 Poor generalization and other problems ( I guess. ), Here we only change the width of the model 
The optimizer consists of the original SGD Changed to Adam, Mu Shen said that it can reduce the impact of learning rate , There are also relevant explanations in Teacher Li Hongyi's subsequent courses 
L2 regularization and try more parameters
L2 regularization Not really , Adjust the parameters slightly 
Submit results
Finish the above steps and have a strong baseline There should be no problem , It may be better to adjust the parameters again ? However, it is not recommended to adjust parameters repeatedly to achieve better results , This may lead to poor generalization of the model 
边栏推荐
- 为什么我们无法写出真正可重用的C#/F#代码?
- 元宇宙浪潮震撼来袭,抓住时机,齐心协力
- Flutter 运行flutter pub get 报错“客户端没有所需特权“
- 中国经济网:“元宇宙”炙手可热
- 解决servlet中post请求和get请求中文乱码现象
- Idea integrated sonar complete process
- SAP batch import template (WBS batch import as an example)
- 32 < tag array and bit operation > supplement: Lt. sword finger offer 56 - I. number of occurrences of numbers in the array
- openvino_datawhale
- 什么是即时通讯?即时通讯的发展
猜你喜欢

CLion + MinGW64配置C语言开发环境 Visual Studio安装

How does VirtualBox set up port forwarding?

C# 客户端程序调用外部程序的3种实现方法

CV (3)- CNNs

What is instant messaging? Development of instant messaging

《Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks》论文阅读

TZC 1283: 简单排序 —— 堆排序

NFT数字藏品版权如何保护?

阿里云如何将一个域名解析到另一个域名上

数据湖:Delta Lake介绍
随机推荐
TZC 1283: 简单排序 —— 堆排序
Chapter 4 Executing Commands
低代码平台搭建医药企业供应商、医院、患者等多方协同管理案例分析
Accessory mode
PXE远程安装和Kickstart无人值守安装 技术文档
C# IValueConverter接口用法举例
配饰器模式
牛客刷题篇——剑指offer (第二期)
《Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks》论文阅读
比你老师详细系列————结构体
openvino_datawhale
SVG、canvas、绘制线段和填充多边形、矩形、曲线的绘制和填充
中国经济网:“元宇宙”炙手可热
Cloudcompare & PCL point cloud point matching (based on point to face distance)
32 < tag array and bit operation > supplement: Lt. sword finger offer 56 - I. number of occurrences of numbers in the array
2022/7/22
8 < tag dynamic programming and LCS problems > lt.300. Longest increasing subsequence + lt.674. Longest continuous increasing sequence
Antlr4 introductory learning (I): Download and test
kex_ exchange_ Identification: read: connection reset by peer imperfect solution (one)
添加信任列表