当前位置:网站首页>NLP model Bert: from introduction to mastery (2)
NLP model Bert: from introduction to mastery (2)
2020-11-06 01:22:00 【Elementary school students in IT field】
Named entity recognition
First download the corresponding bert modular
pip install bert-base==0.0.9 -i https://pypi.python.org/simple
Also can reference Official website Handle
install
What the package now supports
1. Named entity recognition training
2. Services for Named Entity Recognition C/S
3. Inherit excellent open source software :bert_as_service(hanxiao) Of BERT All services
4. Text categorization Services
The following functions will continue to increase
Training named entity recognition model based on named row :
installed bert-base after , Two tools based on named rows will be generated , among bert-base-ner-train Support the training of named entity recognition model , You just need to specify the directory of training data ,BERT The directory of relevant parameters can be . You can use the following command to view help
The examples of training are named as follows :
bert-base-ner-train \
-data_dir {your dataset dir}\
-output_dir {training output dir}\
-init_checkpoint {Google BERT model dir}\
-bert_config_file {bert_config.json under the Google BERT model dir} \
-vocab_file {vocab.txt under the Google BERT model dir}
Parameter description
among data_dir It's the directory where your data is located , Training data , The naming format of validation data and test data is :train.txt, dev.txt,test.txt, Please name the file in this format , Otherwise, an error will be reported .
The format of training data is as follows :
The sea O
fishing O
Than O
" O
The earth O
spot O
stay O
mansion B-LOC
door I-LOC
And O
gold B-LOC
door I-LOC
And O
between O
Of O
The sea O
Domain O
. O
The first word in each line is , The second is its label , Use spaces ’ ' Separate , Please make sure to use spaces . Use blank lines between sentences . The program will automatically read your data .
output_dir: Training model output file path , Model checkpoint And some tag mapping tables will be stored here , This path is used as a service , Can be specified as -ner_model_dir
init_checkpoint: Download Google BERT Model
bert_config_file : Google BERT Under the model bert_config.json
vocab_file: Google BERT Under the model vocab.txt
After training , You can specify in your output_dir To see the results of your training .
More operations :
https://blog.csdn.net/macanv/article/details/85684284
One more bert Encapsulation of models
https://www.jianshu.com/p/1d6689851622
https://cloud.tencent.com/developer/article/1470051
https://www.h3399.cn/201908/714454.html

版权声明
本文为[Elementary school students in IT field]所创,转载请带上原文链接,感谢
边栏推荐
- Installing the consult cluster
- JVM memory area and garbage collection
- 你的财务报告该换个高级的套路了——财务分析驾驶舱
- What is the side effect free method? How to name it? - Mario
- 从海外进军中国,Rancher要执容器云市场牛耳 | 爱分析调研
- Group count - word length
- Working principle of gradient descent algorithm in machine learning
- Examples of unconventional aggregation
- 小程序入门到精通(二):了解小程序开发4个重要文件
- Existence judgment in structured data
猜你喜欢
How long does it take you to work out an object-oriented programming interview question from Ali school?
中国提出的AI方法影响越来越大,天大等从大量文献中挖掘AI发展规律
Not long after graduation, he earned 20000 yuan from private work!
PHP应用对接Justswap专用开发包【JustSwap.PHP】
Linked blocking Queue Analysis of blocking queue
加速「全民直播」洪流,如何攻克延时、卡顿、高并发难题?
向北京集结!OpenI/O 2020启智开发者大会进入倒计时
DevOps是什么
ES6学习笔记(四):教你轻松搞懂ES6的新增语法
采购供应商系统是什么?采购供应商管理平台解决方案
随机推荐
Don't go! Here is a note: picture and text to explain AQS, let's have a look at the source code of AQS (long text)
Use of vuepress
CCR炒币机器人:“比特币”数字货币的大佬,你不得不了解的知识
From zero learning artificial intelligence, open the road of career planning!
OPTIMIZER_ Trace details
Programmer introspection checklist
华为云“四个可靠”的方法论
JVM memory area and garbage collection
“颜值经济”的野望:华熙生物净利率六连降,收购案遭上交所问询
Tool class under JUC package, its name is locksupport! Did you make it?
The practice of the architecture of Internet public opinion system
6.4 viewresolver view parser (in-depth analysis of SSM and project practice)
采购供应商系统是什么?采购供应商管理平台解决方案
嘗試從零開始構建我的商城 (二) :使用JWT保護我們的資訊保安,完善Swagger配置
前端都应懂的入门基础-github基础
Existence judgment in structured data
How to select the evaluation index of classification model
Troubleshooting and summary of JVM Metaspace memory overflow
Vue 3 responsive Foundation
I think it is necessary to write a general idempotent component