当前位置:网站首页>NLP model Bert: from introduction to mastery (2)
NLP model Bert: from introduction to mastery (2)
2020-11-06 01:22:00 【Elementary school students in IT field】
Named entity recognition
First download the corresponding bert modular
pip install bert-base==0.0.9 -i https://pypi.python.org/simple
Also can reference Official website Handle
install
What the package now supports
1. Named entity recognition training
2. Services for Named Entity Recognition C/S
3. Inherit excellent open source software :bert_as_service(hanxiao) Of BERT All services
4. Text categorization Services
The following functions will continue to increase
Training named entity recognition model based on named row :
installed bert-base after , Two tools based on named rows will be generated , among bert-base-ner-train Support the training of named entity recognition model , You just need to specify the directory of training data ,BERT The directory of relevant parameters can be . You can use the following command to view help
The examples of training are named as follows :
bert-base-ner-train \
-data_dir {your dataset dir}\
-output_dir {training output dir}\
-init_checkpoint {Google BERT model dir}\
-bert_config_file {bert_config.json under the Google BERT model dir} \
-vocab_file {vocab.txt under the Google BERT model dir}
Parameter description
among data_dir It's the directory where your data is located , Training data , The naming format of validation data and test data is :train.txt, dev.txt,test.txt, Please name the file in this format , Otherwise, an error will be reported .
The format of training data is as follows :
The sea O
fishing O
Than O
" O
The earth O
spot O
stay O
mansion B-LOC
door I-LOC
And O
gold B-LOC
door I-LOC
And O
between O
Of O
The sea O
Domain O
. O
The first word in each line is , The second is its label , Use spaces ’ ' Separate , Please make sure to use spaces . Use blank lines between sentences . The program will automatically read your data .
output_dir: Training model output file path , Model checkpoint And some tag mapping tables will be stored here , This path is used as a service , Can be specified as -ner_model_dir
init_checkpoint: Download Google BERT Model
bert_config_file : Google BERT Under the model bert_config.json
vocab_file: Google BERT Under the model vocab.txt
After training , You can specify in your output_dir To see the results of your training .
More operations :
https://blog.csdn.net/macanv/article/details/85684284
One more bert Encapsulation of models
https://www.jianshu.com/p/1d6689851622
https://cloud.tencent.com/developer/article/1470051
https://www.h3399.cn/201908/714454.html

版权声明
本文为[Elementary school students in IT field]所创,转载请带上原文链接,感谢
边栏推荐
- 6.4 viewresolver view parser (in-depth analysis of SSM and project practice)
- ES6学习笔记(五):轻松了解ES6的内置扩展对象
- This article will introduce you to jest unit test
- 5.4 static resource mapping
- 业内首发车道级导航背后——详解高精定位技术演进与场景应用
- 從小公司進入大廠,我都做對了哪些事?
- Filecoin的经济模型与未来价值是如何支撑FIL币价格破千的
- 数据产品不就是报表吗?大错特错!这分类里有大学问
- OPTIMIZER_ Trace details
- 做外包真的很难,身为外包的我也无奈叹息。
猜你喜欢
随机推荐
6.1.2 handlermapping mapping processor (2) (in-depth analysis of SSM and project practice)
5.4 static resource mapping
怎么理解Python迭代器与生成器?
IPFS/Filecoin合法性:保护个人隐私不被泄露
6.1.1 handlermapping mapping processor (1) (in-depth analysis of SSM and project practice)
Nodejs crawler captures ancient books and records, a total of 16000 pages, experience summary and project sharing
从海外进军中国,Rancher要执容器云市场牛耳 | 爱分析调研
Face to face Manual Chapter 16: explanation and implementation of fair lock of code peasant association lock and reentrantlock
Swagger 3.0 天天刷屏,真的香嗎?
What is the difference between data scientists and machine learning engineers? - kdnuggets
CCR炒币机器人:“比特币”数字货币的大佬,你不得不了解的知识
Keyboard entry lottery random draw
From zero learning artificial intelligence, open the road of career planning!
一篇文章带你了解CSS3圆角知识
如何将数据变成资产?吸引数据科学家
Filecoin主网上线以来Filecoin矿机扇区密封到底是什么意思
Filecoin的经济模型与未来价值是如何支撑FIL币价格破千的
在大规模 Kubernetes 集群上实现高 SLO 的方法
Analysis of react high order components
一篇文章带你了解CSS 分页实例