当前位置:网站首页>Forgotten Jieba participle
Forgotten Jieba participle
2022-06-26 01:37:00 【Green Lantern swordsman】
Stuttering participle , How many people still use ?
One . The principle of stuttering participle
Statistical dictionary 、 Prefix Dictionary 、 Directed acyclic graph 、 From the back forward ( Prefix Dictionary + The nature of dynamic programming determines ) Viterbi predicts the maximum probability path .
(1) Dictionary based string maximum matching algorithm
(2) Based on word frequency statistics : The more adjacent words appear at the same time , The more likely it is to form a word , It is a total segmentation method . At present, the commonly used algorithm is HMM、CRF、 Deep learning and other algorithms .jieba Word segmentation uses dynamic programming to find the maximum probability path , Find out the maximum segmentation combination based on word frequency , For unregistered words , Based on the ability of Chinese characters to form words HMM Model .
problem 3:jieba Word segmentation and word segmentation in knowledge map CoreNLP The principle is different ?
CoreNLP It was released by Stanford , Functionally, it can be used for part of speech tagging and entity recognition , and jieba No entity recognition function ;
Reference link 10
Two . The use of stuttering participles
边栏推荐
- Quickly generate 1~20 natural numbers and easily copy
- 15 `bs object Node name Node name String` get nested node content
- C disk cleaning strategy of win10 system
- Design and process analysis of anti backflow circuit for MOS transistor
- 2021-1-15 摸鱼做的笔记Ctrl+c /v来的
- Have you considered going or staying in graduation season
- leetcode 300. Longest Increasing Subsequence 最长递增子序列 (中等)
- From query database performance optimization to redis cache - talk about cache penetration, avalanche and breakdown
- 《网络是怎么样连接的》读书笔记 - 集线器、路由器和路由器(三)
- Test questions and answers for the 2022 baby sitter (Level 5) examination
猜你喜欢

Qt Cmake 纯C 代码调用系统控制台输入scanf 及 中文输出乱码

Data analysis slicer, PivotTable and PivotChart (necessary in the workplace)

shell正则表达式

The kth largest element in the array
![[Excel知识技能] Excel数据类型](/img/f6/e1ebe033d1a2a266ebda00b10098ed.png)
[Excel知识技能] Excel数据类型

Musk vs. jobs, who is the greatest entrepreneur in the 21st century

STM32 key development foundation

MySQL图书借阅系统项目数据库建库表语句(组合主键、外键设置)

Oracle database startup backup preparation

CityJSON
随机推荐
新库上线 | CnOpenData中国新房信息数据
What is the process of opening a mobile card account? Is it safe to open an account online?
《网络是怎么样连接的》读书笔记 - 集线器、路由器和路由器(三)
MySQL example - comprehensive case (multi condition combined query)
20. Hough line transformation
100ask seven day IOT training camp learning notes - bare metal program framework design
28. contour discovery
JSON基本语法
Data arrangement of machinetranslation
Laravel basic course routing and MVC - routing
手机卡开户的流程是什么?网上开户是否安全么?
24. histogram calculation
MySQL图书借阅系统项目数据库建库表语句(组合主键、外键设置)
shell正则表达式
leetcode 300. Longest Increasing Subsequence 最长递增子序列 (中等)
15 `bs object Node name Node name String` get nested node content
Is it safe to open a securities account online
远程增量同步神器rsync
CityJSON
C disk cleaning strategy of win10 system