Update readme.md

这个提交包含在:
Yixuan Weng
2022-04-07 20:56:04 +08:00
提交者 GitHub
父节点 bbe7423c8c
当前提交 e277db8640

查看文件

@@ -27,7 +27,7 @@
## 词向量对比 ## 词向量对比
### 医学词向量 ### 医学词向量
```
#### wv1.most_similar('海马') #### wv1.most_similar('海马')
#### Out[30]: #### Out[30]:
@@ -55,10 +55,10 @@
('头孢克洛缓释片', 0.5178096294403076), ('头孢克洛缓释片', 0.5178096294403076),
('头胞克洛', 0.5159974098205566), ('头胞克洛', 0.5159974098205566),
('罗红霉素', 0.5115748643875122)] ('罗红霉素', 0.5115748643875122)]
```
### 通用词向量(https://github.com/Embedding/Chinese-Word-Vectors) ### 通用词向量(https://github.com/Embedding/Chinese-Word-Vectors)
```
#### wv2.most_similar('海马') #### wv2.most_similar('海马')
#### Out[31]: #### Out[31]:
@@ -72,13 +72,12 @@
('海马回', 0.5352568030357361), ('海马回', 0.5352568030357361),
('北汽', 0.5325318574905396), ('北汽', 0.5325318574905396),
('小海马', 0.5315144062042236)] ('小海马', 0.5315144062042236)]
#### wv2.most_similar('头孢') #### wv2.most_similar('头孢')
#### Out[33]: #### Out[33]:
[('头孢拉定', 0.7558029294013977), [('头孢拉定', 0.7558029294013977),
('头孢菌素', 0.7490127086639404), ('头孢菌素', 0.7490127086639404),
('头孢类', 0.7476578950881958), ('头孢氨苄', 0.7415952682495117), ('头孢曲松钠', 0.7406224608421326), ('头孢哌酮', 0.7398018836975098), ('头孢噻肟钠', 0.7393568158149719), ('头孢噻吩', 0.7348008751869202), ('头孢哌酮钠', 0.729317843914032), ('氨苄', 0.7292327284812927)] ('头孢类', 0.7476578950881958), ('头孢氨苄', 0.7415952682495117), ('头孢曲松钠', 0.7406224608421326), ('头孢哌酮', 0.7398018836975098), ('头孢噻肟钠', 0.7393568158149719), ('头孢噻吩', 0.7348008751869202), ('头孢哌酮钠', 0.729317843914032), ('氨苄', 0.7292327284812927)]
```
### 此医学词向量含278256个生物医学相关词汇,维度512,使用gensim训练。 ### 此医学词向量含278256个生物医学相关词汇,维度512,使用gensim训练。
```python ```python