Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

配置自定义词典无效 #31

Open
suyuanhxx opened this issue Jan 11, 2018 · 3 comments
Open

配置自定义词典无效 #31

suyuanhxx opened this issue Jan 11, 2018 · 3 comments

Comments

@suyuanhxx
Copy link

根据说明中的配置方式配置了两种自定义词典方式均没有生成*.bin文件
solr版本为7.1

  1. schema.xml文件中配置customDictionaryPath
  <fieldType name="text_cn" class="solr.TextField">
    <analyzer type="index">
        <tokenizer class="com.hankcs.lucene.HanLPTokenizerFactory" enableIndexMode="true" customDictionaryPath="E:\Develop\solr-7.1.0\server\solr-webapp\webapp\WEB-INF\classes\hanlp\data\dictionary\custom\Organization.txt"/>
    </analyzer>
    <analyzer type="query">
        <tokenizer class="com.hankcs.lucene.HanLPTokenizerFactory" enableIndexMode="false"/>
    </analyzer>
  </fieldType>

core所在目录为E:\Develop\solr-7.1.0\server\solr\mycore
schema.xmlE:\Develop\solr-7.1.0\server\solr\mycore\conf
2. 在hanlp.properties中配置CustomDictionaryPath也没有生效

root=E:/Develop/solr-7.1.0/server/solr-webapp/webapp/WEB-INF/classes/hanlp/
CustomDictionaryPath=data/dictionary/custom/CustomDictionary.txt; Organization.txt;

hanlp.properties所在目录为E:\Develop\solr-7.1.0\server\solr-webapp\webapp\WEB-INF\classes

这两种配置方式在Organization.txt目录下均没有生成.bin,这样子对吗?

@suyuanhxx
Copy link
Author

windows下无效,在linux下第一种方式有效

@duringall
Copy link

第一种方式是路径错了,\应该是/

@duringall
Copy link

第二种方式root路径配置参考:https://github.com/hankcs/HanLP/tree/1.x。

image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants