加入 Gitee
与超过 1200万 开发者一起发现、参与优秀开源项目,私有仓库也完全免费 :)
免费加入
文件
克隆/下载
jcseg.properties 2.01 KB
一键复制 编辑 原始数据 按行查看 历史
# jcseg properties file.
# bug report chenxin <chenxin619315@gmail.com>
# Jcseg function
#maximum match length. (5-7)
jcseg.maxlen=5
#recognized the chinese name.(1 to open and 0 to close it)
jcseg.icnname=1
#maximum chinese word number of english chinese mixed word.
jcseg.mixcnlen=2
#maximum length for pair punctuation text.
jcseg.pptmaxlen=15
#maximum length for chinese last name andron.
jcseg.cnmaxlnadron=1
#Wether to clear the stopwords.(set 1 to clear stopwords and 0 to close it)
jcseg.clearstopword=0
#Wether to convert the chinese numeric to arabic number. (set to 1 open it and 0 to close it)
# like '\u4E09\u4E07' to 30000.
jcseg.cnnumtoarabic=1
#Wether to convert the chinese fraction to arabic fraction.
jcseg.cnfratoarabic=1
#Wether to keep the unrecognized word. (set 1 to keep unrecognized word and 0 to clear it)
jcseg.keepunregword=1
#Wether to start the secondary segmentation for the complex english words.
jcseg.ensencondseg = 1
#min length of the secondary simple token. (better larger than 1)
jcseg.stokenminlen = 2
#thrshold for chinese name recognize.
# better not change it before you know what you are doing.
jcseg.nsthreshold=1000000
#The punctuations that will be keep in an token.(Not the end of the token).
jcseg.keeppunctuations=@#%.&+
####about the lexicon
#prefix of lexicon file.
lexicon.prefix=lex
#suffix of lexicon file.
lexicon.suffix=lex
#abusolte path of the lexicon file.
#Multiple path support from jcseg 1.9.2, use ';' to split different path.
#example: lexicon.path = /home/chenxin/lex1;/home/chenxin/lex2 (Linux)
# : lexicon.path = D:/jcseg/lexicon/1;D:/jcseg/lexicon/2 (WinNT)
lexicon.path={jar.dir}/lexicon
#lexicon.path=/java/JavaSE/jcseg/lexicon
#Wether to load the modified lexicon file auto.
lexicon.autoload=0
#Poll time for auto load. (seconds)
lexicon.polltime=120
####lexicon load
#Wether to load the part of speech of the entry.
jcseg.loadpos=1
#Wether to load the pinyin of the entry.
jcseg.loadpinyin=0
#Wether to load the synoyms words of the entry.
jcseg.loadsyn=1
Loading...
马建仓 AI 助手
尝试更多
代码解读
代码找茬
代码优化