pinyin

elasticsearch-analysis-pinyin可以用于繁体字吗？谢谢

Elasticsearch • weizhuang 发起了问题 • 2 人关注 • 0 个回复 • 2871 次浏览 • 2018-06-17 12:18 • 来自相关话题

使用ik+pinyin分词后，高亮显示的问题

贡献

Elasticsearch • medcl 回复了问题 • 3 人关注 • 1 个回复 • 6067 次浏览 • 2018-06-05 14:55 • 来自相关话题

请教汉字同音词pinyin分词器搜索问题

贡献

Elasticsearch • hnj1575565068 回复了问题 • 2 人关注 • 1 个回复 • 4488 次浏览 • 2018-04-27 14:29 • 来自相关话题

elasticsearch拼音提示指定字段问题

贡献

Elasticsearch • wengqiankun 回复了问题 • 4 人关注 • 2 个回复 • 5011 次浏览 • 2017-06-21 18:16 • 来自相关话题

ES 的拼音插件多音字有问题

Elasticsearch • huigy 发起了问题 • 1 人关注 • 0 个回复 • 5470 次浏览 • 2017-03-26 13:19 • 来自相关话题

elasticsearch-analysis-pinyin更新至es2.4.1和5.0.0-rc1

Elasticsearch • medcl 发表了文章 • 3 个评论 • 5169 次浏览 • 2016-10-13 21:49 • 来自相关话题

版本分别支持到最新的 es v2.4.1和 es v5.0.0-rc1 新增若干特性，支持多种选项配置，支持 pinyin 的切分，比之前需要结合 ngram 的方式更加准确，如：liudehuaalibaba13zhuanghan->liu,de,hua,a,li,ba,ba,13,zhuang,han，具体配置参加文档： https://github.com/medcl/elast ... inyin 下载： https://github.com/medcl/elast ... eases 欢迎测试：

curl -XPUT http://localhost:9200/medcl/ -d'
{
    "index" : {
        "analysis" : {
            "analyzer" : {
                "pinyin_analyzer" : {
                    "tokenizer" : "my_pinyin"
                    }
            },
            "tokenizer" : {
                "my_pinyin" : {
                    "type" : "pinyin",
                    "keep_separate_first_letter" : false,
                    "keep_full_pinyin" : true,
                    "keep_original" : false,
                    "limit_first_letter_length" : 16,
                    "lowercase" : true
                }
            }
        }
    }
}'

curl http://localhost:9200/medcl/_a ... lyzer
{
  "tokens" : [ {
    "token" : "liu",
    "start_offset" : 0,
    "end_offset" : 1,
    "type" : "word",
    "position" : 0
  }, {
    "token" : "de",
    "start_offset" : 1,
    "end_offset" : 2,
    "type" : "word",
    "position" : 1
  }, {
    "token" : "hua",
    "start_offset" : 2,
    "end_offset" : 3,
    "type" : "word",
    "position" : 2
  }, {
    "token" : "a",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 3
  }, {
    "token" : "b",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 4
  }, {
    "token" : "c",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 5
  }, {
    "token" : "d",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 6
  }, {
    "token" : "liu",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 7
  }, {
    "token" : "de",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 8
  }, {
    "token" : "hua",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 9
  }, {
    "token" : "wo",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 10
  }, {
    "token" : "bu",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 11
  }, {
    "token" : "zhi",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 12
  }, {
    "token" : "dao",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 13
  }, {
    "token" : "shi",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 14
  }, {
    "token" : "shui",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 15
  }, {
    "token" : "ldhabcdliudehuaw",
    "start_offset" : 0,
    "end_offset" : 16,
    "type" : "word",
    "position" : 16
  } ]
}

elasticsearch-analysis-pinyin可以用于繁体字吗？谢谢

Elasticsearch • weizhuang 发起了问题 • 2 人关注 • 0 个回复 • 2871 次浏览 • 2018-06-17 12:18 • 来自相关话题

使用ik+pinyin分词后，高亮显示的问题

Elasticsearch • medcl 回复了问题 • 3 人关注 • 1 个回复 • 6067 次浏览 • 2018-06-05 14:55 • 来自相关话题

请教汉字同音词pinyin分词器搜索问题

Elasticsearch • hnj1575565068 回复了问题 • 2 人关注 • 1 个回复 • 4488 次浏览 • 2018-04-27 14:29 • 来自相关话题

elasticsearch拼音提示指定字段问题

Elasticsearch • wengqiankun 回复了问题 • 4 人关注 • 2 个回复 • 5011 次浏览 • 2017-06-21 18:16 • 来自相关话题

ES 的拼音插件多音字有问题

Elasticsearch • huigy 发起了问题 • 1 人关注 • 0 个回复 • 5470 次浏览 • 2017-03-26 13:19 • 来自相关话题

elasticsearch-analysis-pinyin更新至es2.4.1和5.0.0-rc1

Elasticsearch • medcl 发表了文章 • 3 个评论 • 5169 次浏览 • 2016-10-13 21:49 • 来自相关话题

curl -XPUT http://localhost:9200/medcl/ -d'
{
    "index" : {
        "analysis" : {
            "analyzer" : {
                "pinyin_analyzer" : {
                    "tokenizer" : "my_pinyin"
                    }
            },
            "tokenizer" : {
                "my_pinyin" : {
                    "type" : "pinyin",
                    "keep_separate_first_letter" : false,
                    "keep_full_pinyin" : true,
                    "keep_original" : false,
                    "limit_first_letter_length" : 16,
                    "lowercase" : true
                }
            }
        }
    }
}'

curl http://localhost:9200/medcl/_a ... lyzer
{
  "tokens" : [ {
    "token" : "liu",
    "start_offset" : 0,
    "end_offset" : 1,
    "type" : "word",
    "position" : 0
  }, {
    "token" : "de",
    "start_offset" : 1,
    "end_offset" : 2,
    "type" : "word",
    "position" : 1
  }, {
    "token" : "hua",
    "start_offset" : 2,
    "end_offset" : 3,
    "type" : "word",
    "position" : 2
  }, {
    "token" : "a",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 3
  }, {
    "token" : "b",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 4
  }, {
    "token" : "c",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 5
  }, {
    "token" : "d",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 6
  }, {
    "token" : "liu",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 7
  }, {
    "token" : "de",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 8
  }, {
    "token" : "hua",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 9
  }, {
    "token" : "wo",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 10
  }, {
    "token" : "bu",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 11
  }, {
    "token" : "zhi",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 12
  }, {
    "token" : "dao",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 13
  }, {
    "token" : "shi",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 14
  }, {
    "token" : "shui",
    "start_offset" : 2,
    "end_offset" : 31,
    "type" : "word",
    "position" : 15
  }, {
    "token" : "ldhabcdliudehuaw",
    "start_offset" : 0,
    "end_offset" : 16,
    "type" : "word",
    "position" : 16
  } ]
}

更多...

elasticsearch-analysis-pinyin可以用于繁体字吗？谢谢

使用ik+pinyin分词后，高亮显示的问题

请教汉字同音词pinyin分词器搜索问题

elasticsearch拼音提示指定字段问题

ES 的拼音插件多音字有问题

elasticsearch-analysis-pinyin更新至es2.4.1和5.0.0-rc1

elasticsearch-analysis-pinyin可以用于繁体字吗？谢谢

使用ik+pinyin分词后，高亮显示的问题

请教汉字同音词pinyin分词器搜索问题

elasticsearch拼音提示指定字段问题

ES 的拼音插件多音字有问题

elasticsearch-analysis-pinyin更新至es2.4.1和5.0.0-rc1

话题描述

活动推荐

相关话题

4 人关注该话题