{"id":1637,"date":"2016-09-21T19:09:51","date_gmt":"2016-09-21T10:09:51","guid":{"rendered":"http:\/\/kazu.tv\/blog\/?p=1637"},"modified":"2016-11-07T14:03:00","modified_gmt":"2016-11-07T05:03:00","slug":"analyzer-settings-in-elasticsearch","status":"publish","type":"post","link":"https:\/\/kazu.tv\/blog\/2016\/09\/21\/analyzer-settings-in-elasticsearch\/","title":{"rendered":"Elasticsearch \u306e analyzer \u95a2\u9023\u306e\u8a2d\u5b9a\u3067\u77e5\u3063\u3066\u308b\u3053\u3068\u3092\u5168\u3066\u66f8\u304f"},"content":{"rendered":"<p>\u8a73\u3057\u3044\u4eba\u304b\u3089\u898b\u308c\u3070\u5927\u3057\u305f\u5185\u5bb9\u3058\u3083\u306a\u3044\u3068\u601d\u3046\u3051\u3069\u3001\u8abf\u3079\u305f\u308a\u8a66\u884c\u932f\u8aa4\u3057\u305f\u7d50\u679c\u3092\u307e\u3068\u3081\u308b\u3002\uff08\u9593\u9055\u3044\u306a\u3069\u304c\u3042\u308c\u3070\u3001\u3054\u6307\u6458\u9802\u3051\u308b\u3068\u3042\u308a\u304c\u305f\u3044\u3067\u3059\u3002\uff09<\/p>\n<h2>Elasticsearch \u3092\u4f55\u306b\u4f7f\u3063\u3066\u3044\u308b\u304b<\/h2>\n<h3>\u4ed6\u30b5\u30fc\u30d3\u30b9 \u2192 API\/webhook \u2192 \u81ea\u30b5\u30fc\u30d0\u30fc\u306e Elasticsearch<\/h3>\n<p>\u4ee5\u524d\u89e6\u308c\u305f\u3068\u601d\u3046\u3051\u3069\u3001\u958b\u767a\u30d7\u30ed\u30b8\u30a7\u30af\u30c8\u306e\u30c7\u30fc\u30bf\u3092\u5168\u90e81\u7b87\u6240\u306b\u307e\u3068\u3081\u3066\u691c\u7d22\u51fa\u6765\u308b\u3088\u3046\u306b\u3057\u3066\u3044\u3066\u3001\u305d\u3053\u3067 Elasticsearch \u3092\u4f7f\u3063\u3066\u3044\u308b\u3002<\/p>\n<p>\u81ea\u5206\u306e\u4ed5\u4e8b\u5185\u5bb9\u3068\u3057\u3066\u306f\u3001\u6642\u9593\u306e4\u5272\u4f4d\u3092\u53d7\u8a17\u958b\u767a\u306b\u3042\u3066\u3066\u3044\u308b\u3002\u53d7\u8a17\u958b\u767a\u3067\u306f\u3001\u304a\u5ba2\u69d8\u3084\u767a\u6ce8\u5143\u306e\u958b\u767a\u30d9\u30f3\u30c0\u30fc\u306b\u5408\u308f\u305b\u3066\u8272\u3093\u306a\u30c4\u30fc\u30eb\u3092\u4f7f\u308f\u3056\u308b\u3092\u5f97\u306a\u304f\u3066\u3001\u5177\u4f53\u7684\u306b\u306f ChatWork, Backlog, Google Drive \u3068\u304b\u3092\u4f7f\u3046\u3053\u3068\u304c\u7d50\u69cb\u591a\u3044\u3002\u305d\u308c\u306b\u5bfe\u3057\u3066\u3001\u81ea\u5206\u9054\u306e\u958b\u767a\u30c1\u30fc\u30e0\u5185\u90e8\u3067\u306f\u81ea\u5206\u9054\u7528\u306eSlack\u30c1\u30fc\u30e0\u304c\u3042\u308b\u3057\u3001\u305d\u308c\u4ee5\u5916\u306b\u3082\u5225\u306e\u30c4\u30fc\u30eb\u3092\u4f7f\u3063\u3066\u305f\u308a\u3057\u3066\u3044\u308b\u306e\u3067\u3001\u8272\u3093\u306a\u3068\u3053\u308d\u306b\u60c5\u5831\u304c\u5206\u6563\u3057\u304c\u3061\u3002<\/p>\n<p>\u306a\u306e\u3067\u3001\u5404\u30c4\u30fc\u30eb\u306eAPI\uff08\u3042\u308b\u3044\u306fwebhook\uff09\u7d4c\u7531\u3067\u81ea\u5206\u9054\u306e\u30b5\u30fc\u30d0\u30fc\u306b\u30c7\u30fc\u30bf\u3092\u9001\u3063\u3066\u3001\u305d\u308c\u3092 Elasticsearch \u306b\u6d41\u3057\u8fbc\u3093\u3067\u3044\u308b\u3002<\/p>\n<h3>\u691c\u7d22\u5bfe\u8c61<\/h3>\n<p>\u5143\u30c7\u30fc\u30bf\u306e\u7a2e\u985e\u306f\u4ee5\u4e0b\u306e\u901a\u308a\u3002<\/p>\n<ul>\n<li>\u30c6\u30ad\u30b9\u30c8\n<ul>\n<li>GitHub\/Bitbucket\/Backlog \u306e issue (\u30c1\u30b1\u30c3\u30c8), PR \u3067\u306e\u8b70\u8ad6<\/li>\n<li>Slack, ChatWork \u306e\u4f1a\u8a71<\/li>\n<li>wiki \u30da\u30fc\u30b8<\/li>\n<\/ul>\n<\/li>\n<li>\u30d5\u30a1\u30a4\u30eb\n<ul>\n<li>Backlog \u306e\u30d5\u30a1\u30a4\u30eb\u7f6e\u304d\u5834\u306b\u3042\u308b\u30d5\u30a1\u30a4\u30eb<\/li>\n<li>Google Drive \u306e\u30d5\u30a1\u30a4\u30eb<\/li>\n<li>Slack \u306b\u8cbc\u3089\u308c\u305f\u30d5\u30a1\u30a4\u30eb<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<p>\u8a00\u8a9e\u306f\u3001\u65e5\u672c\u8a9e\u304c\u30e1\u30a4\u30f3\u3060\u3051\u3069\u3001\u6d77\u5916\u306e\u4eba\u3068\u306e\u3084\u308a\u3068\u308a\u3084\u3001\u82f1\u8a9e\u306eweb\u30da\u30fc\u30b8\uff08StackOverflow\u3068\u304b\uff09\u304b\u3089\u306e\u30b3\u30d4\u30da\u3082\u3042\u308b\u306e\u3067\u3001\u82f1\u8a9e\u3082\u3042\u308b\u7a0b\u5ea6\u4f7f\u308f\u308c\u3066\u3044\u308b\u3002<\/p>\n<p><!--more--><\/p>\n<h2>\u5168\u822c\u7684\u306a\u8a71<\/h2>\n<p>Elasticsearch \u306e\u30a4\u30f3\u30b9\u30c8\u30fc\u30eb\u3068\u304b\u57fa\u672c\u7684\u306a\u8a2d\u5b9a\u3068\u304b\u306f\u3001\u8272\u3093\u306a\u30b5\u30a4\u30c8\u3067\u66f8\u304b\u308c\u3066\u3044\u308b\u306e\u3067\u3001\u3053\u3053\u3067\u306f\u89e6\u308c\u306a\u3044\u3002\u4eca\u56de\u306f\u3001analyzer \u306e\u8a2d\u5b9a\u5468\u308a\u306b\u3064\u3044\u3066\u66f8\u304f\u3002<\/p>\n<h3>\u305d\u306e\u524d\u306b\u3001\u53c2\u8003\u306b\u3057\u305f\u30b5\u30a4\u30c8<\/h3>\n<ul>\n<li>\u672c\u5bb6\u306e<a href=\"https:\/\/www.elastic.co\/guide\/en\/elasticsearch\/reference\/current\/index.html\" target=\"_blank\">\u30ea\u30d5\u30a1\u30ec\u30f3\u30b9<\/a>: \u304b\u306a\u308a\u8a73\u3057\u304f\u66f8\u3044\u3066\u3042\u308b\u306e\u3067\u3001\u7591\u554f\u70b9\u306f\u307e\u305a\u306f\u3053\u3061\u3089\u3092\u3042\u305f\u308b\u3079\u304d<\/li>\n<li><a href=\"https:\/\/medium.com\/hello-elasticsearch\" target=\"_blank\">Hello! Elasticsearch<\/a>: \u691c\u7d22\u3059\u308b\u3068\u3053\u306e\u30b5\u30a4\u30c8\u304c\u7d50\u69cb\u3072\u3063\u304b\u304b\u308b\u306e\u3067\u304a\u4e16\u8a71\u306b\u306a\u3063\u3066\u3044\u308b\u65b9\u3082\u591a\u3044\u306f\u305a<\/li>\n<li>\u305d\u306e\u4ed6\u3001\u8272\u3005\u691c\u7d22\u3057\u3066\u3001\u8272\u3093\u306a\u65b9\u306e\u30d6\u30ed\u30b0\u8a18\u4e8b\u3068\u304b\u3092\u8aad\u3093\u3060<\/li>\n<\/ul>\n<h3>\u74b0\u5883\u3068\u304b<\/h3>\n<ul>\n<li>Elasticsearch 2.3.5<\/li>\n<li>\u4f7f\u7528\u3057\u3066\u3044\u308b\u30a2\u30ca\u30e9\u30a4\u30b6\u30fc\n<ul>\n<li><a href=\"https:\/\/www.elastic.co\/guide\/en\/elasticsearch\/plugins\/current\/analysis-kuromoji.html\" target=\"_blank\">Japanese (kuromoji) Analysis Plugin<\/a><\/li>\n<li><a href=\"https:\/\/github.com\/codelibs\/elasticsearch-analysis-kuromoji-neologd#elasticsearch-analysis-kuromoji-neologd\" target=\"_blank\">Elasticsearch Analysis Kuromoji Neologd<\/a> (kuromoji \uff0b\u65b0\u3057\u3044\u8a9e\u5f59\u304c\u5927\u91cf\u306b\u5165\u3063\u305f\u8f9e\u66f8\u3001\u307f\u305f\u3044\u306a\u3084\u3064)<\/li>\n<li><a href=\"https:\/\/www.elastic.co\/guide\/en\/elasticsearch\/plugins\/current\/analysis-icu.html\" target=\"_blank\">ICU Analysis Plugin<\/a><\/li>\n<\/ul>\n<\/li>\n<li><a href=\"https:\/\/github.com\/elastic\/elasticsearch-mapper-attachments\" target=\"_blank\">Mapper Attachment Type Plugin<\/a> (tika \u3067\u30c6\u30ad\u30b9\u30c8\u3092\u629c\u304d\u51fa\u3057\u3066\u30a4\u30f3\u30c7\u30c3\u30af\u30b9\u306b\u767b\u9332)<\/li>\n<li><a href=\"https:\/\/github.com\/sksamuel\/elastic4s\" target=\"_blank\">elastic4s<\/a> \u3068\u3044\u3046\u30e9\u30a4\u30d6\u30e9\u30ea\u7d4c\u7531\u3067 Scala \u304b\u3089\u4f7f\u7528<\/li>\n<\/ul>\n<p>\u30c7\u30fc\u30bf\u91cf\u3082\u30e6\u30fc\u30b6\u30fc\u6570\u3082\u5c11\u306a\u3044\u306e\u3067\u3001\u8ca0\u8377\u3068\u304b\u306f\u3042\u307e\u308a\u8003\u616e\u3057\u3066\u3044\u306a\u3044\u3002<\/p>\n<h2>\u5b9f\u969b\u306e\u8a2d\u5b9a<\/h2>\n<p>\u307e\u305a\u306f\u8a2d\u5b9a\u5185\u5bb9\u306a\u3069\u304b\u3089\u6652\u3057\u3066\u3001\u305d\u306e\u5f8c\u3001\u8abf\u3079\u305f\u3053\u3068\u3068\u304b\u306b\u3064\u3044\u3066\u66f8\u3044\u3066\u3044\u304f\u3002<\/p>\n<h3>analyzer \u306e\u8a2d\u5b9a<\/h3>\n<p>elasticsearch.yml \u3067\u306f\u3044\u304f\u3064\u304b analyzer \u3092\u5b9a\u7fa9\u3057\u3066\u3044\u308b\u304c\u3001\u30e1\u30a4\u30f3\u3067\u4f7f\u3063\u3066\u3044\u308b analysis-kuromoji-neologd \u3092\u4f7f\u3063\u305f\u3082\u306e\u3092\u4ee5\u4e0b\u306b\u66f8\u304f\u3002<\/p>\n<pre>\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 analyzer:\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 ja_kuromoji_neologd:\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 type: custom\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 tokenizer: kuromoji_neologd_tokenizer\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 char_filter: [\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 icu_normalizer,\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 html_strip,\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 kuromoji_neologd_iteration_mark\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 ]\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 filter: [\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 kuromoji_neologd_stemmer,\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 kuromoji_part_of_speech,\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 kuromoji_neologd_baseform,\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 icu_normalizer,\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 ja_stop\r\n\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0\u00a0 ]<\/pre>\n<p>\u5404 char filter, (token) filter \u306b\u3064\u3044\u3066\u306f\u3001\u3053\u306e<a href=\"https:\/\/medium.com\/hello-elasticsearch\/elasticsearch-833a0704e44b#.wn8y9gu6e\" target=\"_blank\">\u30da\u30fc\u30b8<\/a>\u306b\u5206\u304b\u308a\u3084\u3059\u304f\u8aac\u660e\u3057\u3066\u3042\u308b\u3002<\/p>\n<p>\u3067\u3001\u3053\u306e analyzer \u306e\u8a2d\u5b9a\u306f\u3001\u540c\u30b5\u30a4\u30c8\u306e\u3053\u3061\u3089\u306e<a href=\"https:\/\/medium.com\/hello-elasticsearch\/elasticsearch-6d69b6ff5c26#.72nhkqf8l\" target=\"_blank\">\u30da\u30fc\u30b8<\/a>\u306e\u5185\u5bb9\u3092\u30d9\u30fc\u30b9\u306b\u3057\u3066\u3044\u308b\u304c\u3001\u82e5\u5e72\u9055\u3046\u70b9\u3082\u3042\u308b\u306e\u3067\u3001\u5f8c\u307b\u3069\u8aac\u660e\u3002<\/p>\n<h3>\u30b9\u30ad\u30fc\u30de\u5b9a\u7fa9<\/h3>\n<p>\u73fe\u72b6\u306f\u3001\u30a4\u30f3\u30c7\u30c3\u30af\u30b9\u4f5c\u6210\u6642\u306b\u8a00\u8a9e\u3092\u56fa\u5b9a\u3057\u3066\u3044\u308b\u3002\u4eca\u5f8c\u306f\u591a\u8a00\u8a9e\u5316\u3057\u305f\u3044\u304c\u3001\u305d\u308c\u3082\u5f8c\u8ff0\u3059\u308b\u3002<\/p>\n<p>\u4ee5\u4e0b\u3001elastic4s \u3067\u306e\u30b9\u30ad\u30fc\u30de\u5b9a\u7fa9\u304b\u3089\u306e\u629c\u7c8b\u3060\u304c\u3001DSL \u306a\u306e\u3067\u307e\u3041\u5927\u4f53\u5185\u5bb9\u306f\u5206\u304b\u308b\u3068\u601d\u3046\u3002<\/p>\n<pre>val messageMapping = mapping(\"message\").fields(\r\n  \"title\" typed StringType analyzer \"ja_kuromoji_neologd\",\r\n  \"body\" typed StringType analyzer \"ja_kuromoji_neologd\",\r\n  \"attached_files\" nested (\r\n    \"attached_file\" typed AttachmentType fields (\r\n      \"content\" typed StringType analyzer \"ja_kuromoji_neologd\" termVector (\"with_positions_offsets\") store (true)\r\n    )\r\n  ) includeInRoot (true)\r\n)<\/pre>\n<h3>\u30c6\u30ad\u30b9\u30c8\u30d5\u30a3\u30fc\u30eb\u30c9\u306e\u691c\u7d22<\/h3>\n<p>\u4ee5\u4e0b\u306e\u3088\u3046\u306a\u691c\u7d22\u6587\u5b57\u5217\u304c\u3042\u3063\u305f\u3068\u3059\u308b\u3002<\/p>\n<p>&#8220;google drive&#8221; \u9023\u643a \u8a2d\u5b9a<\/p>\n<p>\u305d\u308c\u3092\u4ee5\u4e0b\u306e3\u3064\u306b\u5206\u89e3\u3057\u3001<\/p>\n<ul>\n<li>\u300cgoogle drive\u300d\u306e phrase query<\/li>\n<li>\u300c\u9023\u643a\u300d\u306e match query<\/li>\n<li>\u300c\u8a2d\u5b9a\u300d\u306e match query<\/li>\n<\/ul>\n<p>\u305d\u308c\u3092 boolean query \u3067\u307e\u3068\u3081\u3066\u308b\u3002<\/p>\n<h3>\u6dfb\u4ed8\u30d5\u30a1\u30a4\u30eb\u306e\u691c\u7d22<\/h3>\n<p>\u6dfb\u4ed8\u30d5\u30a1\u30a4\u30eb\u306e\u691c\u7d22\u3082\u3001\u57fa\u672c\u7684\u306b\u306f\u30c6\u30ad\u30b9\u30c8\u30d5\u30a3\u30fc\u30eb\u30c9\u306e\u691c\u7d22\u3068\u540c\u3058\u3002\u305f\u3060\u3001\u6dfb\u4ed8\u30d5\u30a1\u30a4\u30eb\u306f\u4e0a\u8ff0\u306e\u30b9\u30ad\u30fc\u30de\u5b9a\u7fa9\u306e\u901a\u308a nested \u306a\u306e\u3067\u3001\u30af\u30a8\u30ea\u30fc\u306e\u6295\u3052\u65b9\u304c\u82e5\u5e72\u9055\u3046\u3002\u8a73\u3057\u304f\u306f\u3001\u30ea\u30d5\u30a1\u30ec\u30f3\u30b9\u3092\u53c2\u7167\u3002<\/p>\n<h3>\u6dfb\u4ed8\u30d5\u30a1\u30a4\u30eb\u306e\u30cf\u30a4\u30e9\u30a4\u30c8<\/h3>\n<p>\u691c\u7d22\u306b\u30d2\u30c3\u30c8\u3057\u305f\u5834\u5408\u306f\u3001\u30d2\u30c3\u30c8\u3057\u305f\u90e8\u5206\u3092\u30cf\u30a4\u30e9\u30a4\u30c8\u8868\u793a\u3057\u305f\u3044\u304c\u3001\u6dfb\u4ed8\u30d5\u30a1\u30a4\u30eb\u306e\u5834\u5408\uff08\u304b\u3064 nested \u306e\u5834\u5408\uff1f\uff09\u3001\u666e\u901a\u306b\u3084\u308b\u3068\u3046\u307e\u304f\u3044\u304b\u306a\u304b\u3063\u305f\u306e\u3067\u3001<a href=\"https:\/\/www.elastic.co\/guide\/en\/elasticsearch\/reference\/current\/search-request-highlighting.html#_highlight_query\" target=\"_blank\">highlight query<\/a> \u3068\u3044\u3046\u306e\u3092\u4f7f\u3063\u305f\u3002<\/p>\n<h2>\u8abf\u3079\u305f\u4e8b\u3068\u304b<\/h2>\n<h3>\u4f7f\u7528\u3057\u3066\u3044\u308b char filter, token filter \u306b\u95a2\u3057\u3066<\/h3>\n<p>\u4e0a\u306e analyzer \u8a2d\u5b9a\u306f\u3001\u3053\u3061\u3089\u306e<a href=\"https:\/\/medium.com\/hello-elasticsearch\/elasticsearch-6d69b6ff5c26#.72nhkqf8l\" target=\"_blank\">\u30da\u30fc\u30b8<\/a>\u306e\u5185\u5bb9\u3068\u82e5\u5e72\u7570\u306a\u308b\u3002\u4e3b\u306a\u9055\u3044\u306f\u4ee5\u4e0b\u306e\u901a\u308a\u3002<\/p>\n<ul>\n<li>kuromoji_* -&gt; kuromoji_neologd_*<\/li>\n<li>lowercase, cjk_width (\u5171\u306b token filter) \u3092\u4f7f\u308f\u306a\u3044\u3067 icu_normalizer char filter \u3092\u4f7f\u7528<\/li>\n<li>kuromoji_neologd_baseform \u3092\u4f7f\u7528<\/li>\n<li>icu_normalizer token filter \u3092\u6700\u5f8c\u306b\u3082\u3046\u4e00\u5ea6\u4f7f\u7528<\/li>\n<\/ul>\n<p>\u6700\u521d\u306e kuromoji_neologd_* \u306b\u95a2\u3057\u3066\u306f\u3001\u7279\u306b\u8aac\u660e\u3059\u308b\u3053\u3068\u3082\u306a\u3044\u306e\u3067\u98db\u3070\u3057\u3066\u3001\u6b8b\u308a\u306e3\u3064\u306b\u3064\u3044\u3066\u66f8\u304f\u3002<\/p>\n<h4>lowercase, cjk_width (token filter) -&gt; icu_normalize char filter<\/h4>\n<p><a href=\"https:\/\/www.elastic.co\/guide\/en\/elasticsearch\/reference\/current\/analysis-cjk-width-tokenfilter.html\" target=\"_blank\">\u30c9\u30ad\u30e5\u30e1\u30f3\u30c8<\/a>\u306b\u3001\u300cCJK Width Token Filter \u306e\u51e6\u7406\u306f NFKC\/NFKD Unicode \u6b63\u898f\u5316\u306e\u30b5\u30d6\u30bb\u30c3\u30c8\u306a\u306e\u3067\u3001analysis-icu plugin \u3092\u4f7f\u3046\u3068\u826f\u3044\u3088\u300d\uff08\u8d85\u8a33\uff09\u3068\u66f8\u3044\u3066\u3042\u308b\u306e\u3067\u3001cjk_width \u3067\u306f\u306a\u304f icu_normalize char filter \u3092\u4f7f\u7528\u3002<\/p>\n<p>\u307e\u305f\u3001 icu_normalize \u306f\u3001\u5927\u6587\u5b57 \u2192 \u5c0f\u6587\u5b57\u306b\u3082\u3057\u3066\u304f\u308c\u308b\u306e\u3067\u3001lowercase token filter \u3082\u4e0d\u8981\u3002<\/p>\n<h3>kuromoji_neologd_baseform \u3092\u4f7f\u7528<\/h3>\n<p>\u5358\u8a9e\u3092\u539f\u578b\u306b\u3057\u3066\u304f\u308c\u308b\u3002\u3053\u308c\u3092\u4f7f\u3046\u304b\u3069\u3046\u304b\u306f\u8981\u4ef6\u306b\u3088\u3063\u3066\u5909\u308f\u308b\u3068\u601d\u3046\u3051\u3069\u3001\u81ea\u5206\u9054\u306e\u7528\u9014\u3060\u3068\u3001\u5358\u8a9e\u3084\u30d5\u30ec\u30fc\u30ba\u5358\u4f4d\u3067\u691c\u7d22\u3059\u308b\u3053\u3068\u304c\u6b86\u3069\u306a\u306e\u3067\u3001\u4f7f\u7528\u3057\u305f\u307b\u3046\u304c\u826f\u3044\u3068\u5224\u65ad\u3002<\/p>\n<h3>icu_normalizer token filter \u3092\u6700\u5f8c\u306b\u3082\u3046\u4e00\u5ea6\u4f7f\u7528<\/h3>\n<p>icu_normalizer \u306f char filter \u3068 token filter \u306e\u3069\u3061\u3089\u3068\u3057\u3066\u3082\u4f7f\u7528\u3067\u304d\u308b\u3002char filter \u3092\u4f7f\u3063\u3066\u308b\u306e\u306b\u3001\u6700\u5f8c\u3067\u3082\u3046\u4e00\u5ea6 token filter \u3068\u3057\u3066\u4f7f\u3063\u3066\u3044\u308b\u7406\u7531\u306f\u3001\u305d\u306e\u524d\u6bb5\u306e kuromoji_neologd_baseform token filter \u304c\u6b63\u898f\u5316\u3055\u308c\u3066\u3044\u306a\u3044\u6587\u5b57\u5217\u3092\u8fd4\u3059\u5834\u5408\u304c\u3042\u308b\u306e\u3067\u3001\u305d\u306e\u5bfe\u7b56\u3002<\/p>\n<p>\u5177\u4f53\u7684\u306b\u306f\u3001linux \u3084 LINUX \u3068\u3044\u3063\u305f\u3001\u8f9e\u66f8\u306b\u8f09\u3063\u3066\u3044\u308b\u3051\u3069\u539f\u578b\u3067\u306a\u3044\u5358\u8a9e\u306e\u5834\u5408\u3001kuromoji_neologd_baseform \u304c Linux \u3068\u3044\u3046\u539f\u578b\uff08\u5148\u982d\u304c\u5927\u6587\u5b57\uff09\u306b\u3057\u3066\u8fd4\u3057\u3066\u304f\u308c\u308b\u304c\u3001Linux \u3068\u3044\u3046\u539f\u578b\u3092\u6e21\u3057\u305f\u5834\u5408\u306f\u3001kuromoji_neologd_baseform \u306f\u51e6\u7406\u3092\u305b\u305a\u306b\u305d\u306e\u307e\u307e linux \u3092\u5f8c\u7d9a\u306b\u6e21\u3059\u3002\u3064\u307e\u308a\u3001\u5165\u529b token \u3068 filter \u5f8c\u306e token \u306e\u95a2\u4fc2\u306f\u4ee5\u4e0b\u306e\u901a\u308a\u3002<\/p>\n<ul>\n<li>linux -&gt; Linux<\/li>\n<li>LINUX -&gt; Linux<\/li>\n<li>Linux -&gt; linux<\/li>\n<\/ul>\n<p>\u3053\u308c\u3060\u3068\u56f0\u308b\u306e\u3067\u3001\u6700\u5f8c\u306b icu_normalizer \u3092\u518d\u5ea6\u304b\u3051\u3066\u3044\u308b\u3002<\/p>\n<p>\u4ee5\u4e0a\u3001\u304b\u306a\u308a\u306f\u3057\u3087\u3063\u305f\u8aac\u660e\u3002<\/p>\n<p>kuromoji_neologd_baseform \u304c\u6b63\u898f\u5316\u3055\u308c\u305f\u5f62\u5f0f\u306e\u3082\u306e\u3092\u8fd4\u3057\u3066\u304f\u308c\u308c\u3070\u305d\u308c\u304c\u3044\u3044\u306e\u304b\u3082\u3057\u308c\u306a\u3044\u3051\u3069\u3001baseform \u3084\u3001\uff08\u4eca\u56de\u306f\u89e6\u308c\u3066\u3044\u306a\u3044\u3051\u3069\uff09synonym \u306e\u5c55\u958b\u3068\u304b\u3001\u6b63\u898f\u5316\u3055\u308c\u305f\u6587\u5b57\u5217\u304c\u8fd4\u3055\u308c\u306a\u3044\u30b1\u30fc\u30b9\u3082\u8003\u616e\u3057\u3066\u3001\u6700\u5f8c\u306b\u5165\u308c\u3068\u3044\u305f\u307b\u3046\u304c\u5b89\u5fc3\u304b\u306a\u3041\u3068\u3044\u3046\u611f\u60f3\u3002<\/p>\n<h3>\u6dfb\u4ed8\u30d5\u30a1\u30a4\u30eb\u306e highlight \u306b\u3064\u3044\u3066<\/h3>\n<p>nest \u3055\u308c\u305f attachment \u30bf\u30a4\u30d7\u306e\u30c9\u30ad\u30e5\u30e1\u30f3\u30c8\u306e\u30cf\u30a4\u30e9\u30a4\u30c8\u3092\u3046\u307e\u304f\u3084\u308b\u306e\u306b\u3001\u3061\u3087\u3063\u3068\u6642\u9593\u304c\u304b\u304b\u3063\u305f\u3002<\/p>\n<p>elasticsearch-mapper-attachments \u306e\u4ee5\u4e0b\u306e issue \u3092\u898b\u308b\u3068\u3001\u3069\u3046\u3082\u51fa\u6765\u306a\u3044\uff1f<\/p>\n<p><a href=\"https:\/\/github.com\/elastic\/elasticsearch-mapper-attachments\/issues\/153\">Highlighting in nested document \u00b7 Issue #153 \u00b7 elastic\/elasticsearch-mapper-attachments<\/a><\/p>\n<p>\u3082\u3046\u3061\u3087\u3044\u691c\u7d22\u3057\u305f\u3089\u3001\u3058\u3087\u30fc\u305f\u306b\u3055\u3093\u304c\u4f5c\u6210\u3057\u305f\u672c\u5bb6\u306e\u4ee5\u4e0b\u306e issue \u304c\u898b\u3064\u304b\u3063\u305f\u3002<\/p>\n<p><a href=\"https:\/\/github.com\/elastic\/elasticsearch\/issues\/5245\">Does not return stored field in nested object \u00b7 Issue #5245 \u00b7 elastic\/elasticsearch<\/a><\/p>\n<p>nested objects \u306e highlight \u306f inner hits \u3092\u4f7f\u3048\u3070\u51fa\u6765\u308b\u3063\u307d\u3044\u3093\u3060\u3051\u3069\u3001attachment \u306e\u5834\u5408\u306f\u305d\u308c\u3067\u826f\u3044\u306e\u304b\u3088\u304f\u5206\u304b\u3089\u306a\u304b\u3063\u305f\u3002\u7d50\u5c40\u3001\u4e0a\u8ff0\u306e\u901a\u308a highlight query \u3092\u4f7f\u7528\u3057\u305f\u3002<\/p>\n<h3>phrase \u691c\u7d22\u306f\u5fc5\u8981<\/h3>\n<p>\u5f62\u614b\u7d20\u89e3\u6790\u3092\u30d9\u30fc\u30b9\u3068\u3057\u305f analyzer \u306e\u5834\u5408\u3001\u4f8b\u3048\u3070\u300c\u81ea\u7136\u8a00\u8a9e\u51e6\u7406\u300d\u3068\u5165\u529b\u3059\u308b\u3068\u3001\u300c\u81ea\u7136\u8a00\u8a9e\u51e6\u7406\u300d\u3060\u3051\u3067\u306a\u304f\u300c\u81ea\u7136\u300d\u300c\u8a00\u8a9e\u300d\u300c\u51e6\u7406\u300d\u306e\u691c\u7d22\u7d50\u679c\u3082\u8fd4\u3063\u3066\u304d\u3066\u3057\u307e\u3046\u306e\u3067\u3001phrase \u691c\u7d22\u51fa\u6765\u308b\u3088\u3046\u306b\u3057\u3066\u304a\u304f\u3068\u3001\u4f59\u5206\u306a\u7d50\u679c\u3092\u5f3e\u3051\u3066\u4fbf\u5229\u3002<\/p>\n<h3>explain=true \u304c\u3081\u3061\u3083\u4fbf\u5229<\/h3>\n<p>analyzer \u306e\u6319\u52d5\u304c\u601d\u3063\u305f\u901a\u308a\u306b\u306a\u3089\u306a\u3044\u6642\u306b\u3001Analyze API \u306b explain=true \u3092\u3064\u3051\u308b\u3068\u3001\u8a73\u7d30\u306a\u60c5\u5831\u304c\u51fa\u3066\u304d\u3066\u3081\u3061\u3083\u4fbf\u5229\u3002\u3053\u306e<a href=\"http:\/\/d.hatena.ne.jp\/Kazuhira\/20160206\/1454729308\" target=\"_blank\">\u30da\u30fc\u30b8<\/a>\u304c\u4e01\u5be7\u306b\u8aac\u660e\u3057\u3066\u3042\u3063\u305f\u3002<\/p>\n<p>\u3061\u306a\u307f\u306b\u3001\u3053\u306e\u6a5f\u80fd\u3001johtani \u3055\u3093\u304c\u4f5c\u3063\u305f\u3089\u3057\u3044\uff08\u3069\u3063\u304b\u306e\u30da\u30fc\u30b8\u306b\u66f8\u3044\u3066\u3042\u3063\u305f\u3051\u3069\u3001URL\u898b\u3064\u304b\u3089\u305a\uff09\u3002\u3044\u3084\u30fc\u3001\u7d20\u6674\u3089\u3057\u3044\u3002\u3061\u306a\u307f\u306b\u3001 twitter \u3067\u4f55\u5ea6\u304b\u8cea\u554f\u306b\u56de\u7b54\u3057\u3066\u3044\u305f\u3060\u3044\u305f\u308a\u3068\u304b\u3001\u500b\u4eba\u7684\u306b\u306f\u5927\u8c37\u3055\u3093\u306b\u8db3\u3092\u5411\u3051\u3066\u5bdd\u3089\u308c\u306a\u3044\u3002\u3069\u3061\u3089\u306b\u304a\u4f4f\u307e\u3044\u304b\u306f\u5b58\u3058\u307e\u305b\u3093\u304c\u3002<\/p>\n<h2>\u4eca\u5f8c\u3084\u308a\u305f\u3044\u4e8b\u3001\u6c17\u306b\u306a\u308b\u4e8b<\/h2>\n<h3>\u3042\u308b\u7a0b\u5ea6\u82f1\u8a9e\u5411\u3051\u306e\u8a2d\u5b9a\u3082\u6df7\u305c\u308b\uff1f<\/h3>\n<p>\u6700\u7d42\u7684\u306b\u306f\u3001\u6b21\u9805\u3067\u66f8\u304f\u3088\u3046\u306b\u591a\u8a00\u8a9e\u5316\u3057\u305f\u3044\u3002\u305f\u3060\u3001\u81ea\u5206\u9054\u306e\u3088\u3046\u306a\u7528\u9014\u3060\u3068\u3001\u3042\u308b\u30c6\u30ad\u30b9\u30c8\u306e\u4e2d\u306b\u73fe\u308c\u308b\u8a00\u8a9e\u3063\u3066\u3001\u6bcd\u56fd\u8a9e\uff08\u65e5\u672c\u8a9e\u3084\u305d\u306e\u4ed6\u306e\u8a00\u8a9e\uff09\uff0b\u82f1\u8a9e\u3068\u3044\u3046\u5f62\u304c\u6b86\u3069\u306a\u306e\u3067\u3001analyzer \u306e\u8a2d\u5b9a\u3067\u3001<\/p>\n<ul>\n<li>\u6bcd\u56fd\u8a9e\u5411\u3051\u306e\u8a2d\u5b9a\uff08\u4eca\u56de\u306e\u8a2d\u5b9a\uff09<\/li>\n<li>\u57fa\u672c\u7684\u306a\u82f1\u8a9e\u5411\u3051\u306e\u8a2d\u5b9a \uff08baseform \u306b\u63c3\u3048\u308b\u3001\u3068\u304b\uff09<\/li>\n<\/ul>\n<p>\u30921\u3064\u306e analyzer \u306b\u307e\u3068\u3081\u3066\u3057\u307e\u3063\u3066\u3082\u826f\u3044\u3093\u3058\u3083\u306a\u3044\u304b\u3001\u3068\u3044\u3046\u30a2\u30a4\u30c7\u30a3\u30a2\u3092\u6301\u3063\u3066\u308b\u306e\u3067\u3001\u4eca\u5ea6\u8a66\u3057\u3066\u307f\u3088\u3046\u3068\u601d\u3046\u3002<\/p>\n<h3>\u591a\u8a00\u8a9e\u5316<\/h3>\n<p>\u524d\u8ff0\u306e Hello! Elasticsearch \u306e\u8457\u8005\u306e\u65b9\u304c Elasticsearch \u52c9\u5f37\u4f1a\u3067\u767a\u8868\u3057\u305f<a href=\"https:\/\/speakerdeck.com\/kunihikokido\/elasticsearch-ri-ben-yu-sukimaresuhuan-jing-gou-zhu-to-tuideniduo-yan-yu-dui-ying\" target=\"_blank\">\u8cc7\u6599<\/a>\u304c\u53c2\u8003\u306b\u306a\u308a\u305d\u3046\u3002\u3053\u306e\u65b9\u304c\u958b\u767a\u3055\u308c\u3066\u3044\u308b Siba \u3068\u3044\u3046\u30b5\u30fc\u30d3\u30b9\u306e\u5ba3\u4f1d\u3082\u517c\u306d\u305f\u767a\u8868\u306a\u306e\u3067\u3001\u809d\u5fc3\u306a\u3068\u3053\u308d\u306f\u7d50\u69cb\u30dc\u30ab\u3055\u308c\u3066\u3044\u308b\u3051\u3069\u3001\u3042\u308b\u7a0b\u5ea6\u53c2\u8003\u306b\u306a\u3063\u305f\u3002<\/p>\n<p>\u4f7f\u7528\u7528\u9014\u3068\u3057\u3066\u3001\u4f8b\u3048\u3070\u30d5\u30a3\u30ea\u30d4\u30f3\u3067\u306e\u958b\u767a\u30c1\u30fc\u30e0\u3060\u3068\u3001\u30bf\u30ac\u30ed\u30b0\u53c8\u306f\u30d3\u30b5\u30e4\u8a9e\uff0b\u82f1\u8a9e\u3060\u3051\u3069\u3001\u305d\u3053\u306b\u65e5\u672c\u8a9e\u304c\u4ea4\u3058\u308b\u3053\u3068\u306f\u6b86\u3069\u306a\u3044\u306e\u3067\u3001<\/p>\n<ul>\n<li>\u4f7f\u7528\u3059\u308b\u30c1\u30fc\u30e0\u5358\u4f4d\u3067\u7570\u306a\u308b\u30a4\u30f3\u30c7\u30c3\u30af\u30b9<\/li>\n<li>\u30a4\u30f3\u30c7\u30c3\u30af\u30b9\u5358\u4f4d\u3067\u5404\u30c1\u30fc\u30e0\u306e\u6bcd\u56fd\u8a9e\uff0b\u82f1\u8a9e\u304c\u4f7f\u3048\u308b\u8a2d\u5b9a<\/li>\n<li>\u3069\u306e\u300c\u6bcd\u56fd\u8a9e\u300d\u3092\u4f7f\u3046\u304b\u306f\u81ea\u52d5\u5224\u5225<\/li>\n<\/ul>\n<p>\u307f\u305f\u3044\u306a\u5f62\u306b\u3057\u3088\u3046\u304b\u3068\u8003\u3048\u3066\u3044\u308b\u3002<\/p>\n<h3>ingest \u306a\u306b\u305d\u308c\uff1f<\/h3>\n<p>Mapper Attachments Plugin \u304c\u3001<a href=\"https:\/\/www.elastic.co\/guide\/en\/elasticsearch\/plugins\/current\/mapper-attachments.html\" target=\"_blank\">Elasticsearch 5.0.0 \u3067 deprecated \u306b\u306a\u308a<\/a>\u3001<a href=\"https:\/\/www.elastic.co\/guide\/en\/elasticsearch\/plugins\/master\/ingest-attachment.html\" target=\"_blank\">Ingest Attachment Processor Plugin<\/a> \u306b\u306a\u308b\u3068\u306e\u3053\u3068\u3002Elasticsearch \u3063\u3066\u4fbf\u5229\u3060\u3051\u3069\u3001\u30d0\u30fc\u30b8\u30e7\u30f3\u30a2\u30c3\u30d7\u304c\u901f\u3044\u3057\u3001\u4ed5\u69d8\u5909\u66f4\u3082\u3081\u3061\u3083\u591a\u3044\u306e\u3067\u3001\u3064\u3044\u3066\u3044\u304f\u306e\u304c\u5927\u5909\u3002<\/p>\n<p>Ingest \u306b\u95a2\u3059\u308b\u30c9\u30ad\u30e5\u30e1\u30f3\u30c8\u306f<a href=\"https:\/\/www.elastic.co\/guide\/en\/elasticsearch\/reference\/master\/ingest.html\" target=\"_blank\">\u3053\u3061\u3089<\/a>\u3092\u53c2\u7167\u3002<\/p>\n<h2>\u307e\u3068\u3081<\/h2>\n<p>\u307e\u3068\u3081\u3089\u3057\u3044\u8a71\u306f\u7279\u306b\u7121\u3044\u3051\u3069\u3001Elasticsearch \u306e\u65e5\u672c\u8a9e\u306b\u95a2\u3059\u308b\u60c5\u5831\u3063\u3066\u7d50\u69cb\u30d0\u30e9\u30d0\u30e9\u306a\u306e\u3067\u3001\u8abf\u3079\u305f\u3053\u3068\u3092\u307e\u3068\u3081\u305f\u3002\u6c17\u304c\u5411\u3051\u3070\uff08\u591a\u5206\u5411\u304b\u306a\u3044\uff09\u8ffd\u8a18\u4e88\u5b9a\u3002<\/p>\n<p>\u8ffd\u8a18\uff1a\u591a\u8a00\u8a9e\u5316\u306b\u95a2\u3057\u3066\u3061\u3087\u3063\u3068\u66f8\u3044\u3066\u307f\u305f\u3002<\/p>\n<ul>\n<li><a href=\"http:\/\/kazu.tv\/blog\/2016\/11\/07\/i18n-elasticsearch-part-1\/\">Elasticsearch\u591a\u8a00\u8a9e\u5316\u305d\u306e1 \u2013 K blog<\/a><\/li>\n<\/ul>\n","protected":false},"excerpt":{"rendered":"<p>\u8a73\u3057\u3044\u4eba\u304b\u3089\u898b\u308c\u3070\u5927\u3057\u305f\u5185\u5bb9\u3058\u3083\u306a\u3044\u3068\u601d\u3046\u3051\u3069\u3001\u8abf\u3079\u305f\u308a\u8a66\u884c\u932f\u8aa4\u3057\u305f\u7d50\u679c\u3092\u307e\u3068\u3081\u308b\u3002\uff08\u9593\u9055\u3044\u306a\u3069\u304c\u3042\u308c\u3070\u3001\u3054\u6307\u6458\u9802\u3051\u308b\u3068\u3042\u308a\u304c\u305f\u3044\u3067\u3059\u3002\uff09 Elasticsearch \u3092\u4f55\u306b\u4f7f\u3063\u3066\u3044\u308b\u304b \u4ed6\u30b5\u30fc\u30d3\u30b9 \u2192 API\/webh&hellip;<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_monsterinsights_skip_tracking":false,"_monsterinsights_sitenote_active":false,"_monsterinsights_sitenote_note":"","_monsterinsights_sitenote_category":0,"footnotes":""},"categories":[15],"tags":[934],"class_list":["post-1637","post","type-post","status-publish","format-standard","hentry","category-15","tag-elasticsearch"],"_links":{"self":[{"href":"https:\/\/kazu.tv\/blog\/wp-json\/wp\/v2\/posts\/1637","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/kazu.tv\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/kazu.tv\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/kazu.tv\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/kazu.tv\/blog\/wp-json\/wp\/v2\/comments?post=1637"}],"version-history":[{"count":5,"href":"https:\/\/kazu.tv\/blog\/wp-json\/wp\/v2\/posts\/1637\/revisions"}],"predecessor-version":[{"id":1654,"href":"https:\/\/kazu.tv\/blog\/wp-json\/wp\/v2\/posts\/1637\/revisions\/1654"}],"wp:attachment":[{"href":"https:\/\/kazu.tv\/blog\/wp-json\/wp\/v2\/media?parent=1637"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/kazu.tv\/blog\/wp-json\/wp\/v2\/categories?post=1637"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/kazu.tv\/blog\/wp-json\/wp\/v2\/tags?post=1637"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}