Details
-
Bug
-
Resolution: Fixed
-
High
-
5.1, 5.2-rc1
-
None
-
eZ Publish 5.1 / eZ Find 5.1 (SOLR 3.6.1)
Description
eZ Find's autocomplete functionality does not work, on both backend and frontend siteaccesses, with Kanji and Hiragana Japanese characters. However, is does work with katakana characters.
Steps to reproduce:
1. Configure CJKTokenizer in solr. Following SOLR's example (http://svn.apache.org/repos/asf/lucene/dev/branches/lucene_solr_3_6/solr/example/solr/conf/schema.xml), I added the following block to ./ezpublish_legacy/extension/ezfind/java/solr/conf/schema.xml:
<fieldType name="text_cjk" class="solr.TextField" positionIncrementGap="100"> <analyzer> <tokenizer class="solr.StandardTokenizerFactory"/> <!-- normalize width before bigram, as e.g. half-width dakuten combine --> <filter class="solr.CJKWidthFilterFactory"/> <!-- for any non-CJK --> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.CJKBigramFilterFactory"/> </analyzer> </fieldType>
...just after:
<fieldtype name="geohash" class="solr.GeoHashField"/>
Please note that you must re-start SOLR for the changes to take effect. Re-indexing in not necessary, though.
2. Create Japanese content. For the sake of completeness, I created content in Kanji, Hiragana and Katakana:
Kanji: 漢字(かんじ) no auto-complete
Hiragana: ひらがな no auto-complete
Katakana: カタカナ auto-complete works