http://bugzilla.opensuse.org/show_bug.cgi?id=917030 Bug ID: 917030 Summary: idzebra-2.0: Issue with ICU token processing Classification: openSUSE Product: openSUSE Distribution Version: 13.2 Hardware: Other URL: http://git.indexdata.com/?p=idzebra.git;a=commit;h=704 fd190292cb771df94553b0ed6f9f4b71660a6 OS: Other Status: NEW Severity: Normal Priority: P5 - None Component: Maintenance Assignee: ke@suse.com Reporter: dcook@prosentient.com.au QA Contact: qa-bugs@suse.de Found By: --- Blocker: --- Dear Karl, Bjørn Lie mentioned that you were the maintainer for idzebra-2.0 and that I should assign this bug to you. He also mentioned that he's already prepared an update for the devel repo at https://build.opensuse.org/request/show/284830. On to the actual bug report: Zebra 2.0.59 has an issue where search queries involving hyphens are tokenised, but only the first token is used for searching. So a search for "Mont-Royal" will actually just be a search for "Mont". Or a search for "up-to-date" will just be a search for "up". This is the case even when trying to use ICU transformation/transliteration rules to remove the hyphen before tokenising. I reported the bug to Indexdata on February 4th and they fixed it and released version 2.0.60 on February 7th with this fixed. Here is the link to the relevant git commit: http://git.indexdata.com/?p=idzebra.git;a=commit;h=704fd190292cb771df94553b0... and the relevant news patch: http://git.indexdata.com/?p=idzebra.git;a=commitdiff;h=b51184e7cf9eabd2c609f... I've already tested the fix on a Debian system using the Debian packages that Indexdata provides, and it works great. Please let me know if you need any information on reproducing the bug or whatever else. Thanks. -- You are receiving this mail because: You are on the CC list for the bug.