Copied to clipboard

Flag this post as spam?

This post will be reported to the moderators as potential spam to be looked at


  • Rotem Orbach 105 posts 578 karma points
    Aug 29, 2018 @ 13:08
    Rotem Orbach
    0

    Lucene Search Engine results Hebrew

    Hi, I have implemented Lucene Search Engine results in my website, and it works flawlessly, except one distinct issue: When I try to search a Hebrew term that has double quotation marks in it, I get no results. For example: דפ"ר

    this word should return results, but it doesn't.

    Note: wild card works: דפ*

    Any thoughts?

    thanks, Rotem

  • Yakov Lebski 427 posts 1654 karma points
    Sep 10, 2018 @ 16:33
    Yakov Lebski
    0

    You can try to use escape function - please see Lucene documentation https://lucene.apache.org/core/294/queryparsersyntax.html

  • Ismail Mayat 4511 posts 10059 karma points MVP 2x admin c-trib
    Sep 11, 2018 @ 08:43
    Ismail Mayat
    0

    Rotem,

    Are you using hebrew analyser? See https://our.umbraco.com/packages/developer-tools/hebrew-search/ this may get round the problem. Ideally when doing multilingual you should use the language specific analyser if available.

    Regards

    Ismail

  • Yakov Lebski 427 posts 1654 karma points
    Sep 13, 2018 @ 08:31
    Yakov Lebski
    0

    just note that HebMorph is not free for commercial use

    from HebMorph website:

    If you are developing a commercial application, please contact us to discuss a commercial license. This helps us support development of future versions that make Hebrew search even better. Paid licenses are bundled with a proprietary dictionary, which replaces the AGPL3'd hspell used by the open-source version, as well as integration support.

  • Ismail Mayat 4511 posts 10059 karma points MVP 2x admin c-trib
    Sep 13, 2018 @ 09:02
    Ismail Mayat
    0

    ah did not know that good spot.

  • Rotem Orbach 105 posts 578 karma points
    Sep 13, 2018 @ 08:34
    Rotem Orbach
    0

    Hi,

    Thank you for the answers. I tried https://our.umbraco.com/packages/developer-tools/hebrew-search/

    and it does not solve the issue :(

    also, when I debug the search I can clearly see that the word is already escaped (דפ\"ר) Meaning: I can see the slash being added to the string, but still, no value is being returned.

    any other thoughts? :)

    thanks

  • Ismail Mayat 4511 posts 10059 karma points MVP 2x admin c-trib
    Sep 13, 2018 @ 09:02
    Ismail Mayat
    100

    Rotem,

    I would reach out to itamar https://twitter.com/synhershko he ported lucene to lucene.net and put together HebMorph also he is native hebrew speaker.

    Regards

    Ismail

  • Rotem Orbach 105 posts 578 karma points
    Sep 20, 2018 @ 12:16
    Rotem Orbach
    0

    Thanks, I marked your answer as solution, but unfortunately, Itamar did not replay, so I guess i'll have to face this one myself.

    :)

  • This forum is in read-only mode while we transition to the new forum.

    You can continue this topic on the new forum by tapping the "Continue discussion" link below.

Please Sign in or register to post replies