Uploaded image for project: 'eZ Publish / Platform'
  1. eZ Publish / Platform
  2. EZP-31248

SolR breaks when content contains ascii codes < 32

    XMLWordPrintable

Details

    • Icon: Bug Bug
    • Resolution: Fixed
    • Icon: Medium Medium
    • 3.1.0, 2.5.10, 3.0.4
    • None
    • None
    • None

    Description

      Steps to reproduce:

      1. I imported a document, which seems to have a special character in it into a text field of a content object via Public API. (Find the text attached to this ticket)
      2. I run ezplatform:reindex

      It fails with the message

       Wrong HTTP status received from Solr: 400 on http://solr:8983/solr/snh_behat
            EzSystems\EzPlatformSolrSearchEngine\Gateway\Endpoint::__set_state(array(
               'scheme' => 'http',
               'user' => NULL,
               'pass' => NULL,
               'host' => 'solr',
               'port' => 8983,
               'path' => '/solr',
               'core' => 'snh_behat',
            ))
            EzSystems\EzPlatformSolrSearchEngine\Gateway\Message::__set_state(array(
               'headers' => 
              array (
                'version' => '1.1',
                'status' => 400,
                'Content-Type' => 'text/plain;charset=utf-8',
                'Content-Length' => '285',
              ),
               'body' => '{"responseHeader":{"status":400,"QTime":2},"error":{"metadata":["error-class","org.apache.solr.common.SolrException","root-error-class","com.ctc.wstx.exc.WstxUnexpectedCharException"],"msg":"Illegal character ((CTRL-CHAR, code 19))\\n at [row,col {unknown-source}]: 
      [4,5]","code":400}}
      ...

      Attachments

        Activity

          People

            Unassigned Unassigned
            9dcdb0dd-5caa-459b-93b0-1ce97332c3fa@accounts.ibexa.co Hannes Giesenow
            Votes:
            0 Vote for this issue
            Watchers:
            4 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: