Uploaded image for project: 'eZ Publish / Platform'
  1. eZ Publish / Platform
  2. EZP-29958

Creating paragraph longer than 32766 bytes in RichText Editor will fail with Solr Error

    XMLWordPrintable

Details

    Description

      When Solr is set as a search engine, creating content with paragraph longer than 32766 bytes in RichText Editor will result in 500 error.
      Also, Solr will report error:

      error: Document contains at least one immense term in field="article_body_value_s" (whose UTF8 encoding is longer than the max length 32766), all of which were skipped.  Please correct the analyzer to not produce such terms.  The prefix of the first immense term is: '[108, 111, 114, 101, 109, 32, 105, 112, 115, 117, 109, 32, 100, 111, 108, 111, 114, 32, 115, 105, 116, 32, 97, 109, 101, 116, 44, 32, 115, 105]...', original message: bytes can be at most 32766 in length; got 566610. Perhaps the document has an indexed string field (solr.StrField) which is too large
      

      Steps to reproduce
      1. Clean installation of eZ Platform 2.3
      2. Set Solr as search engine
      3. Create Article, put more than 32766 bytes of text in a single paragraph in RichText Editor.
      4. Press publish

      Results
      Publish content will fail with exception:

      HTTP 500 Internal Server Error
      Runtime Exception
      
      Wrong HTTP status received from Solr: 400 on http://localhost:8983/solr/collection1
      EzSystems\EzPlatformSolrSearchEngine\Gateway\Endpoint::__set_state(array(
      'scheme' => 'http',
      'user' => NULL,
      'pass' => NULL,
      'host' => 'localhost',
      'port' => 8983,
      'path' => '/solr',
      'core' => 'collection1',
      ))
      EzSystems\EzPlatformSolrSearchEngine\Gateway\Message::__set_state(array(
      'headers' => 
      array (
      'version' => '1.1',
      'status' => 400,
      'Content-Type' => 'text/plain;charset=utf-8',
      'Content-Length' => '880',
      ),
      ...
      

      Attachments

        Activity

          People

            Unassigned Unassigned
            mateusz.bieniek@ibexa.co Mateusz Bieniek
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: