Uploaded image for project: 'Metron (Retired)'
  1. Metron (Retired)
  2. METRON-1677

UUIDv4 GUID is not Lucene friendly

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Done
    • Major
    • Resolution: Done
    • None
    • None
    • None

    Description

      Using UUIDv4 by UUID.randomUUID() in Java is not Lucene friendly and impacts Elasticsearch and Solr indexing/search performance and makes it unpredictable sometimes.

      http://blog.mikemccandless.com/2014/05/choosing-fast-unique-identifier-uuid.html

      Moreover, specifying doc id at the client side will impact indexing throughput due to enabling Elasticsearch deduplication policy and changing insert to upsert. Hence, indexing throughput can be increased by providing an ability to disable ID generation at the client side. Currently, the way ID is generated can be overwritten at the config level by replacing Metron default guid via Stellar, but it is not possible to disable it completely to let Elasticsearch decide what ID can be used for the corresponding document.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              ali.nazemian Ali Nazemian
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved:

                Time Tracking

                  Estimated:
                  Original Estimate - Not Specified
                  Not Specified
                  Remaining:
                  Remaining Estimate - 0h
                  0h
                  Logged:
                  Time Spent - 3.5h
                  3.5h