Uploaded image for project: 'Flume'
  1. Flume
  2. FLUME-2390

Flume-ElasticSearch Data gets posted multiple times when one of the event fail validation at elastic search sink for JSON Data

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • 1.4.0
    • None
    • Sinks+Sources
    • None
    • CDH4.5

    Description

      Hi,

      I am using Elastic Search Sink to post JSON data. I used the temporary fix mentioned in https://issues.apache.org/jira/browse/FLUME-2126 to get JSON data posted to elastic search. When one of the message fail validation at ElasticSearch mapping for JSON data ( For example - getting empty message) , Flume seems to post the entire batch again and again until I restart Flume. Because of that no of events went from an avg of 100 to avg of 2000 per 10 minutes. As a temporary fix I set a header in my FlumeHTTP Source for non valid JSON and used a interceptor to send data to multiple ESSINKS which has different index names.

      Attachments

        Activity

          People

            Unassigned Unassigned
            deepakas1 Deepak Subhramanian
            Votes:
            0 Vote for this issue
            Watchers:
            9 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: