Uploaded image for project: 'Apache Storm'
  1. Apache Storm
  2. STORM-2147

[Storm SQL] Support automatic spout parallelism based on DataSource metadata

    XMLWordPrintableJSON

Details

    • New Feature
    • Status: Closed
    • Major
    • Resolution: Abandoned
    • None
    • None
    • storm-sql

    Description

      It would be better to receive metadata from Data Source, especially producer which can give some hints to optimize.
      A notable kind of hint is parallelism hint. In storm-kafka we know that normally it's best to set parallelism to same as topic's partition count so that Spouts can pull the data from all partitions in parallel.

      We can apply non-query optimizations start from here.

      Attachments

        Issue Links

          Activity

            People

              Unassigned Unassigned
              kabhwan Jungtaek Lim
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: