Uploaded image for project: 'Apache Drill'
  1. Apache Drill
  2. DRILL-6628

Possible incorporation of Twitter text processing UDFs into Drill-proper

    XMLWordPrintableJSON

    Details

    • Type: Improvement
    • Status: Open
    • Priority: Major
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: Functions - Drill
    • Labels:

      Description

      Per the User mailing list thread — https://mail-archives.apache.org/mod_mbox/drill-user/201807.mbox/%3Caef1979d-f454-4691-8607-8267adf2ac1e%40getmailbird.com%3E — submitting the possibility for the inclusion of drill-twitter-text — https://github.com/hrbrmstr/drill-twitter-text — into Drill-proper.

      Shifting the conversation here since it's more appropriate and CC'ing Charles Givre who posited the idea.

      On the one hand, there are function groups such as "Phonetic" and "String Distance" so there's precedent for inclusion of "non-boring-SQL"-like functions into Drill-proper. On the other hand, this is a small addition of a handful of functions for Twitter text so would this be to niche for a "Twitter" function group?

      As noted in the mailing list thread, there are more "cyber"-ish UDFs on the way (still kinda hoping for that guava upgrade that I saw mentioned in various places in jira), so would the Twitter components be in a "Cyber" group?

      Regardless, I'll take a look at how the functions are structured in the Drill source tree and gladly machinate the necessary changes/inclusions if the result of this discussion results in that decision.

        Attachments

          Activity

            People

            • Assignee:
              Unassigned
              Reporter:
              hrbrmstr Bob Rudis
            • Votes:
              0 Vote for this issue
              Watchers:
              2 Start watching this issue

              Dates

              • Created:
                Updated: