Uploaded image for project: 'Apache DataLab (Retired)'
  1. Apache DataLab (Retired)
  2. DATALAB-2492

[GCP][RStudio][Apache Standalone cluster] Flight data visualization runs with error of collect method usage

ResolvedReopenedClosedVerifiedBlockedon hold
    XMLWordPrintableJSON

Details

    Description

      Preconditions:

      1. Apache Standalone cluster is created on RStudio

      Steps to reproduce:

      1. Run Flight data visualization on Apache Standalone cluster

      Actual result:

      1. Flight data visualization playbook running fails

      > delay <- collect(delay_sql)
      Error in UseMethod("collect") : 
        no applicable method for 'collect' applied to an object of class "SparkDataFrame"
      > delay_melt <- melt(delay[c('Carrier', 'WorkDayDelay', 'WeekendDelay')])
      Error in melt(delay[c("Carrier", "WorkDayDelay", "WeekendDelay")]) : 
        object 'delay' not found

      Expected result:

      1. Flight data visualization playbook running is successful

      Attachments

        1. RStudio-Spark.png
          105 kB
          Vira Vitanska

        Activity

          People

            lfrolov Leonid Frolov
            vvitanska Vira Vitanska
            Votes:
            0 Vote for this issue
            Watchers:
            1 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved: