Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-13253

[C++][FlightRPC] Segfault when sending record batch >2GB

Details

    Description

      When sending a record batch > 2GiB, the server will segfault. Although Flight checks for this case and returns an error, it turns out that gRPC always tries to increment the refcount of the result buffer whether the serialization handler returned successfully or not:

      // From gRPC 1.36
      Status CallOpSendMessage::SendMessagePtr(const M* message,
                                               WriteOptions options) {
        msg_ = message;
        write_options_ = options;
        // Store the serializer for later since we have access to the message
        serializer_ = [this](const void* message) {
          bool own_buf;
          // TODO(vjpai): Remove the void below when possible
          // The void in the template parameter below should not be needed
          // (since it should be implicit) but is needed due to an observed
          // difference in behavior between clang and gcc for certain internal users
          Status result = SerializationTraits<M, void>::Serialize(
              *static_cast<const M*>(message), send_buf_.bbuf_ptr(), &own_buf);
          if (!own_buf) {
            // XXX(lidavidm): This should perhaps check result.ok(), or Serialize should
            // unconditionally initialize send_buf_
            send_buf_.Duplicate();
          }
          return result;
        };
        return Status();
      }
      

      Hence when Flight returns an error without initializing the buffer, we get a segfault.

      Originally reported on StackOverflow: https://stackoverflow.com/questions/68230146/pyarrow-flight-do-get-segfault-when-pandas-dataframe-over-3gb

      Attachments

        Activity

          People

            lidavidm David Li
            lidavidm David Li
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 3h
                3h

                Slack

                  Issue deployment