Uploaded image for project: 'Apache Arrow'
  1. Apache Arrow
  2. ARROW-11472

[Python][CI] Kartothek integrations build is failing with numpy 1.20

Details

    Description

      See eg https://github.com/ursacomputing/crossbow/runs/1804464537, failure looks like:

       ____________ ERROR collecting tests/io/dask/dataframe/test_read.py _____________
      tests/io/dask/dataframe/test_read.py:185: in <module>
          @pytest.mark.parametrize("col", get_dataframe_not_nested().columns)
      kartothek/core/testing.py:65: in get_dataframe_not_nested
          "unicode": pd.Series(["Ö"], dtype=np.unicode),
      /opt/conda/envs/arrow/lib/python3.7/site-packages/pandas/core/series.py:335: in __init__
          data = sanitize_array(data, index, dtype, copy, raise_cast_failure=True)
      /opt/conda/envs/arrow/lib/python3.7/site-packages/pandas/core/construction.py:480: in sanitize_array
          subarr = _try_cast(data, dtype, copy, raise_cast_failure)
      /opt/conda/envs/arrow/lib/python3.7/site-packages/pandas/core/construction.py:587: in _try_cast
          maybe_cast_to_integer_array(arr, dtype)
      /opt/conda/envs/arrow/lib/python3.7/site-packages/pandas/core/dtypes/cast.py:1723: in maybe_cast_to_integer_array
          casted = np.array(arr, dtype=dtype, copy=copy)
      E   ValueError: invalid literal for int() with base 10: 'Ö'
      

      So it seems that pd.Series(["Ö"], dtype=np.unicode) stopped working with numpy 1.20.0

      Attachments

        Activity

          People

            jorisvandenbossche Joris Van den Bossche
            jorisvandenbossche Joris Van den Bossche
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Time Tracking

                Estimated:
                Original Estimate - Not Specified
                Not Specified
                Remaining:
                Remaining Estimate - 0h
                0h
                Logged:
                Time Spent - 2.5h
                2.5h

                Slack

                  Issue deployment