Uploaded image for project: 'Beam'
  1. Beam
  2. BEAM-11155

Series.str.repeat zipping operation produces incorrect proxy

Details

    • Bug
    • Status: Open
    • P3
    • Resolution: Unresolved
    • None
    • None
    • sdk-py-core
    • None

    Description

      https://github.com/apache/beam/pull/13139#discussion_r513684704

      This proxy is incorrectly inferred as bool.

      In [10]: proxy.dtypes
      Out[10]: 
      str        object
      repeats     int64
      dtype: object
      
      In [11]: proxy.str.str.repeat(proxy.repeats)
      Out[11]: Series([], Name: str, dtype: bool)
      

      The actual operation does produce object though:

      In [13]: df.str.str.repeat(df.repeats)
      Out[13]: 
      0      AAA
      1        B
      2     CCCC
      3    DDDDD
      4       EE
      Name: str, dtype: object
      

      Currently we work around this by specifying the proxy manually, maybe it can be fixed upstream?

      Attachments

        Activity

          People

            Unassigned Unassigned
            bhulette Brian Hulette
            Votes:
            0 Vote for this issue
            Watchers:
            3 Start watching this issue

            Dates

              Created:
              Updated: