XMLWordPrintableJSON

Details

    • Sub-task
    • Status: In Progress
    • Major
    • Resolution: Unresolved
    • 3.2.0
    • None
    • PySpark
    • None

    Description

      When reading csv file in pandas, pandas automatically detect the thousand separator if `thousands` argument is specified.

      >>> pd.read_csv(path, sep=";")
          name  age        job      money
      0  Jorge   30  Developer  1,000,000
      1    Bob   32  Developer    1000000
      
      >>> pd.read_csv(path, sep=";", thousands=",")
          name  age        job    money
      0  Jorge   30  Developer  1000000
      1    Bob   32  Developer  1000000

      However, pandas-on-Spark doesn't support it.

      Attachments

        Activity

          People

            Unassigned Unassigned
            itholic Haejoon Lee
            Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

              Created:
              Updated: