Hadoop Map/Reduce
  1. Hadoop Map/Reduce
  2. MAPREDUCE-774

Java/C++ word count examples have different outputs

    Details

    • Type: Bug Bug
    • Status: Open
    • Priority: Trivial Trivial
    • Resolution: Unresolved
    • Affects Version/s: None
    • Fix Version/s: None
    • Component/s: examples
    • Labels:
      None

      Description

      I ran the c++ word count example using pipes and got this result:

      Alethea 1488
      Arneb 1508
      Auriculariales 1518
      Aktistetae 92126
      Animalivora 91969
      Aplacentalia 92690
      Aktistetae 1503
      Animalivora 1518
      Aplacentalia 1452
      Alethea 91928
      Arneb 91926
      Auriculariales 92448

      The correct result generated by Java word count example is:

      Aktistetae 93629
      Alethea 93416
      Animalivora 93487
      Aplacentalia 94142
      Arneb 93434
      Auriculariales 93966

        Activity

        Hyunjung Park made changes -
        Field Original Value New Value
        Attachment MAPREDUCE-774.patch [ 12414039 ]
        Hide
        Hyunjung Park added a comment -

        Java/C++ examples used different delimiters for splitting strings.

        Show
        Hyunjung Park added a comment - Java/C++ examples used different delimiters for splitting strings.
        Hyunjung Park created issue -

          People

          • Assignee:
            Unassigned
            Reporter:
            Hyunjung Park
          • Votes:
            0 Vote for this issue
            Watchers:
            2 Start watching this issue

            Dates

            • Created:
              Updated:

              Development