Uploaded image for project: 'Apache Jena'
  1. Apache Jena
  2. JENA-2225

TDB/TDB2 dataset size stat serialized incorrectly for large datasets

    XMLWordPrintableJSON

Details

    • Bug
    • Status: Closed
    • Minor
    • Resolution: Fixed
    • Jena 4.3.1
    • Jena 4.5.0
    • TDB, TDB2
    • None

    Description

      When computing the TDB/TDB2 stats via CLI the size will be serialized incorrectly for large datasets.

      For example for latest Wikidata Truthy we get

      (count -1983667112))

      This happens because for both the corresponding `Stats.java` class does enforce an Integer type Node though the value is a long type:

      if ( count >= 0 )
          addPair(meta.getList(), StatsMatcher.COUNT, NodeFactoryExtra.intToNode((int)count)) ; 

      Attachments

        1. stats.opt.gz
          265 kB
          Lorenz Bühmann

        Issue Links

          Activity

            People

              andy Andy Seaborne
              LorenzB Lorenz Bühmann
              Votes:
              0 Vote for this issue
              Watchers:
              3 Start watching this issue

              Dates

                Created:
                Updated:
                Resolved: