Uploaded image for project: 'Derby'
  1. Derby
  2. DERBY-2798

A new approach for main-memory database

Attach filesAttach ScreenshotVotersWatch issueWatchersCreate sub-taskLinkCloneUpdate Comment AuthorReplace String in CommentUpdate Comment VisibilityDelete Comments
    XMLWordPrintableJSON

Details

    • Improvement
    • Status: Closed
    • Major
    • Resolution: Won't Fix
    • 10.2.2.0
    • None
    • Store
    • None
    • all

    Description

      As a part of my Masters degree I have created an extension that allows data to reside in memory without the need to serialize it to Page-objects. This is a pretty big chunk of code and is sort of a proof of concept of another way to make an in-memory storage mode.

      I created two new conglomerates, called MemHeap and MemSkiplist. Derby interfaces them the same way as it does with Heap and BTree. These new conglomerates use RawStore for transaction support and logging, but not for storage. Instead it uses a new system service I've called MemStore. This data store only stores pointers to Slot-objects organized in arrays corresponding to its container/conglomerate/table. A Slot-object consists mainly of a DataValueDescriptor[]-object representing a row in a table.

      So, instead of just doing dummy-IO in memory where Derby still thinks its doing real IO and page caching, this new approach bypasses the cache and page-structure by keeping the DataValueDescriptor[]-objects in memory without serializing them.

      Manipulating operations on data in memory are done via new operation-objects (ex. MemInsertOperation, MemInsertUndoOperation, MemDeleteOperation...) with still uses RawStore for transaction control and persistence. Checkpointing is done by serializing the objects in MemStore fuzzily and completely unsynchronized to disk. Recovery consists of de-serializing the objects to MemStore before the existing REDO- and UNDO-phase of Derby recovery in RawStore will get the data transaction-consistent by replaying or undo the new operation-objects in the log.

      Locking is hard coded as row based with locking degree SERIALIZABLE.

      To get Derby to use the new conglomerates I hacked the SQL-layer to create MemHeap-tables and MemSkiplist-indexes when the table name starts with 'mem_'.

      Because this is a major rewrite of the access- and storage-layer there is a lot of known and unknown bugs and missing functionality. What is working is essentially select, insert, update and delete on tables with one primary key.

      Attachments

        1. Derby-10.2.2.0-memstore.diff
          368 kB
          Knut Magne Solem
        2. DERBY-2798.diff
          368 kB
          Knut Magne Solem
        3. DERBY-2798-10.3.1.0.diff
          340 kB
          Knut Magne Solem
        4. DERBY-2798-10.3.1.0.stat
          5 kB
          Knut Magne Solem
        5. select.png
          3 kB
          Knut Magne Solem
        6. update.png
          3 kB
          Knut Magne Solem

        Issue Links

        Activity

          This comment will be Viewable by All Users Viewable by All Users
          Cancel

          People

            Unassigned Unassigned
            knutmas Knut Magne Solem
            Votes:
            4 Vote for this issue
            Watchers:
            7 Start watching this issue

            Dates

              Created:
              Updated:
              Resolved:

              Slack

                Issue deployment