Relational Database Handler: Difference between revisions

From OPeNDAP Documentation
⧼opendap2-jumptonavigation⧽
Line 73: Line 73:
#:: - Make database query
#:: - Make database query
#:: - retrieve row set response
#:: - retrieve row set response
#:: - for each row in the row set, serialize() each child variable
#:: - set row set cursor to first row, first column.
#::: - pass row set to each variable
#:: - for each row in the row set:
#::: - serialize() each projected child variable
#:::: - pass row set to each variable
#::: - Move row set cursor to next row.
#:}
#:}
#:
#:

Revision as of 16:46, 5 May 2009

In order to provide support for a Sensor Observation Service (SOS) implementation Hyrax will need a handler that allows it to access data in a Relational Database Management System (RDBMS). (This requirement stems from the observation that many of our stake holders store their in situ measurement data in RDBMSs)

It is anticipated RDBMS handler will have a much broader application than just SOS related data sets.


Use Cases

  1. Adding the RDH to the BES
  2. RDH handles bes:showCatalog request
  3. RDH handles a DDX request
  4. RDH handles a DDS request
  5. RDH handles a DAS request
  6. RDH handles a DAP2 data request

Definitions

row set
A row set is an object which encapsulates a set of rows. Database tables are row sets. A database view is a row set. SQL queries return row sets. SQL JOIN operations take row sets as input and produce row sets.
Constraint Expression (CE)
The DAP constraint expression string as described in the DAP2 Specification.

Background

In the past a Java servlet called the DODS Relational Database Server (DRDS) was used to provide DAP access to RDBMS holdings. However this older implementation has a number of shortcomings that preclude it's direct use in our current server architecture:

  • No longer supported.
  • Uses the Java DAP implementation
  • Not a BES module
  • Significant memory limitations
  • Difficult to configure/localize

Although there has been continuing interest in a DRDS replacement within the OPeNDAP community, no funding has been available to develop a soution until recently. The IOOS project has a need to provide a Sensor Observation Service (SOS) interface for Hyrax. Since much of the sensor data is already held in RDBMSs it is a natural and necessary time to develop a DRDS replacement.

Design Overview

The RDH will be a BES module/plug-in. It will use an implementation of the ODBC (most likely unixODBC) to access the RDMS(s). ODBC Data Sources will be defined at the system level as usual. [1][2] [3] [4] [5]

Building the DDS object in memory.

The RDH will load (and cache) from it's specific configuration section of the BES configuration file a list of ODBC Data Sources that it will serve as DAP data sets. When the RDH receives a request for content (such as a DDS, DDX, DAP2 data etc.) it will identify the DAP dataset from the request, and determine which ODBC Data Source it must interact with to fulfill the request. Then, using ODBC (or SQL) introspection methods in the ODBC API, the RDH will identify the collection of tables and views available in the ODBC Data Source and use them to construct a specialized DDS instance in memory. The RDH will traverse the available tables and views in the ODBC Data Source and it will build a representation of each one as a DAP Sequence object. The columns in each table and view will represented by a DAP variable in the corresponding Sequence instance. Each of these Sequence instances will be added to the DDS instance. When building the Sequences and their variables the RDH will use a custom class factory to build instances of DAP objects that can utilize the ODBC API to extract data from the Result Set of an ODBC brokered database query. Each of the data types generated by the custom factory will have implemented read methods able to read data from the row set returned through the ODBC API.

Processing the Constraint Expression

  1. Once the DDS instance is constructed the CE can be processed.
  2. When the ConstraintEvaluator parses the constraint it will:
    1. Convert the CE to an SQL query string.
      • Each projected variable should appear in the SELECT section.
      • Each clause should be converted (where possible) into a condition in the WHERE section.

For example, the CE "?v,u&lat<26&lat>24&lon<-126&lon>-128" would

    1. Mark the projected variables in the DDS.
  1. The Constraint expression must be converted into a set of one or more SQL queries, where each query is associated with a Sequence variable in the DDS.
  2. Each Sequence requested will be associated with a separate SQL query. As the DDS is serialized, each Sequence will use the ODBC API to send the query to the Data Source and then use the returned row set as the data content for the serialization of it's (the Sequence's) variables.

Get DAP2 Data Request

  1. build dds
    1. Connect to datasource
    2. use introspection to colllect tables and views information.
    3. Build DDS instance with a sequence for each table and view.
    4. Cache Data source connection information in DDS (in each sequence?)
  2. parse constraint. For each sequence containing projected variables:
    1. Mark sequence (and varaibles) as projected.
    2. convert CE to SQL query string and cache it in the sequence.
  3. send data: In the DDS each sequence containing projected variables will be serialized:
    Sequence::serialize() {
    - Make database query
    - retrieve row set response
    - set row set cursor to first row, first column.
    - for each row in the row set:
    - serialize() each projected child variable
    - pass row set to each variable
    - Move row set cursor to next row.
    }
    BaseTypeVariable::serialize(){
    - read value from row set
    - move cursor to next column
    - Apply remaining constraints and array sub-setting
    - transmit data
    }


Details:

  • Each simple DAP type implementation will need to be able to:
    • read from the Result Set
    • move the Result Set Cursor to the next Column
(The implementer will have to decide the order of these two operations)
  • The Sequence type will have to manage moving the Result Set Cursor from one row to the next.
  • Some helper class (ConstraintEvaluator??? ) will have to convert DAP constraint expressions to SQL queries.

Mapping the ODBC data model to the DAP2 data model

RDH Catalog Organization

Deliverables

Period of use