Example 1: Quality Assessment of RDF Data Cubes

Prior to publishing the resulting RDF data on an existing data portal and thus enabling other users to download and exploit the data for various purposes, every dataset should be validated to ensure it conforms to the RDF Data Cube model.

Table 3. Example values for a Data Cube structure representing the Serbian economic statistics

Component property

Concept description

Identifier

Code list

Dimension

Geographical region

rs:geo

cl:geo

Dimension

Time

rs:time

cl:time

Dimension

Economic activity

rs:activityNACEr2

cl:nace rev2

Attribute

Unit of measurement

sdmx-

attribute:unitMeasure

cl:esa95-unit

Measure

Observed value

sdmx-measure:obsValue

Fig. 4. RDF Data Cube quality assessment

The data validation step is covered by the LOD2 stack, i.e. through the following software tools:

The RDF Data Cube Validation Tool[1];

CubeViz, a tool for visualization of RDF Data Cubes[2].

The RDF Data Cube Validation Tool aims at speeding-up the processing and publishing of Linked Data in RDF Data Cube format. Its main use is validating the integrity constraints defined in the RDF Data Cube specification. It works with the Virtuoso Universal Server as a backend and can be run from the LOD2 Statistical Workbench environment.

The main benefits of using this component are improved understanding of the RDF Data Cube vocabulary and automatic repair of identified errors. Figure 4 shows the component in action: the integrity constraints and their status are shown on the left side, while the results of analysis are shown on the right. A list of resources that violate the constraint, an explanation about the problem, and if possible, a quick solution to the problem is offered to the user. Once an RDF

Data Cube satisfies the Data Cube integrity constraints, it can be visualized with CubeViz. More details can be found in the LOD2 Stack Documentation[3].

Example 2: Filtering, Visualization and Export of RDF Data Cubes

The facetted browser and visualization tool CubeViz can be used to filter observations to be visualized in charts interactively. Figure 5 shows an exploration session that comprises of the following steps:

Fig. 5. RDF Data Cube exploration and analysis

1. Select one out of the available datasets in the RDF graph;

2. Choose observations of interest by using a subset of the available dimensions;

3. Visualize the statistics by using slices, or

4. Visualize the statistics in two different measure values (millions of national currency and percentages).

Example 3: Merging RDF Data Cubes

Merging[4] is an operation of creating a new dataset (RDF Data Cube) that compiles observations from the original datasets (two or more), and additional resources (e.g. data structure definition, component specifications). In order to obtain meaningful charts the observed phenomena (i.e. serial data) have to be described on the same granularity level (e.g. year, country) and expressed in same units of measurement (e.g. euro, %). Therefore alignment of the code lists used in the input data is necessary before the merging operation is performed.

  • [1] wiki.lod2.eu/display/LOD2DOC/RDF+Data+Cube+Quality+Assessment
  • [2] wiki.lod2.eu/display/LOD2DOC/Data+Subsetting+and+Export+Scenario
  • [3] wiki.lod2.eu/display/LOD2DOC/LOD2+Statistical+Workbench
  • [4] wiki.lod2.eu/display/LOD2DOC/Eurostat+Merge+and+Enhance+Scenario
 
< Prev   CONTENTS   Next >