Difference between revisions of "OLAP4LD Demo at ESWC 2014"

From www.b-kaempgen.de
Jump to: navigation, search
 
m (1 revision: From LD-Cubes)
 
(No difference)

Latest revision as of 13:24, 4 January 2023

On this page, we collect information about our demonstration at ESWC 2014.

In our demonstration we will show how changes in modelling are propagated to LDCX by live modifying a published QB dataset. Also, we show common modelling errors in existing QB datasets such as missing dimension rdfs:range or qb:CodeList and observations not adhering to data structure definitions.

Demonstrating the three-step interface

What will the audience learn? How to explore one dataset.
  1. Select dataset -> Explore dataset...
  2. Select measures
  3. Select dimensions on rows and columns -< Update table...

FAQ

  • How does the query look like? MDX query, since datasets are represented as data cubes. Why not directly a SPARQL query? Because OLAP application designers do not know about SPARQL. MDX is specifically designed for analytical queries over multidimensional datasets (cube,measures,dimensions).
SELECT /* $session: 2e72789e-08d7-d14d-2450-c9f4004b04c1 */ NON EMPTY CrossJoin({[httpXXX3AXXX2FXXX2Folap4ldYYYgooglecodeYYYcomXXX2FgitXXX2FOLAP4LDZZZtrunkXXX2FtestsXXX2Fssb001XXX2FttlXXX2FexampleYYYttlXXX23lo_quantity],[httpXXX3AXXX2FXXX2Folap4ldYYYgooglecodeYYYcomXXX2FgitXXX2FOLAP4LDZZZtrunkXXX2FtestsXXX2Fssb001XXX2FttlXXX2FexampleYYYttlXXX23lo_revenue]}, {Members([httpXXX3AXXX2FXXX2Folap4ldYYYgooglecodeYYYcomXXX2FgitXXX2FOLAP4LDZZZtrunkXXX2FtestsXXX2Fssb001XXX2FttlXXX2FexampleYYYttlXXX23lo_suppkeyCodeList])}) ON COLUMNS
, NON EMPTY CrossJoin({Members([httpXXX3AXXX2FXXX2Folap4ldYYYgooglecodeYYYcomXXX2FgitXXX2FOLAP4LDZZZtrunkXXX2FtestsXXX2Fssb001XXX2FttlXXX2FexampleYYYttlXXX23lo_custkeyCodeList])}, {Members([httpXXX3AXXX2FXXX2Folap4ldYYYgooglecodeYYYcomXXX2FgitXXX2FOLAP4LDZZZtrunkXXX2FtestsXXX2Fssb001XXX2FttlXXX2FexampleYYYttlXXX23lo_orderdateCodeList])}) ON ROWS
FROM [httpXXX3AXXX2FXXX2Folap4ldYYYgooglecodeYYYcomXXX2FgitXXX2FOLAP4LDZZZtrunkXXX2FtestsXXX2Fssb001XXX2FttlXXX2FexampleYYYttlXXX23ds]
  • How to drill-down? By adding dimensions.
  • What is done in the backend? 1. Loading data cubes. Running normalisation algorithm, checking integrity constraints... 2. Executing MDX query over the data cubes.

Live modifying a published QB dataset

What will the audience learn? 
  • Pastebin Example Star Schema Dataset: http://pastebin.com/raw.php?i=839G2u72#ds
  • Pastebin: http://pastebin.com/839G2u72
  • Example modifications:
    • Change label of dataset.
    • Change discount of first observation.
    • Add a new dimension so that error is thrown. ("Failed specification check: IC-4. Dimensions have range. Every dimension declared in a qb:DataStructureDefinition must have a declared rdfs:range.", "Failed specification check: IC-11. All dimensions required. Every qb:Observation has a value for each dimension declared in its associated qb:DataStructureDefinition.")
[ qb:dimension :lo_superkey ]
    • Remove rdfs:range or qb:CodeList for skos:Concept dimensions.

Drill-Across Queries

What will the audience learn?

Example datasets:

Energy dependence: http://estatwrap.ontologycentral.com/id/tsdcc310 (2001-2012)
Energy productivity: http://estatwrap.ontologycentral.com/id/t2020_rd310 (2000-2012)
Energy intensity: http://estatwrap.ontologycentral.com/id/tsdec360 (2001-2012)
Greenhouse gas emissions per capita: http://estatwrap.ontologycentral.com/id/t2020_rd300 (2000-2011)
Share of renewable energy: http://estatwrap.ontologycentral.com/id/t2020_31 (2004-2012)
People at risk of poverty or social exclusion: http://estatwrap.ontologycentral.com/id/t2020_50 (2004-2012)
People living in households with very low work intensity: http://estatwrap.ontologycentral.com/id/t2020_51 (2004 - 2012)
People at risk of poverty after social transfers: http://estatwrap.ontologycentral.com/id/t2020_52 (2003-2012)
Severely materially deprived people: http://estatwrap.ontologycentral.com/id/t2020_53 (2003-2012)

Common modelling errors

What will the audience learn?

Missing range

  • Transparency International Linked Data - Corruption Perceptions Index 2011: http://transparency.270a.info/dataset/CPI2011
  • Missing rdfs:range for dimension "source" [1]: "Failed specification check: IC-4. Dimensions have range. Every dimension declared in a qb:DataStructureDefinition must have a declared rdfs:range."

No resolveable URIs

No DataStructureDefinition

No properly modelled cube