Skip to content

Commit

Permalink
First cut at implementing and documenting PobObs approach for schemao…
Browse files Browse the repository at this point in the history
  • Loading branch information
danbri committed Jul 24, 2019
1 parent eadc8da commit 3d91998
Show file tree
Hide file tree
Showing 2 changed files with 271 additions and 0 deletions.
138 changes: 138 additions & 0 deletions data/ext/pending/issue-2291.rdfa
Original file line number Diff line number Diff line change
@@ -0,0 +1,138 @@
<div>
<!-- Short description here, issue 2291 -->

<!--
new type: StatisticalPopulation
new properties: populationType, numConstraints, constrainingProperties
new type: Observation
new properties: observedNode, measuredProperty, measuredValue, observationDate, marginOfError, measurementMethod
-->

<div typeof="rdfs:Class" resource="http://schema.org/StatisticalPopulation">
<span>Category: <span property="schema:category">issue-2291</span></span>
<span class="h" property="rdfs:label">StatisticalPopulation</span>
<span property="rdfs:comment">A StatisticalPopulation is a set of instances of a certain given type that satisfy some set of constraints. The property [[populationType]] is used to specify the type. Any property that can be used on instances of that type can appear on the statistical population. For example, a [[StatisticalPopulation]] representing all [[Person]]s with a [[homeLocation]] of East Podunk California, would be described by applying the appropriate [[homeLocation]] and [[populationType]] properties to a [[StatisticalPopulation]] item that stands for that set of people.
The properties [[numConstraints]] and [[constrainingProperties]] are used to specify which of the populations properties are used to specify the population. Note that the sense of "population" used here is the general sense of a statistical
population, and does not imply that the population consists of people. For example, a [[populationType]] of [[Event]] or [[NewsArticle]] could be used. See also [[Observation]], and <a href="/docs/data-and-datasets.html">data and datasets</a> overview for more details.
</span>
<span>Subclass of: <a property="rdfs:subClassOf" href="http://schema.org/Intangible">Intangible</a></span>
<link property="http://schema.org/isPartOf" href="http://pending.schema.org" />
<span>Source: <a property="dc:source" href="https://github.com/schemaorg/schemaorg/issues/2291">#2291</a></span>
</div>

<div typeof="rdf:Property" resource="http://schema.org/populationType">
<span>Category: <span property="schema:category">issue-2291</span></span>
<span class="h" property="rdfs:label">populationType</span>
<span property="rdfs:comment">Indicates the populationType common to all members of a [[StatisticalPopulation]].</span>
<span>domainIncludes: <a property="http://schema.org/domainIncludes" href="http://schema.org/StatisticalPopulation">StatisticalPopulation</a></span>
<span>rangeIncludes: <a property="http://schema.org/rangeIncludes" href="https://schema.org/Class">Class</a></span>
<link property="http://schema.org/isPartOf" href="http://pending.schema.org" />
<span>Source: <a property="dc:source" href="https://github.com/schemaorg/schemaorg/issues/2291">#2291</a></span>
</div>


<div typeof="rdf:Property" resource="http://schema.org/numConstraints">
<span>Category: <span property="schema:category">issue-2291</span></span>
<span class="h" property="rdfs:label">numConstraints</span>
<span property="rdfs:comment">Indicates the number of constraints (not counting [[populationType]]) defined for a particular [[StatisticalPopulation]]. This helps applications understand if they have access to a sufficiently complete description of a [[StatisticalPopulation]].</span>
<span>domainIncludes: <a property="http://schema.org/domainIncludes" href="http://schema.org/StatisticalPopulation">StatisticalPopulation</a></span>
<span>rangeIncludes: <a property="http://schema.org/rangeIncludes" href="https://schema.org/Integer">Integer</a></span>
<link property="http://schema.org/isPartOf" href="http://pending.schema.org" />
<span>Source: <a property="dc:source" href="https://github.com/schemaorg/schemaorg/issues/2291">#2291</a></span>
</div>

<div typeof="rdf:Property" resource="http://schema.org/constrainingProperty">
<span>Category: <span property="schema:category">issue-2291</span></span>
<span class="h" property="rdfs:label">constrainingProperty</span>
<span property="rdfs:comment">Indicates the number of constraints (not counting [[populationType]]) defined for a particular [[StatisticalPopulation]]. This helps applications understand if they have access to a sufficiently complete description of a [[StatisticalPopulation]].</span>
<span>domainIncludes: <a property="http://schema.org/domainIncludes" href="http://schema.org/StatisticalPopulation">StatisticalPopulation</a></span>
<span>rangeIncludes: <a property="http://schema.org/rangeIncludes" href="https://schema.org/Integer">Integer</a></span>
<link property="http://schema.org/isPartOf" href="http://pending.schema.org" />
<span>Source: <a property="dc:source" href="https://github.com/schemaorg/schemaorg/issues/2291">#2291</a></span>
</div>






<div typeof="rdfs:Class" resource="http://schema.org/Observation">
<span>Category: <span property="schema:category">issue-2291</span></span>
<span class="h" property="rdfs:label">Observation</span>
<span property="rdfs:comment">Instances of the class [[Observation]] are used to specify observations about an entity (which may or may not be an instance of a [[StatisticalPopulation]]), at a particular time. The principal properties of an [[Observation]] are [[observedNode]], [[measuredProperty]], [[measuredValue]] (or [[median]], etc.) and [[observationDate]] ([[measuredProperty]] properties can, but need not always, be W3C RDF Data Cube "measure properties", as in the [[lifeExpectancy example]](https://www.w3.org/TR/vocab-data-cube/#dsd-example).
See also [[StatisticalPopulation]], and <a href="/docs/data-and-datasets.html">data and datasets</a> overview for more details.
</span>
<span>Subclass of: <a property="rdfs:subClassOf" href="http://schema.org/Intangible">Intangible</a></span>
<link property="http://schema.org/isPartOf" href="http://pending.schema.org" />
<span>Source: <a property="dc:source" href="https://github.com/schemaorg/schemaorg/issues/2291">#2291</a></span>
</div>


<div typeof="rdf:Property" resource="http://schema.org/observedNode">
<span>Category: <span property="schema:category">issue-2291</span></span>
<span class="h" property="rdfs:label">observedNode</span>
<span property="rdfs:comment">The observedNode of an [[Observation]], often a [[StatisticalPopulation]].</span>
<span>domainIncludes: <a property="http://schema.org/domainIncludes" href="http://schema.org/Observation">Observation</a></span>
<span>rangeIncludes: <a property="http://schema.org/rangeIncludes" href="https://schema.org/StatisticalPopulation">StatisticalPopulation</a></span>
<link property="http://schema.org/isPartOf" href="http://pending.schema.org" />
<span>Source: <a property="dc:source" href="https://github.com/schemaorg/schemaorg/issues/2291">#2291</a></span>
</div>




<div typeof="rdf:Property" resource="http://schema.org/measuredProperty">
<span>Category: <span property="schema:category">issue-2291</span></span>
<span class="h" property="rdfs:label">measuredProperty</span>
<span property="rdfs:comment">The measuredProperty of an [[Observation]], either a schema.org property, a property from other RDF-compatible systems e.g. W3C RDF Data Cube, or schema.org extensions such as [GS1's](https://www.gs1.org/voc/?show=properties).</span>
<span>domainIncludes: <a property="http://schema.org/domainIncludes" href="http://schema.org/Observation">Observation</a></span>
<span>rangeIncludes: <a property="http://schema.org/rangeIncludes" href="https://schema.org/Property">Property</a></span>
<link property="http://schema.org/isPartOf" href="http://pending.schema.org" />
<span>Source: <a property="dc:source" href="https://github.com/schemaorg/schemaorg/issues/2291">#2291</a></span>
</div>


<div typeof="rdf:Property" resource="http://schema.org/measuredValue">
<span>Category: <span property="schema:category">issue-2291</span></span>
<span class="h" property="rdfs:label">measuredProperty</span>
<span property="rdfs:comment">The measuredValue of an [[Observation]].</span>
<span>domainIncludes: <a property="http://schema.org/domainIncludes" href="http://schema.org/Observation">Observation</a></span>
<span>rangeIncludes: <a property="http://schema.org/rangeIncludes" href="https://schema.org/DataType">DataType</a></span>
<link property="http://schema.org/isPartOf" href="http://pending.schema.org" />
<span>Source: <a property="dc:source" href="https://github.com/schemaorg/schemaorg/issues/2291">#2291</a></span>
</div>

<div typeof="rdf:Property" resource="http://schema.org/observationDate">
<span>Category: <span property="schema:category">issue-2291</span></span>
<span class="h" property="rdfs:label">observationDate</span>
<span property="rdfs:comment">The observationDate of an [[Observation]].</span>
<span>domainIncludes: <a property="http://schema.org/domainIncludes" href="http://schema.org/Observation">Observation</a></span>
<span>rangeIncludes: <a property="http://schema.org/rangeIncludes" href="https://schema.org/DateTime">DateTime</a></span>
<link property="http://schema.org/isPartOf" href="http://pending.schema.org" />
<span>Source: <a property="dc:source" href="https://github.com/schemaorg/schemaorg/issues/2291">#2291</a></span>
</div>

<div typeof="rdf:Property" resource="http://schema.org/marginOfError">
<span>Category: <span property="schema:category">issue-2291</span></span>
<span class="h" property="rdfs:label">marginOfError</span>
<span property="rdfs:comment">A marginOfError for an [[Observation]].</span>
<span>domainIncludes: <a property="http://schema.org/domainIncludes" href="http://schema.org/Observation">Observation</a></span>
<span>rangeIncludes: <a property="http://schema.org/rangeIncludes" href="https://schema.org/DateTime">DateTime</a></span>
<link property="http://schema.org/isPartOf" href="http://pending.schema.org" />
<span>Source: <a property="dc:source" href="https://github.com/schemaorg/schemaorg/issues/2291">#2291</a></span>
</div>


<!-- commented out, with a view to possibly adapting https://schema.org/measurementTechnique -->
<!--
<div typeof="rdf:Property" resource="http://schema.org/measurementMethod">
<span>Category: <span property="schema:category">issue-2291</span></span>
<span class="h" property="rdfs:label">measurementMethod</span>
<span property="rdfs:comment">The measurementMethod of an [[Observation]].</span>
<span>domainIncludes: <a property="http://schema.org/domainIncludes" href="http://schema.org/Observation">Observation</a></span>
<span>rangeIncludes: <a property="http://schema.org/rangeIncludes" href="https://schema.org/Thing">Thing</a></span>
<link property="http://schema.org/isPartOf" href="http://pending.schema.org" />
<span>Source: <a property="dc:source" href="https://github.com/schemaorg/schemaorg/issues/2291">#2291</a></span>
</div> -->

</div>
133 changes: 133 additions & 0 deletions docs/data-and-datasets.html
Original file line number Diff line number Diff line change
@@ -0,0 +1,133 @@
<!DOCTYPE html>
<html xmlns="http://www.w3.org/1999/xhtml" xml:lang="en" lang="en">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />
<title>Data Model - schema.org</title>
<meta name="description" content="Schema.org is a set of extensible schemas that enables webmasters to embed
structured data on their web pages for use by search engines and other applications." />
<link rel="stylesheet" type="text/css" href="schemaorg.css" />
<link href="prettify.css" type="text/css" rel="stylesheet" />
<script type="text/javascript" src="prettify.js">
</script>
<script type="text/javascript" src="//ajax.googleapis.com/ajax/libs/jquery/1.5.1/jquery.min.js"></script>
<!-- ##### Generated insert [CSEScript-start] see scripts/genhtmlinserts.sh ##### -->
<!-- Enable CSE Search -->
<script>
(function() {
var cx = '013516846811604855281:nj5laplixaa'; // Insert your own Custom Search engine ID here
var gcse = document.createElement('script'); gcse.type = 'text/javascript'; gcse.async = true;
gcse.src = 'https://cse.google.com/cse.js?cx=' + cx;
var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(gcse, s);
})();
</script>
<!-- ##### Generated insert [CSEScript-end] see scripts/genhtmlinserts.sh ##### -->

<script type="text/javascript">
$(document).ready(function(){
prettyPrint();
setTimeout(function(){

$('.atn').each(function(i) {
if (($(this).text() == 'itemscope') || ($(this).text() == 'itemtype') || ($(this).text() == 'itemprop') || ($(this).text() == 'itemid') || ($(this).text() == 'time') || ($(this).text() == 'datetime') || ($(this).text() == 'vocab') || ($(this).text() == 'property') || ($(this).text() == 'typeOf') || ($(this).text() == 'rel'))
$(this).addClass('custom');
});

}, 500);
});
</script>

<style>

.tag { color: #000; } /* div, span, etc */
.atn { color: #000; } /* href, datetime, */
.custom { color: #660003; } /* itemscope, itemtype, etc,. */

</style>

</head>
<body>
<!-- ##### Generated insert [DOCSHDR-start] see scripts/genhtmlinserts.sh ##### -->
<div id="container">
<div id="intro">
<div id="pageHeader">
<div class="wrapper">
<div id="sitename">
<h1>
<a href="../">schema.org</a>
</h1>
</div>
<div id="cse-search-form" style="width: 400px;">
<div class="gcse-searchbox-only" data-resultsurl="search_results.html"></div>
</div>
</div>
</div>
</div>
</div>
<div id="selectionbar">
<div class="wrapper">
<ul>
<li>
<a href="documents.html" >Documentation</a>
</li>
<li>
<a href="schemas.html" >Schemas</a>
</li>
<li>
<a href="../" >Home</a>
</li>
</ul>
</div>
</div>
<!-- ##### Generated insert [DOCSHDR-end] see scripts/genhtmlinserts.sh ##### -->
<div id="mainContent">



<h1>Data and Datasets overview</h1>

<p>This note provides some background on the various notions of "data" and "dataset" related to Schema.org.</p>

<p>Schema.org as a project, and as a collection of terms, is entirely devoted to data. We define types
such as <a href="/Event">Event</a>, <a href="/NewsArticle">NewsArticle</a>, <a href="/Review">Review</a>, <a href="/Person">Person</a>, as well as properties that characterize and interlink instances of
these types. For example, the "alumni" property links <a href="/Person">people</a> with <a href="/EducationalOrganization">educational organizations</a>.
</p>

<p>Schema.org itself also contains some dedicated vocabulary that can be used in applications which publish,
discover or integrate different kinds of data. Just as schema.org defines vocabulary to help describe people, volcanos and
public toilets, it can also be used to describe data. This capability is in addition to schema.org's general nature as a
collection of structured data schemas, and complements numerous other data-related formats and standards.
</p>

<ul>
<li>
When describing collections of data, for example as published in scientific, scholarly or governmental
"open data" repositories, the <a href="/Dataset">Dataset</a> type can be used, alongside <a href="/DataCatalog">DataCatalog</a> to indicate the larger collection, and <a href="/DataDownload">DataDownload</a> for specific representations of a dataset.
These "datasets", unlike typical use of Schema.org, can be in arbitrary formats. For example, they may include data that is stored in collections of spreadsheet files, or as digital images, or in dedicated scientific, geospatial and engineering file formats. Such diversity reflects the complexity of real-world data, but the use of diverse and often incompatible
formats also makes it hard to integrate the information that they encode, e.g. for use in unified "knowledge graphs" such as <a href="https://wikidata.org">Wikidata</a> and <a href="https://DataCommons.org">DataCommons.org</a>.
Schema.org's <a href="/Dataset">Dataset</a> vocabulary was originally based on <a href="https://en.wikipedia.org/wiki/Data_Catalog_Vocabulary">DCAT</a>, which in turn used <a href="https://www.w3.org/TR/vocab-dcat/#basic-example">used</a> <a href="http://dublincore.org/">Dublin Core</a> and <a href="http://xmlns.com/foaf/spec/">FOAF</a> terms.
</li>
<li>
When aggregating and integrating statistical observations that describe collections ("populations") of
individual entities, the <a href="/StatisticalPopulation">StatisticalPopulation</a> and <a href="/Observation">Observation</a> types can be used. See <a href="https://github.com/schemaorg/schemaorg/issues/2291 ">proposal</a> and <a href="https://docs.google.com/document/d/139jXakeQk4ChwCkGjqq5wJfCPMDnwIV94oCH-JzJrhM/edit">overview document</a> for details, and <a href="http://datacommons.org">DataCommons.org</a> for an application of this approach to large scale knowledge graphs. This approach emphasises the use of schema.org vocabulary to integrate information from multiple independent statistical datasets, by using schema.org and related vocabulary to explain the content of the statistical data.
</li>
</ul>

<p>
Other related work includes W3C's <a href="https://www.w3.org/TR/tabular-data-primer/">CSVW</a>
and <a href="https://www.w3.org/TR/vocab-data-cube/">RDF Data Cube</a> specifications, as well as
the <a href="https://google.github.io/dspl/dspl2-spec.html">DSPL 2.0</a> specification. DSPL 2.0 combines Schema.org
for per-dataset metadata with the use of CSV files to represent code lists, enumerations and statistical observations.
These technologies all in turn depend on lower-level standards, such as for JSON-LD, RDFa, Microdata, XML, Unicode etc.,
and share a broadly <a href="https://en.wikipedia.org/wiki/Resource_Description_Framework">RDF-like</a> approach to
representing information.
</p>

</div>


<div id="footer"><p>
<a href="../docs/terms.html">Terms and conditions</a></p>
</div>

</body>
</html>

0 comments on commit 3d91998

Please sign in to comment.