Chapter 3: What is Open Data Spaces (ODS)?

3.1 Concepts of Open Dataspaces

Open Dataspaces is neither an extension of an enterprise's internal data infrastructure nor a shared platform controlled by a single company. What matters is not "collecting all data in one place (Push and Ingest)," but rather "handling data in a distributed state — where it is, what it means, and who can use it and how (Serving and Pull)."

As an approach to distributed data management, "Data Mesh" has already attracted significant attention in the United States and elsewhere, with a growing number of companies adopting it. However, data mesh focuses on how to manage data scattered within an organization while keeping it distributed, and does not address data management that crosses organizations and national boundaries. Open Dataspaces is an approach designed to fill this gap.

There are various challenges in cross-organizational data management, and Open Dataspaces provides solutions addressing primarily the following three:

  1. "Where to Get" Problem: Addresses the existence, identity, and discoverability of data. (e.g., Where does that data exist in the first place? Does that data refer to the same thing as this data?)

  2. "What to Mean" Problem: Addresses the meaning, vocabulary, and semantic consistency of data. (e.g., What does that data mean? Is the meaning of that data consistent with the meaning of this data?)

  3. "Who and How to Use" Problem: Addresses subject trust, access, and terms of use. (e.g., Who is attempting to access the data? Who is allowed to access that data? How must that data be used?)

These are organized as three pillars accordingly:

  • (1)DAD(Data Addressability and Discoverability)

  • (2)OSI(Ontology and Semantic Interoperability)

  • (3)IUC(Identity and Usage Control)

The design philosophy, principles, and architectural implications of these three pillars are documented in Design Philosophy and ODS-RAM. This document limits its coverage to mapping the role of each pillar to the entry opportunities available to users.

3.2 Differences from Conventional Data Management

Conventional data management has relied on two approaches:

  1. consolidating data internally for management and analysis

  2. establishing individual point-to-point connections with each required counterpart.

Open Dataspaces takes a different approach — neither consolidation nor individual negotiation — by handling existence, meaning, and terms of use while keeping data distributed. Table 2 presents a comparison with conventional data management approaches. For a more detailed comparison, please refer to "Design Philosophy".

Table 2 Comparison with Conventional Data Management Approaches

Perspective
Individual API Integration
Enterprise Internal Data Infrastructure
Open Dataspaces

Data placement

Handled individually per connection

Consolidated internally

Handled in a distributed state

Discovery

Assumes prior knowledge of connections and specifications

Assumes an internal catalog

Assumes addressability and discoverability (DAD)

Handling of meaning

Dependent on individual implementation

Managed under internal schema; meaning and structure tend to be tightly coupled

Meaning (semantics/ontology) is separated from structure (OSI)

Identity and usage control

Dependent on individual contracts and implementation

Centered on internal access management

Identity, access, and terms of use are handled explicitly (IUC)

3.3 Value Provided by Open Dataspaces

3.3.1 Value for Data Providers

The reasons data providers hesitate to share data externally go beyond the risk of information leakage alone. Compounding problems include: "We cannot accurately communicate what our data means to external parties," "Our data may be conflated with other companies' data and misused," and "It is difficult to operationalize terms of use." Open Dataspaces enables the separate handling of issues related to data existence and identification, data meaning and vocabulary, and access and terms of use. This separation makes it easier for data providing enterprises to design external data sharing within the necessary scope and conditions, without needing to unify their internal systems or internal identifiers.

3.3.2 Value for Data Users

By separately handling discoverability, semantic organization, and the management of entities and terms of use, Open Dataspaces reduces the operational burden of "interpreting meaning, ensuring consistency, and confirming terms of use" after acquiring external data. This makes it easier for utilizing enterprises to incorporate external data into BI/DI analysis, business applications, and AI initiatives.

3.4 Examples of Business Models Achievable with ODS

Entry opportunities for users in Open Dataspaces can be organized as shown in Table 4. These represent general business model perspectives applicable across industries and use cases.

Table 4 Entry Pattern to Open Dataspaces

Entry Pattern
Overview
Save Money / Make Money Perspective

Collection and utilization of external data

Acquire data from multiple providers and apply it to BI/DI/AI analysis and operational improvement

Reduction of individual API coordination costs, faster decision-making (Save) / Improved AI accuracy, new service creation (Make)

External provision and monetization of internal data

Provide internally held data to external parties under defined conditions, generating revenue

Acquisition of new revenue streams, expansion of partnerships (Make)

Ecosystem participation through mutual data exchange

Exchange data mutually with multiple parties to enable supply chain management, quality control, and traceability

Operational efficiency and risk reduction (Save) / Compliance with industry standards, enhanced brand value (Make)

Adoption of ODP-implemented services as a user

Use ODP-implemented managed services and applications as a user

Reduced initial investment, faster time to value (Save)

It should be noted that in Open Dataspaces, N:N connectivity becomes a realistic option through a degree of discipline and interoperability. The scale of business value that cannot be achieved through one-to-one individual integrations represents the fundamental motivation for participation in Open Dataspaces.

3.5 The Relationship Between AI and Open Dataspaces

Finally, the relationship between the AI industry and the Open Dataspaces market is examined. In AI applications — particularly in analytics based on operational data and the use of Agentic AI — the ability to continuously handle "data context, meaning, terms of use, and currency" is more important than simply having a large volume of data. The Design Philosophy also highlights that in the Agrntic AI era, unique context is the essence of data, and emphasizes the importance of domain owners explicitly providing that context as meaning (ontology). Open Dataspaces technology provides the technical foundation for addressing these challenges through endpoint discoverability, semantics and ontology, and identity and usage control.

As a result, the following kinds of benefits can be expected:

  • It becomes easier to perform reasoning and interpretation that presupposes relationships and context between data points

  • It becomes easier to construct AI pipelines that account for data provenance and currency

  • It becomes easier to incorporate external data into AI applications without relying solely on individual integration efforts

The next chapter builds on this foundation to examine, from the perspective of participation structure, the positions that users can occupy.

Last updated