More than half of publicly funded research data in Europe remains difficult to access outside the institutions that created it. In many cases the issue is not the quality of the data, but the architecture of the platforms that store it.
INFRA-ART contained over 2,000 spectral records derived from paintings, archaeological artifacts, and cultural heritage objects. 
The data was valuable and well maintained, but the platform had been designed through a web application, with no interoperability with external research infrastructures.

The Data Was Good. The Architecture Was Not Designed to Share It.

Most research platforms are not built to fail. They are built to solve a specific problem, at a specific point in time. The data inside them is often excellent but the architecture around them, less.

INFRA-ART worked well enough: a Django backend, a solid PostgreSQL database, thousands of spectral records collected from paintings, archaeological artifacts, and cultural heritage objects.

The European Commission has measured this pattern: over 60% of publicly funded research data in the EU stays locked inside the team that created it. EOSC alignment and FAIR compliance are now conditions for qualifying for the next round of European funding, not recommendations.

What FAIR Compliance Actually Requires

The FAIR acronym: Findable, Accessible, Interoperable, Reusable, appears frequently in policy documents. It appears far less frequently in engineering specifications, which is part of why so many platforms claim compliance without achieving it.

In practical terms, FAIR compliance requires five things to be true simultaneously:

  • Persistent identifiers assigned to every dataset, so records can be cited reliably across systems
  • Structured metadata schemas aligned with EOSC or Dublin Core standards
  • A public API with consistent, documented responses
  • Standardised export formats: JSON, CSV, RDF
  • Access policies that are documented and findable by external parties

Each of these is a concrete engineering requirement, not a policy aspiration.

A platform that satisfies four out of five is not FAIR compliant. It is close, which is a different thing entirely.

FAIR Principle What It Requires How We Implemented It
Findable Persistent identifiers, rich metadata, indexed by search engines PIDs assigned, JSON-LD metadata, EOSC-indexed
Accessible Standardized protocols, documented access policies Versioned REST API, documented endpoints, public access policy
Interoperable Shared knowledge representation formats JSON/CSV/RDF export, OpenAIRE-compatible schema
Reusable Usage licenses and provenance documentation Audit trail, GDPR-aligned, CC licensing

The Architecture: Separating What Should Not Be Mixed

When we reviewed the existing system, the data was not the problem. The data was the asset. The problem, rather, was that the architecture only supported access through the web application. No other systems could integrate with it programmatically.

No public API. Metadata in formats that external harvesters could not read. Infrastructure tied to institutional servers with an uncertain operational future. The data was there. The means to reach it were not.

Consequently, our starting point was to treat interoperability as a design constraint, not a post-launch improvement. That decision shapes everything that follows. It means the first weeks of a project move more deliberately, and the last weeks do not involve emergency retrofitting.

Separating the Data Layer from the Content Layer

Research records and editorial content have different requirements. A spectral database record needs to be fast, consistent, and machine-readable. A page describing the project, on the other hand, needs to be editable by a non-technical administrator without involving a developer.

The solution was to route them separately. Scientific data flows through direct PostgreSQL queries, bypassing the CMS entirely for performance-critical operations. Editorial content lives in Strapi, giving administrators independent control over pages and updates. Each layer has a clear responsibility and teams can maintain or update it without affecting the other.

Choosing Infrastructure That Meets Sovereignty Requirements

Beyond the application architecture, the hosting environment also required a deliberate decision. EU data sovereignty requirements mean research data must reside on EU-based infrastructure and comply with GDPR.

Layer Technology Responsibility
Scientific Data PostgreSQL (direct queries) High-performance spectral search, data integrity
Content Management Strapi (Headless CMS) Editorial content, non-technical admin access
API Layer REST API Machine-to-machine integration, EOSC harvesting
Frontend Modern web stack, Next.js, Material UI User-facing interface, accessibility
Infrastructure EU-hosted (Hetzner, Germany) GDPR compliance, data sovereignty

The Public API Is Not Optional

A point worth stating directly: without a public API, FAIR compliance remains an intention, not a capability.

External repositories cannot retrieve your data. Harvesting tools cannot index it. Partner institutions cannot integrate with it programmatically. Consequently, the quality of the underlying data is irrelevant if no stable, documented interface exists to access it.

We built the INFRA-ART API around consistent JSON responses across all endpoints, versioned documentation accessible to external developers, and rate limiting that protects against abuse without restricting legitimate access. Scientific data and editorial content serve different audiences with different performance requirements, so we routed them through separate API paths accordingly.

Results

Outcome Detail
Independence from institutional infrastructure EU-hosted, no operational dependency on institutional servers
Modular, maintainable architecture Clean separation between data, API, CMS, and frontend layers
Public REST API Documented, versioned, ready for EOSC harvesting
Non-technical administration Content managed independently through Strapi
Zero data loss All 2,000+ spectral records preserved through migration
92% FAIR compliance score Verified by F-UJI, one category at Advanced level

 


In Summary

Research platforms do not usually fail because of bad data. They fail because the architecture was built for a different purpose and never updated to reflect what the data became worth.

INFRA-ART had that data. What it needed was an architecture capable of making it available to the people and systems that could use it.

FAQ's

1. What is a FAIR compliant research data platform?

A system where scientific datasets can be found, accessed, and reused by external systems, not just the team that created them. It requires persistent identifiers, structured metadata, a public API, standardized export formats, and documented access policies. All five components, functioning together.

2. How does EOSC relate to FAIR compliance?

EOSC is the federated European infrastructure for open science. FAIR compliance is what allows a platform to participate in it. Without it, data cannot be harvested by OpenAIRE or satisfy the data management requirements of Horizon Europe funding. The connection between the two is increasingly direct.plication behavior, reducing reliance on static regression suites.

3. Can legacy research systems be modernized without data loss?

Yes. The approach is to preserve the original database layer and build modular API and CMS layers around it, rather than replacing the data infrastructure entirely. In the INFRA-ART project, all 2,000+ spectral records were preserved through a complete architectural modernization.

4. Why is a public API a requirement rather than an enhancement?

Because interoperability without an API is theoretical. External systems need a stable, documented interface to retrieve data programmatically. Without one, there is no EOSC integration, no OpenAIRE indexing, and no machine-to-machine access, regardless of the quality of the underlying data.

5. How is FAIR compliance measured objectively?

Through automated tools like F-UJI, which assess platforms against defined FAIR criteria and produce a quantified score. The score can be tracked over time, reported to funders, and used to identify specific improvement areas. It replaces subjective self-assessment with an externally verifiable result.

6. What hosting requirements apply to EU-funded research projects?

EU data sovereignty requirements mean research data must reside on EU-based infrastructure and be handled in accordance with GDPR. Hosting with providers like Hetzner in Germany satisfies both conditions and ensures the platform remains eligible for funding programmes that mandate sovereign, open data infrastructure.