More than half of publicly funded research data in Europe remains difficult to access outside the institutions that created it. In many cases the issue is not the quality of the data, but the architecture of the platforms that store it.
INFRA-ART contained over 2,000 spectral records derived from paintings, archaeological artifacts, and cultural heritage objects.
The data was valuable and well maintained, but the platform had been designed through a web application, with no interoperability with external research infrastructures.
The Data Was Good. The Architecture Was Not Designed to Share It.
Most research platforms are not built to fail. They are built to solve a specific problem, at a specific point in time. The data inside them is often excellent but the architecture around them, less.
INFRA-ART worked well enough: a Django backend, a solid PostgreSQL database, thousands of spectral records collected from paintings, archaeological artifacts, and cultural heritage objects.
The European Commission has measured this pattern: over 60% of publicly funded research data in the EU stays locked inside the team that created it. EOSC alignment and FAIR compliance are now conditions for qualifying for the next round of European funding, not recommendations.
What FAIR Compliance Actually Requires
The FAIR acronym: Findable, Accessible, Interoperable, Reusable, appears frequently in policy documents. It appears far less frequently in engineering specifications, which is part of why so many platforms claim compliance without achieving it.
In practical terms, FAIR compliance requires five things to be true simultaneously:
- Persistent identifiers assigned to every dataset, so records can be cited reliably across systems
- Structured metadata schemas aligned with EOSC or Dublin Core standards
- A public API with consistent, documented responses
- Standardised export formats: JSON, CSV, RDF
- Access policies that are documented and findable by external parties
Each of these is a concrete engineering requirement, not a policy aspiration.
A platform that satisfies four out of five is not FAIR compliant. It is close, which is a different thing entirely.
| FAIR Principle | What It Requires | How We Implemented It |
| Findable | Persistent identifiers, rich metadata, indexed by search engines | PIDs assigned, JSON-LD metadata, EOSC-indexed |
| Accessible | Standardized protocols, documented access policies | Versioned REST API, documented endpoints, public access policy |
| Interoperable | Shared knowledge representation formats | JSON/CSV/RDF export, OpenAIRE-compatible schema |
| Reusable | Usage licenses and provenance documentation | Audit trail, GDPR-aligned, CC licensing |
The Architecture: Separating What Should Not Be Mixed
When we reviewed the existing system, the data was not the problem. The data was the asset. The problem, rather, was that the architecture only supported access through the web application. No other systems could integrate with it programmatically.
No public API. Metadata in formats that external harvesters could not read. Infrastructure tied to institutional servers with an uncertain operational future. The data was there. The means to reach it were not.
Consequently, our starting point was to treat interoperability as a design constraint, not a post-launch improvement. That decision shapes everything that follows. It means the first weeks of a project move more deliberately, and the last weeks do not involve emergency retrofitting.
Separating the Data Layer from the Content Layer
Research records and editorial content have different requirements. A spectral database record needs to be fast, consistent, and machine-readable. A page describing the project, on the other hand, needs to be editable by a non-technical administrator without involving a developer.
The solution was to route them separately. Scientific data flows through direct PostgreSQL queries, bypassing the CMS entirely for performance-critical operations. Editorial content lives in Strapi, giving administrators independent control over pages and updates. Each layer has a clear responsibility and teams can maintain or update it without affecting the other.
Choosing Infrastructure That Meets Sovereignty Requirements
Beyond the application architecture, the hosting environment also required a deliberate decision. EU data sovereignty requirements mean research data must reside on EU-based infrastructure and comply with GDPR.
| Layer | Technology | Responsibility |
| Scientific Data | PostgreSQL (direct queries) | High-performance spectral search, data integrity |
| Content Management | Strapi (Headless CMS) | Editorial content, non-technical admin access |
| API Layer | REST API | Machine-to-machine integration, EOSC harvesting |
| Frontend | Modern web stack, Next.js, Material UI | User-facing interface, accessibility |
| Infrastructure | EU-hosted (Hetzner, Germany) | GDPR compliance, data sovereignty |
The Public API Is Not Optional
A point worth stating directly: without a public API, FAIR compliance remains an intention, not a capability.
External repositories cannot retrieve your data. Harvesting tools cannot index it. Partner institutions cannot integrate with it programmatically. Consequently, the quality of the underlying data is irrelevant if no stable, documented interface exists to access it.
We built the INFRA-ART API around consistent JSON responses across all endpoints, versioned documentation accessible to external developers, and rate limiting that protects against abuse without restricting legitimate access. Scientific data and editorial content serve different audiences with different performance requirements, so we routed them through separate API paths accordingly.
Results
| Outcome | Detail |
| Independence from institutional infrastructure | EU-hosted, no operational dependency on institutional servers |
| Modular, maintainable architecture | Clean separation between data, API, CMS, and frontend layers |
| Public REST API | Documented, versioned, ready for EOSC harvesting |
| Non-technical administration | Content managed independently through Strapi |
| Zero data loss | All 2,000+ spectral records preserved through migration |
| 92% FAIR compliance score | Verified by F-UJI, one category at Advanced level |
In Summary
Research platforms do not usually fail because of bad data. They fail because the architecture was built for a different purpose and never updated to reflect what the data became worth.
INFRA-ART had that data. What it needed was an architecture capable of making it available to the people and systems that could use it.
FAQ's
A system where scientific datasets can be found, accessed, and reused by external systems, not just the team that created them. It requires persistent identifiers, structured metadata, a public API, standardized export formats, and documented access policies. All five components, functioning together.
EOSC is the federated European infrastructure for open science. FAIR compliance is what allows a platform to participate in it. Without it, data cannot be harvested by OpenAIRE or satisfy the data management requirements of Horizon Europe funding. The connection between the two is increasingly direct.plication behavior, reducing reliance on static regression suites.
Yes. The approach is to preserve the original database layer and build modular API and CMS layers around it, rather than replacing the data infrastructure entirely. In the INFRA-ART project, all 2,000+ spectral records were preserved through a complete architectural modernization.
Because interoperability without an API is theoretical. External systems need a stable, documented interface to retrieve data programmatically. Without one, there is no EOSC integration, no OpenAIRE indexing, and no machine-to-machine access, regardless of the quality of the underlying data.
Through automated tools like F-UJI, which assess platforms against defined FAIR criteria and produce a quantified score. The score can be tracked over time, reported to funders, and used to identify specific improvement areas. It replaces subjective self-assessment with an externally verifiable result.
EU data sovereignty requirements mean research data must reside on EU-based infrastructure and be handled in accordance with GDPR. Hosting with providers like Hetzner in Germany satisfies both conditions and ensures the platform remains eligible for funding programmes that mandate sovereign, open data infrastructure.