More than half of publicly funded research data in Europe remains difficult to access outside the institutions that created it. In many cases the issue is not the quality of the data, but the architecture of the platforms that store it.
INFRA-ART contained over 2,000 spectral records derived from paintings, archaeological artifacts, and cultural heritage objects. 
The data was valuable and well maintained, but the platform had been designed through a web application, with no interoperability with external research infrastructures.

The Data Was Good. The Architecture Was Not Designed to Share It.

Most research platforms are not built to fail. They are built to solve a specific problem, at a specific point in time. The data inside them is often excellent but the architecture around them, less.

INFRA-ART worked well enough: a Django backend, a solid PostgreSQL database, thousands of spectral records collected from paintings, archaeological artifacts, and cultural heritage objects.

The European Commission has measured this pattern: over 60% of publicly funded research data in the EU stays locked inside the team that created it. EOSC alignment and FAIR compliance are now conditions for qualifying for the next round of European funding, not recommendations.

What FAIR Compliance Actually Requires

The FAIR acronym: Findable, Accessible, Interoperable, Reusable, appears frequently in policy documents. It appears far less frequently in engineering specifications, which is part of why so many platforms claim compliance without achieving it.

In practical terms, FAIR compliance requires five things to be true simultaneously:

Persistent identifiers assigned to every dataset, so records can be cited reliably across systems
Structured metadata schemas aligned with EOSC or Dublin Core standards
A public API with consistent, documented responses
Standardised export formats: JSON, CSV, RDF
Access policies that are documented and findable by external parties

Each of these is a concrete engineering requirement, not a policy aspiration.

A platform that satisfies four out of five is not FAIR compliant. It is close, which is a different thing entirely.

FAIR Principle	What It Requires	How We Implemented It
Findable	Persistent identifiers, rich metadata, indexed by search engines	PIDs assigned, JSON-LD metadata, EOSC-indexed
Accessible	Standardized protocols, documented access policies	Versioned REST API, documented endpoints, public access policy
Interoperable	Shared knowledge representation formats	JSON/CSV/RDF export, OpenAIRE-compatible schema
Reusable	Usage licenses and provenance documentation	Audit trail, GDPR-aligned, CC licensing

The Architecture: Separating What Should Not Be Mixed

When we reviewed the existing system, the data was not the problem. The data was the asset. The problem, rather, was that the architecture only supported access through the web application. No other systems could integrate with it programmatically.

No public API. Metadata in formats that external harvesters could not read. Infrastructure tied to institutional servers with an uncertain operational future. The data was there. The means to reach it were not.

Consequently, our starting point was to treat interoperability as a design constraint, not a post-launch improvement. That decision shapes everything that follows. It means the first weeks of a project move more deliberately, and the last weeks do not involve emergency retrofitting.

Separating the Data Layer from the Content Layer

Research records and editorial content have different requirements. A spectral database record needs to be fast, consistent, and machine-readable. A page describing the project, on the other hand, needs to be editable by a non-technical administrator without involving a developer.

The solution was to route them separately. Scientific data flows through direct PostgreSQL queries, bypassing the CMS entirely for performance-critical operations. Editorial content lives in Strapi, giving administrators independent control over pages and updates. Each layer has a clear responsibility and teams can maintain or update it without affecting the other.

Choosing Infrastructure That Meets Sovereignty Requirements

Beyond the application architecture, the hosting environment also required a deliberate decision. EU data sovereignty requirements mean research data must reside on EU-based infrastructure and comply with GDPR.

Layer	Technology	Responsibility
Scientific Data	PostgreSQL (direct queries)	High-performance spectral search, data integrity
Content Management	Strapi (Headless CMS)	Editorial content, non-technical admin access
API Layer	REST API	Machine-to-machine integration, EOSC harvesting
Frontend	Modern web stack, Next.js, Material UI	User-facing interface, accessibility
Infrastructure	EU-hosted (Hetzner, Germany)	GDPR compliance, data sovereignty

The Public API Is Not Optional

A point worth stating directly: without a public API, FAIR compliance remains an intention, not a capability.

External repositories cannot retrieve your data. Harvesting tools cannot index it. Partner institutions cannot integrate with it programmatically. Consequently, the quality of the underlying data is irrelevant if no stable, documented interface exists to access it.

We built the INFRA-ART API around consistent JSON responses across all endpoints, versioned documentation accessible to external developers, and rate limiting that protects against abuse without restricting legitimate access. Scientific data and editorial content serve different audiences with different performance requirements, so we routed them through separate API paths accordingly.

Results

Outcome	Detail
Independence from institutional infrastructure	EU-hosted, no operational dependency on institutional servers
Modular, maintainable architecture	Clean separation between data, API, CMS, and frontend layers
Public REST API	Documented, versioned, ready for EOSC harvesting
Non-technical administration	Content managed independently through Strapi
Zero data loss	All 2,000+ spectral records preserved through migration
92% FAIR compliance score	Verified by F-UJI, one category at Advanced level

In Summary

Research platforms do not usually fail because of bad data. They fail because the architecture was built for a different purpose and never updated to reflect what the data became worth.

INFRA-ART had that data. What it needed was an architecture capable of making it available to the people and systems that could use it.

FAQ's

1. What is a FAIR compliant research data platform?

A system where scientific datasets can be found, accessed, and reused by external systems, not just the team that created them. It requires persistent identifiers, structured metadata, a public API, standardized export formats, and documented access policies. All five components, functioning together.

2. How does EOSC relate to FAIR compliance?

EOSC is the federated European infrastructure for open science. FAIR compliance is what allows a platform to participate in it. Without it, data cannot be harvested by OpenAIRE or satisfy the data management requirements of Horizon Europe funding. The connection between the two is increasingly direct.plication behavior, reducing reliance on static regression suites.

3. Can legacy research systems be modernized without data loss?

Yes. The approach is to preserve the original database layer and build modular API and CMS layers around it, rather than replacing the data infrastructure entirely. In the INFRA-ART project, all 2,000+ spectral records were preserved through a complete architectural modernization.

4. Why is a public API a requirement rather than an enhancement?

Because interoperability without an API is theoretical. External systems need a stable, documented interface to retrieve data programmatically. Without one, there is no EOSC integration, no OpenAIRE indexing, and no machine-to-machine access, regardless of the quality of the underlying data.

5. How is FAIR compliance measured objectively?

Through automated tools like F-UJI, which assess platforms against defined FAIR criteria and produce a quantified score. The score can be tracked over time, reported to funders, and used to identify specific improvement areas. It replaces subjective self-assessment with an externally verifiable result.

6. What hosting requirements apply to EU-funded research projects?

EU data sovereignty requirements mean research data must reside on EU-based infrastructure and be handled in accordance with GDPR. Hosting with providers like Hetzner in Germany satisfies both conditions and ensures the platform remains eligible for funding programmes that mandate sovereign, open data infrastructure.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Blog

Modernising INFRA-ART: From Legacy Database to FAIR-Compliant Open Research Platform

The Data Was Good. The Architecture Was Not Designed to Share It.

What FAIR Compliance Actually Requires

The Architecture: Separating What Should Not Be Mixed

Separating the Data Layer from the Content Layer

Choosing Infrastructure That Meets Sovereignty Requirements

The Public API Is Not Optional

Results

In Summary

FAQ's

Share This!

Related Posts

A Love Letter to Business Analysis: Evozon at the IIBA Romania Event in Cluj

Cum alegi un partener Microsoft Dynamics 365 în 2026

How AI Is Redefining Software Testing Practices in 2026

TAGS