SciTransfer
CINECA · Project

Cross-Border Health Data Search Engine for Genomic and Patient Cohort Research

healthTestedTRL 5

Imagine you need medical data on a rare disease, but the patient records are scattered across hospitals in 12 different countries — each with its own rules, formats, and privacy laws. CINECA built a system that lets researchers search across all these databases at once without ever moving the sensitive data. Think of it like a Google for health cohorts: you type in what you're looking for, and it finds matching patients across Europe, Canada, and Africa — covering 1.4 million individuals — while keeping everything private and legally compliant.

By the numbers
1.4M
individuals in virtual cohort across three continents
11
diverse cohorts federated into a single searchable system
19
consortium partners across the project
12
countries represented in the federation
33
total deliverables produced
The business problem

What needed solving

Health and genomic data is trapped in national silos — hospitals, biobanks, and research cohorts across different countries cannot easily share or cross-query patient data due to incompatible systems, different data formats, and conflicting privacy regulations. This makes clinical trial recruitment slow, rare disease research fragmented, and precision medicine development unnecessarily expensive.

The solution

What was built

CINECA built a federated discovery and analytics platform that searches across 1.4 million individuals in 11 cohorts across 12 countries without moving sensitive data. Concrete outputs include: an open source GDPR-compliant diagnostic NGS service, a multi-dimensional cohort discovery portal, federated biomarker discovery tools, and demonstrated authentication interoperability between European and Canadian systems.

Audience

Who needs this

Pharmaceutical companies running multi-country clinical trials needing faster patient recruitmentGenomic diagnostic companies expanding GDPR-compliant NGS services across EuropeBiobank operators wanting to make their data discoverable to international researchersHealth data platform providers building cross-border interoperability solutionsPrecision medicine startups needing access to large, diverse patient cohorts
Business applications

Who can put this to work

Pharmaceutical & Biotech
enterprise
Target: Pharma companies running clinical trials or developing precision medicine therapies

If you are a pharma company struggling to recruit enough patients for clinical trials across multiple countries — CINECA developed a federated discovery platform that can search across 1.4 million individuals in 11 cohorts spanning Europe, Canada, and Africa. Instead of negotiating data-sharing agreements one by one, you query a single platform that finds matching patient profiles in real time while the data stays in its original location.

Genomic Diagnostics
mid-size
Target: Diagnostic labs and NGS service providers expanding into clinical genomics

If you are a diagnostic company looking to offer GDPR-compliant next-generation sequencing services across European markets — CINECA built a proof-of-concept open source diagnostic NGS service that is both GDPR and FAIR data compliant. This means you can process genomic data across borders without building your own compliance infrastructure from scratch, using federated workflows already tested across 12 countries.

Health Data Infrastructure
any
Target: Biobank operators and health data platforms needing interoperability solutions

If you are a biobank or health data platform that needs to make your datasets discoverable and queryable by international researchers — CINECA developed federated data analytics specifically for European biobanks, plus a discovery service catalog that searches datasets by genomic information and clinical metadata. The system demonstrated interoperability between European (ELIXIR) and Canadian (CanDIG) authentication systems, proving cross-continental federation works.

Frequently asked

Quick answers

What would it cost to adopt or integrate CINECA's federated search infrastructure?

The project's EU contribution amount is not available in the dataset, so exact development costs cannot be stated. However, CINECA built open source tools (including the diagnostic NGS service), which means the software itself is free to use. Integration costs would depend on your existing data infrastructure and compliance needs.

Can this work at industrial scale with millions of patient records?

Yes — CINECA assembled a virtual cohort of 1.4 million individuals from 11 diverse cohorts across 12 countries. The federated discovery platform was designed to perform real-time search across disparate cohort infrastructures in Canada, Europe, and Africa, which demonstrates population-scale capability.

What is the IP and licensing situation?

CINECA was a publicly funded Research and Innovation Action (RIA) with 19 partners. The diagnostic NGS service was explicitly developed as open source. Specific licensing terms for other components would need to be confirmed with the coordinator, the European Molecular Biology Laboratory (EMBL) in Germany.

How does this handle GDPR and cross-border data regulations?

CINECA specifically addressed this by developing a federated approach where data never leaves its home jurisdiction. They built GDPR and FAIR compliant diagnostic services and created an ELSI (ethical, legal, social) framework supporting data exchange across legal jurisdictions. The AAI interoperability demonstration proved authentication works between European and Canadian systems.

What was actually delivered and tested?

The project produced 33 deliverables including: a federated discovery service catalog, a GDPR-compliant diagnostic NGS proof of concept, federated data analytics for biobanks, a biomarker discovery service, query expansion services, and a public demonstration of Europe-Canada authentication interoperability. These were tested across 11 cohorts spanning 12 countries.

How does this integrate with existing health IT systems?

CINECA was built on established standards and infrastructures including the Global Alliance for Genomic Health, BBMRI, ELIXIR, and EOSC. The system uses ELIXIR AAI for authentication and demonstrated interoperability with Canada's CanDIG platform. This standards-based approach means it can connect to systems already using these widely adopted health data standards.

Is there ongoing support or a community behind this?

The project ran from 2019 to 2023 with 19 partners across 12 countries, anchored by EMBL — one of Europe's leading life science research organizations. The project website (cineca-project.eu) and connection to ELIXIR infrastructure suggest continued community support, though active development status should be confirmed with the coordinator.

Consortium

Who built it

The CINECA consortium of 19 partners across 12 countries is heavily research-oriented: 9 universities and 7 research organizations form the core, with only 2 industry partners (11% industry ratio). The coordinator is EMBL, a world-class molecular biology lab based in Germany — strong scientific credibility but not a commercial entity. The geographic spread across Europe, Canada, and South Africa is unusually broad for an H2020 project, which validates the cross-border federation claims. The low industry participation means commercialization will likely require new partnerships with health IT companies, biotech firms, or diagnostic service providers to bring these tools to market.

How to reach the team

European Molecular Biology Laboratory (EMBL), Germany — reach out to the CINECA project office via the project website

Next steps

Talk to the team behind this work.

Want to connect with the CINECA team for federated health data solutions? SciTransfer can arrange an introduction and help you evaluate fit for your use case.

More in Health & Biomedical
See all Health & Biomedical projects