SciTransfer
BiCIKL · Project

One-Stop Search Engine Linking All Biodiversity Data from Specimens to Published Research

environmentTestedTRL 6Thin data (2/5)

Imagine you need to look up everything known about a particular species — its DNA, where specimens are stored, what scientists have published about it, and what names it goes by. Right now, all that information is scattered across dozens of separate databases that don't talk to each other. BiCIKL built the plumbing to connect all these databases into one searchable knowledge graph, so you can start with a specimen and follow the trail all the way to published research and back. Think of it as Google for biodiversity data, where every search result links to the original evidence.

By the numbers
15
research infrastructure partners integrated across consortium
10
countries represented in the consortium
39
total deliverables produced
4
SMEs involved in the consortium
8
demo deliverables including APIs, knowledge hub, and training
The business problem

What needed solving

Companies in environmental consulting, pharma, and agriculture waste significant time manually searching fragmented biodiversity databases to compile species records, trace genetic data, or find relevant scientific literature. There is no single entry point to follow the evidence chain from a physical specimen to its DNA sequence, species classification, and published research. This fragmentation slows down environmental impact assessments, drug discovery screening, and crop diversity research.

The solution

What was built

BiCIKL built a connected biodiversity data infrastructure: three RESTful APIs (factoid question-answering, passage retrieval, and search/link association with confidence scoring), a browser-based workbench with HTML5/JavaScript interfaces, a Biodiversity Knowledge Hub, a MOOC training programme, and the architecture for a pan-European persistent identifier system for digital specimens.

Audience

Who needs this

Environmental impact assessment consultancies needing fast species data lookupsBiotech and pharma companies screening natural products for drug candidatesAgricultural genetics firms tracking wild crop relatives across gene banksScientific publishers and data platforms wanting to link biodiversity datasetsGovernment biodiversity monitoring agencies compiling species inventories
Business applications

Who can put this to work

Environmental Consulting
any
Target: Environmental impact assessment firms

If you are an environmental consulting firm spending days manually searching scattered databases to compile species records for impact assessments — this project developed a Biodiversity Knowledge Hub and RESTful APIs that let you query linked specimen, sequence, and publication data in one place. Instead of cross-referencing 10 separate portals, your analysts can retrieve connected evidence chains across 15 partner institutions' data holdings. The factoid question-answering API lets staff ask plain-language questions and get sourced answers from the scientific literature.

Pharmaceutical & Biotechnology
mid-size
Target: Natural product drug discovery companies

If you are a biotech company screening natural organisms for drug candidates and struggling to trace specimen records back to genetic sequences and published bioactivity data — this project built search and link association APIs that return ranked lists of connected data objects with confidence scores. Your discovery team can input a specimen accession number and instantly see linked sequences, species identifications, and relevant publications. The passage retrieval API lets researchers find relevant literature passages using keyword searches across the biodiversity corpus.

Agricultural Genetics & Seed Technology
mid-size
Target: Seed companies and crop biodiversity researchers

If you are a seed company or agricultural genetics firm that needs to track wild relatives of crop species across museum collections, gene banks, and published literature — this project created tools for seamless linking along the chain from specimens to sequences to species to publications. The workbench provides browser-based interfaces for navigating these connections, while the pan-European PID system for Digital Specimens ensures each biological sample has a permanent, traceable identifier across 10 countries' collections.

Frequently asked

Quick answers

What would it cost to access these biodiversity data tools?

BiCIKL was built as open science infrastructure with open access principles. The APIs (factoid Q&A, passage retrieval, search and link association) are RESTful services designed for public use. Based on available project data, access appears to be free, though enterprise-level service agreements or high-volume usage terms would need to be discussed with the coordinator.

Can these tools handle the volume of data my organization works with?

The project integrated data across 15 partner institutions in 10 countries, covering specimens, sequences, taxonomic names, literature, and figures. The system was designed for pan-European scale with 39 deliverables covering infrastructure from data extraction to knowledge graph construction. For specific throughput limits on the APIs, you would need to consult the technical documentation on the project website.

Who owns the intellectual property and can I license these tools?

The coordinator is Pensoft Publishers, a Bulgarian SME. As an EU-funded Research and Innovation Action, results are typically subject to open access requirements. The 4 SMEs in the consortium may hold IP on specific commercial implementations. Licensing terms for the APIs and workbench tools should be discussed directly with Pensoft.

How mature are these tools — are they production-ready?

The project delivered working RESTful APIs for factoid question-answering, passage retrieval, and search/link association services, plus a browser-based workbench built in HTML5/JavaScript. A MOOC training programme was also delivered, suggesting the tools reached a usable state. The project closed in April 2024, so current operational status should be verified.

Can these tools integrate with our existing data systems?

The APIs are RESTful, meaning they follow standard web service protocols and can integrate with most modern software. The workbench uses HTML5/JavaScript for browser-based access. The project also designed a pan-European PID (Persistent Identifier) system for Digital Specimens, which supports data interoperability standards. Integration with proprietary internal databases would likely require custom development.

Is there regulatory or compliance value in using these tools?

For companies conducting Environmental Impact Assessments or biodiversity-related regulatory reporting, having traceable evidence chains from specimens through sequences to published literature strengthens compliance documentation. The linked open data approach and reproducibility focus align with emerging biodiversity reporting requirements under EU regulations. Based on available project data, specific regulatory certifications are not mentioned.

What ongoing support is available after the project ended?

The project website (bicikl-project.eu) remains active and Pensoft Publishers, as coordinator, continues to operate as a going concern in scientific publishing. The MOOC training programme provides self-service learning. For dedicated technical support or custom implementations, direct engagement with the consortium partners would be needed.

Consortium

Who built it

The BiCIKL consortium brings together 15 partners from 10 countries, led by Pensoft Publishers — a Bulgarian SME specializing in scientific publishing. The consortium is research-heavy: 6 research organizations and 3 universities make up the core, with only 2 industry partners (13% industry ratio). The 4 SMEs in the group suggest some commercial orientation, but this is primarily a research infrastructure project. For a business looking to adopt these tools, the low industry ratio means the solutions were designed mainly for researchers, though the SME-led coordination by a publishing company adds practical, market-aware leadership to the technical outputs.

How to reach the team

Pensoft Publishers (Bulgaria) — contact via project website or SciTransfer can facilitate introduction

Next steps

Talk to the team behind this work.

Want to explore how BiCIKL's biodiversity data tools could streamline your species research or environmental compliance work? SciTransfer can arrange a direct introduction to the project team and help evaluate fit for your specific use case.

More in Environment & Climate
See all Environment & Climate projects