SciTransfer
Organization

ONTOTEXT AD

Bulgarian semantic technology SME providing knowledge graphs, NLP, and intelligent data integration across multilingual European research projects.

Technology SMEdigitalBGSMENo active H2020 projects
H2020 projects
9
As coordinator
0
Total EC funding
€1.2M
Unique partners
94
What they do

Their core work

Ontotext is a Bulgarian software company specializing in semantic technology, knowledge graphs, and intelligent data integration. They build graph databases and text analytics platforms that help organizations link, enrich, and query large volumes of unstructured and structured data. Across their H2020 portfolio, they have provided semantic search, natural language processing, and linked data capabilities to projects ranging from medical text analysis (KConnect) to business data integration (euBusinessGraph) and cultural heritage (EHRI). Their core value lies in turning messy, multilingual data into machine-readable, interconnected knowledge.

Core expertise

What they specialise in

Knowledge graphs and semantic data integrationprimary
5 projects

Central to euBusinessGraph, proDataMarket, EXA MODE, Cleopatra, and EHRI — all requiring linked data and ontology-based integration.

Multilingual text analytics and NLPprimary
3 projects

KConnect focused on multilingual medical text search; Cleopatra on cross-lingual event analytics; EXA MODE on multimodal ontology discovery.

Data marketplace and business data servicessecondary
2 projects

proDataMarket (property data marketplace) and euBusinessGraph (European business graph for data products).

Medical and biomedical data processingsecondary
2 projects

KConnect addressed medical text analysis and search; EXA MODE tackled medical image analysis via ontology-driven methods.

Disinformation detection and content verificationemerging
2 projects

WeVerify focused on content verification; COMPACT addressed social media and content convergence research.

Domain-specific big data analyticssecondary
1 project

BigDataGrapes applied big data techniques to grapevine and wine industries, showing cross-domain adaptability.

Evolution & trajectory

How they've shifted over time

Early focus
Data integration and semantic search
Recent focus
Multimodal knowledge discovery

In the early period (2015–2017), Ontotext focused on data marketplaces, medical text search, and research infrastructure — essentially applying their semantic technology to structured data products and domain-specific search. From 2018 onward, their work shifted toward multimodal analytics, knowledge graph construction, cross-lingual event processing, and medical image analysis via ontologies. The trajectory shows a clear move from text-centric data integration toward richer, multimodal knowledge discovery combining text, images, and multilingual content.

Ontotext is expanding from pure text/data semantics into multimodal AI and cross-lingual analytics, positioning themselves for projects that need to connect diverse data types — text, images, and structured records — through knowledge graphs.

Collaboration profile

How they like to work

Role: specialist_contributorReach: European28 countries collaborated

Ontotext operates exclusively as a participant or partner — they have never coordinated an H2020 project, which is typical for a technology SME that provides specialized components rather than leading research agendas. With 94 unique consortium partners across 28 countries, they are remarkably well-networked and clearly comfortable integrating into diverse consortia. Their role pattern suggests they are a trusted technology provider that teams recruit when they need semantic infrastructure, rather than a group that drives the research vision.

Ontotext has collaborated with 94 unique partners across 28 countries, giving them one of the broadest networks for a Bulgarian SME. Their partnerships span Western European universities, research institutes, and technology companies, with no obvious geographic concentration beyond pan-European coverage.

Why partner with them

What sets them apart

Ontotext is one of very few European SMEs that combines a commercial graph database product with deep NLP and semantic web expertise — they are not just researchers but technology vendors whose tools get deployed in production. Their Bulgarian base gives them a cost advantage while their 28-country network proves they operate at a fully European level. For consortium builders, they offer a rare combination: a proven SME partner that brings both reusable technology infrastructure and research capability in knowledge graphs and text analytics.

Notable projects

Highlights from their portfolio

  • EHRI
    Largest funding (EUR 342,419) and longest duration — a major research infrastructure project for Holocaust studies, showing Ontotext's ability to handle sensitive cultural heritage data at scale.
  • euBusinessGraph
    Directly aligned with their commercial knowledge graph expertise — building the European Business Graph as a data product, bridging research and market application.
  • EXA MODE
    Represents their expansion into medical image analysis and multimodal ontologies, signaling a new direction beyond text-only semantics.
Cross-sector capabilities
Health and biomedical data (medical text analytics, digital pathology)Cultural heritage and humanities (research infrastructure, multilingual archives)Agriculture and food industry (big data analytics for viticulture)Media and journalism (content verification, disinformation detection)
Analysis note: Ontotext is a well-established company in the semantic technology space with a recognizable commercial product (GraphDB). The H2020 data confirms their role as a specialist technology contributor. Early-period keywords were empty in the dataset, so evolution analysis relies on project titles/topics for 2015-2017 vs. explicit keywords available for 2019-2023 projects. Confidence is 4 rather than 5 because they never coordinated a project, limiting insight into their independent research agenda.