At its heart it is a data catalog and business glossary that documents the data assets of an organisation. Data can be classified and profiled, and business ownership can be assigned to the various data assets. Policies can be defined and enforced, so for example, workflow can be defined to check the authorisation of changes to a particular business hierarchy. Collibra has visual tools to assist users in navigating their data landscape, including a comprehensive data lineage solution that tracks the flow of data and can be used for impact analysis. This is the company’s own intellectual property, so they no longer rely, as they once did, on a partnership with a third party data lineage specialist. There is also a “knowledge graph” to help visualise the technical and business connections between data.
There is a search capability to allow data assets to be discovered. Collibra has a reputation for having an interface that is digestible by business users, and this will go further in the future versions with a comprehensive set of changes, including the ability to tailor the interface to specific data assets as well as different business roles. Collibra extends its capability to cover data access and security and overlaps to a degree with special security governance products.
The product uses artificial intelligence in several ways, some of which will be more visible in the upcoming releases, for example with the generation of content for data asset descriptions. This is already used in suggesting possible data duplication. The vendor (correctly) believes that although generative AI can be very productive, it still needs human review for generated content, as AI content can sometimes be overly imaginative (the so-called “hallucination” problem inherent in large language models),
Collibra has extensive partnerships, so for example, Collibra can ingest metadata from platforms such as Google Cloud, AWS, Microsoft Azure and SAP. Interestingly, there is what seems to be a quite productive partnership with SAP despite that vendor notionally having its own data governance offering. Additional bridges exist to products like Databricks and Snowflake, with over a hundred connectors provided. Collibra is planning integration to productivity tools like Slack and Teams also, enabling for example alerts in Collibra to be visible in Slack. Collibra has a robust data quality application which can be used standalone or as part of the overall data intelligence platform, and they continue to further integrate these experiences. For example, data quality scores will be made visible within the data catalog when a data asset is being viewed.