Collibra Data Quality & Data Observability
Collibra Data Quality proactively surfaces data quality issues in real time, making reliable and accurate data readily available to drive informed decisions. It enables you to leverage machine learning to generate explainable and autonomous data quality rules, reducing manual rule writing and errors and increasing trust in the data.
Features
- Predictive self service data quality in real time
- Automated data quality using machine learning for rule generation
- Reduce manual rule writing and errors to increase trust
- Scan large complex databases, files, streaming data and analyse results
- Continuously monitor and detect data quality issues
- Automatically uncover data drift, outliers, patterns and schema changes
- Leverage a unified scoring system reporting across all data sources
- Personal alerts to proactively detect, escalate and remediate quality issues
- Row, column and conformity value checks between source and target
- Automatically understand semantic schema so sensitive data can be masked
Benefits
- Better data quality means better decision making
- Manage data issues with a business friendly scorecard and dashboards
- Reduce complexity and drive better insights with auto-discovered rules
- Build high quality data pipelines
- Automate quality checks at every point in the data journey
- Manage risk and improve regulatory compliance
- Ensure data is complete, timely, accurate and valid
- Validate data integrity between source and target data systems
- Reduce risk and cost of migrating data
- Improve trust in the data through ensuring the quality improves
Pricing
£122,919 a licence a year
Service documents
Request an accessible format
Framework
G-Cloud 14
Service ID
1 3 6 6 2 9 3 9 2 5 0 4 8 7 0
Contact
Collibra UK Ltd
Rosalind Elmes
Telephone: 07770638107
Email: rosalind.elmes@collibra.com
Service scope
- Software add-on or extension
- Yes, but can also be used as a standalone service
- What software services is the service an extension to
- Collibra Data Quality can be used as an extension to Collibra's data governance and catalogue service. This allows the quality of the data to be reflected automatically within the core platform. Working together the two solutions ensure data is not only easy to find and understand, but also trustworthy.
- Cloud deployment model
-
- Public cloud
- Private cloud
- Hybrid cloud
- Service constraints
- Support hours for the standard support offering are Monday through Friday, 9:00 a.m. to 6:00 p.m. excluding public holidays. Severity 1 issues (production outages) are covered 24/7/365 for all customers. An optional premium support service is also available at an additional cost, offering 24/5 support.
- System requirements
-
- Software requirements
- Helm, kubectl
- Cloud CLI, Docker
- RHEL8.0
- Postgresql version 11+
- Collibra DQ
- Apache Spark
- Java IDE
User support
- Email or online ticketing support
- Email or online ticketing
- Support response times
-
Basic support is for the resolution of product defects/incidents and reporting feature enhancement requests. This is commonly referred as traditional ‘break-fix' support.
Collibra commits to maximum response and resolution times based on the criticality of the incident.
Severity 1: Target response time is 2 business hours.
Severity 2: Target response time is 4 business hours.
Severity 3: Target response time is 1 business day.
Standard support hours for North America and Europe are Monday through Friday, 9:00 a.m. – 6:00 p.m., excluding public holidays. Severity 1 issues (production outages) are covered 24/7/365 for all customers. - User can manage status and priority of support tickets
- Yes
- Online ticketing support accessibility
- WCAG 2.1 AA or EN 301 549
- Phone support
- No
- Web chat support
- No
- Onsite support
- No
- Support levels
-
Standard support is provided within the cost of the annual subscription. There is also a premium support option at an additional cost of £75,000 pa. This service includes 24/5 support, a dedicated support consultant, and regular meetings to provide advice.
In addition, further resources are available free of charge:
> The Collibra Community knowledge base and product Q&A forums are a great resource for self-service help from peers and experts.
> Collibra University provides an excellent range of courses covering all aspects of Collibra’s solution from getting started through to advanced.
> At the start of the contract, Collibra provides a series of onboarding sessions to get new customers started.
> An assigned Customer Success Manager will work with the customer to determine the best route forward, ensure all questions are answered, and discuss best practices to encourage adoption. - Support available to third parties
- Yes
Onboarding and offboarding
- Getting started
-
Collibra will appoint an onboarding manager to guide the customers through a series of sessions to get them set up. The Customer Onboarding Program includes:
> Guidance to build your success plan
> Initial high-level enablement sessions and training recommendations
> Access to Collibra experts to validate technical prerequisites
> Coordination with your implementation partner or Collibra Services
> Introduction to all our Customer Success resources
In addition, we offer:
> Collibra University: A free, self-paced training platform that is available at university.collibra.com. It includes modules that cover all aspects of Collibra product functionality including application configuration and customization.
> Community: The online community includes a wealth of resources including a knowledge base, solution template marketplace, product documentation and a very active product question and answer forum.
> Instructor-led education classes: A Collibra instructor remotely leads and monitors students in either public courses or private courses specific to the organisation. The objective is to train a group who can lead and guide others. - Service documentation
- Yes
- Documentation formats
-
- HTML
- End-of-contract data extraction
-
Collibra Data Quality is deployed in the customer's environment, with data stored on the client's infrastructure. Upon termination of the agreement, Customer’s license to the Software will cease, and Customer must immediately cease using the Software and delete (or, upon request, return) all copies of the Software.
Customers will have access to the metastore to extract the data from the Collibra Data Quality web application using the export function. Customers can also manage offboarding by taking regular backups of the metastore as desired. - End-of-contract process
-
Collibra Data Quality is deployed in the customer's environment, with data stored on the client's infrastructure. Upon termination of the agreement, Customer’s license to the Software will cease, and Customer must immediately cease using the Software and delete (or, upon request, return) all copies of the Software.
Customers will have access to the metastore to extract the data from the Collibra Data Quality web application using the export function. Customers can also manage offboarding by taking regular backups of the metastore as desired.
Using the service
- Web browser interface
- Yes
- Supported browsers
-
- Microsoft Edge
- Firefox
- Chrome
- Safari
- Application to install
- Yes
- Compatible operating systems
-
- MacOS
- Windows
- Designed for use on mobile devices
- No
- Service interface
- Yes
- User support accessibility
- None or don’t know
- Description of service interface
-
Collibra Data Quality offers out-of-the-box dashboards, scorecards, and reporting. The information of the scorecard can be configured within the application. For example, clients can fully customize scoring thresholds. We offer several methods (sliders, precise values, high-medium-low) for scoring thresholds. Users can easily adjust the score related to the rule break or change the threshold of breaking.
If you already have rules written in Oracle, Sybase, or DB2 syntax, you can copy-paste the rule directly into the Native SQL section of Collibra Data Quality. Additionally, Collibra Data Quality supports the building of rules through point-and-click operations, SQL statements, and library inclusion. - Accessibility standards
- None or don’t know
- Description of accessibility
- Collibra Data Quality has not yet been evaluated for accessibility.
- Accessibility testing
- Collibra Data Quality has not yet been evaluated for accessibility.
- API
- Yes
- What users can and can't do using the API
- Collibra Data Quality exposes the internal API and public APIs so that all potential operations are available. These API endpoints may change over time; always refer to the product documentation link for recent updates. Users can be authenticated using username/password or JWT tokens for accessing APIs. The APIs can be used against the application in live working mode, which is recommended. Swagger documentation of API endpoints can be found on Collibra DQ documentation.
- API documentation
- Yes
- API documentation formats
-
- Open API (also known as Swagger)
- HTML
- API sandbox or test environment
- Yes
- Customisation available
- Yes
- Description of customisation
-
Rules Customization:
Many rules will be autogenerated based on our proprietary ML models. They will also learn from the data and adapt over time. Additionally, subject matter experts can override auto-generated rules and generate any rules themselves to fine-tune the DQ program to your specific business and use case.
Threshold Customization:
We offer several methods (sliders, precise values, high-medium-low) for scoring thresholds. Users can easily adjust the score related to the rule break or change the threshold of breaking.
Dashboard Customization:
Robust dashboards and reporting are available in Collibra Data Quality. You can also build your own custom reports from the metastore.
Scaling
- Independence of resources
- Collibra Data Quality is deployed on the customer's own environment, using a Spark cluster. Collibra DQ is built to scale up horizontally and can scale to hundreds of nodes.
Analytics
- Service usage metrics
- Yes
- Metrics types
- Collibra DQ includes high-level usage metrics. Usage of Collibra DQ by user and dataset is tracked and statistics are compiled within the admin Dashboard. An audit history of all this is available based on the role-based access controls and within the UI, as the metastore has audit records stored.
- Reporting types
-
- API access
- Real-time dashboards
Resellers
- Supplier type
- Not a reseller
Staff security
- Staff security clearance
- Other security clearance
- Government security clearance
- None
Asset protection
- Knowledge of data storage and processing locations
- No
- Datacentre security standards
- Supplier-defined controls
- Penetration testing frequency
- At least once a year
- Penetration testing approach
- Another external penetration testing organisation
- Protecting data at rest
- Other
- Other data at rest protection approach
- Security is of the utmost importance for Collibra Data Quality and our customers. In order to not send around plain text passwords when owlchecks are executed from the CLI users/admins can encrypt passwords and execute owlchecks using the encrypted passwords instead of plain text.
- Data sanitisation process
- Yes
- Data sanitisation type
- Deleted data can’t be directly accessed
- Equipment disposal approach
- A third-party destruction service
Data importing and exporting
- Data export approach
-
Collibra Data Quality' export function extracts data in CSV files. Exports can also be initiated through APIs as explained in Collibra DQ documentation:
The API method uses the following general steps:
1. Find your dataset,
2. Pass your table to an API call.
3. Pass the output of the previous statement into the body of an import request, with the desired environment specified. - Data export formats
-
- CSV
- Other
- Other data export formats
-
- JSON
- XML
- DELTA
- PARQUET
- AVRO
- Data import formats
-
- CSV
- Other
- Other data import formats
-
- XML
- JSON
Data-in-transit protection
- Data protection between buyer and supplier networks
- Other
- Other protection between networks
- Not applicable. Collibra Data Quality is deployed on the customer's own environment.
- Data protection within supplier network
- Other
- Other protection within supplier network
- Collibra Data Quality is currently deployed on the customer's own environment. HTTPS can be enabled for the platform.
Availability and resilience
- Guaranteed availability
- Please note that the answers above apply to other areas of Collibra's business and not to Collibra Data Quality, which is deployed on the customer's own environment. Availability will depend on the client's infrastructure, so no SLAs are offered.
- Approach to resilience
- Collibra Data Quality is deployed on the customer's own environment so this is under the control of the user
- Outage reporting
- Not applicable. Collibra Data Quality is currently deployed on the customer's own environment so this is under the control of the user
Identity and authentication
- User authentication needed
- Yes
- User authentication
-
- Identity federation with existing provider (for example Google Apps)
- Username or password
- Access restrictions in management interfaces and support channels
- Collibra Data Quality supports access management through a local user store, integration with Active Directory, or SAML authentication. The solution uses role-based security. An admin can create many ROLEs, and a user can be part of one or many roles. A ROLE maps to a Dataset within Collibra Data Quality.
- Access restriction testing frequency
- At least once a year
- Management access authentication
- Other
- Description of management access authentication
- Not applicable. Collibra Data Quality is currently deployed on customer's own environment.
Audit information for users
- Access to user activity audit information
- Users have access to real-time audit information
- How long user audit data is stored for
- User-defined
- Access to supplier activity audit information
- No audit information available
- How long system logs are stored for
- User-defined
Standards and certifications
- ISO/IEC 27001 certification
- No
- ISO 28000:2007 certification
- No
- CSA STAR certification
- No
- PCI certification
- No
- Cyber essentials
- No
- Cyber essentials plus
- No
- Other security certifications
- No
Security governance
- Named board-level person responsible for service security
- Yes
- Security governance certified
- Yes
- Security governance standards
-
- ISO/IEC 27001
- Other
- Other security governance standards
- SOC 1 / SOC 2. Please note that these certifications do not apply to Collibra Data Quality, which is currently deployed on the customer's own environment.
- Information security policies and processes
- Information security policies are created by the information security team and approved by the security board. In the security board are members from the executive committee. The policies are reviewed at least annually and adapted depending on risk and changes within the organization.
Operational security
- Configuration and change management standard
- Supplier-defined controls
- Configuration and change management approach
- Each item is tracked in multiple ways. The first is through our asset management and configuration management, the second from a security point of view is agent based tracking. Each change is validated and tested in our test environment prior to release.
- Vulnerability management type
- Supplier-defined controls
- Vulnerability management approach
- Not applicable. Collibra Data Quality is deployed on the customer's own environment. Network vulnerabilities and operational security will depend on client's infrastructure and processes.
- Protective monitoring type
- Supplier-defined controls
- Protective monitoring approach
- Not applicable. Collibra Data Quality is deployed on the customer's own environment. Protective monitoring and operational security will depend on client's infrastructure and processes.
- Incident management type
- Supplier-defined controls
- Incident management approach
- A standard incident management process has been developed which includes reporting to stakeholders and affected customers, as well as authorities when required. At the end of the incident handling, a small report is created with an overview of items which occurred and lessons learned. This report is shared with the stakeholders. Customers can report issues to Collibra via the Community portal.
Secure development
- Approach to secure software development best practice
- Supplier-defined process
Public sector networks
- Connection to public sector networks
- No
Social Value
- Social Value
-
Social Value
Fighting climate changeFighting climate change
Climate change claimed its first bankruptcy in 2019. The detailed study is a benchmark in understanding how technology can help gain complete visibility in reporting and data-backed action plans for ESG. Organizations that can report on ESG effectively have a critical market opportunity compared to their competitors, reports Accenture. While driving your efforts towards better ESG scores, it is also essential to acknowledge that it requires a long-term commitment. You are in for a long cycle of setting goals, tweaking them with evolving regulations and standards, measuring your performance against them, and planning for new goals. And your data quality solution needs to be prepared for it, too.
Optimizing your ESG initiatives requires end-to-end visibility and complete trust in your data. Collibra data quality and observability offers:
Data pipeline monitoring
Proactive anomaly detection
ML-generated adaptive rules
Data discovery and enforcement
Self-service for faster time to decisions
Pricing
- Price
- £122,919 a licence a year
- Discount for educational organisations
- No
- Free trial available
- Yes
- Description of free trial
-
Collibra offers a 20 days free trial of Data Quality and Observability. Upon completion of the registration process, customers can choose from a GCP install or an AWS install.
Collibra also offers a Product Tour accessible from the website https://www.collibra.com/us/en/tour/start - Link to free trial
- https://www.collibra.com/us/en/dq-trial