Official

Accepted

Unanswered

Most Recent
Selected Most Recent

Advanced Filters

Search Results (84)

Restart Collibra Data Quality automatically on GCP

A startup script is a file that performs tasks during the startup process of a virtual machine (VM) instance. We are going to configure our DQ instance to be able to start Collibra DQ (Postgre, Spark, OwlAgent OwlWebapp) automatically when the instance is started. For more information about how this

Questions

2

0

0

0

Reference Data Validations or Checks in CDQ

Hi All - curious to see if anyone has encountered reference data checks and how collibra DQ handles it. 1. Table A (has values), Table B (has codes and values). Want to be able to validate/cross check the values in Table A against values in Table B to identify records with no match. Tables could be

Questions

63

3

0

0

How to configure DQ Connector

In this post we are going to review the DQ connector configuration step by step. Prerequisites Edge Create an Edge site via Collibra Data Intelligence Cloud Settings. Install the Edge site close to the data source you want to access. Collibra Data Quality Install Collibra Data Quality. Con

Questions

7

0

0

0

DQ download on Linux

I am not able to find the download link for Linux standalone installation. I need to play with my Linux machine with DQ on my interest, can anyone help me with the download link " Download full package tarball using the signed link to the full package tarball provided by the DQ Team. Replace <sig

Questions

24

2

0

0

How to launch a Collibra DQ installation in Azure from an Image

Microsoft Azure allows us to create images. These images contain the configuration for creating instances. A managed image resource can be created from a generalized virtual machine (VM) that is stored as either a managed disk or an unmanaged disk in a storage account. You can launch multiple VMs fr

Questions

223

2

0

0

DQ Agents: Restarting Agents In Kubernetes Deployment (GCP)

In order to restart an agent in a Kubernetes deployment, please review the guidance below: Open Google Cloud Console Navigate to Kubernetes Engine Select Workloads Tab Identify your correct namespace e.g. Collibra DQ Dev / Test / Prod Identify your workload name e.g. collibradq-agent-dev which corr

Questions

2

0

0

0

How to launch a Collibra DQ installation in GCP from an image

GCP allows us to create Images. These images contain the configuration for creating instances. You can launch multiple instances from a single image when you need multiple instances with the same configuration. This post explains how to configure Collibra DQ from a GCP image with DQ installed. Click

Questions

5

0

0

0

Functional: DQ Rules

Rules / General: Popular Questions Q: Can You Select Rules Altogether And Apply To Dataset? A: Yes, Select One By One For Now; Currently We Are Doing Custom But Productizing That Feature Is In Our Backlog Q: Is there a library of OOTB rules? A: Yes, please refer to the Rule Library Q: Can You Weight

Questions

83

1

0

0

Functional / Technical: DQ Capabilities

Profiling Q: Does Collibra DQ help identify PII / sensitive data? A: Yes, users can run customized PII discovery using OOTB RegEx or bring your own RegEx. A: https://dq-docs.collibra.com/dq-visuals/data-concepts-and-semantics will be included in 2021.11 Collibra DQ release Agent Q: How exactly does

Questions

15

0

0

0

How to Launch Collibra DQ from an AWS AMI

AWS allows us to create AMI (Amazon Machine Images). This AMI contains the configuration for creating an instance. You can launch multiple instances from a single AMI when you need multiple instances with the same configuration. This document explains how to configure Collibra DQ installed in an AMI

Questions

44

2

0

0

Exception Tracking

Has anyone users connected Collibra DQ to an in-house exception tracking tool? I would be interested to hear how successful it was and any lessons learned.

Questions

50

1

0

0

DQ Audit Trail for Rule Changes

A prospective client asked a fine question regarding “auditing the change of the custom rules.” After some research and testing this, indeed we do have auditing of rule changes at this level of granularity: Audit columns: dataset = Any rule is associated with a Dataset. So this is the name of the

Questions

22

2

0

0

Technical: Data Quality (DQ) Integration With Collibra Catalog

Below are some helpful resources to get you started with the Collibra DQ Integration released in the 2021.07 release of Collibra Data Intelligence Cloud Top Links Collibra Native DQ Connector Complete Documentation, FAQ, Troubleshooting Guide: [Click Here] Collibra Native DQ Connector Supplemental D

Questions

55

0

0

0

DQ Alerts for long running job?

Alerts only trigger off of the finished DQ score from a run, correct? I’m looking for a solution for when a job take longer than X hours.

Questions

13

1

0

0

Explanation on Row Limiter and escape character

When setting the row limiter on the profiling setup screen. Is this simply the first 1000 sequential rows to be pulled back from the select? is there an option to make this a random subset or even systematic? Secondly what does the escape character represent in this section?

Questions

4

1

0

0

Public API documentation

Is there a place online where customers can look at our DQ API’s to get a grasp of our functionality and how they might integrate with their solution? I see JSON REST and CLI APi’s within the demo environment but of course I cant share this. Thanks

Questions

13

2

0

0

Troubleshooting the Collibra DQ AMI

Hi Community, Sharing my experience below in trying to stand up the AMI instance in the Collibra AWS environment in case helpful for others. https://owl-analytics.gitbook.io/user-guide/-La5sAxci8GhitOM0qCp/learning-the-ropes-of-owldq-ami I was told that the AMI must be updated with a new SPRING_DAT

Questions

56

1

0

0

Apache Spark in Collibra DQ with Py4J (need Spark v. 3.01, and correct Scala version)

Python and Py4J code: owl.owlCheck() fails with an exception: File “/opt/spark/python/lib/py4j-0.10.9-src.zip/py4j/java_gateway.py”, line 1305, in call File “/opt/spark/python/lib/pyspark.zip/pyspark/sql/utils.py”, line 128, in deco File “/opt/spark/python/lib/py4j-0.10.9-src.zip/py4j/protocol.py”,

Questions

18

1

0

0

What causes AR Rule Conditions to update?

Hi, I’m looking at the AR rules and how they change over time to show the value of not having to go back and adjust manual rules over time. I noticed that the “Condition” which describes what the rule (198.7 - 3599) (See attached image) does not match the latest “learning” values 0-4586. My questio

Questions

18

1

0

0

Technical : Temp file (Run DQ Job) and Best practices

Question: When we run DQ job against the temp file and i can’t see “estimate job” button. that is valid scenario ? Answer : yes. Thats valid scenario Question: Are we doing any compute when upload the temp file ? Answer : No. We cannot do any kind of compute at any sort of scale in Web itself

Questions

55

1

0

0