V

Thursday, June 2nd, 2022 12:12 AM

Spring boot Integration to AWS Lake

Hi Team,

For AWS Lake to Collibra Bidirectional Integration
We have set up the Spring Boot Integration library with Collibra Integration Library and when trying to run the package we are getting the below error

com.collibra.marketplace.library.integration.exception.CollibraIntegrationLibraryException: {“message”:“There are one or more validation errors.”,“details”:“The Relation Key (:SOURCE) Asset ID ‘’ is not a valid UUID - it cannot be empty.”}

Can you please help us on this.

368 Messages

2 years ago

Hi @vani.nithya.indla,

Just to let you know that the updated AWS Lake Formation integration (having version 1.0.3) has been released and is available on Collibra’s Marketplace.

2 years ago

Hi @spring-team.collibra.com

Thank you very much !!!

We will test and get back for any queries.

368 Messages

Hi @vani.nithya.indla, please note that before you use the integration, make sure to remove all of the existing custom assets/relations/domains/etc that are related to the AWS Integration since the CMA file won’t update the UUIDs of the assets if they already exist.
Thanks

2 years ago

Hi @spring-team.collibra.com

Thanks for the update

We have few queries on the package shared

  1. Two grant workflows are shared
    1. Grant Tag Permissions
    2. Grant Permissions resource
    Will you be sharing revoke workflows separately ? Also there is a Grant/ revoke batch permission workflow as per the flow diagram , will you be developing those workflows as well?


  2. In Grant permissions resource workflow shared, we need to call Request Asset Access workflow , but when we call the workflow from data basket, it is directly going to approval without calling ARN attach / Grant AWS workflows and in the logs we could fine the below

We are selecting column ingested from AWS but still getting this error. Can you help us on this?

368 Messages

Hi @vani.nithya.indla,

Thanks for your feedback. The revoke permission workflows are currently in the backlog with no ETA assigned.

Regarding the ‘a non AWS database found’ issue, the function is doing the following steps;

  • Firstly, it fetches the related schemas, databases and tables of the column.

From the logs, we can see that this step successfully ran since we can see the log 'ARN to Data Usage: The database<DATABASE NAME>'

  • Secondly, the workflow fetches the assets from the ‘Databases Domain ID’ property of the workflow. (If you have used the CMA file, this should match the domain ID to which the integration imports the assets i.e. bd5d3952-4f4a-41c1-a895-90e2cabebb6f).
    It then filters through the results and finds the number of assets that matches the display name with the <DATABASE NAME> logged in the first step. If the number is not equal to 1 , it skips the asset and logs ‘a non AWS database found’.

So if the <DATABASE NAME> that was logged is an AWS asset but is being marked as a non-AWS asset, please make sure that there is an asset having type Database and its name matching exactly , in the domain whose resource ID is the value specified in the Attach ARN Attribute workflow, Databases Domain ID property.
Also, ensure that this domain’s UUID (i.e. Databases Domain ID property) is bd5d3952-4f4a-41c1-a895-90e2cabebb6f which should be the case if you have used the provided CMA file.

Thanks

2 years ago

Hi @spring-team.collibra.com

Thanks for your update , We will test it and get back to you.

We are testing the Grant Tag permissions workflow and found that after selecting ARN, while selecting LF-Tag key we are not getting task instead it is just showing the below screen with done option.

There is no pop up task to select the key or value. Can you please advise us on this?

368 Messages

2 years ago

Hi @vani.nithya.indla,
Thanks again for your feedback.
Can you provide us with a screenshot of the ‘view workflow diagram option’ and the logs, please?
Also, what is your Collibra version and is it a cloud or an on-premise instance?

Thanks

Hi @spring-team.collibra.com

Please find the details below


We are using collibra cloud instance version Collibra 2022.07.1


2 years ago

Hi @spring-team.collibra.com
I would like to raise an issue with the bidirectional integration

  1. When we ingest data from AWS to Collibra in Read only domains, Database has LF Tag relation , Table hasLF tag relation and Column has LF Tag relation should be populated

But if in AWS Lake formation, if the LF - Tag is inherited from Database to Table / Table to Column, The inherited LF-Tag relation is not ingested

For Example in AWS I have a database X with Table a and Table B, and I have assigned LF-Tag on Database X which inherited to Table a and Table B

When Data is ingested in Collibra Only Database X has LF tag relation is getting ingested but Table a and Table b doesnt have that relation populated.

This is very important to classify the data for our custom needs. Please let us know your thoughts on this.

368 Messages

Hi @vani.nithya.indla ,
thanks for your feedback. We have created a ticket for the issue with the boolean attributes hasLfTagOnTable/Database/Column. However, please note that the LF Tags are still represented using the asset’s relations. Can confirm this by checking the diagram view of the asset?

Thanks.

Hi @spring-team.collibra.com

Yes, Tags can still be represented as relations to the database/ Table / Columns. I agree with the approach.

2 years ago

Hi @spring-team.collibra.com

Suddenly we are getting the below error while syncing the results . Can you please help us to decode this error:

2022-08-08 12:35:41,291 [scheduling-1] ERROR com.collibra.aws.lakeformation.scheduler.ScheduledSyncProcess - Sync Scheduler Error {“message”:“An unexpected error occurred during execution.”,“details”:“Error reading entity from input stream.”,“cause”:“Unrecognized field “result” (class com.collibra.internal.api.model.output.Job), not marked as ignorable (17 known properties: “userId”, “createdOn”, “resourceType”, “endDate”, “name”, “system”, “startDate”, “state”, “visibility”, “cancelable”, “lastModifiedBy”, “type”, “id”, “lastModifiedOn”, “createdBy”, “progressPercentage”, “message”])\n at [Source: (org.glassfish.jersey.message.internal.ReaderInterceptorExecutor$UnCloseableInputStream); line: 1, column: 376] (through reference chain: com.collibra.internal.api.model.output.Job[“result”])”}

368 Messages

Hi Vani,

Please try switching to the latest version of the integration library (version 1.1.8), which you can find here https://marketplace.collibra.com/listings/spring-boot-integration-library/.

After following the instructions found in the installation section of the documentation, go to the pom.xml file in the AWS Lake Formation Spring Boot integration directory and switch the integration library version from 1.1.7 to 1.1.8 like in the screenshot below.

Thanks

2 years ago

Hi @spring-team.collibra.com

Thanks a lot for your support

For sbi-aws-lakeformation-v1.0.2 core as collibra-integration-library -1.1.6 (

com.collibra.internal.api.model.out put doesn’t have attribute result

) but still ingesting and syncing worked without any issue.

sbi-aws-lakeformation-v1.0.3 now added dependency core as collibra-integration-library -1.1.8( com.collibra.internal.api.model.output have attribute result) and working without job exception.

Is there anyway can check in collibra end or is there any update happened on the job level? on JOB API like response format or attribute changes

https://tfnsw-dev.collibra.com/rest/2.0/Jobs

2 years ago

Hi @spring-team.collibra.com

Is it possible to organize a 30 mins session to discuss on Grant LF-Tag workflow
We are receiving below errors as per the workflow inputs


Also after changing resource type to database we are getting the below error

Please let us know on this.


368 Messages

Hi @vani.nithya.indla,

This issue occurred in version 1.0.2 of the integration and was solved in version 1.0.3.
Are you using the latest version of the integration and workflows?

Could you confirm that the workflow creates the following attributes:
Entry Attributes:



Grant Permission Issue Asset:
image
Thanks.




2 years ago

Hi @spring-team.collibra.com

In v 1.0.2 we haven’t received any workflows.

With the latest version 1.0.3 we have received 3 workflows of which we are testing “Grant AWS Permissions (LF-TAG).bar”

When we triggered the workflow it is creating 4 asset types

  1. AWS Entry
    Attributes:
    Resource Type : Default value “TAG”
    Grantable-Permissions : Default Value “SELECT”
    Tag-Permissions : Default Value “DESCRIBE”
    ARN
    Catalog ID
    Tag Key






  2. Aws-Grant-Batch-Permissions-Tag
    Attributes:
    ARN
    Catalog ID
    Tag Key
    Tag Value
    Has aws-entry





  3. AWS- LF-Tag Key
    has Tag-value

4.AWS-LF-Tag Value

Also in the workflow both select and describe are coming up and it is so confusing to understand the LF Tag grant functionality

2 years ago

Hi @spring-team.collibra.com Can we please schedule a call to identify the issue , We require to check this functionality to collaborate with collibra team to raise our customizations if required.

368 Messages

2 years ago

Hi @vani.nithya.indla,

We will check regarding the 30-minute session and keep you updated. In the meantime, could you please provide us with the Spring Boot logs (via a private message) for each synchronisation? Thanks

2 years ago

Hi @spring-team.collibra.com We are good if session is delayed, Can you please provide us some resolution for the issue raised, I have attached the workflow code as well

368 Messages

Hi @vani.nithya.indla,
Thanks for your reply. I cannot see the workflow logs in your reply. Not sure whether they were removed. In that case, can you please send us the following via private message:

  • The error messages
  • The spring boot logs
  • The workflow logs

Thanks!

2 years ago

Hi @spring-team.collibra.com

We have a query on leaveraging Lineage harvester feature for this integration, Is there a way to load transformations from Glue jobs ? Can you please suggest us on any approach.

368 Messages

Hi @vani.nithya.indla,

The lineage harvester beta integration with AWS Glue (script annotations) might satisfy your requirements.

Please follow instructions in this page to request access to this beta integration.

2 years ago

Hi @spring-team.collibra.com

We noticed that while ingesting data from AWS lake formation to Collibra, partition columns are skipped and not getting ingested into Collibra.

Can you please let us know on this.

Loading...