-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(great-expectations): add SqlAlchemyDataset support #9225
base: master
Are you sure you want to change the base?
feat(great-expectations): add SqlAlchemyDataset support #9225
Conversation
f23b158
to
2d5894c
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can we add some tests for this?
if "." in data_asset._table.name: | ||
# bigquery case | ||
schema_name, table_name = data_asset._table.name.split(".") | ||
sqlalchemy_uri = f"{data_asset.engine.url}/{schema_name}" |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
does the engine url already have the project name in it?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the engine full url is like bigquery://my-project
Then it is parsed to fetch the project name and the database, that's why I've added the /{schema_name}
here
Should I also bump the great-expectation requirements ? because to use this I have to install |
@seuf if we can upgrade the great expectations action code to be compatible with both older and newer versions of GX, then go for it. I suspect it'll just need some conditional imports e.g. In our SQL sources, we also use great-expectations purely for data profiling. A good interim outcome would be that we leave those as-is (and so things like Related to #8115. |
@seuf I know it's been a while on this one, but we've made some progress on making it easier to support other GX versions in the plugin and also support additional datasource types. See my comment here #8115 (comment) I think it makes sense to revisit this PR now that we have a separated gx-plugin package. Let me know if you're up for it! |
@hsheth2 this and the outdated openlineage version in Airflow are the two places where Datahub is holding all other packages back |
Sorry, I didn't had time to answer.. Now that GX 1.0 is out I need to upgrade it on my stack and after I'll try to update this MR with tests. Or fell free to do it if you have some bandwidth 😅 |
Hi,
This Pull request add support for SqlAlchemyDataset data asset type in the great expectations integration.
Tested with bigquery engine.
Checklist