Skip to content

Commit

Permalink
Added metadata validation demo (#208)
Browse files Browse the repository at this point in the history
* Added metadata validation demo

* Do not run hooks-metadata-validation notebook in GitHub action

---------

Co-authored-by: Your Name <[email protected]>
  • Loading branch information
kesarwam and Your Name authored Jun 28, 2024
1 parent 75a08ad commit bb4ad0d
Show file tree
Hide file tree
Showing 7 changed files with 1,064 additions and 0 deletions.
1 change: 1 addition & 0 deletions .github/workflows/notebooks_to_exclude.txt
Original file line number Diff line number Diff line change
Expand Up @@ -8,6 +8,7 @@
./hooks-webhooks-demo.ipynb
./hooks-schema-and-pii-validation.ipynb
./hooks-schema-validation.ipynb
./hooks-metadata-validation.ipynb

# This only works against lakeFS cloud
./rbac-demo.ipynb
Expand Down
1 change: 1 addition & 0 deletions 00_notebooks/00_index.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -34,6 +34,7 @@
"* [**Only allow specific file formats in data lake**](hooks-webhooks-demo.ipynb) (with lakeFS webhooks)\n",
"* [**Prevent unintended schema change**](hooks-schema-validation.ipynb) (with lakeFS Lua hooks)\n",
"* [**Avoid leaking PII data**](hooks-schema-and-pii-validation.ipynb) (shows how to use multiple Lua hooks)\n",
"* [**Validate metadata**](hooks-metadata-validation.ipynb) (shows how to use multiple Lua hooks)\n",
"* [**Import into a lakeFS repository from multiple paths**](./import-multiple-buckets.ipynb) \n",
"* [**ML Experimentation/Reproducibility 01 (Dogs)**](./ml-reproducibility.ipynb)\n",
"* [**ML Experimentation 02 (Wine Quality)**](./ml-experimentation-wine-quality-prediction.ipynb)</br>_See also the [accompanying blog](https://lakefs.io/blog/building-an-ml-experimentation-platform-for-easy-reproducibility-using-lakefs/)_\n",
Expand Down
10 changes: 10 additions & 0 deletions 00_notebooks/assets/lakefs_demo.py
Original file line number Diff line number Diff line change
Expand Up @@ -22,3 +22,13 @@ def print_commit(log):
print('Parents:', log.parents)
print('Metadata:')
pprint(log.metadata)

def lakefs_ui_endpoint(lakefsEndPoint):
if lakefsEndPoint.startswith('http://host.docker.internal'):
lakefsUIEndPoint = lakefsEndPoint.replace('host.docker.internal','127.0.0.1')
elif lakefsEndPoint.startswith('http://lakefs'):
lakefsUIEndPoint = lakefsEndPoint.replace('lakefs','127.0.0.1')
else:
lakefsUIEndPoint = lakefsEndPoint

return lakefsUIEndPoint
Loading

0 comments on commit bb4ad0d

Please sign in to comment.