Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Limit of 100 pipelines Imported from Airflow 2.7 (MySQL backend db) in OpenMetadata 1.5.5 #18278

Open
d-gorodnov opened this issue Oct 15, 2024 · 0 comments
Assignees
Labels
bug Something isn't working Ingestion

Comments

@d-gorodnov
Copy link

Affected module
Ingestion Framework

Describe the bug
When ingesting Airflow metadata using OpenMetadata version 1.5.5, only the top 100 DAGs are imported, even though the serialized_dag table in the Airflow database contains more records. There appears to be no configuration parameter to adjust this limit.

To Reproduce

  1. Start OpenMetadata using the docker-compose.yml file downloaded from GitHub.
  2. Generate more than 100 DAG records in the serialized_dag table within the airflow_db.
  3. Attempt to ingest internal Airflow metadata.
  4. Observe that only 100 pipelines are displayed in the OpenMetadata UI.

Also you can try to ingest pipelines from random Airflow with backend db MySQL 8 and DAGs amount >100

Expected behavior
All DAGs present in the serialized_dag table should be ingested and displayed in the OpenMetadata UI, not just the top 100.

Version:

  • OS: MacOS
  • Python version:
  • OpenMetadata version: 1.5.5
  • OpenMetadata Ingestion package version: docker.getcollate.io/openmetadata/ingestion:1.5.5

Additional context
This issue persists even after updating the Airflow MySQL database to version 8. The problem was reproducible using only OpenMetadata images. Further investigation is needed to determine if there is a hidden configuration parameter or a bug in the ingestion framework that limits the number of imported DAGs.

@pmbrull pmbrull added Ingestion bug Something isn't working labels Oct 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working Ingestion
Projects
None yet
Development

No branches or pull requests

3 participants