OpenLineage

An Open Standard for lineage metadata collection

APACHE-2.0 License

Downloads
5.1M
Stars
1.6K

Bot releases are visible (Hide)

OpenLineage - OpenLineage 0.8.1

Published by merobi-hub over 2 years ago

Added

Fixed

  • GreatExpectations: Fixed bug when invoking GreatExpectations using v3 API (#683) @collado-mike
OpenLineage - OpenLineage 0.7.1

Published by merobi-hub over 2 years ago

Added

Fixed

OpenLineage - OpenLineage 0.6.2

Published by merobi-hub over 2 years ago

Added

Fixed

OpenLineage - OpenLineage 0.6.1

Published by merobi-hub over 2 years ago

Fixed

  • Catch possible failures when emitting events and log them @mobuchowski
OpenLineage - OpenLineage 0.6.0

Published by merobi-hub over 2 years ago

Added

  • Extract source code of PythonOperator code similar to SQL facet @mobuchowski
  • Add DatasetLifecycleStateDatasetFacet to spec @pawel-big-lebowski
  • Airflow: extract source code from BashOperator @mobuchowski
  • Add generic facet to collect environmental properties (EnvironmentFacet) @harishsune
  • Update Airflow connection parsing to exclude AWS keys and extraneous Snowflake params @collado-mike
  • OpenLineage sensor for OpenLineage-Dagster integration @dalinkim
  • Java-client: make generator generate enums as well @pawel-big-lebowski

Fixed

  • Airflow: increase import timeout in tests, fix exit from integration @mobuchowski
  • Reduce logging level for import errors to info @rossturk
  • Remove AWS secret keys and extraneous Snowflake parameters from connection uri @collado-mike
  • Convert to LifecycleStateChangeDatasetFacet @pawel-big-lebowski
OpenLineage - OpenLineage 0.5.2

Published by collado-mike over 2 years ago

Added

  • Proxy backend example using Kafka @wslulciuc
  • Support Databricks Delta Catalog naming convention with DatabricksDeltaHandler @wjohnson
  • Add javadoc as part of build task @mobuchowski
  • Include TableStateChangeFacet in non V2 commands for Spark @mr-yusupov
  • Support for SqlDWRelation on Databricks' Azure Synapse/SQL DW Connector @wjohnson
  • Implement input visitors for v2 commands @pawel-big-lebowski
  • Enabled SparkListenerJobStart events to trigger open lineage events @collado-mike

Fixed

  • dbt: job namespaces for given dbt run match each other @mobuchowski
  • Fix Breaking SnowflakeOperator Changes from OSS Airflow @denimalpaca
OpenLineage - OpenLineage 0.5.1

Published by collado-mike over 2 years ago

Added

  • Support for dbt-spark adapter @mobuchowski
  • New backend to proxy OpenLineage events to one or more event streams 🎉 @mandy-chessell @wslulciuc
  • Add Spark extensibility API with support for custom Dataset and custom facet builders @collado-mike

Fixed

  • airflow: fix import failures when dependencies for bigquery, dbt, great_expectations extractors are missing @lukaszlaszko
  • Fixed openlineage-spark jar to correctly rename bundled dependencies @collado-mike
OpenLineage - OpenLineage 0.4.0

Published by collado-mike almost 3 years ago

Added

Fixed

  • dbt: column descriptions are properly filled from metadata.json @mobuchowski
  • dbt: allow parsing artifacts with version higher than officially supported @mobuchowski
  • dbt: dbt build command is supported @mobuchowski
  • dbt: fix crash when build command is used with seeds in dbt 1.0.0rc3 @mobuchowski
  • spark: increase logical plan visitor coverage @mobuchowski
  • spark: fix logical serialization recursion issue @OleksandrDvornik
  • Use URL#getFile to fix build on Windows @mobuchowski
OpenLineage - OpenLineage 0.3.1

Published by julienledem almost 3 years ago

Fixed

OpenLineage - OpenLineage 0.3.0

Published by julienledem almost 3 years ago

Added

Fixed

OpenLineage - OpenLineage 0.2.3

Published by wslulciuc about 3 years ago

Fixed

OpenLineage - OpenLineage 0.2.2

Published by collado-mike about 3 years ago

Added

  • Implement OpenLineageValidationAction for Great Expectations @collado-mike
  • facet: add expectations assertions facet @mobuchowski

Fixed

  • airflow: pendulum formatting fix, add tests @mobuchowski
  • dbt: do not emit events if run_result file was not updated @mobuchowski
OpenLineage - OpenLineage 0.2.1

Published by wslulciuc about 3 years ago

Fixed

  • Default --project-dir argument to current directory in dbt-ol script @mobuchowski
OpenLineage - OpenLineage 0.2.0

Published by wslulciuc about 3 years ago

Added

  • Parse dbt command line arguments when invoking dbt-ol @mobuchowski. For example:

    $ dbt-ol run --project-dir path/to/dir
    
  • Set UnknownFacet for spark (captures metadata about unvisited nodes from spark plan not yet supported) @OleksandrDvornik

Changed

Fixed

  • Remove instance references to extractors from DAG and avoid copying log property for serializability @collado-mike
OpenLineage - OpenLineage 0.1.0

Published by wslulciuc about 3 years ago

OpenLineage is an open standard for lineage metadata collection designed to record metadata for a job in execution. The initial public release includes:

  • An initial specification. The initial version 1-0-0 of the OpenLineage specification defines the core model and facets.
  • Integrations that collect lineage metadata as OpenLineage events:
  • Clients that send OpenLineage events to an HTTP backend. Both java and python are initially supported.
OpenLineage - 0.0.1-rc8

Published by mobuchowski about 3 years ago

OpenLineage - 0.0.1-rc6

Published by mobuchowski over 3 years ago

OpenLineage - 0.0.1-rc5

Published by mobuchowski over 3 years ago

OpenLineage - 0.0.1-rc3

Published by mobuchowski over 3 years ago

OpenLineage - 0.0.1-rc2

Published by mobuchowski over 3 years ago