OpenMetadata

Open Standard for Metadata. A Single place to Discover, Collaborate and Get your data right.

APACHE-2.0 License

Downloads
107.7K
Stars
4.2K
Committers
225
OpenMetadata - OpenMetadata 1.0.3-Release

Published by akash-jain-10 over 1 year ago

What's Changed

Full Changelog: https://github.com/open-metadata/OpenMetadata/compare/1.0.2-release...1.0.3-release

OpenMetadata - OpenMetadata 1.0.2-Release

Published by Ashish8689 over 1 year ago

What’s New in Release 1.0.2

UI Improvements

  • Supports a separate column for Classification and Glossary in the following entities: Topic, Dashboard, Pipeline, ML Model, Container, and Data Model.
  • Improved Sample Data tab UX for Tables.
  • Email is now displayed on the Teams page. Users can edit their Email.
  • The custom logo can be configured from the UI. The logo will be displayed on the Login page and app bar.
  • UI supports updating the displayName for service, database, schema, and all other assets.

Ingestion

  • Supports custom database name for Glue.
  • Fixed Postgres lineage ingestion for version 11.6.
  • Added api_version and domain fields to Salesforce connector.
  • Kerberos libraries have been added to ingestion image.
  • PII flags have been further restricted for column names.
  • Fixed GitHub reader for LookML.
  • Restructured the Tableau data models ingestion.
  • Fixed the basic auth for Kafka.

Backend

  • Fixed vulnerabilities for dependencies.
  • Supports custom logo from backend.
  • Fixed a bug related to random password.
  • By default, service connection details will be masked for users, and unmasked for bots. Users will be able to view based on their view permissions.
  • Fixed Elasticsearch indexing issues for a large number of objects.

Full Changelog: https://github.com/open-metadata/OpenMetadata/compare/1.0.1-release...1.0.2-release

OpenMetadata - OpenMetadata 0.13.4-Release

Published by pmbrull over 1 year ago

Bring parenthesis to Entity Name from https://github.com/open-metadata/OpenMetadata/pull/11580

OpenMetadata - OpenMetadata 1.0.1-Release

Published by Ashish8689 over 1 year ago

What’s New in Release 1.0.1

UI Improvements

  • Improved search experience while editing manual lineage.
  • Improved security with masked API token for Looker connection.
  • The tier component has been revamped.
  • Added Pagination support on the Data Model page.
  • Added startDate to create ingestion flow.
  • Search improvements have been made on the Explore page.
  • Multiple UI tweaks have been made for a better user experience, such as image placeholder improvements, text alignment and custom connectors icons.

Ingestion

  • Included IBM dependency for i Series.
  • Fixed CVE vulnerability for ingestion docker image.
  • Now, we fetch views and view definitions from Hive and Impala.
  • Added a test connection step for verifying the Owner details in Tableau.
  • Profiler logs have been improved.
  • Fixed the issues reported around ingestion.

Notifications

  • Alert notifications have been added for Data Insights Report.

Glossary

  • Earlier, we only supported changing or updating the Glossary Owner. Now, we even support the removal of Owner from Glossary as well as Glossary Terms.

New Contributors

Full Changelog: https://github.com/open-metadata/OpenMetadata/compare/1.0.0-release...1.0.1-release

OpenMetadata - OpenMetadata 1.0.0-Release

Published by NiharDoshi99 over 1 year ago

What’s New in Release 1.0

APIs & Schema

  • Stabilized and improved the Schemas and APIs.
  • The APIs are backward compatible from this release onwards

Ingestion

  • Connecting to your data sources has never been easier. Find all the necessary permissions and connection details directly in the UI.
  • When testing the connection, we now have a comprehensive list of validations to let you know which pieces of metadata can be extracted with the provided configuration.
  • Performance improvements when extracting metadata from sources such as Snowflake, Redshift, Postgres, and dbt.
  • New Apache Impala connector.

Storage Services

  • Based on your feedback, we created a new service to extract metadata from your cloud storage.
  • The Data Lake connector ingested one table per file, which covered only some of the use cases in a Data Platform. With Storage Services, you can present accurate metadata from your tables, even when partitioned.
  • The first implementation has been done on S3, and we will keep adding support for other sources in the upcoming releases.

Dashboard Data Models

  • Dashboard Services now support the concept of Data Models: data that can be directly defined and managed in the Dashboard tooling itself, e.g., LookML models in Looker.
  • Data Models will help us close the gap between engineering and business by providing all the necessary metadata from sources typically used and managed by analysts or business users.
  • The first implementation has been done for Tableau and Looker.

Queries

  • Improved UI for SQL Queries, with faster loading times and allowing users to vote for popular queries!
  • Users can now create and share a Query directly from the UI, linking it to multiple tables if needed.

Localization

  • In 1.0, we have added Localization support for OpenMetadata.
  • Now, you can use OpenMetadata in English, French, Chinese, Japanese, Portuguese, and Spanish.

Glossary

  • New and Improved Glossary UI
  • Easily search for Glossaries and any Glossary Term directly in the global search.
  • Drag and drop Glossary Terms within and across Glossaries to adapt your Glossaries as you build them.
  • Instead of searching and tagging their assets individually, users can add Glossary Terms to multiple assets from the Glossary UI.

Auto PII Classification

  • Implemented an automated way to tag PII data.
  • The auto-classification is an optional step of the Profiler workflow. We will analyze the column names, and if sample data is being ingested, we will run NLP models on top of it.

Search

  • Improved Relevancy, with added support for partial matches.
  • Improved Ranking, with most used or higher Tier assets at the top of the search.
  • Support for Classifications and Glossaries in the global search.

Security

  • SAML support has been added.
  • Deprecation Notice: SSO Service accounts for Bots will be deprecated. JWT authentication will be the preferred method for creating Bots.

Lineage

  • Enhanced Lineage UI to display a large number of nodes (1000+).
  • Improved UI for better navigation.
  • Improved SQL parser to extract lineage in the Lineage Workflows.

Chrome Browser Extension

  • All the metadata is at your fingertips while browsing Looker, Superset, etc., with the OpenMetadata Chrome Browser Extension.
  • Chrome extension supports Google SSO, Azure SSO, Okta, and AWS Cognito authentication.
  • You can Install the Chrome extension from the Chrome Web Store.

New Contributors

Full Changelog: https://github.com/open-metadata/OpenMetadata/compare/0.13.3-release...1.0.0-release

OpenMetadata - OpenMetadata 1.0.0-beta-release

Published by ulixius9 over 1 year ago

What's Changed

New Contributors

Full Changelog: https://github.com/open-metadata/OpenMetadata/compare/0.13.3-release...1.0.0-beta-release

OpenMetadata - 1.0.0-alpha-release

Published by ulixius9 over 1 year ago

What does Alpha mean?

This is a release from our WIP towards 1.0. We wanted to bring out some new features and fixes to the community (e.g., Glossary), for you to test things out and give feedback.

This release IS NOT INTENDED for any PROD environment or to be used to update any existing installation. Please, use an isolated run for these binaries. The docker quickstart would be the easiest way to handle this.

The OpenMetadata community will continue to patch things up and improve testing and stability for 1.0.

What's Changed

New Contributors

Full Changelog: https://github.com/open-metadata/OpenMetadata/compare/0.13.2-release...1.0.0-alpha-release

OpenMetadata - 0.13.3-release

Published by Sachin-chaurasiya over 1 year ago

What's Changed

Full Changelog: https://github.com/open-metadata/OpenMetadata/compare/0.13.2-release...0.13.3-release

OpenMetadata - OpenMetadata 0.13.2-Release

Published by aniketkatkar97 over 1 year ago

What’s New in Release 0.13.2

Improved SQL Lineage

  • We have collaborated with the sqllineage and sqlfluff communities to improve the parsing capabilities of sqllineage. We'll continue to collaborate to ship further improvements in new releases.

New Glossary UI

  • Moved from a tree view in the left panel to an easy-to-navigate list of the terms sorted alphabetically.
  • The term list shows the tags and descriptions in the cards.

Glossary Import & Export

  • You can now export your Glossary data as a CSV file.
  • In the same way, you can now bulk upload terms to a Glossary by adding their details in a CSV file.
  • The import utility will validate the file and show you a preview of the elements that are going to be imported to OpenMetadata.

Unified Tag Category API

  • Renamed Tag Categories to Classification, a more widely used term.
  • Updated the API to conform with the rest of the specifications. More info here.

Mutually Exclusive Tags

  • When creating a Classification or a Glossary term, you can now make the tags mutually exclusive.
  • If tags are set to be mutually exclusive, you won't be able to set multiple tags from the same category in the same asset.

EntityName

  • To better manage and harmonize entityName value and allow users to form better expectations around these values, the team introduced enforcement of the entityName format using a regex pattern. You can find more information about this in the docs.

Ingestion Framework

  • Performance Improvements: We are now getting descriptions in batch, making connectors such as Redshift or Snowflake way faster!
  • The Oracle connector now ships with the Thick mode enabled.
  • AWS QuickSight...

What's Changed

New Contributors

Full Changelog: https://github.com/open-metadata/OpenMetadata/compare/0.13.1-release...0.13.2-release

OpenMetadata - OpenMetadata 0.13.2-beta Release

Published by pmbrull over 1 year ago

What's Changed

New Contributors

Full Changelog: https://github.com/open-metadata/OpenMetadata/compare/0.13.1-release...0.13.2-beta-release

OpenMetadata - OpenMetadata 0.13.1-Release

Published by chirag-madlani almost 2 years ago

What’s New in Release 0.13.1

  • Freshness Metric has been introduced. Data freshness is defined by how often a table is being updated and the number of rows being affected. All this is displayed within the data profiler with filterable graphs. This is currently supported for BigQuery, Snowflake, and Redshift.
  • Data Quality Tests now support Data Lake services.
  • Notification Support experience has been improved. Users can define Alerts based on a Trigger (all data assets or a specific entity), Filters (events to consider), and Action (Slack, MS Teams, Email, Webhook) on where to send the alert.
  • dbt has its own workflow. Previously, dbt was a part of the metadata ingestion process. This allows users to ingest multiple dbt projects into the same database service.
  • Topic Schemas support field descriptions and tags. Previously, they were read-only. We now support JSON Schema, Avro and Protobuf parsing and field level details for topic schemas.
  • Data Insight Report has an improved layout. We now display a line graph instead of a bar graph. The Most Viewed Data Assets are clickable to view the asset details page.
  • Advanced Search improvements have been made. When a filter is applied, the details of the filter selected are displayed for clarity.
  • Side Preview on the Explore page UI is now available for all data assets. Previously it was only displayed for tables.
  • Airflow Lineage Operator and the OpenMetadata Hook are now part of the ingestion package. Send Airflow metadata from your DAGs and safely store the OpenMetadata server connection directly in Airflow.

What's Changed

New Contributors

Full Changelog: https://github.com/open-metadata/OpenMetadata/compare/0.13.0-release...0.13.1-release

OpenMetadata - OpenMetadata 0.13.0-Release

Published by ayush-shah almost 2 years ago

OpenMetadata 0.13.0 Release Highlights

Data Insights and KPIs

Data Insights, a game-changing feature, has been introduced that transforms the passive approach to data into a collaborative project toward improved data culture. Data Insights aims to provide a single-pane view of all the key metrics to best reflect the state of your data. OpenMetadata gathers all the metrics related to the metadata that you are extracting. The entities created the types of entities and the data evolution over a period of time. Based on these metrics, we provide analytics to assess the gathered data.

Admins can define the Key Performance Indicators (KPIs) and set goals within OpenMetadata to work towards better documentation, ownership, and tiering.
These goals are based on entities and driven to achieve targets within a specified time. For example, Admins can set goals to have at least 60% of the entities documented, owned and tiered by the end of Q1 2023.

The data insights dashboard provides a quick glance at aspects like data ownership, description coverage, data tiering, and so on. Teams can view a timeseries report to track progress and monitor the health of your data. In addition to the metrics on data, Admins can view the aggregated user activity and get insights into user engagement and user growth. Admins can check for Daily active users and know how the tool is being used.

The Data Insights Report is emailed weekly, so that teams can assess their performance relative to the KPIs set at an organizational level to improve data culture on an ongoing basis.

Lineage

The lineage UI has been transformed to enhance user experience. Users can get a holistic view of an entity from the Lineage tab. When an entity is selected, the UI displays end-to-end lineage traceability for the table and column levels. Just search for an entity and expand the graph to unfold lineage. It’ll display the upstream and downstream for each node.

The Lineage Tab UI supports two-finger scrolling to zoom in or zoom out.

Data Quality

With the OpenMetadata UI, users can now create and deploy profiling workflows for the Datalake connector, which supports AWS S3 and GCS. In the next release, we’ll add the support to run tests as well as covering Azure ADLS.
Security
With the addition of the LDAP SSO in the current release, OpenMetadata supports nine SSOs, which includes Google, Azure, Okta, OneLogin, Auth0, Amazon Cognito, Keycloak, and custom OIDC. In the 0.12.1 release, support was added for basic authentication to sign up using a Username/Password.

OpenMetadata Roles and Policies treat Bots as a special user with access to all the APIs and entities, just like an Admin. Bots have been in use for ingestion to extract metadata, as well as for data profiler and so on. In the 0.13 release, we’ve created multiple bots to serve different scenarios. For example, Ingestion Bot, Lineage Bot, Data Quality and Profiler Bot.

Given the varying roles for specific bots, the policies and access control for bots has been redefined. Now, Bots can have their own policies. For example, the Ingestion Bot can create and update entities. The Profiler Bot can only update the profile of a table, and not have policies for any other entities or access to update table description, etc.

Advanced Search

OpenMetadata already supports advanced search syntax. Since it is syntax-driven, it’s not easy to use for all except advanced users. In the current release, a Syntax Editor has been introduced for advanced search with And/Or conditions that help discover assets quickly. A huge thank you to Cristian Osiac from Bloomberg for helping with this feature.

New Connectors

In the 0.13 release, we have introduced four new connectors:

Domo, a cloud-based dashboard service. The Domo Business Cloud is a low-code data app platform that takes the power of BI to the next level by combining all your data and putting it to work across any business process or workflow. OpenMetadata supports Domo as a Database, Dashboard, as well as a Pipeline service.

Hacktoberfest has been a complete success at the OpenMetadata community, with three connectors being developed as part of the event:

AWS SageMaker, a fully managed machine learning service, where data scientists and developers can quickly and easily build and train machine learning models, and then directly deploy them into a production-ready hosted environment.
AWS Kinesis, a cloud-based messaging service that allows real-time processing of streaming large amounts of data per second.
AWS QuickSight, a cloud-scale business intelligence (BI) service that allows everyone in the organization to understand the data by asking questions in natural language, exploring through interactive dashboards, or automatically looking for patterns and outliers powered by machine learning.

Big thanks and congratulations to Michael Zhou for developing AWS QuickSight and to Tushar Mittal for adding both AWS SageMaker and AWS Kinesis.
Several improvements have been made to the ingestion framework. In the 0.12.1 release, we shipped the ability to add a custom service type. Users can now develop their own connector and ingest it as with any other supported service! If you’d like to learn more about that, you can check out the demo!

Messaging Service Schemas

Major enhancements have been made to how data is extracted from Kafka and Redpanda Messaging services. Previously, OpenMetadata extracted all the Topics in the messaging queue and also connected to the Schema Registry to get the Schemas. These schemas were taken as one payload and published to OpenMetadata. We now parse Avro and Protobuf Schemas to extract the fields. Now, users can document each of these fields within a schema by adding descriptions and tags. Users can search based on the fields in the Schema of a Topic.

Other Changes

Soft deleted entities can be restored. Currently, only the ML Models are not supported.
Soft deleted teams can be restored. When restoring a soft deleted parent team, the child teams will not be restored by default.

OpenMetadata - OpenMetadata 0.12.3-Release

Published by mohityadav766 almost 2 years ago

What's Changed

Full Changelog: https://github.com/open-metadata/OpenMetadata/compare/0.12.2-preview...0.12.3-release

OpenMetadata - OpenMetadata 0.13.0-Preview

Published by akash-jain-10 almost 2 years ago

What's Changed

New Contributors

Full Changelog: https://github.com/open-metadata/OpenMetadata/compare/0.12.2-release...0.13.0-preview

OpenMetadata - 0.12.2-Release

Published by Vj-L about 2 years ago

What's Changed

Full Changelog: https://github.com/open-metadata/OpenMetadata/compare/0.12.2-preview...0.12.2-release

OpenMetadata - OpenMetadata 0.12.2-Preview

Published by akash-jain-10 about 2 years ago

TBD - Preview Release

OpenMetadata - OpenMetadata 0.12.1-Release

Published by akash-jain-10 about 2 years ago

0.12.1 Release

Basic Authentication

  • User/Password signup and login
  • Email notifications for forgotten password
  • Admin can add new users and send an email

ElasticSearch full-index through UI

  • Now admins can full-re index elasticsearch through the UI itself

Versioning support for Custom Attributes

  • Any changes to entity custom attributes are now versioned

DBT Metadata - Tags

  • We support ingesting DBT tags into OpenMetadata

Bots Integration

  • Admins can create bots and their security mechanism from UI itself.

Bug Fixes

  • Around 136 Features/Bug fixes/improvements/Tests made it into 0.12.1 release

New Contributors

Full Changelog: https://github.com/open-metadata/OpenMetadata/compare/0.12.0-release...0.12.1-release

OpenMetadata - OpenMetadata 0.12.0-Release

Published by Vj-L about 2 years ago

0.12.0 Release - Sept 7th 2022 🎉

You can read the Release Blog here
or watch an awesome video showing the new features!


Team Hierarchy

Prior releases supported a flat hierarchy of just Teams and Users. In 0.12, support has been added for the entire organizational hierarchy with Business Unit, Division, Department, and Groups. An organization from small to very large can now be modeled in OpenMetadata with this feature.

Roles and Policies

Access Control functionality has been revamped to support many use cases that were not possible before. Previously, a Role contained a single Policy, which consisted of simple Rules to Allow/Not Allow. The advanced rule configuration in the 0.12 release allows users to build more expressive rules using conditions.

  • A Role is a collection of Policies. Roles can be assigned to users or teams where all the users in the team inherit the team roles.
  • A Policy is a collection of Rules. A Policy can be reused as it can be part of a Role or can be directly assigned to Teams.
  • A Rule is defined by a set of Resources, a set of Operations, an Effect to either Deny or Allow the operation, and a condition written as SpEL expression to add additional conditions based on metadata attributes. Examples of conditions — isOwner(), noOwner() && !matchTags('PII').

Data Quality and Data Profiler

OpenMetadata began support for Data Quality in the 0.10 release, and support was added for publishing Great Expectations results in the 0.11 release. Our goal with OpenMetadata is to define metadata standards for all things data and in this release, we are standardizing Tests and Data Quality metadata. Data Quality Tests can be expressed in JSON schema and now these tests can be added dynamically using the Test Definitions API. We have also added a custom SQL data quality test that allows you to write your data quality tests using SQL statements.

An interactive dashboard helps to visualize and explore the data from the Data Profiler. You can explore how your data is changing over time, and identify data drifts using this dashboard. You can also see how data quality is changing by looking at how tests are doing over time. What is even better is, that you can explore this at both the table level or drill down to each column level going back up to 60 days.

The UI supports the detailed exploration of data quality tests, and users can drill down for the details of the test results present in a time series fashion. Tests can be added easily from the Profiler tab in the UI, both at the Table and Column levels. The UI provides a one-glance update on the metrics with a summary of data quality at the Table and Column levels.

Announcements

Informing users about upcoming changes to the data is a big challenge. In most organizations, a team sends an email well in advance about the change. But no one reads/tracks them and finally, when the change is done, many users are unprepared to handle it.

With Announcements, you can now inform your entire team of all the upcoming events and changes, such as deprecation, deletion, or schema changes. These announcements can be scheduled with a start date and an end date. All the users following your data are not only notified in Activity Feeds but a banner is also shown on the data asset details page for users to discover (or be reminded of) the announcement.

Activity Feed Notifications

In 0.12, we’ve also streamlined the Notifications menu with two separate tabs for Tasks and Mentions, that’ll display only the recent notifications. You can always navigate to your User Profile page to view more activities.

Slack & Microsoft Teams integration

Users can get timely updates about the metadata change events for all entities through APIs using webhooks. The webhook integration with Slack has been further improved in this release.

OpenMetadata also supports webhook integration to Microsoft Teams, just as it supports Slack. Users can choose to receive notifications for only the required entities by using event filters based on when an entity is created, updated, or deleted.

Tasks

In the 0.11 release, a request to add or update descriptions for data assets could be converted to a Task. In the 0.12 release, Tasks can be created based on requests to create or update tags. Also, a glossary term approval workflow can be converted to a Task.

Secret Management Store Interface

In 0.12, we have completely revamped how that secret is stored, accessed, and by whom; by introducing a Secrets Manager Interface to communicate with any Key Management Store. The KMS will mediate between any OpenMetadata internal requirement and sensitive information. That way, users can choose to use the underlying database as KMS, or any external system. The OpenMetadata community has already added support for AWS Key Management Service and AWS SSM.

Connectors

New connectors are an essential part of every release in OpenMetadata. We are introducing four new connectors in this release:

  • Redpanda is a Kafka API-compatible streaming data platform for developers that unifies historical and real-time data. OpenMetadata now supports Redpanda as a Messaging service, which allows users to document its topics and schemas. Refer to the Redpanda documentation for more info.
  • Dagster is a new-generation Python-based orchestrator that’s designed for developing and maintaining data assets, such as tables, data sets, machine learning models, and reports. It has been added as part of OpenMetadata’s pipeline connectors. Read more from the Dagster documentation.
  • Fivetran delivers ready-to-use connectors that automatically adapt as schemas and APIs change, ensuring consistent, reliable access to data. It has been added as a pipeline service. For more information, refer to the Fivetran documentation.
  • Apache NiFi automates the flow of data between systems. OpenMetadata now supports a NiFi connector as the third new pipeline service on this release.

Lineage

We’ve enhanced the performance of workflows by having a separate workflow for Lineage and Usage. By using two workflows for computing specific pieces of information, we can effectively filter down the queries to extract lineage.

During table usage ingestion, the tables retrieved successfully will be cached, so that there is no need to repeat the same calls multiple times as many queries would be referencing the same tables.
Usage queries have been optimized.
A result limit has been added to Usage queries.

Global Settings

The OpenMetadata Settings dropdown menu has been transformed into a single, centralized Settings page for added convenience in viewing all the available options. The Global Settings comprises setting options for Team Members, Access based on Roles and Policies, Services, Data Quality, Collaboration, Custom Attributes, and Integrations for webhooks and bots. Admins can view or update settings for various services like Slack, MS Teams, Webhooks, etc from the Global Settings page.

UI/UX Improvements

The major UI UX improvements have been done around Roles and Policies and a Global Settings page. Quite a lot of tweaks have been made to the UI to improve the UX.

When creating a new user or when a user is registering for the first time, the dropdown menu for Teams now displays an option to ‘Show All’ teams. Previously, we supported the display of only the first 10 teams. An option has also been provided to search and filter.
UI improvements have been made on the Schema, Service, and Database details pages.
Manage Tab has been replaced with the manage button on the UI.

New Contributors

Full Changelog: https://github.com/open-metadata/OpenMetadata/compare/0.11.0-release...0.12.0-release

OpenMetadata - OpenMetadata 0.11.5-release

Published by akash-jain-10 about 2 years ago

What's Changed

Full Changelog: https://github.com/open-metadata/OpenMetadata/compare/0.11.4-release...0.11.5-release

OpenMetadata - OpenMetadata 0.11.4-release

Published by akash-jain-10 about 2 years ago