metaflow - 2.9.1

Published by savingoyal over 1 year ago

Features
- Introduce Slack notifications support for workflow running on Argo Workflows

Features

Introduce Slack notifications support for workflow running on Argo Workflows

With this release, Metaflow users can get notified on Slack when their workflows succeed or fail on Argo Workflows. Using this feature is quite straightforward

Follow these instructions on Slack to set up incoming webhooks for your Slack workspace.
You should now have a webhook URL that Slack provides. Here is an example webhook:
```
https://hooks.slack.com/services/T0XXXXXXXXX/B0XXXXXXXXX/qZXXXXXX
```
To enable notifications on Slack when your Metaflow flow running on Argo Workflows succeeds or fails, deploy it using the --notify-on-error or --notify-on-success flags:
```
python flow.py argo-workflows create --notify-on-error --notify-on-success --notify-slack-webhook-url <slack-webhook-url>
```
You can also set METAFLOW_ARGO_WORKFLOWS_CREATE_NOTIFY_SLACK_WEBHOOK_URL=<slack-webhook-url> in your environment instead of specifying --notify-slack-webhook-url on the CLI everytime.
Next time your workflow succeeds or fails on Argo Workflows, you will get a helpful notification on Slack.

FAQ

I deployed my workflow following the instructions above, but I haven’t received any notifications yet?

This issue may very well happen if you are running Kubernetes v1.24 or newer.

Since v1.24, Kubernetes stopped automatically creating a secret for every serviceAccount. Argo Workflows relies on the existence of these secrets to run lifecycle hooks responsible for the emission of these notifications.

Follow these steps for explicitly creating a secret for the service account that responsible for executing Argo Workflows steps:

Run the following command, replacing service-account.name with the serviceAccount in your deployment. Also change the name of the secret to correctly reflect the name of the _serviceAccount _for which this secret is

cat <<EOF | kubectl apply -f -
apiVersion: v1
kind: Secret
metadata:
  name: default-sa-token #change according to the name of the sa
  annotations:
    kubernetes.io/service-account.name: default #replace with your sa
type: kubernetes.io/service-account-token
EOF

Edit the serviceAccount object so as to add the name of the above secret in it. You can use kubectl edit for this. The serviceAccount yaml should look like the following

$ kubectl edit sa default -n mynamespace
...
apiVersion: v1
kind: ServiceAccount
metadata:
  creationTimestamp: "2023-05-05T20:58:58Z"
  name: default
  namespace: jobs-default
  resourceVersion: "6739507"
  uid: 4a708eff-d6ba-4dd8-80ee-8fb3c4c1e1c7
secrets:
- name: default-sa-token # should match the secret above

That’s it! Try executing your workflow again on Argo Workflows. If you are still running into issues, reach out to us!

In case you need any assistance or have feedback for us, ping us at chat.metaflow.org or open a GitHub issue.

What's Changed

feature: add argo events environment variables to metaflow configure kubernetes by @saikonen in https://github.com/Netflix/metaflow/pull/1405
handle whitespaces in argo events parameters by @savingoyal in https://github.com/Netflix/metaflow/pull/1408
Add back comment for argo workflows by @savingoyal in https://github.com/Netflix/metaflow/pull/1409
Support ArgoEvent object with @kubernetes by @savingoyal in https://github.com/Netflix/metaflow/pull/1410
Print workflow template location as part of argo-workflows create by @savingoyal in https://github.com/Netflix/metaflow/pull/1411

Full Changelog: https://github.com/Netflix/metaflow/compare/2.8.6...2.9.0

metaflow - 2.9.0

Published by savingoyal over 1 year ago

Features
- Introduce support for composing multiple interrelated workflows through external events

Features

Introduce support for composing multiple interrelated workflows through external events

With this release, Metaflow users can architect sequences of workflows that conduct data across teams, all the way from ETL and data warehouse to final ML outputs. Detailed documentation and a blog post to follow very shortly! Keep watching this space.

In case you need any assistance or have feedback for us, ping us at chat.metaflow.org or open a GitHub issue.

What's Changed

feature: add argo events environment variables to metaflow configure kubernetes by @saikonen in https://github.com/Netflix/metaflow/pull/1405
handle whitespaces in argo events parameters by @savingoyal in https://github.com/Netflix/metaflow/pull/1408
Add back comment for argo workflows by @savingoyal in https://github.com/Netflix/metaflow/pull/1409
Support ArgoEvent object with @kubernetes by @savingoyal in https://github.com/Netflix/metaflow/pull/1410
Print workflow template location as part of argo-workflows create by @savingoyal in https://github.com/Netflix/metaflow/pull/1411

Full Changelog: https://github.com/Netflix/metaflow/compare/2.8.6...2.9.0

metaflow - 2.8.6

Published by savingoyal over 1 year ago

Features
- Introduce support for persistent volume claims for executions on Kubernetes

Features

Introduce support for persistent volume claims for executions on Kubernetes

With this release, Metaflow users can attach existing persistent volume claims to Metaflow tasks running on a Kubernetes cluster.

To use this functionality, simply list your persistent volume claim and mount point using the persistent_volume_claims arg in @kubernetes decorator - @kubernetes(persistent_volume_claims={"pvc-claim-name": "mount-point", "another-pvc-claim-name": "another-mount-point"}).

Here is an example:

from metaflow import FlowSpec, step, kubernetes, current
import os

class MountPVCFlow(FlowSpec):

    @kubernetes(persistent_volume_claims={"test-pvc-feature-claim": "/mnt/testvol"})
    @step
    def start(self):
        print('testing PVC')
        mount = "/mnt/testvol"
        file = f"zeros_run_{current.run_id}"
        with open(os.path.join(mount, file), "w+") as f:
            f.write("\0" * 50)
            f.flush()
        
        print(f"mount folder contents: {os.listdir(mount)}")
        self.next(self.end)

    @step
    def end(self):
        print("finished")

if __name__=="__main__":
    MountPVCFlow()

In case you need any assistance or have feedback for us, ping us at chat.metaflow.org or open a GitHub issue.

What's Changed

handle bools properly for argo-workflows task runtime cli by @savingoyal in https://github.com/Netflix/metaflow/pull/1395
fix: migrate R support to use importlib by @saikonen in https://github.com/Netflix/metaflow/pull/1396
Add configuration of username from metaflow_config.py by @tfurmston in https://github.com/Netflix/metaflow/pull/1400
feature: add Kubernetes support for PVC mounts by @saikonen in https://github.com/Netflix/metaflow/pull/1402
Update version to 2.8.6 by @savingoyal in https://github.com/Netflix/metaflow/pull/1404

Full Changelog: https://github.com/Netflix/metaflow/compare/2.8.5...2.8.6

metaflow - 2.8.5

Published by romain-intel over 1 year ago

Improvements

Make pickled Metaflow client objects accessible across namespaces

Improvements

Make pickled Metaflow client objects accessible across namespaces

The previous release resulted in disabling a sequence of user operations that worked previously:

Pickle a Metaflow object
Access this Metaflow object in a different namespace
Access a child or parent object of this object

This release restores the previous behavior.

In case you need any assistance or have feedback for us, ping us at chat.metaflow.org or open a GitHub issue.

What's Changed

feature: add sanitization for batch tags by @saikonen in https://github.com/Netflix/metaflow/pull/1376
fix: make metaflow config aware of profile environment variable by @saikonen in https://github.com/Netflix/metaflow/pull/1391
Fix an issue introduced in 2.8.4 that prevented pickled MetaflowObjec… by @romain-intel in https://github.com/Netflix/metaflow/pull/1392
Updating version to 2.8.5 by @pjoshi30 in https://github.com/Netflix/metaflow/pull/1393

Full Changelog: https://github.com/Netflix/metaflow/compare/2.8.4...2.8.5

metaflow - 2.8.4

Published by savingoyal over 1 year ago

Features

Introduce support for tmpfs for executions on Kubernetes

It is typical for the user code in a Metaflow step to download assets from an object store, e.g. S3. Examples include serialized models and raw input data, such unstructured media or structured Parquet files. The amount of data loaded in a task is typically 10-100GB, allowing even terabytes to be handled in a foreach.

To reduce IO bottlenecks in such tasks, we provide an optimized client for S3, metaflow.S3 that makes it possible to download data using all available network bandwidth. Notably, in a modern instance the available network bandwidth can be higher than the local disk bandwidth. Consider: SATA 3.0 provides 6Gbit/s whereas a large instance can have 20Gbit/s network throughput. Even Gen3 NVMe provides just 16Git/s. To benefit from the full network bandwidth, local disk IO must be bypassed. The metaflow.S3 client accomplishes this by relying on the page cache: Nominally files are downloaded in a temporary directory on disk but practically all data stays in the page cache. This is assuming that the downloaded data can fit in memory, which can be ensured by having a high enough @resources(memory=) setting.

The above setup, which can provide excellent IO performance in general, has a small gotcha: The instance needs to have enough local disk space to back all the data, although no data actually hits the disk. Increasingly, instances may have more memory than local disk space available, so this superfluous requirement becomes a problem. This puts users in a strange situation: The instance has enough RAM to hold all the data in memory, and there are ways to download it quickly from S3, but the lack of local disk space (that is not even needed), makes it impossible to access the data.

Kubernetes supports mounting a tmpfs filesystem on the fly. Using this feature, the user can create a memory-backed file system which can be used as a temporary space for downloaded data. This removes the need to have to deal with any local disks. One can simply use a minimal root filesystem, which greatly simplifies the infrastructure setup.

With this release, we introduce a new config option - METAFLOW_TEMPDIR, which, if defined, is used as the default metaflow.S3(tmproot). If METAFLOW_TEMPDIR is not defined, tmproot=’.’ as before. In addition, a few new attributes are introduced for @kubernetes decorator -

Attribute (default)	Default behavior	Override semantics
use_tmpfs=False	tmpfs disabled	use_tmpfs=True enables tmpfs
tmpfs_tempdir=True	sets METAFLOW_TEMPDIR=tmpfs_path	tmpfs_tempdir=False doesn't set METAFLOW_TEMPDIR
tmpfs_size=None	sets tmpfs size to 50% of @resources(memory)	tmpfs size in megabytes
tmpfs_path=None	use /metaflow_temp as tmpfs_path	custom mount point

Examples

Handle large amounts of data in-memory with Kubernetes:

@kubernetes(memory=100000, use_tmpfs=True)

In this case, at most 50GB is available for tmpfs and it is used by S3 by default. Note that tmpfs only consumes the amount of memory corresponding to the data stored, so there is no downside in setting a large size by default.

Increase tmpfs size:

@kubernetes(memory=100000, tmpfs_size=100000)

Let tmpfs use all available memory. Note that use_tmpfs=True doesn’t have to be specified redundantly.

Custom tmpfs use case:

@kubernetes(memory=100000, tmpfs_size=10000, tmpfs_path=’/data’, tmpfs_tempdir=False)

Full control over settings - metaflow.S3 doesn’t use the tmpfs volume in this case.

Besides metaflow.S3, the user may want to use the tmpfs volume for their own use cases. In particular, many modern ML libraries require a local cache. To support these use cases, tmpfs_path is exposed through the current object, as current.tempdir.
This allows the user to leverage the volume straightforwardly:

AutoModelForSeq2SeqLM.from_pretrained(
            model_path,
            cache_dir=current.tempdir,
            device_map='auto',
            load_in_8bit=True,
        )

Introduce current.run and current.task_ in current singleton

With this release, you can access current.run and current.task within a running flow, allowing for use cases like

from metaflow import current

# add tags from inside a run
current.run.add_tag('foobar')

Improvements

Make metaflow client objects backward compatible

The previous release broke backward compatibility in cases where the metaflow client object is deserialized from an older version of Metaflow. This release preserves the functionality and provides explicit compatibility guarantees going forward.

In case you need any assistance or have feedback for us, ping us at chat.metaflow.org or open a GitHub issue.

What's Changed

Fix: Check all steps for MetaflowCode and return if any by @bsridatta in https://github.com/Netflix/metaflow/pull/1338
chore: comment on run.code by @saikonen in https://github.com/Netflix/metaflow/pull/1357
Add kubernetes labels by @dhpollack in https://github.com/Netflix/metaflow/pull/1236
Revert "Add kubernetes labels" by @savingoyal in https://github.com/Netflix/metaflow/pull/1359
fix: batch tmpfs enabling logic by @saikonen in https://github.com/Netflix/metaflow/pull/1365
feature: tmpfs for kubernetes and argo by @saikonen in https://github.com/Netflix/metaflow/pull/1361
Fix: Validate pathspec argument for MetaflowObject by @bsridatta in https://github.com/Netflix/metaflow/pull/1350
Fix: METAFLOW_S3_ENDPOINT_URL as a part of airflow by @valayDave in https://github.com/Netflix/metaflow/pull/1368
Introduce support for event-triggered workflows by @savingoyal in https://github.com/Netflix/metaflow/pull/1271
feature: remove pylint dependency by @saikonen in https://github.com/Netflix/metaflow/pull/1378
Fixing a MetaflowObject backward compatibility issue by @pjoshi30 in https://github.com/Netflix/metaflow/pull/1363
added missing return statement by @felipeGarciaDiaz in https://github.com/Netflix/metaflow/pull/1383
fix: batch decorator missing metadata handling by @saikonen in https://github.com/Netflix/metaflow/pull/1385
mute argo event emmission by @savingoyal in https://github.com/Netflix/metaflow/pull/1386
Update current object adding run and task object. by @romain-intel in https://github.com/Netflix/metaflow/pull/1384
release 2.8.4 by @savingoyal in https://github.com/Netflix/metaflow/pull/1388

New Contributors

@dhpollack made their first contribution in https://github.com/Netflix/metaflow/pull/1236
@felipeGarciaDiaz made their first contribution in https://github.com/Netflix/metaflow/pull/1383

Full Changelog: https://github.com/Netflix/metaflow/compare/2.8.3...2.8.4

metaflow - 2.8.3

Published by savingoyal over 1 year ago

Features

Introduce support for tmpfs for executions on AWS Batch

It is typical for the user code in a Metaflow step to download assets from an object store, e.g. S3. Examples include serialized models and raw input data, such unstructured media or structured Parquet files. The amount of data loaded in a task is typically 10-100GB, allowing even terabytes to be handled in a foreach.

To reduce IO bottlenecks in such tasks, we provide an optimized client for S3, metaflow.S3 that makes it possible to download data using all available network bandwidth. Notably, in a modern instance the available network bandwidth can be higher than the local disk bandwidth. Consider: SATA 3.0 provides 6Gbit/s whereas a large instance can have 20Gbit/s network throughput. Even Gen3 NVMe provides just 16Git/s. To benefit from the full network bandwidth, local disk IO must be bypassed. The metaflow.S3 client accomplishes this by relying on the page cache: Nominally files are downloaded in a temporary directory on disk but practically all data stays in the page cache. This is assuming that the downloaded data can fit in memory, which can be ensured by having a high enough @resources(memory=) setting.

The above setup, which can provide excellent IO performance in general, has a small gotcha: The instance needs to have enough local disk space to back all the data, although no data actually hits the disk. Increasingly, instances may have more memory than local disk space available, so this superfluous requirement becomes a problem. The issue is further amplified by the fact that as of today, it is impossible to add ephemeral volumes on the fly on AWS Batch. This puts users in a strange situation: The instance has enough RAM to hold all the data in memory, and there are ways to download it quickly from S3, but the lack of local disk space (that is not even needed), makes it impossible to access the data.

AWS Batch supports mounting a tmpfs filesystem on the fly. Using this feature, the user can create a memory-backed file system which can be used as a temporary space for downloaded data. This removes the need to have to deal with any local disks. One can simply use a minimal root filesystem, which greatly simplifies the infrastructure setup.

With this release, we introduce a new config option - METAFLOW_TEMPDIR, which, if defined, is used as the default metaflow.S3(tmproot). If METAFLOW_TEMPDIR is not defined, tmproot=’.’ as before. In addition, a few new attributes are introduced for @batch decorator -

Attribute (default)	Default behavior	Override semantics
use_tmpfs=False	tmpfs disabled	use_tmpfs=True enables tmpfs
tmpfs_tempdir=True	sets METAFLOW_TEMPDIR=tmpfs_path	tmpfs_tempdir=False doesn't set METAFLOW_TEMPDIR
tmpfs_size=None	sets tmpfs size to 50% of @resources(memory)	tmpfs size in megabytes
tmpfs_path=None	use /metaflow_temp as tmpfs_path	custom mount point

Examples

Handle large amounts of data in-memory with Batch:

@batch(memory=100000, use_tmpfs=True)

In this case, at most 50GB is available for tmpfs and it is used by S3 by default. Note that tmpfs only consumes the amount of memory corresponding to the data stored, so there is no downside in setting a large size by default.

Increase tmpfs size:

@batch(memory=100000, tmpfs_size=100000)

Let tmpfs use all available memory. Note that use_tmpfs=True doesn’t have to be specified redundantly.

Custom tmpfs use case:

@batch(memory=100000, tmpfs_size=10000, tmpfs_path=’/data’, tmpfs_tempdir=False)

Full control over settings - metaflow.S3 doesn’t use the tmpfs volume in this case.

Besides metaflow.S3, the user may want to use the tmpfs volume for their own use cases. In particular, many modern ML libraries require a local cache. To support these use cases, tmpfs_path is exposed through the current object, as current.tempdir.
This allows the user to leverage the volume straightforwardly:

AutoModelForSeq2SeqLM.from_pretrained(
            model_path,
            cache_dir=current.tempdir,
            device_map='auto',
            load_in_8bit=True,
        )

Introduce auto-completion support for metaflow client in ipython notebooks

With this release, Metaflow client objects will support autocomplete in ipython notebooks

from metaflow import Flow, Metaflow

Metaflow().flows
>>> [Flow('HelloFlow'), Flow('MovieStatsFlow')]

flow = Flow('HelloFlow') # No autocomplete here
flow._ipython_key_completions_()
>>> 
['1680815181013681',
 '1680815178214737',
 '1680432265121345',
 '1680430310127401']

run = flow["1680815178214737"]
run._ipython_key_completions_()
>>> ['end', 'hello', 'start']

step = run["hello"]
step._ipython_key_completions_()
>>> ['2']

task = step["2"]
task._ipython_key_completions_()
>>> ['name']

Improvements

Reduce metadata service network calls for faster execution of flows

With this release, Metaflow flows should execute a tad bit faster since a few network calls to Metaflow's metadata service are now cached. Expect continued further improvements in flow execution times over the next few releases.

Handle unsupported data types for pandas.DataFrame gracefully for Metaflow's default card

With this release, Metaflow card creation will handle non-JSON parseable types gracefully by replacing the column values with UnsupportedType : <TYPENAME>.

In case you need any assistance or have feedback for us, ping us at chat.metaflow.org or open a GitHub issue.

What's Changed

Introduce codeql by @savingoyal in https://github.com/Netflix/metaflow/pull/1272
fix: GitHub Workflow security recommendations by @saikonen in https://github.com/Netflix/metaflow/pull/1334
Add docstring style to contribution code style guide by @jimbudarz in https://github.com/Netflix/metaflow/pull/1328
remove METAFLOW_DATATOOLS_SYSROOT_S3 from configuration command by @tfurmston in https://github.com/Netflix/metaflow/pull/1312
Fix #1326 and strips ext_info from blobs passed to schedulers by @romain-intel in https://github.com/Netflix/metaflow/pull/1329
Namespace check skip feature from #1271 by @romain-intel in https://github.com/Netflix/metaflow/pull/1341
Introduce tmpfs config options for @batch by @savingoyal in https://github.com/Netflix/metaflow/pull/1287
fix: kubernetes ec2 instance metadata timeout by @saikonen in https://github.com/Netflix/metaflow/pull/1335
Make the contact information displayed by the Metaflow command configurable by @romain-intel in https://github.com/Netflix/metaflow/pull/1340
Safely parse pandas.DataFrame for default card by @valayDave in https://github.com/Netflix/metaflow/pull/1344
Reduce multiple metadata service rtts using cached version. by @shrinandj in https://github.com/Netflix/metaflow/pull/1347
Kubernetes running job cancellation to fallback to patching parallelism by @jackie-ob in https://github.com/Netflix/metaflow/pull/1353
Remove encoding for JSON.loads by @wangchy27 in https://github.com/Netflix/metaflow/pull/1352
Prep for 2.8.3 release by @savingoyal in https://github.com/Netflix/metaflow/pull/1354

New Contributors

@wangchy27 made their first contribution in https://github.com/Netflix/metaflow/pull/1352

Full Changelog: https://github.com/Netflix/metaflow/compare/2.8.2...2.8.3

metaflow - 2.8.2

Published by savingoyal over 1 year ago

Features
- Introduce support for Metaflow sandboxes for Metaflow tutorials
- Display Metaflow UI URL on the terminal when a flow is executed via step-functions trigger or argo-workflows trigger

Features

Introduce support for Metaflow sandboxes for Metaflow tutorials

With this release, the Metaflow tutorials can now be executed within the Metaflow sandboxes, making it trivial to evaluate whether Metaflow is a good fit for your organization without committing to deploying the necessary cloud infrastructure upfront.

Display Metaflow UI URL on the terminal when a flow is executed via `step-functions trigger` or `argo-workflows trigger`

With this release, if the Metaflow config (in ~/.metaflow_config) includes a reference to the deployed Metaflow UI (assigned to METAFLOW_UI_URL), the user-facing logs in the terminal will indicate the direct link to the relevant run view in the Metaflow UI.

image (6)

In case you need any assistance or have feedback for us, ping us at chat.metaflow.org or open a GitHub issue.

What's Changed

Add a way to create aliases to other parts of metaflow by @romain-intel in https://github.com/Netflix/metaflow/pull/1304
feature: emit UI url for argo workflows and step-functions by @saikonen in https://github.com/Netflix/metaflow/pull/1311
fix: update cards dependencies by @saikonen in https://github.com/Netflix/metaflow/pull/1314
Sync tutorials for Outerbounds sandbox by @emattia in https://github.com/Netflix/metaflow/pull/1299
Fix the logs command in cases where the step/task hasn't finished by @romain-intel in https://github.com/Netflix/metaflow/pull/1315
Update version to 2.8.2 by @savingoyal in https://github.com/Netflix/metaflow/pull/1325

New Contributors

@emattia made their first contribution in https://github.com/Netflix/metaflow/pull/1299

Full Changelog: https://github.com/Netflix/metaflow/compare/2.8.1...2.8.2

metaflow - 2.8.1

Published by savingoyal over 1 year ago

Features
- Add ec2 instance metadata in task.metadata_dict when a task executes on AWS Batch
- Display Metaflow UI URL on the terminal when a flow is executed either via run or resume

Features

Add ec2 instance metadata in `task.metadata_dict` when a task executes on AWS Batch

With this release, task.metadata_dict will include the fields - ec2-instance-id, ec2-instance-type, ec2-region, and ec2-availability-zone whenever the Metaflow task is executed on AWS Batch and the task container has access to ec2 metadata magic URL.

Display Metaflow UI URL on the terminal when a flow is executed either via `run` or `resume`

With this release, if the Metaflow config (in ~/.metaflow_config) includes a reference to the deployed Metaflow UI (assigned to METAFLOW_UI_URL), the user-facing logs in the terminal will indicate the direct link to the relevant run view in the Metaflow UI.

In case you need any assistance or have feedback for us, ping us at chat.metaflow.org or open a GitHub issue.

metaflow - 2.8.0

Published by savingoyal over 1 year ago

Features
- Introduce capability to schedule Metaflow flows with Apache Airflow

Features

Introduce capability to schedule Metaflow flows with Apache Airflow

With this release, we are introducing an integration with Apache Airflow similar to our integrations with AWS Step Functions and Argo Workflows where Metaflow users can easily deploy & schedule their DAGs by simply executing

python myflow.py airflow create mydag.py

which will create an Airflow DAG for them. With this feature, Metaflow users can now enjoy all the features of Metaflow on top of Apache Airflow - including a more user-friendly and productive development API for data scientists and data engineers, without needing to change anything in your existing pipelines or operational playbooks, as described in its announcement blog post. To learn how to deploy and operate the integration, see Using Airflow with Metaflow.

When running on Airflow, Metaflow code works exactly as it does locally: No changes are required in the code. With this integration, Metaflow users can inspect their flows deployed on Apache Airflow as before and debug and reproduce results from Apache Airflow on their local laptop or within a notebook. All tasks are run on Kubernetes respecting the @resources decorator as if the @kubernetes decorator was added to all steps, as explained in Executing Tasks Remotely.

The main benefits of using Metaflow with Airflow are:

You get to use the human-friendly API of Metaflow to define and test workflows. Almost all features of Metaflow work with Airflow out of the box, except nested foreaches, which are not yet supported by Airflow, and @batch as the current integration only supports @kubernetes at the moment.
You can deploy Metaflow flows to your existing Airflow server without having to change anything operationally. From Airflow's point of view, Metaflow flows look like any other Airflow DAG.
If you want to consider moving to another orchestrator supported by Metaflow, you can test them easily just by changing one command to deploy to Argo Workflows or AWS Step Functions.

In case you need any assistance or have feedback for us, ping us at chat.metaflow.org or open a GitHub issue.

metaflow - Metaflow 2.7.23

Published by romain-intel over 1 year ago

What's Changed

New MF configs for Argo Workflows by @jackie-ob in https://github.com/Netflix/metaflow/pull/1267
Added typing information for all public APIs by @romain-intel in https://github.com/Netflix/metaflow/pull/1158
When packaging metaflow_extensions, add an empty __init__.py file. by @romain-intel in https://github.com/Netflix/metaflow/pull/1276
Replace instances of "INFO" with a constant by @romain-intel in https://github.com/Netflix/metaflow/pull/1275
Fix an issue with threading and the escape hatch. by @romain-intel in https://github.com/Netflix/metaflow/pull/1274

Full Changelog: https://github.com/Netflix/metaflow/compare/2.7.22...2.7.23

metaflow - Metaflow 2.7.22

Published by romain-intel over 1 year ago

What's Changed

Test metaflow.s3 on multiple Python versions across Linux and MacOS by @savingoyal in https://github.com/Netflix/metaflow/pull/1246
fix metaflow.s3 tests by @savingoyal in https://github.com/Netflix/metaflow/pull/1248
support timezone when scheduling with Argo workflows by @amerberg in https://github.com/Netflix/metaflow/pull/1250
Airflow V2 PR (Foreach + Sensors + GCP + MWAA Support) by @valayDave in https://github.com/Netflix/metaflow/pull/1256
Expose Kubernetes Node IP in task metadata by @savingoyal in https://github.com/Netflix/metaflow/pull/1254
Fix timezone code in Argo Workflows by @jackie-ob in https://github.com/Netflix/metaflow/pull/1258
Implement @secrets, with AWS support by @jackie-ob in https://github.com/Netflix/metaflow/pull/1251
Expose AWS instance metadata for @kubernetes by @savingoyal in https://github.com/Netflix/metaflow/pull/1263
Fix an issue with configurations for env escape extensions by @romain-intel in https://github.com/Netflix/metaflow/pull/1264
Bump version to 2.7.22 by @savingoyal in https://github.com/Netflix/metaflow/pull/1247

Full Changelog: https://github.com/Netflix/metaflow/compare/2.7.21...2.7.22

metaflow - Metaflow 2.7.21

Published by romain-intel over 1 year ago

What's Changed

Fix extension support on Python 3.5 by @romain-intel in https://github.com/Netflix/metaflow/pull/1245

Full Changelog: https://github.com/Netflix/metaflow/compare/2.7.20...2.7.21

metaflow - Metaflow 2.7.20

Published by romain-intel over 1 year ago

What's Changed

Bug/long card by @obgibson in https://github.com/Netflix/metaflow/pull/1233
Restrict token permissions for test jobs by @romain-intel in https://github.com/Netflix/metaflow/pull/1238
Since we're now providing type annotations, add typed marker as per PEP-561 by @oavdeev in https://github.com/Netflix/metaflow/pull/1239
Allow configuration of plugins/cmds using METAFLOW_ENABLED_* variable by @romain-intel in https://github.com/Netflix/metaflow/pull/1212
Bump setup.py to 2.7.20 by @romain-intel in https://github.com/Netflix/metaflow/pull/1244

Incompatible change

If you are using the unsupported Metaflow Extensions mechanism, you may have to change them slightly. Please see https://github.com/Netflix/metaflow-extensions-template/blob/master/CHANGES.md for more details.

Full Changelog: https://github.com/Netflix/metaflow/compare/2.7.19...2.7.20

metaflow - Metaflow 2.7.19

Published by savingoyal almost 2 years ago

What's Changed

Reduce @environment arg length for step-functions create by @savingoyal in https://github.com/Netflix/metaflow/pull/1215
Add support for Kubernetes tolerations by @odracci in https://github.com/Netflix/metaflow/pull/1207
Support for AWS inferentia instances on Batch by @DanCorvesor in https://github.com/Netflix/metaflow/pull/1205
Fix CVE-2007-4559 (tar.extractall) by @romain-intel in https://github.com/Netflix/metaflow/pull/1213
Support .conda packages by @savingoyal in https://github.com/Netflix/metaflow/pull/1221

New Contributors

@odracci made their first contribution in https://github.com/Netflix/metaflow/pull/1207
@DanCorvesor made their first contribution in https://github.com/Netflix/metaflow/pull/1205

Full Changelog: https://github.com/Netflix/metaflow/compare/2.7.18...2.7.19

metaflow - Metaflow 2.7.18

Published by romain-intel almost 2 years ago

What's Changed

Adds check for tutorials dir and flattens if necessary by @ashrielbrian in https://github.com/Netflix/metaflow/pull/1211
Fix bug with datastore backend instantiation by @savingoyal in https://github.com/Netflix/metaflow/pull/1210

New Contributors

@ashrielbrian made their first contribution in https://github.com/Netflix/metaflow/pull/1211

Full Changelog: https://github.com/Netflix/metaflow/compare/2.7.17...2.7.18

metaflow - Metaflow 2.7.17

Published by romain-intel almost 2 years ago

What's Changed

Fix regression causing CL tool to not work. by @romain-intel in https://github.com/Netflix/metaflow/pull/1209
Bump qs from 6.5.2 to 6.5.3 in /metaflow/plugins/cards/ui by @dependabot in https://github.com/Netflix/metaflow/pull/1208

Full Changelog: https://github.com/Netflix/metaflow/compare/2.7.16...2.7.17

metaflow - Metaflow 2.7.16

Published by romain-intel almost 2 years ago

What's Changed

Deal with transient errors (like SlowDowns) more effectively for S3 by @romain-intel in https://github.com/Netflix/metaflow/pull/1186
Fix/move data files by @romain-intel in https://github.com/Netflix/metaflow/pull/1206

Full Changelog: https://github.com/Netflix/metaflow/compare/2.7.15...2.7.16

metaflow - Metaflow 2.7.15

Published by romain-intel almost 2 years ago

What's Changed

Handle aborted Kubernetes workloads. by @shrinandj in https://github.com/Netflix/metaflow/pull/1195
Bump loader-utils from 3.2.0 to 3.2.1 in /metaflow/plugins/cards/ui by @dependabot in https://github.com/Netflix/metaflow/pull/1194
Fix ._orig access for submodules for MF extensions by @romain-intel in https://github.com/Netflix/metaflow/pull/1174
Update black to latest version by @savingoyal in https://github.com/Netflix/metaflow/pull/1199
allow equal sign in decorator spec values by @amerberg in https://github.com/Netflix/metaflow/pull/1197
Typo repair and PEP8 cleanup by @jimbudarz in https://github.com/Netflix/metaflow/pull/1190
Pin GH tests to Ubuntu 20.04 by @savingoyal in https://github.com/Netflix/metaflow/pull/1201
Set gpu resources correctly "--with kubernetes" by @shrinandj in https://github.com/Netflix/metaflow/pull/1202
Clean up configuration variables by @romain-intel in https://github.com/Netflix/metaflow/pull/1183
GCP datastore implementation by @jackie-ob in https://github.com/Netflix/metaflow/pull/1135
Bump version; remove R tests by @romain-intel in https://github.com/Netflix/metaflow/pull/1204

New Contributors

@shrinandj made their first contribution in https://github.com/Netflix/metaflow/pull/1195
@amerberg made their first contribution in https://github.com/Netflix/metaflow/pull/1197
@jimbudarz made their first contribution in https://github.com/Netflix/metaflow/pull/1190

Full Changelog: https://github.com/Netflix/metaflow/compare/2.7.14...2.7.15

metaflow - Metaflow 2.7.14

Published by romain-intel almost 2 years ago

What's Changed

fix pandas call bug by @mbalajew in https://github.com/Netflix/metaflow/pull/1173
Metaflow pathspec in Airflow UI by @valayDave in https://github.com/Netflix/metaflow/pull/1119
Allow the input paths to be passed via a file by @romain-intel in https://github.com/Netflix/metaflow/pull/1181
Check compatibility for R 4.2 by @savingoyal in https://github.com/Netflix/metaflow/pull/1160
issue 1040 fix: apply _sanitize to template names in Argo workflows by @johnaparker in https://github.com/Netflix/metaflow/pull/1180

New Contributors

@mbalajew made their first contribution in https://github.com/Netflix/metaflow/pull/1173
@johnaparker made their first contribution in https://github.com/Netflix/metaflow/pull/1180

Full Changelog: https://github.com/Netflix/metaflow/compare/2.7.13...2.7.14

metaflow - Metaflow 2.7.13

Published by romain-intel about 2 years ago

What's Changed

Add cmd extension point to allow MF extensions to extend it by @romain-intel in https://github.com/Netflix/metaflow/pull/1143
Fix periodic messages printed at runtime by @romain-intel and @jackie-ob in https://github.com/Netflix/metaflow/pull/1061, https://github.com/Netflix/metaflow/pull/1151 and https://github.com/Netflix/metaflow/pull/1159
Pass datastore_type to validate_environment by @romain-intel in https://github.com/Netflix/metaflow/pull/1152
Support kubernetes_conn_id in Airflow integration by @valayDave in https://github.com/Netflix/metaflow/pull/1153
Use json to dump/load decorator specs by @romain-intel in https://github.com/Netflix/metaflow/pull/1144
argo use kubernetes client class by @oavdeev in https://github.com/Netflix/metaflow/pull/1163
Rewrite IncludeFile implementation by @romain-intel in https://github.com/Netflix/metaflow/pull/1109
Add options to make card generation faster in some cases by @romain-intel in https://github.com/Netflix/metaflow/pull/1167
Env escape improvements and bug fixes by @romain-intel in https://github.com/Netflix/metaflow/pull/1166
Allow figures in Image.from_matplotlib by @valayDave in https://github.com/Netflix/metaflow/pull/1147
Bump for release by @romain-intel in https://github.com/Netflix/metaflow/pull/1168

Full Changelog: https://github.com/Netflix/metaflow/compare/2.7.12...2.7.13

metaflow

Features

Introduce Slack notifications support for workflow running on Argo Workflows

FAQ

What's Changed

Features

Introduce support for composing multiple interrelated workflows through external events

What's Changed

Features

Introduce support for persistent volume claims for executions on Kubernetes

What's Changed

Improvements

Make pickled Metaflow client objects accessible across namespaces

What's Changed

Features

Introduce support for tmpfs for executions on Kubernetes

Examples

Handle large amounts of data in-memory with Kubernetes:

Increase tmpfs size:

Custom tmpfs use case:

Introduce current.run and current.task_ in current singleton

Improvements

Make metaflow client objects backward compatible

What's Changed

New Contributors

Features

Introduce support for tmpfs for executions on AWS Batch

Examples

Handle large amounts of data in-memory with Batch:

Increase tmpfs size:

Custom tmpfs use case:

Introduce auto-completion support for metaflow client in ipython notebooks

Improvements

Reduce metadata service network calls for faster execution of flows

Handle unsupported data types for pandas.DataFrame gracefully for Metaflow's default card

What's Changed

New Contributors

Features

Introduce support for Metaflow sandboxes for Metaflow tutorials

Display Metaflow UI URL on the terminal when a flow is executed via step-functions trigger or argo-workflows trigger

What's Changed

New Contributors

Features

Add ec2 instance metadata in task.metadata_dict when a task executes on AWS Batch

Display Metaflow UI URL on the terminal when a flow is executed either via run or resume

Features

Introduce capability to schedule Metaflow flows with Apache Airflow

What's Changed

What's Changed

What's Changed

What's Changed

Incompatible change

What's Changed

New Contributors

What's Changed

New Contributors

What's Changed

What's Changed

What's Changed

New Contributors

What's Changed

New Contributors

What's Changed

Related Projects

deepflow

serve

SUSE-openSUSE-Guide

pipelines

clearml

cloudflow

argo-workflows

Display Metaflow UI URL on the terminal when a flow is executed via `step-functions trigger` or `argo-workflows trigger`

Add ec2 instance metadata in `task.metadata_dict` when a task executes on AWS Batch

Display Metaflow UI URL on the terminal when a flow is executed either via `run` or `resume`