Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

What is this?

This projects provides an abstraction layer between secrets and services by externalizing the configuration using yaml. In constrast to other config libraries the library returns fully-configured and authenticated client SDK objects for services. Secrets can be fetched from a number of sources. It's expected that the briefcase.yaml is stored along side notebooks (e.g. in the root folder of a git repository).

Goals

Simplify authentication
Enable resource sharing between Notebooks and team members
Improve service specific SDK discoverability

Example: Azure Blob and Environment Variable

Accessing private blobs is usually performed using SAS tokens or by sharing account keys.

pip install mlbriefcase

Create briefcase.yaml

azure:
    storage:
        blob:
            -name: blob1
             url: https://myblob123.blob.core.windows.net/test/test.csv

# use Azure Storage Account key
export blob1=KwY...8w==

import mlbriefcase
import pandas as pd

# searches for briefcase.yaml in current directory and all parent directories
briefcase = mlbriefcase.Briefcase()

# let's get the resource by name
blob = briefcase['blob1']

# Performs
# - probe credential providers (e.g. environment variable, dotenv, ...) to find storage account key
# - create Azure Storage SDK object (available through blob.get_client())
# - generated authenticated url using SAS token
url = blob.get_url()  

df = pd.read_csv(url, sep='\t')

Example: Azure Cognitive Vision Service and .env file

This example demonstrate how to get the Azure Cognitive Vision service client.

Create briefcase.yaml

azure:
  cognitiveservice:
    vision:
      - name: vision1

vision1=<Insert Cog Service Key>

import mlbriefcase

# searches for briefcase.yaml in current directory and all parent directories
briefcase = mlbriefcase.Briefcase()

# Performs
# - probe credential providers (e.g. environment variable, dotenv, ...) to find cognitive service key
# - initialize the Cognitive Service Vision SDK object
vision = briefcase['vision1'].get_client()

vision.detect... # TODO

Rules of the Game

briefcase.yaml is searched in the current directory and if not found recursed until the root directory.
The name of a service (e.g. vision1) is used as the key name for the corresponding secret. The key can be customized (see Remapping Keys).
Credential providers are probed for keys in order

Jupyter Lab Credentials
Python Keyring
Environment variables
.env files
All credential providers defined in the briefcase.yaml
Behavior can be customized - see Specific credential provider

Any other required service property not found in the yaml is searched for in credential providers (e.g. one might not want to share the endpoint for a keyvault) using name.property (or name_property for environment variables)

Features

Remapping Keys

In the example below the Cognitive Service Vision token is searched using VISION_KEY. Since the url is not specified and remapped it's search using VISION_URL.

azure:
    cognitiveservice:
        vision:
         - name: vision1
           secret:
            key: VISION_KEY
           url:
            key: VISION_URL

Specific credential provider

As mentioned earlier which credential provider is used for lookup can be customized using the credentialprovider property.

azure:
    keyvault:
     - name: kv1
       dnsname: https://myvault.vault.azure.net/
    
    storage:
        account:
         - name: blob1
           accountname: test1
           credentialprovider: kv1
        account:
         - name: blob2
           accountname: test2
           credentialprovider: env

python:
    env: 
     - name: env

IntelliSense in VSCode

To ease authoring we provide a JSON schema used by VS Code yaml plugin and enables IntelliSense in VS Code.

Authentication

The default order for credential provider resolution:

Jupyter Lab Credential Provider
Python KeyRing
Environment variables
.env fiels
Any declared credential provider resource found in briefcase.yaml

For Azure resources the following authentication methods are supported

Service Principal
Azure Device Login
Azure Managed Service identity

FAQ

How to get the logging to work on Jupyter?

Add the following cells to your Jupyter notebook (and yes the first cell throws an error, but that seems to be required).

%config Application.log_level='WORKAROUND'

import logging
logging.getLogger('briefcase').setLevel(logging.DEBUG)

Python

Service SDK libraries are imported at time of usage (e.g. resource.get_client())
If import fails, exception contains the name of the pip package

Development

Run

pip install -e .[test]

cd tests
pytest -s . -k test_sql_alchemy

Note: most tests depend on secrets thus you won't be able to run them without setting up your own resources.

How to add a new resource?

Add the resource definition to JSON schema
The path to the resource is used as the package/class name (e.g. azure.cognitiveservice.vision). Name your resource accordingly.

azure:
    cognitiveservice:
        vision:
         - name: vision1

Inherit from Resource
Define pip_package variable
define get_client_lazy
Use self.get_secret() to trigger secret resolution
Use self. to access any other property required by your resource

YAML / JSON Schema

To get live updates of JSON schema and validate in VS Code, update the settings to directly reference the JSON schema.

    "yaml.schemas": {
        "file:///mnt/c/work/Workspace/mlbriefcase/briefcase-schema.json": ["briefcase.yaml"]
    }

Package Rankings

Top 25.66% on Pypi.org

Badges

Extracted from project README

Related Projects

msticpy

Microsoft Threat Intelligence Security Tools

21 Feb 2019 1,763

MLOps

MLOps examples

03 May 2019 1,814

computervision-recipes

Best Practices, code samples, and documentation for Computer Vision.

11 Feb 2019 9,470

msrc-appconfig

Type safe composable application configuration management in Python

25 Jun 2020 4

template-validation-action

17 Jul 2024 6

cloud-advocate-workshops

Workshops created by the Cloud Advocacy team at Microsoft.

01 Aug 2023 34

datashaper

Processing engine and React components for constructing configuration-based data transformation a...

22 Nov 2021 177

dstoolkit-mlops-v2

This repository contains the basic repository structure for machine learning projects based on Az...

12 Apr 2023 20

Docker-Provider

Azure Monitor for Containers

17 Mar 2016 139

azure-privacy-sandbox-kms

09 Jan 2024 4

dstoolkit-data-rafting-kit

A declarative pipeline framework that automatically makes PySpark pipelines.

06 Jun 2024 3

jupyter-Kqlmagic

Extension (Magic) to Jupyter notebook and Jupyter lab, that enable notebook experience working wi...

07 Aug 2018 85

mcsa-observability

MCSA-Observability

01 Feb 2024 7

AzureKeyVaultExplorer

Azure Key Vault Explorer — a cross platform GUI desktop application for aggregating secrets (and ...

26 Jan 2023 49

artifacts-credprovider

The Azure Artifacts Credential Provider enables dotnet, NuGet.exe, and MSBuild to interactively a...

19 Jun 2018 769