MLflow smooths out natural language processing and Microsoft support in 1.8 release

AI/ML

By Julia Schmidt

April 23, 2020

MLflow smooths out natural language processing and Microsoft support in 1.8 release

Databricks’ machine learning platform MLflow has learned to play nice with AzureML and spaCy models, making v1.8 ready for downloading.

MLflow is an Apache License 2.0 protected open source project that can be used with a variety of machine learning libraries and comprises of a tracking API to log and compare experiment results, a code packaging format for reproducible runs, a model packaging format and tools for deploying models, as well as a centralised registry meant to help teams manage a model’s lifecycle.

For the latest release, committers have done quite a bit to make the platform work better with Microsoft’s stable of products. Version 1.8 for example provides an API to deploy MLflow models to Azure Machine Learning and finally lets users on Windows machines deploy models to the AWS SageMaker via the platform’s CLI, which was an issue before.

Those who don’t care too much for connections to Redmond, might be more interested to learn that MLflow now comes with a module to save and load models using popular natural language processing library spaCy, thus adding a bit more flexibility to the platform.

Using MLflow with Docker should also have become a bit easier in this version, since it’s now possible to pass arguments to docker run, when running corresponding projects. The platform’s SearchRuns API as well as its UI also learned to recognise case-sensitive LIKE and case-insensitive ILIKE queries when running against a SQL backend, which can be used for pattern matching purposes.

To have a better idea of the exact state an application is in, Databricks fitted the REST API server with a health check endpoint, which returns a 200 status code as long as the app in question is live. Better oversight when comparing runs meanwhile is supposedly provided by a newly added change highlighting, which makes varying parameter values in the CompareRun view of the platform more visible.

Apart from that, metrics UI plots can now handle more input points, since the team switched from scatter to scattergl, and line smoothing has been improved.

A complete list of features and bug fixes is available in the project’s GitHub repository.

Sourcegraph coding assistant now supports Anthropic Claude 3 – though limited to 7K token input

Supabase moves out of beta, adds supports for Swift, plugs in Oriole storage engine

Go dev survey shows frustration with Python’s dominance of AI

AI coding: Hugging Face engineer extols benefits of open source models, but hard questions remain

.NET Smart Components experiment the "Visual Basic" of AI programming?

GitHub autofix progresses to public beta: insecure code corrected with AI, but only for enterprise

JetBrains bows to user pressure and unbundles AI Assistant in new IntelliJ IDEA beta

Hands On: Netlify AI-assisted deployment aims to reduce log-diving

Stack Overflow turns to Google for hosting and AI features, trusts in Gemini for tech answers

Employing your cloud data warehouse to scale up AI/ML

Rust-based Zed editor now open source – with built-in support for OpenAI and GitHub Copilot