Upgrade & Secure Your Future with DevOps, SRE, DevSecOps, MLOps!

We spend hours on Instagram and YouTube and waste money on coffee and fast food, but won’t spend 30 minutes a day learning skills to boost our careers.
Master in DevOps, SRE, DevSecOps & MLOps!

Learn from Guru Rajesh Kumar and double your salary in just one year.

Get Started Now!

AI gets rigorous: Databricks announces MLflow 1.0

Source:-

MLflow, the open source framework for managing machine learning (ML) experiments and model deployments, has stabilized its API, and reached a version 1.0 milestone, now generally available.

One year ago yesterday, at the 2018 Spark and AI Summit in San Francisco, Matei ZahariaDatabricks‘ co-founder/Chief Technologist and creator of Apache Sparkpresented his new development focus, an open source project called MLflow. Today, the project has reached a major maturity milestone, with the release of a full version 1.0 to general availability.

Also read: Apache Spark creators set out to standardize distributed machine learning training, execution, and deployment

ORDER FROM ENTROPY

The data science workflow which, to this day, is chock full of ad hoc tasks in siloed development environments. While things are slowly changing, it’s all too common for data scientists to tinker on their laptops, with algorithms and hyperparameter values, until they have a trained ML model that they like, and then manually deploy to production.

MLflow aims to impose rigor on this process, allowing each training iteration to be logged and model deployment, to any number of cloud or private environments, to be automated. This allows the work to be discoverable by other data scientists (which hopefully will avoid them redoing the same work) and for automation of retraining and subsequent redeployment of the model.

V1 NAILS IT DOWN

MLflow allows this work to be done at the command line, through a user interface, or via an application programming interface (API). All three of these interfaces were subject to significant change during MLflow’s first year of development, but with this 1.0 release, developers can rely on these interfaces being stable from here on.

In addition, MLflow 1.0 offers several new features. Although some of these are pretty technically granular, I’ll try to summarize them:

  • Support for the Hadoop Distributed File System (HDFS) as an “Artifact Store”, allowing MLFlow to store its files in on-premises Hadoop clusters, in addition to cloud storage, local disks, Network File System (NFS) storage and Secure FTP
  • Support for the ONNX (the Open Neural Network eXchange) machine learning model format — originally backed (and used) by Microsoft, Amazon and Facebook — as an MLflow model “flavor”
  • Improved search features, allowing a SQL-like syntax to be used for filter expressions based on attributes and tags, in addition to metrics and parameters
  • Support for tracking metric values based on progressions other than time (officially this is referred to as “Support for X Coordinates in the Tracking API”). This is illustrated in the figure at the top of this post, showing how the MLflow UI allows the X axis of its Metrics visualization to be set to Step, in addition to two variants of Time.
  • Multiple metrics can be logged in “batch,” meaning they can be recorded via a single API call, instead of call per metric-value pair.

RESPECT AS A STANDARD, WITH MORE IN THE PIPELINE

That’s a nice set of features, and there’s more to come. The MLflow roadmap includes a model registry that can facilitate continuous integration/deployment (CI/CD), model check/code review, as well as insight into the usage and effectiveness of different model versions. There are plans for multi-step workflow support as well.

Databricks says MLflow now has over 100 contributors, and has been deployed at thousands of organizations. Add to that participation from Microsoft and support for MLflow in its Azure Machine Learning platform, and this project looks to have achieved the status of a standard, in a discipline strongly in need of them.

 

 

Related Posts

Complete Azure Certification Guide & tutorials

What is Azure? Azure is launched by Microsoft in February 2010. It is a cloud computing platform including Infrastructure as a Service (IaaS), Platform as a Service Read More

Read More

Microsoft and VMware Team Up to Offer the Next Evolution of the Azure VMware Solution

Source:-cio Today’s CIOs and enterprise IT departments continue to look for ways to rapidly expand their critical infrastructure to the cloud. The next evolution of Azure VMware Read More

Read More

New Microsoft program to help develop the quantum computing workforce of the future in India

Source:-microsoft Microsoft is creating a new program to build quantum computing skills and capabilities in the academic community in India. As part of this initiative, Microsoft Garage Read More

Read More

DATA SCIENCE AND MACHINE LEARNING PLATFORMS MARKET INSIGHTS BUSINESS OPPORTUNITIES 2027

Source:-primefeed Analysis of COVID-19 Impact on the Data Science and Machine Learning Platforms Market with Key Players Analysis The report highlights the current impact of COVID-19 on Read More

Read More

Microsoft targets its fastest Azure AI instance to date at large neural networks

SOURCE:-siliconangle Microsoft Corp. today previewed a new Azure instance for training artificial intelligence models that targets the emerging class of advanced, ultra-large neural networks being pioneered by Read More

Read More

AtScale Expands Reach of Cloud OLAP with Support for Microsoft Azure Databricks in

Source:-finance.yahoo the intelligent data virtualization provider for advanced analytics, today announced an enhanced platform integration with Azure Databricks. The integration extends AtScale’s Cloud Online Analytical Processing (COLAP) Read More

Read More
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted
Inline Feedbacks
View all comments
0
Would love your thoughts, please comment.x
()
x