Data Versioning

Data Versioning

Achieving the highest levels of reproducibility and the ability to forensically analyze deployed models and their decisions requires being able to go back in time to determine the exact state of the training data at the time a model was trained. Data Versioning is a critical element of Data Lineage. Many new products and systems are being developed in the area of Data Versioning and many of them rely on a Git-style workflow where they store the metadata (data about the data) and pointers to the source data, without just cloning massive datasets for every version. This particular sub-field of ML is evolving rapidly and we’ll be covering a lot more on this in the near future.

The MLOps Platform
IBM Cloudpak for Data
Reinvent how you work
Modern MLOps focused on speed and simplicity
Weights & Biases
With a few lines of code, save everything you need to debug, compare and reproduce your models
AI and machine learning model management and operations for enterprise data science teams
Oracle – Data Science Platform
Data science platform
Open-source version control system for machine learning projects
Build better models faster
Industry-leading AI OS for machine learning
IBM Watson Studio
Build and scale trusted AI on any cloud. Automate the AI lifecycle for ModelOps