Data versioning in machine learning projects

Loading

Follow to receive video recommendations   a   A
0 Best Data versioning in machine learning projects Videos
  Votes
Date


In machine learning projects it is easy to get lost in many versions of your data files. Data Version Control or DVC is an open source tool for data science projects that was created to solve the issue of discrepancy between code and data files. It works on top of Git and helps you switch between Git branches and extracts not only source code but a right version of data files. Slides: https://www.slideshare.net/DmitryPetrov15/pydata-berlin-2018-dvcorg --- www.pydata.org

Editors Note:

I am looking for editors/curators to help with branches of the tree. Please send me an email  if you are interested.