San Francisco, CA, USA
Pachyderm is an enterprise-grade, open-source data science platform that makes explainable, repeatable, and scalable ML/AI a reality. Its platform brings together version control for data with the tools to build scalable end-to-end ML/AI pipelines while empowering users to use any language, framework, or tool they want. Pachyderm is“Git for Data Science.” It offers complete version control for data and gives your data science team the same first-class development tools as software developers. Pachyderm is ideal for building machine learning pipelines and ETL workflows because we track every model/output directly to the raw input datasets that created it (aka: Provenance).Something looks off?