Pachyderm
Pachyderm

Pachyderm

Pachyderm is an enterprise-grade, open source data science platform that makes explainable, repeatable, and scalable ML/AI a reality
Pachyderm Overview
Pachyderm is an enterprise-grade, open source data science platform that makes explainable, repeatable, and scalable machine learning and artificial intelligence a reality. The Pachyderm platform brings together version control for data with the tools to build scalable end-to-end ML/AI pipelines while empowering users to develop their code in any language, framework, or tool of their choice. Pachyderm has been proven to be the ideal foundation for teams looking to use ML and AI to solve real-world problems in a reliable way.
Deploys On
  • Amazon Web Services
  • Google Cloud Platform
  • Microsoft Azure
  • Other Public Cloud
  • Kubernetes
  • Private Cloud or Datacenter
  • SaaS
Marketplace Links

Have you used Pachyderm?

If so, please share your experiences with the TWIML community.
Learn More About Pachyderm
Play Video
Scaling Video AI at RTL with Daan Odijk
Play Video
Scalable Data Science Workflows with Pachyderm
Pachyderm Details
Benefits
Data Lineage
Think "git for data" but better. Pachyderm version-controls all data types, but it also delivers true data lineage. Data Lineage means knowing, with certainty, the complete journey of your data, code, models, and the relationships between them.

End-To-End Pipelines
Pachyderm makes it simple to build end-to-end data science workflows using any language or framework you want. Transform existing manual processes into fully automated event-driven workflows.

Enterprise Scale
Kubernetes makes software scalable. We built Pachyderm on top of Kubernetes to provide you with a direct path to production, using your choice of infrastructure. It doesn't matter if you're still in the POC phase, or processing petabytes of data, Pachyderm makes scaling simple.
Features
• Data versioning
• Containerized Pipelines
• Data lineage
• Distributed workloads
• GPU support
• Pachyderm dashboard
• Advanced statistics
• User Access controls
• S3 gateway
• Enterprise-Grade Support
• Custom Deployments
• Hosted Service
Pachyderm Vendor Information
Vendor Overview
Pachyderm is an enterprise-grade, open-source data science platform that makes explainable, repeatable, and scalable ML/AI a reality. Its platform brings together version control for data with the tools to build scalable end-to-end ML/AI pipelines while empowering users to use any language, framework, or tool they want.
Pachyderm is “Git for Data Science.” It offers complete version control for data and gives your data science team the same first-class development tools as software developers. Pachyderm is ideal for building machine learning pipelines and ETL workflows because we track every model/output directly to the raw input datasets that created it (aka: Provenance).
Vendor Details
Year Founded
2014
HQ Location
San Francisco, California, United States
Ownership
Private
Pachyderm Articles
Pachyderm

Contact Request

Pachyderm may contact you regarding your request

Submit Review for Pachyderm

This field is for validation purposes and should be left unchanged.