Logo
find AI list
TasksToolsCompareWorkflows
Submit ToolSubmit
Log in
Logo
find AI list

Search by task, compare top tools, and use proven workflows to choose the right AI tool faster.

Platform

  • Tasks
  • Tools
  • Compare
  • Alternatives
  • Workflows
  • Reports
  • Best Tools by Persona
  • Best Tools by Role
  • Stacks
  • Models
  • Agents
  • AI News

Company

  • About
  • Blog
  • FAQ
  • Contact
  • Editorial Policy
  • Privacy
  • Terms

Contribute

  • Submit Tool
  • Manage Tool
  • Request Tool

Stay Updated

Get new tools, workflows, and AI updates in your inbox.

© 2026 findAIList. All rights reserved.

Privacy PolicyTerms of ServiceEditorial PolicyRefund Policy
Home/Tasks/lakeFS
lakeFS logo

lakeFS

lakeFS is a data version control platform that manages the data lifecycle, provenance, and unified access for AI and data teams.

Development
Good for
Version control for data lakesBranching and merging data
0 views
0 saves
Visit Website
  • About
  • Main Tasks
  • Decision Summary
  • Key Features
  • How it works
  • Quick Start
  • Pros & Cons
  • FAQ
  • Similar Tools
Switch To Simple View

About lakeFS

lakeFS is a data version control system that brings Git-like capabilities to data lakes and object storage. It enables data teams to manage data as code, providing features such as branching, merging, and reverting for data. This allows for experimentation, reproducibility, and data quality enforcement. lakeFS supports various storage solutions like Amazon S3, Google Cloud Storage, and Azure Blob Storage. It integrates with compute engines such as Spark, Trino, and Databricks, and is format-agnostic, working with Parquet, CSV, Avro, and more. lakeFS is designed for data engineers, data scientists, and MLOps practitioners who need to manage large datasets, ensure data quality, and streamline data workflows for AI and machine learning projects.

Core Capabilities

lakeFS is a data version control system that brings Git-like capabilities to data lakes and object storage.

Main Tasks

Version control for data lakes

Explore all tools that specialize in version control for data lakes. This domain focus ensures lakeFS delivers optimized results for this specific requirement.

Find Tools

Branching and merging data

Explore all tools that specialize in branching and merging data. This domain focus ensures lakeFS delivers optimized results for this specific requirement.

Find Tools

Reproducible data pipelines

Explore all tools that specialize in reproducible data pipelines. This domain focus ensures lakeFS delivers optimized results for this specific requirement.

Find Tools

Data quality enforcement

Explore all tools that specialize in data quality enforcement. This domain focus ensures lakeFS delivers optimized results for this specific requirement.

Find Tools

Collaboration on data projects

Explore all tools that specialize in collaboration on data projects. This domain focus ensures lakeFS delivers optimized results for this specific requirement.

Find Tools

Data lineage tracking

Explore all tools that specialize in data lineage tracking. This domain focus ensures lakeFS delivers optimized results for this specific requirement.

Find Tools
Decision Summary

What this tool is best suited for

Best Fit
MLOpsData Management
Buying Signals
Pricing not specified
No API listed
Web-first workflow
Setup And Compliance
Not specified
No onboarding steps listed
No compliance tags listed
Trust Signals
Pricing freshness unavailable
URL health not shown
Verification date unavailable
Compare And Alternatives

Shortlist lakeFS against top options

Open side-by-side comparison first, then move to deeper alternatives guidance.

Compare nowView alternatives
No verified pros/cons are available yet for this tool.

Pros

  • No verified strengths listed yet.

Cons

  • No verified trade-offs listed yet.

Reviews & Ratings

Verified feedback from other users.

Reviews

No reviews yet. Be the first to rate this tool.

Write a Review

0/500

Core Tasks

  • Version control for data lakes
  • Branching and merging data
  • Reproducible data pipelines
  • Data quality enforcement
  • Collaboration on data projects
  • Data lineage tracking

Target Personas

MLOpsData Management

Categories

DevelopmentCoding & Devops

Alternative Tools

View More Explore All Tools
Sifflet logo

Sifflet

Data Observability

Business-aware data observability platform connecting data quality to business impact.

23d ago
Best for Data QualityHas API
PricingPaid
Paid
Data Quality Monitoring
Data Lineage Tracking
Incident Response
Bigeye logo

Bigeye

Data

The Enterprise AI Trust Platform built on lineage-enabled data observability.

23d ago
Best for Data Quality & GovernanceHas API
PricingPaid
Paid
Data Monitoring
Data Quality Assurance
Data Governance
Apache Atlas logo

Apache Atlas

Data

Enterprise-grade data governance and metadata management for hybrid-cloud ecosystems.

23d ago
Best for Metadata ManagementHas API
PricingFreemium
Freemium
Automated Data Lineage tracking
Metadata classification and tagging
Business Glossary management
Activeloop Deep Lake logo

Activeloop Deep Lake

Developer

Activeloop Deep Lake is the AI data plane that allows you to store, retrieve, replay, and fine-tune AI agent interactions for continual learning.

23d ago
Best for Data LakeHas API
PricingFreemium
Freemium
Store multimodal AI data (text, images, video, audio)
Retrieve relevant data for AI models
Version control datasets
Apache Avro logo

Apache Avro

Developer

Apache Avro is a data serialization system providing rich data structures and a compact, fast, binary data format.

23d ago
Best for Streaming Data
PricingFree
Free
Define Avro schemas for data structures
Serialize data into the Avro binary format
Deserialize Avro data back into its original structure
DataGroomr logo

DataGroomr

Business

DataGroomr is an AI-powered solution that makes Salesforce data quality fast, accurate, and effortless.

23d ago
Best for Salesforce Data Management
PricingFreemium
Freemium
Detect and merge duplicate records in Salesforce
Verify the accuracy of email addresses
Standardize contact and lead data formats
Data.world logo

Data.world

Business

Data.world is an enterprise data catalog that helps organizations turn data chaos into clarity, enabling better data discovery, governance, and AI initiatives.

23d ago
Best for Data GovernanceHas API
PricingFreemium
Freemium
Discover and understand data assets.
Govern and manage data quality.
Create a unified view of data landscape.
Infoworks logo

Infoworks

Artificial Intelligence

Enterprise AI data agents platform for Zero Data AI.

23d ago
Best for Data Management
PricingPaid
Paid
Data Discovery
Data Cataloging
Data Cleaning