Sorry, our demo is not currently available on mobile devices.

Please check out the desktop version.
You can learn more about Stemma on our blog.
See Our Blog
close icon

Total Trust in Data

Stemma is a fully managed data catalog, powered by the leading open-source data catalog, Amundsen.

Make data-based decisions with absolute confidence.

Get Started

Sign up here to join our next community meeting on December 9th, 9AM PT.

Powered by Amundsen

Amundsen is the leading open-source data catalog. It has the largest and fastest-growing community and the most integrations. Stemma brings you Amundsen and more, with enterprise management and richer metadata.
From startups to Global 500 companies, Amundsen is powering the world’s best data teams. Stemma brings you all the benefits that are pushing them forward.
Learn More About Amundsen
The leading open-source data catalog

Gain Total Trust in Your Data

Everyone has access to data, but few know what exists, what’s trustworthy and how to use it. Stemma makes finding trustworthy data easy and offers an always up-to-date view of your data’s usage at any time.
Stemma for Data Scientists & Analysts
Rather than being the last to find out when data gets shut off or delayed, Stemma users are always in the loop and have oversight on what’s changing with their data.

We keep users up-to-date through automated documentation based on common usage patterns.
Stemma for Data Engineers
Not knowing who or what a data change will affect means that data engineers simply have to spray and pray.

The volume of data that companies need to collect and process is growing exponentially. Stemma removes the complexity and stress of tracking and navigating the flow of new and ever-evolving data, through automated lineage tracking.
Stemma for Data Governance
Organizations often rely on Slack conversations and continuous shoulder-tapping to gain trust in data. This is error-prone and time-consuming.

Stemma uniquely augments your data with automated documentation, so you don’t have to document and curate every single data set.

Do More with Stemma

Stemma takes you beyond Amundsen, unlocking a fleet of additional benefits. Lower your time to value, ensure your data catalogs success, gain enterprise-grade security, easily migrate from Amundsen to Stemma or the other way around, and make deploying easy.
Empowered by Automated Intelligence

Lower Your Time to Value

Automated query parsing to generate lineage removes risks and speeds up migrations. This applies to many of your other use cases.
See common join patterns on your data, derived automatically from existing usage patterns.
Get personalized suggestions regarding what data to use.
Use the Stemma bot to link Slack chats to your data catalog.
Data Enablement

Ensuring Your Data Catalog’s Success

Get the guidance and support needed to ensure deployment, adoption, and success for your automated data catalog.
Powerful launch support - including shared sprint for deployment and integration.
Learn and deploy best practices to ensure adoption and high CSAT.
Fully Managing Your Data Catalog

Easy Deployment &
Enterprise-Grade Security

Maintain audit trail of all actions.
Never lose any data by our industry leading backup and restore functionality.
Protect your data through encryption at rest and in motion.
Integrate out-of-box with your internal Single Sign-On (SSO) provider like Okta or Google Auth.

How it Works

01

Ingest metadata via push, pull & API methods

Ingest from:

  • Data warehouse
  • BI tools
  • Orchestration systems
  • Slack
  • Github, and more...

02

Automated metadata through intelligence

We use existing usage patterns, user’s role & recent activity to power:

  • Related conversations in Slack
  • Common join
  • Personalized experience

03

Accessible through Stemma User Interface & API
  • Access all the ingested and derived metadata through an intuitive data discovery experience in a UI
  • Power programmatic use-cases through the Stemma API

Catalog Use Cases

Discover Trustworthy Data

Enable your analysts and data scientists to discover trusted data on a topic, understand its nuances and the best practices on how to use it.

Faster Onboarding

Bring analysts and data scientists up to speed by sharing rich context about the most commonly used data within your team, populated through curation and usage patterns.

Understand the Impact of Changes

Help your data engineers to understand the full impact of changes through data lineage that is always up-to-date.

Debug Data Issues

Make it easy for your data engineers to understand where data is coming from and see recent changes that have happened on a table.

Data Governance

Define business terms and marry them with technical data.
Further remove the burden of managing your data by simply certifying your most impactful data sets. Receive automatically, augmented documentation for the rest.

Enable Data Mesh

Federate communication ownership across data producers and consumers. Enable consumers to report issues and request metadata directly from the producers, without being the bottleneck.

Seamless Integration with Your Favorite Data Sources

Stemma offers the broadest level of integration with data sources. Connect seamlessly with Snowflake, Redshift, Google BigQuery, Apache Airflow and many more.
Discover the full list of integrations
Browse Integrations
Snowflake
Apache Airflow
BigQuery
Looker
Databricks
Amazon Athena
Mode Analytics
Slack
Amazon Redshift
Tableau
dbt

Testimonials

"We knew that manual documentation wasn't going to work.
We chose Amundsen as our data catalog because of its focus on automation and its supportive community. We knew that manual documentation wasn't going to work. I am thrilled to see Stemma make managed Amundsen widely available."
Michelle Gulen
Manager of Data Analytics
This is some text inside of a div block.
This is some text inside of a div block.
Author
Position
This is some text inside of a div block.
This is some text inside of a div block.
Author
Position
This is some text inside of a div block.
This is some text inside of a div block.
Author
Position
"We knew that manual documentation wasn't going to work.
We chose Amundsen as our data catalog because of its focus on automation and its supportive community. We knew that manual documentation wasn't going to work. I am thrilled to see Stemma make managed Amundsen widely available."
Michelle Gulen
Manager of Data Analytics
"In just 10 months of adoption, the usage of Amundsen has grown 5x. Today, 80% of Convoy's tech org uses Amundsen every month.
Amundsen's adoption has been nothing short of amazing at Convoy. I am excited that Stemma is bringing the same experience to the larger market."
Chad Sanderson
Head of Product, Data Platform
"Amundsen is used by more than 700 analysts and data scientists at ING.
That usage is growing by about 10% each month. We are targeting usage by 50% of the company in the future."
Bolke de Bruin
VP of Engineering Advanced Analytics / Artificial Intelligence
“Amundsen helped analysts at Lyft spend less time figuring out what data to trust, and more time on what’s important - solving business problems.
I’m excited for Stemma to be bringing the power of Amundsen to other data-driven organizations."
George Xing
Formerly Head of Analytics

Case Studies

Wide-spread use of Amundsen by 700+ data analysts and data scientists helps ING enable data mesh

Read Case Study

Wide-spread use of Amundsen by 700+ data analysts and data scientists helps ING enable data mesh

Read Case Study

iRobot leverages Amundsen to deliver improved customer experiences

Read Case Study

80% of Convoy's employees use Amundsen for discovering and trusting data

Read Case Study

Lyft increases data scientist & analyst productivity by 20% with Amundsen

Read Case Study