RaceTrac Customer Case Study

RaceTrac Fuels Real-Time Data Insights for Decision-Making with Alation and Databricks

Photo of a RaceTrac gas station


minutes to insight from POS data, down from 24 hours


reduction in redundant datasets fuels trusted decisions


database tables ingested from SQL Server and Databricks

Challenge: Siloed organization, siloed data

As the gas flows at a RaceTrac retail service station, so does data — a lot of data. 

RaceTrac stores sell 27 million cups of coffee and 9 million slices of pizza per year. Their fuel subsidiary sells 5 billion gallons annually to their locations and other leading retailers and fuel wholesalers. 

On top of 260 million annual transactions, RaceTrac receives feeds from store cameras and fuel pump IoT devices at their nearly 800 locations. 

Each business and department used its own formulas, processes, and definitions to create reports. As a result, and even with the same data, the reports could contain wildly different conclusions and recommendations. What’s worse, reports based on transactional data could take as long as 24 hours to produce since the company had no visibility into real-time data. This hindered the swift strategic decision-making needed to delight RaceTrac customers. 

“Our siloed data, [and a] lack of governance and a single source of the truth, hindered our CEO’s vision of becoming a data-driven organization,” says John (JW) Williams, Executive Director of Enterprise Data at RaceTrac. “Data users didn’t understand and trust the data, limiting their ability to make great strategic business decisions.”

Objective: Become more data-driven

RaceTrac wanted to use the vast data they collect to fuel strategic decision-making and become a more data-driven organization. They first needed to:

  • Break down organizational data silos

  • Standardize and certify datasets to create a single source of truth — and spot discrepancies faster

  • Gain visibility into real-time transaction data

Solution: Alation and Databricks create a data single source of the truth

RaceTrac sought to consolidate and unify the company’s data on the Microsoft Azure cloud. To govern that data, they selected Alation for its ability to easily implement data lineage, governance, and collaboration across the enterprise. The product’s leadership position in the Gartner Magic Quadrant for Metadata Management Solutions instilled confidence in that choice.

The first order of business was to ingest valuable assets into Alation, including more than 500 Power BI datasets and reports and 1,500 database tables across SQL Server and Databricks. RaceTrac then established 44 data owners and stewards across 10 business domains. These data experts, who had been duplicating efforts, worked closely to eliminate redundancies and identify and certify datasets in each domain. The certified datasets, curated in Alation, now provide the definitive source of truth for calculations and reports. 

Alation also provides a single-pane-of-glass view into real-time transaction data in the RaceTrac Databricks Lakehouse — an architecture that combines the best qualities of data warehouses and data lakes. RaceTrac uses Databricks to ingest and analyze real-time transactions at scale. APIs pull data into the lakehouse, including everything from store cash management and point-of-sale (POS) systems to up-to-the-minute fuel trading prices and gas pump information. RaceTrac can even drill down into specific POS transactions and map them to a camera feed in an API to spot theft and other issues and patterns.

John (JW) Williams, Executive Director of Enterprise Data at RaceTrac

At RaceTrac … data is an enterprise-wide asset. By establishing a strong data governance program which provides a standard approach to processing, retrieving, archiving, and restoring data, the organization can not only make good decisions, but they can now make great decisions with a much higher confidence level.

John (JW) Williams

Executive Director of Enterprise Data at RaceTrac

“At RaceTrac, everyone has a level of responsibility when it comes to data because data is an enterprise-wide asset,” says Williams. “By establishing a strong data governance program, which provides a standard approach to processing, retrieving, archiving, and restoring data, we have created a solid foundation for our enterprise data. The organization can now make great decisions with a much higher confidence level.”

Results: Real-time visibility leads to faster insights

With Alation, RaceTrac gained a trusted data governance platform that provides a single view of their organizational data, including that in their Databricks lakehouse. By breaking down data silos, Alation helps decision-makers get to data insights faster. Using Alation, RaceTrac now has visibility into the real-time data ingested through multiple APIs into Databricks. Real-time visibility has helped RaceTrac spot and resolve discrepancies between systems that otherwise would result in loss or theft.

Alation’s Popularity feature, which surfaces the most popular data assets in search, helped RaceTrac identify the most used datasets and their top users. With this visibility, data leaders could then collaborate to reduce the number of disparate datasets from 725 to 250, with a further goal to reduce that number to 150 by 2024. By reducing data redundancies by 65% and aligning on certified datasets and KPIs for reporting, RaceTrac has fostered greater trust in data. The result: faster insights, fewer discrepancies in reporting across the organization, and better decision-making to fuel business growth.

John (JW) Williams, Executive Director of Enterprise Data at RaceTrac

This kind of real-time visibility into our data is game-changing. By processing large volumes of data in our Databricks platform and cataloging it in Alation, our ability to react at the speed of business has been phenomenal. This gives RaceTrac a newfound competitive advantage.

John (JW) Williams

Executive Director of Enterprise Data at RaceTrac

With the Alation Data Intelligence Platform, RaceTrac avoids risk by ensuring that data used for decision-making is high-quality, trusted, and fit for purpose. In the context of AI, this issue is magnified due to the technology's reliance on data to learn and make predictions. However, the same holds true across all automated systems and decision-making processes. Utilizing ungoverned, low-quality, or untrusted data can lead to erroneous decisions, undermining the effectiveness of these processes.

RaceTrac integrated the Bigeye data observability platform with Alation to address this issue and create quality scores for certified enterprise-wide data and datasets. Now, each dataset has a data quality score,  which is calculated based on a set of metrics that measure the following: pipeline reliability, completeness, consistency, accuracy, availability, and timeliness. Consumers of these datasets are notified if the data quality score falls outside of its defined threshold. In addition, all metric violations are viewable within the Alation data intelligence platform through its data lineage feature. This promotes awareness, trust, and transparency.

RaceTrac continues to use Alation to identify and curate data to migrate to the cloud. This helps speed up the ongoing cloud migration and ensures that only the most valued and used data is migrated.


About RaceTrac


Financial Services

Use Case

Data Governance

Data Platforms

MS Azure, Databricks, Power BI, Google Data Studio, SQL Server Management Studio

Take a tour

Try a self-guided demo.

Book a demo

Schedule a live demo guided by an expert.