Making Data Accessible
By Satyen Sangani
Published on February 20, 2020
It’s likely that anyone reading this post believes in the power of data. Data guides people to make better decisions. Data helps people learn about the world around them. Data allows people to build amazing things. Data cuts through fiction, empowering people to work together around the facts. These truths are self evident.
These truths may well be evident, but they are actually realized by a small number of people who access a relatively small amount of data for a small percentage of the time. Stated differently, data is largely inaccessible to most people.
“Huh?” you might say. Don’t databases and BI platforms already make data accessible to anyone that can write a query or even open a link to a dashboard? Aren’t there already about 300 companies that store, process, and visualize data?
You’d be right. There certainly are many data technologies in the world today—and that’s actually part of the problem. We’re all aware that the world is producing more data every single day. People tend to think about this data production in terms of size—the number of terabytes or petabytes, but people rarely think about the fact that we’re also producing more containers of data every day. Every time we write a mobile app, build a web application, write a data pipeline, build a dashboard, or develop a sensor, we create a new container of data. All of these containers come with a lot of complexity, a lot of implicit knowledge. And this complexity, in turn, makes data inaccessible.
First, the content of these containers is written in a language known only to the programmers or the analysts who created them. The guy who created a column named “DB” knew that it meant date of birth; nobody else did. Second, the assumptions used to populate the data are also opaque. The programmer forgot to tell anyone that users that did not enter their date of birth would be assigned a default value of January 1, 1960.
As a consequence, if you want to consume this data to learn about the world, you have to re-discover all of this knowledge. The data is effectively encoded, gated, locked away. You don’t have the context to find the information you want or make sense of the information you find. Sometimes you can decode the data by reading stale documentation, finding the right colleague, or just trying to reverse engineer the knowledge based on what you see in the data itself. Often, you go without.
These examples seem trivial. But, imagine running a $2.6m marketing campaign segmented by age, where you have the wrong ages. Imagine deciding to build a brand new mobile site because you thought that 42% of the traffic came on mobile, where the real number was 4.2% There is a massive potential cost to errors and we bear these costs every single day.
On the other hand, having the right context can be tremendously powerful. Two and a half years ago, I found one of my co-founders, Venky, on LinkedIn. I could do this because LinkedIn contextualized Venky—where he worked, who he knew, and what he knew. Then Google allowed me to find the patents and papers that he had written. Before I met him, I knew that he could be a co-founder, that meeting him would be fruitful. Armed with the right context, at the right time, I was able to find a co-founder to start a company.
Similar to the way that LinkedIn and Google contextualized Venky, Alation exists to contextualize data, to make the right data easy to find, easy to understand, and easy to use. We do this by leveraging all the subtle signals that already exist inside of an enterprise. By providing this context, Alation helps people find the data and the knowledge they need to understand the data. They can learn where to go and who to ask. They can recognize bad information when they see it (not a week later when it’s resulted in a bad decision).
We hope one day, because of Alation, people will be able to instantly find the data they need and know they can trust what they find. Armed with the right information, we hope that our users will build better products, cut wasteful spending, find great new customers, figure out better ways to recruit employees, and maybe even save lives.
We believe that some of the most powerful technologies (like Wikipedia, LinkedIn, Google) are those that enable people to learn about the world around them. People can do amazing things when you put the right information in front of them at the right time. And, we’re so fortunate that so many others — particularly our colleagues, customers, investors, and teammates — share this vision.