But competition in the database business is heating up
CATCHING SNOWFLAKES is fun. It is about to become lucrative. Many investors will scramble for shares in Snowflake, a maker of database programs, when it goes public on the New York Stock Exchange on September 16th. It is expected to be the largest ever initial public offering of a software company, raising around $3bn and valuing the eight-year-old firm at more than $30bn. Even Warren Buffett, abandoning his customary tech-shyness, wants in on the action. The legendary investor’s conglomerate, Berkshire Hathaway, is investing more than $700m in the firm, through a separate private placement and by purchasing shares from a former chief executive.
The excitement shines a light on an obscure corner of the information-technology ecosystem: software for managing corporate data. This database market already generates $55bn a year in sales. It is expected to expand rapidly as data become if not the new oil, then at least an important input for most companies. And it is changing in intriguing ways—not all of them good news for Snowflake.
A database used to be best understood as a digital steam engine. Before electricity came along, a factory’s machines sat near a single power source. Similarly, corporate applications—programs that keep track of a firm’s finances or its supply chain, for example—were built around databases that housed all of a firm’s important information. Hard disks were pricey and had limited capacity so the best way to store it was in lean “relational” databases. Max Schireson, who used to run MongoDB, a database maker, and now works for Battery Ventures, an investment firm, likens these to “a parking garage where, to save space, you put all the seats in one place, the tyres in another and so on”. The industry quickly became dominated by a few firms, with Oracle leading the pack.
As storage has grown cheaper and data volumes have exploded, however, so has the number of startups erecting new kinds of digital carpark. Many focus not on tracking specific transactions but on analysing all manner of a firm’s data to glean relevant knowledge about its business, such as where certain products sell best. These more cluttered “data warehouses”, as they are known, were pioneered in the late 1970s by a firm called Teradata. Their latest iteration are “data lakes”, which take in all sorts of unstructured information, including text and pictures.
Snowflake has gone a step further. It was one of the first firms to lift both the data stockpiles and the software to trawl them from companies’ in-house data centres and into the computing clouds, the biggest of which are operated by Amazon, Google and Microsoft, a trio of tech giants. Snowflake’s customers can add capacity as needed—and pay depending on their use rather than a fixed price for a software licence, as was typical for relational databases. Better yet, its “multi-cloud” service works across the big three computing clouds, so customers need not get locked into any one of them. Recently Snowflake has also added features that let customers share and sell data, setting itself up as a data exchange of sorts.
This has convinced many that Snowflake could be the next Oracle. The firm is certainly on a roll. Although it is not yet making money, its losses, of $171m in the six months to July, have declined even as revenue has more than doubled year on year, to $242m. On current trends sales could reach nearly $1bn in the next 12 months.
Despite these promising numbers, and Mr Buffett’s blessing, Snowflake has its work cut out. The company’s uniqueness will not last much longer, says Donald Feinberg of Gartner, a research firm. Rival firms, in particular the big cloud providers, have been beefing up competing products and even dabbled with the multi-cloud. A few startups are already offering cheaper and more flexible “open source” alternatives such as ClickHouse, a particularly zippy data-management system developed by a startup called Altinity.
Other challengers are building more specialised digital repositories. Data generated by websites, for instance, are often stored on “document-oriented” databases that, in the garage analogy, keep cars intact rather than strip them for parts. MongoDB is the market leader in this segment. Confluent, another startup, is big in “streaming” databases that garner information from sources like sensors. These are more akin to a motorway service station: data are quickly checked to see if action is needed.
Much as today’s assembly lines are driven by dispersed electric motors rather than a single steam engine, then, corporate it systems will increasingly rely on sundry specialised databases, predicts Zane Chrane of Bernstein, a broker. That—and the fact that data will increasingly be analysed in real time, rather than saved in a conventional database—will limit the power and profits of any single supplier. So Snowflake is unlikely ever to become as dominant as Oracle. Snowflakes fly high in a flurry. They also melt.