Data explosion: The ugly truth facing modern marketing technology stacks

Marketing technology is a fast-growing industry. It’s worth $230 billion each year and growing 20% year over year, Singular CEO Gadi Eliashiv said recently at UNIFY.

But that’s slow growth compare to marketing data itself.

“Marketing data is exploding,” Eliashiv said. “It’s growing much faster than the industry itself.”

Why?

There are more connected people, many with multiple devices. That’s more digital activity, all of which generates more data and more statistics. There are more software solutions for both martech and adtech, and each of them ingests, consumes, and generates additional data.

And with that increased digital activity — more of the customer journey is digital now than ever before — marketers have built more metrics to understand what visitors and users and customers are doing.

The current marketing tech stack for an enterprise can easily include more than 100 martech tools, Eliashiv said. The average enterprise currently has 91 cloud services for marketing, according to Netskope data cited by Kleiner Perkins and “chief martech” Scott Brinker.

This puts huge power in the hands of marketers.

But it’s also a huge problem.

“This creates major challenges for marketers,” Eliashiv says. “The data is siloed, the data is non-standardized, and the data is not actionable.”

If it was only siloed, the solution might be simple, though tedious: logging into multiple dashboards, downloading multiple PDF reports, exporting multiple Excel spreadsheets, and combining them all in an internal BI system, or a monster spreadsheet.

And … doing the same task every single week (unless you want more real-time data, in which case you could do it more often.)

But the data is also non-standardized. Naming conventions differ. Definitions of terms like “viewable” differ. Percentages are on different base figures. Conversions mean different things in different systems. So the data needs to be normalized in order to make sense.

Only then is it truly actionable.

“We make sense of it all,” Eliashiv said. “We built an infrastructure that will collect all the information from every solution possible, and then offer insights on top of it.”

That includes marketing data: what the team is doing, where they’re spending money, and what campaigns are going on across all channels and partners. It includes attribution data, which is simply linking that marketing data with outcomes. And it includes customer data: the KPIs or actions that marketing departments are trying to drive.

“The core challenge for marketers is how you make your data actionable,” Eliashiv says.

“To help marketers succeed in this fragmented space, we’re doing three things: connecting all the data from all the silos, standardizing this information so it is ready for consumption and analysis, and analyzing the information and making it actionable.”

Those three simple-sounding steps?

They take the data explosion — an ugly, inconvenient challenge for many modern marketers — and make it an incomparable asset.

Go deeper: Find out how Lyft and Match accelerate their growth.

Unify: Bringing the Marketing World Together

A little over a month ago we held our inaugural UNIFY conference.

The dream for this conference started years ago. Singular is a community, and our employees, partners, and customers are our family. Our family’s mission is to unify data across the marketing ecosystem, and we do that for the best and brightest in the industry.

Our goal since day one — and our goal at the UNIFY conference — was to share the insights we unlock to marketers everywhere.

The best part about UNIFY?

We learned from some of the best about how to beat the rest. A literally all-star cast of speakers shared some of the ways they became all-stars.

Something even better, however, is that over the next week or so, we’re going to share their insights with you … right here on the Singular blog.

That includes top executives from Lyft and Poshmark. Senior leaders from LinkedIn, and VPs from companies like Calm, JamCity, Postmates, and N3twork. We learned about fraud from IronSource’s head of growth, and multi-touch attribution from Lyft’s head of marketing science. Top marketers from Instacart, Kabam, Small Giant, and Riot Games shared how they build their tech stacks and what drives outsized growth. Yelp execs spoke, along with leaders from Nexon, SmartNews, and Grow.co.

And Singular’s own CEO, Gadi Eliashiv, shared what’s changing in marketing technology, how those changes are impacting growth teams, and what marketing leaders can do to win today.

But there’s more. And it’s just for you.

We’re going to be sharing detailed insights on how all of these marketing superstars lead their world-class organizations … with full video.

I’ll update this post with the links as those posts go live over the next 7-10 days. So I strongly advise you to check this space regularly to learn from the leading stars of our industry. Over the next few days, you’ll get a chance to peek inside the curtain and see part of what made them so successful.

Our goal? Providing the tools that you need so you can join them.

If you came to UNIFY: thank you. You helped make it a truly special occasion.

But if you couldn’t get a ticket this year … we look forward to seeing you next year!

Throwback Thursday: A walk down memory lane

We recently celebrated Singular’s 4 year birthday, so for this Throwback Thursday we’d like to take you on a walk down memory lane – and we’re throwing in a few old pictures for posterity.

Let’s set the scene… The year was 2014. “The most famous selfie in the world” was taken, Kim K stopped being Selfish and got married, the selfie stick was EVERYTHING, and you knew at least one person who camped out for the Apple Watch. And somewhere in a San Francisco living room, a brighter future for marketers was being imagined.

When Singular was founded, we were just a small team of three with big dreams. Our founders Gadi Eliashiv, Eran Friedman, and Susan Kuo came up with the idea for Singular after working extensively in the mobile growth industry and noticing a huge void in the resources available to marketers. From their experience together at Onavo (acquired by Facebook), they knew that marketers were constantly facing issues resulting from a lack of data accessibility in their marketing stacks, and knew that this absence of structure was limiting marketers’ ability to strategize and optimize their campaigns.

So they formed Singular with the goal of creating a single source of truth for mobile marketers, and we’ve been building off that goal ever since. In June 2017, we acquired Apsalar and worked hard to merge our complementary technologies. Now, not only do we offer the most sophisticated campaign analytics available on the market, but we also combine that marketing data with attribution data in a single dashboard, allowing marketers to uncover ROI at the most granular levels possible.

Since 2014, Singular has accomplished a lot more:

  • Grown to 140+ employees around the world
  • Opened 7 offices internationally
  • Tracked over $10 billion in ad spend, 3x larger than any other provider in the analytics & attribution space
  • Integrated with over 1,600 partners
  • Live-streamed our CEO’s Tel Aviv wedding to our offices across the globe in September 2016
  • Flew high on a trapeze in December 2016
  • Released the first industry-wide ROI Index in February 2017, which has since become a staple annual report for marketers
  • Created our Unified offering (Singular Analytics + Singular Attribution) in June 2017, now used by 60% of our customers
  • Completed our 100,000th push-up in July 2017
  • Levitated and meditated in aerial yoga in July 2017
  • Released our first Fraud Index in November 2017
  • Survived an SF earthquake that decimated our whiskey collection in December 2017
  • Released our first State of the Industry report in January 2018
  • Held our 50th full company all-hands, Kablash, in May 2018
  • Belted out killer karaoke in the 4th international city on our world tour in June 2018
  • Hosted our inaugural growth summit, UNIFY, in June 2018

 

 

Though we have grown in both size and ambition (much like Drake’s repertoire of dance moves) since 2014, this team still works as hard and as passionately as we did when we were just a handful of people in shared office space chasing a crazy dream. We have overcome countless roadblocks and shifts in the industry over the past four years which have helped shape our roadmap and strengthen our collective vision to make marketers’ lives better. We’re more excited than ever about what the future holds – as long as it isn’t another new type of MacBook port.

Interested in joining our tribe? We’re hiring engineers, support, sellers, marketers and more. Apply today for a role in one of our seven international locations! Must love dogs and GIFs.

 

Top Takeaways From Singular’s ‘Spot the Fraud’ Event

Singular recently hosted our first-ever “Spot the Fraud” event, where mobile marketing and product leaders from companies like Airbnb, Instacart, Postmates and N3TWORK gathered to discuss the future of mobile ad fraud prevention.

At the event, Singular CEO Gadi Eliashiv unveiled Singular’s enhanced Fraud Prevention Suite as well as The Singular Fraud Index, a first-of-its-kind study revealing the industry’s most effective fraud prevention methods along with the 20 mobile ad networks driving the lowest rates of fraud.

The full presentation can viewed by clicking here and below are the top takeaways from the presentation.

Mobile Marketers, Still Unprotected

In the process of analyzing fraud data from hundreds of the world’s largest mobile apps, Singular’s Fraud Prevention Team uncovered a startling stat: 63 percent of marketers do not utilize fraud prevention techniques in their mobile marketing systems. Part of the blame lies with analytics providers that treat fraud prevention as a luxury, offering it to marketers as a “premium” add-on rather than a feature deeply embedded in the core platform. Equally problematic is the complexity of existing fraud offerings — for instance, attribution platforms that place the burden on marketers to run their own statistical analysis to detect fraud.

The Threat of Attribution Manipulation

Today the majority of mobile ad fraud is perpetrated via attribution manipulation, whereby fraudsters steal credit for installs from organic and paid channels. Each month, attribution manipulation attacks account for roughly 70 percent of prevented ad fraud, recent data from Singular shows, costing ad networks and marketers money and corrupting performance data in the process.

But big tech players are working to fix the analytics holes that have allowed fraudsters to steal credit for installs. For instance, Google’s latest analytics release, the Google Play Referrer API, helps marketers and their attribution providers combat attribution manipulation with Referrer and Timestamp data received directly and securely from Google.

Fraud Keeps Getting Smarter

Singular’s Fraud Prevention Team uncovered a new form of fraud being perpetrated by malicious mobile apps, some of which have millions of users. This form of fraud, called Referrer Injection, leverages Android’s internal messaging system to inject a fake referrer into an attributed ad click, thereby assigning credit to the fraudulent source. Because attribution systems prioritize referrer messages over other touchpoints, Referrer Injection is a particularly effective way for fraudsters to steal credit. Thankfully, Google’s new Referrer API allows Singular to stamp out this form of ad fraud. However, Referrer Injection highlights the fact that marketers must reject complacency given the constantly evolving threat of fraud and implement fraud solutions that adapt to increasingly sophisticated fraud tactics.

Safe Havens

Even with fraudsters siphoning off billions from advertising budgets each, there are safe havens for mobile ad dollars. Singular analyzed mobile ad fraud data anonymously collected from mobile marketers within a 30-day range to identify the most secure mobile media providers, or those with the least fraud. The 20 most secure media providers are capable of driving both significant volume to marketers while keeping fraud rates well below the industry average. To view the complete list of of mobile ad networks driving the lowest rates of fraud, download the full report here.

Comprehensive Protection

As The Singular Fraud Index shows, currently there isn’t one fraud prevention method that reigns supreme in blocking the majority of mobile ad fraud. Singular’s breakdown of the most effective fraud prevention methods reveals that the marketing industry uses a handful of prevention methods to combat mobile ad fraud, with each method blocking its share of total fraud prevented each month.

That’s why Singular’s new Fraud Prevention Suite utilizes all known prevention methods to provide maximum protection against existing fraud threats as well as unknown threats to come.

“You don’t want to chase fraud manually,” Gadi said in his presentation. Marketers don’t have the time and, in most cases, the expertise to identify what is fraud and what is legitimate activity.

Singular is excited to equip its customers with a transparent, flexible and proactive fraud solution that analyzes countless fraud signals at the most granular levels and automatically applies rules at the time of attribution to stop fraud in its tracks, keeping sources clean and media budgets focused on quality users.

Learn more about Singular’s enhanced Fraud Prevention Suite here and download The Singular Fraud Index here.

How Singular Delivers Blazingly Fast App ROI Analytics

Mobile app marketers need access to increasingly granular data to accurately measure app ROI — which inevitably requires processing huge volumes of data on the fly. This can dramatically slow down database query times, creating bottlenecks for marketers and preventing them from optimizing as quickly and as often as they’d like to. Call it the Catch-22 of modern-day mobile marketing: an ever-increasing need for speed amid an ever-increasing flood of data.

At Singular, the issue came to the fore ahead of our latest analytics offering for marketers, Publisher ROI. Publisher ROI allows marketers to quickly expose a breakdown of an ad network’s inventory by publisher and determine the individual sites and apps driving the best performance.

Early testing of the feature showed that customer queries for publisher-level data required an enormous increase in computing power — to the tune of 50-100X the data normally being ingested and processed for queries in Singular. As we saw query times spike during testing, it became clear that we had hit a major startup milestone: We had outgrown our database technologies.

In order to launch our latest innovation, and continue offering mobile app marketers fast and flexible access to increasingly granular marketing data, we would need to introduce a new data pipeline and datastore — one that was capable of enabling ad-hoc queries on a billion rows with sub-second performance.

In this post, we’ll describe how we rebuilt certain components of our database technologies and dramatically increased the speed of our customers’ queries — in some cases by a factor of 150X.

But before we dive into how we accelerated our customers’ queries, it’s worth giving some background on the evolving needs of mobile app marketers, and the sheer size of the data challenges we face in providing them with an analytics platform as flexible as Singular.

From the early days of Singular, we kept a close eye on the kinds of queries our customers were running in Singular. We tried to spot patterns in order to build systems that anticipated the dimensions a marketer might use in their queries. But we quickly found that there was a huge diversity in the reports and queries customers were running, with countless combinations of different dimensions and metrics.

Think about it in terms of rows in a table. Say you advertise on an ad network that runs your ads on 100 different publishers. You might check the overall ROI of the network — with Singular, that’s just one line of data. To then expose a breakdown of each individual publisher’s performance in order to segment high-performing from under-performing publishers, you would need to render 100 rows of data. Now say you’re running 4 multi-country campaigns and you want to see if certain publishers are performing particularly well in certain Geos — now you’re looking at 400 rows. Want to a break out data by iOS and Android? — that’s 800 rows.

This kind of complexity led to our first conclusion about the database architecture that could support our needs. Because usage of Singular is so diverse, with countless combinations of query dimensions, we needed to support ad-hoc queries, or queries that cannot be determined prior to the moment they are issued. As most data engineers know, databases that are truly optimized for ad-hoc querying are usually not good at updating data after it is ingested. We therefore needed to ingest data into our database in its final form.

Data Ingestion at Singular

At Singular, we run around 10,000 marketing data collection tasks a day. Each collection task is responsible for collecting data for a specific customer from one of its third-party marketing vendors from a window of 1-30 days back. Our ingestion pipeline is distinctive in the volume of data each task collects (granular Facebook stats for 30 days is huge) and because we collect data from vendors who tend to update data retroactively on a regular basis, which requires Singular to constantly swap out old statistics with newly updated data.

Our ingestion pipeline previously loaded data into MySQL and kept running various ingestion logic by querying the data and using updates. After the data was available, the queries triggered by our dashboard and API would hit MySQL as well.

This has been working well for us — that is, until we introduced publisher-level collection and querying. With the large increase in data volume, loading to MySQL became slower, creating bottlenecks in our pipeline. In addition, running analytics queries on this volume of data was simply too slow.

Designing a New Ingestion Pipeline

To support an infinite number of tasks, with an ever-growing size of ingested data, we aimed to build a pipeline that was horizontally scalable (unlike MySQL). The choice of Amazon Simple Storage Service or “S3” to support these dependencies was obvious. We had already been using S3 for backing up our data before inserting it into MySQL. Plus it is horizontally scalable, requires zero ops and offers fast download/upload speed when working within Amazon Web Services.

Thus, our new ingestion architecture relies on S3 with only the metadata stored in MySQL. Instead of querying and updating MySQL, each component of our pipeline receives an S3 file as input and passes on another S3 file containing the data it received together with the new information the component produces.

The end of our pipeline sends the data to an S3 bucket, containing the most up-to-date statistics per customer, marketing source and date. This enables running data collection tasks in parallel, and makes this bucket the source of truth for our system.

Towards a New Datastore

With our data in its final form, ready to be queried in S3, our choice of datastore was super flexible. We had previously built a small abstraction layer between our API and MySQL, which could be adapted to support any query language or schema. Thus, in evaluating new databases, we knew beforehand that the decision wasn’t final, as switching costs were so low. In the end, we selected Druid, an open-source data store developed for the exact need of aggregate queries over marketing analytics data.

Once implemented, we were thrilled with the results: With Druid, queries that once 60 seconds took 1-2 seconds, while queries that once took 30 seconds took less than a second. In certain cases, we saw improvements in database query times as high as 150X compared to the old system.

All these developments bring us to earlier this year, when we began switching on Publisher-Level ROI for select customers in a closed beta test of the feature. The beta would serve as the ultimate stress test for our new architecture.

Granularity & Speed for the Win

As a refresher on Publisher-Level ROI and why it’s so groundbreaking, here’s a little industry background. Ad networks typically purchase inventory consisting of ad slots in hundreds, sometimes thousands, of sites and apps. These sites and apps, known individually as “publishers”, are where marketers’ ads run.

In recent years, mobile app marketers have demanded more visibility into publisher-level performance in order to identify pockets of their most valuable traffic. Networks have responded by providing Publisher IDs in the performance data they expose to marketers. Marketers, in turn, use these Publisher or Site IDs to optimize, increasing spend in high-performing publishers and decreasing spend or “blacklisting” under-performers.

Historically, however, publisher optimization across multiple networks has been an arduous and error-prone process, requiring marketers to toggle between multiple dashboards and manually update unwieldy Excel files. The process is particularly painful for marketers who wish to analyze the performance of publishers not merely by click-through rates or raw install count but rather by the actual quality of those users, as measured by ROI.

Singular automates the process of collecting all your marketing data under one roof, before cleaning and combining the data with revenue and events retrieved from tracking links in order to expose app ROI and other full-funnel business metrics.

But, as we’ve shown here, once ingested, enriched and combined, making publisher-level data fast and flexible is a whole ‘nother beast. Which is precisely why we invested so heavily in our new pipeline and datastore. Mobile app marketers need both granularity and speed — and we’re proud to say that the results speak for themselves.

As we’ve rolled out Publisher-Level ROI to our beta test customers who are now running on our new Druid-based system, customers have reported lightning-fast load times, even with the massive increase in data volume.

And the performance improvements aren’t limited to publisher-level querying. With Singular, similarly data-intensive querying — to expose Campaign, Country, Creative and User-Level performance — is now faster than ever thanks to these advancements in our new database technologies.

Top Takeaways From AppBoy’s LTR Summit

At Singular, our goal is to help grow the app industry by participating in key mobile strategy discussions wherever they occur. Last week Singular attended AppBoy’s Long-Term Relationship Conference, where mobile marketing strategy and product leaders gathered to discuss the future of mobile marketing and user communication. The event focused on today’s rapidly evolving mobile app marketing stack and featured talks with innovators at companies like Lyft, Ibotta, OkCupid and Wallapop (all Singular customers!). Here are top mobile marketing strategy takeaways from the event:

1. An inflection point for “buy vs. build”

OkCupid historically built most of its mobile app analytics services in-house. Yet things have changed in recent years as the team has started to hook more third-party services into their mobile app marketing stack. “For most our history we felt we were ahead of the curve, but many of these components have rapidly advanced over the years,” said Mike Cirello, Software Architect at OkCupid, who mentioned mParticle, Looker, Amplitude, and AppBoy as some third-party tools used by his team. Cirello’s sentiment echoed that of several speakers who reported that their mobile app marketing teams are using more third-party vendors to achieve efficiency gains in areas like data management, data processing, product experimentation and customer support.

For Lyft, the decision to build or buy components of its mobile app marketing stack largely comes down to speed. As Milan Thakor, Lyft’s Passenger Engagement Lead, said:

“We need to know [the third party] moves really quickly and will build faster than us, and that their vision is aligned with ours.”

2. The danger of disconnected data feeds

In recent years there’s been an exponential increase in the number of mobile marketing data feeds inside organizations, said Michael Katz, Founder of mParticle. Marketers, meanwhile, have quickly learned that siloed data can lead to poor user experiences. For instance, if mobile app customer data is not connected to marketing automation, a user who submits a complaint because of an incorrect order might still get emailed with new deals prior to the complaint being resolved. Or an iOS or Android app user visiting a city might receive location-specific offers in that city even after they’ve departed. It’s clear how those two experiences could frustrate app users. To avoid such messaging mishaps, companies must ensure that their marketing tools effectively communicate with their customer data.

3. Optimizing for downstream events

Bait and switch ads might drive high iOS and Android app ad click rates, but they don’t pay off in the long run, said Rich Donahue, Ibotta’s SVP of Marketing. Instead, Ibotta optimizes its ads and messaging for downstream events, including testing 97 different onboarding flows in its Android or iOS app. In addition, Ibotta runs cross-channel messaging tests that leverage interconnected analytics systems — for instance, tests to determine how paid ads affect the open rates of emails and push notifications.

The Wallapop business, a mobile marketplace for secondhand goods, is similarly focused on optimizing its user acquisition and re-engagement for key actions in its app. Users of Wallapop who haven’t opened a conversation with a seller in the first 7 days after registration are much more likely to drop off, said Nicolás Herrero, Wallapop’s Lead Data Scientist. Such mobile app strategy findings have led to Wallapop building campaigns around custom events and triggering messages if a user hasn’t performed a certain action within a given period of time.

Download The Singular ROI Index to see the world’s first ranking of ad networks by app ROI.