Create High-Impact Data Visualizations: Nine Effective Strategies

Green_Visual I believe deeply in the value of making data accessible.

In service of that belief, there are few things that bring me as much joy as visualizing data (smart segmentation comes close). There is something magical about taking the tons and tons of complexity that lurks in our data, being able to find the core essence, and then illustrate that simply. The result then is both a mind and heart connection that drives action with a sense of urgency. #winning

While I am partial to the simplest of visualizations in a business data context, I love a simple Bar Chart just as much as a Chord or Fisher-Yates Shuffle. As we have all learned, tools matter a lot less than what we do with the tool. 🙂

In this post I want to inspire you to think differently. I’ve curated sixteen extremely diverse visualization examples to do that. By design none of them from the world of digital analytics, though I’ll stay connected to that world from a how could you use this idea perspective. My primary goal is to expand your horizon so that we can peek over and see new possibilities.

To spark your curiosity, the visuals I’ve worked hard to find for you cover the US debt, European politics, lynching and slavery, pandemics, movies, gun control, drugs and health, the Chinese economy, and where we spend our lives (definitely review this one!).

The sixteen examples neatly fall into nine strategies I hope you’ll cultivate in your analytics practice as you create data visualizations:

This post has quite a bit of depth, and loads for you to explore, reflect and internalize. It will take a few visits to absorb all the lessons. In as much, my recommendation is to read one section per day. Take time to really understand what’s going on, go to the site, play, look at the higher resolution versions (click on the images), make notes of what you’ll do for the first time or change about what you already do. Most importantly, practice taking action. Then, come back, read the next one and take action. I promise, the rewards will be rich.

Let’s go make you an even more effective influencer when it comes to data!

Strategy 1: The Simplicity Obsession

One of the reasons so many visuals are so very complex is that the Analyst/Creator is trying to demonstrate how clever they are. Sadly in the process of demonstrating aforementioned cleverness, the visuals ends up being incredibly complex crammed with every little bit of amazesomeness they  are trying to demonstrate…


us maximum personal income tax rate vs national debt burden per capita
(Click on the above image for a higher resolution version)

There is absolutely no doubt in my mind that the Creator worked very hard, and, I sincerely mean this, they are very clever.

The problem is that the essence of what they want to communicate is probably only known to them, or to any person willing to take the time to first learn the job of the analyst, dig into the data themselves, create this picture and then understand what is being said.

It breaks my heart.

Go on. Scroll back up. See if you can understand what is being said.

In my humble opinion there is an additional subtle problem. The Creator was asked to plot the data, or perhaps share the insights, but it is unclear whose job it was to answer this simple question at the end: So What?

When you start with that as your destination, so what, as the creator of any visualization you are going to ask for a lot more context, you are going to make sure the visual is in service of the answer, you’ll make sure your cleverness is focused on the outcome the data has to serve.

Please, please, please keep that in mind.

The complicated thing above is trying to highlight an important trend, is missing the context, and is simply not as dramatic as the reality of it actually is!

Here’s a better visual showing the National Debt Burden, with four additional elements of context…


debt_chart_2016_sm
(Source)

Did you get what the point was in zero seconds?

Are you a whale-load more scared as you contemplate the red and the green?

Are you freaked out that if there is one thing both political parties in the US seem to be good at it is the red (!)?

That is what a good data visual does.

For the few of you that are a part of the team I lead, in addition to creating a visual for your analysis that is simple and effective, you know that my expectation is that you’ll come with recommendations on what to do.

To demonstrate that there are many paths to JesusKrishnaAllah… Here is another simple view of the debt, with a different x-axis, a stretched out y-axis, along with a different set of context…


federal_debt_past_future
(Source: CBO)

Different questions, different arguments, different outcomes. But, you’ll get to them much, much, much faster than the first visual.

I can’t stress this enough: Don’t try to earn your performance review from the client/audience. Earn it from your boss. Tell your boss how hard you worked, show her how clever you are, earn her praise. Spare your client/audience – show them the simplest manifestation of your brilliant insight, with the NACR criteria applied.

(For more on using NACR to identify out-of-sights, see TMAI #66.)

Strategy 2: If Complex, Focus!

You are going to see my deep bias for simplicity for the rest of this post (or in the 745,540 words written on this blog thus far). I do not want to come across as a simplicity snob.

Deployed well, there are instances where I love complexity.

I thought this was exceptionally well done…


earth_s_oldest_trees__paukee-sm
(Source: Michael Paukner  |  His Flickr collection)
(Click on the above image for a higher resolution version)

While it is a little difficult to follow all the arrows back to the original country, the shape of the graphic is an homage to the visual’s topic. The background color could not have been more prefect. And, notice there is just the perfect amount of information about every tree.

There are other more subtle things to admire. I love, love, love that Michael put the US on the right. When we “trip up” our audiences like this,  it gives them a pause and forces them to look at all the other information more carefully.

There is of course data itself that gives you many pauses. Notice the youngest tree in the graphic is older than Jesus Christ. Or, that we should all be so glad that the American West was settled last (by then we were more appreciative of nature as humans).

I am fine with complexity, if the essential makes it through. I am fine with complexity, if someone who’ll spend 1/100th of the time on the visual compared to you get’s it.

Strategy 3: Venn Diagrams FTW!

I love Venn diagrams. Ok, strictly speaking Euler. But, let’s not get pedantic.

I’ve used them to simplify the presentation of complex topics. Ex: Six Visual Solutions To Complex Digital Marketing/Analytics Challenges

I am only slightly kidding but one of humanity’s most complex undertaking is to understand what the heck Europe is. One end’s up ruing even asking, because you hear back EU, EEA, Euro Zone, Schengen, EFTA, and more.

I felt Bloomberg did a wonderful job with, what looks like an amoeba-inspired, Euler diagram…


eu_explained

(Click on the above image for a higher resolution version)

The color schemes are contrasted enough to allow you to follow along nicely.

The context from the sizes of the economy is a nice touch. (This is embarrassing but I was surprised how big Italy is, and how small Sweden is.)

The clusters of countries next to each other, for the sake of cleaner lines, all by itself has a built-in message. Cyprus and Ireland. UK, Romania, Bulgaria and Croatia. So on and so forth.

Overall, this is a topic that has been tackled numerous times, with painful to see results. Bloomberg managed to make it as simple as possible, with valuable built-in context.

Staying in the same geographic area, and my Euler-love, here’s another fantastic visualization of often a very complicated answer: What is each political party in the UK promising?

I adore this as the answer…


uk_election_manifesto_economist
(Source: Economist)
(Click on the above image for a higher resolution version)

Would you have believed that the totally out there UKIP would have something on common with Labour? Or that Labour is completely alone in the minimum wage issue?

The visual makes it easier to understand what we might be most interested in from the thousands of pages that form each party’s manifesto. You, the audience, is now empowered to agree more passionately with your party or feel the uncomfortable squirming that comes with realizing what your party is solving for. Both. Fantastic. Outcomes.

Clearly this is a political picture, and someone has to decide what to include and what to exclude because the parties promise the Earth, Moon and the Andromeda galaxy. But that is the life of an Analyst… They have to make tough choices.

Two hopes.

1. I hope every single news organization in every single country in the world will copy this visualization and create it for their main political parties. (Also see related NYT example on Guns below.)

2. What will you do with this? Can you pull out all the content types from your digital existence and create a visual like this one for which goal (overlapping goals) each type is solving for? How about displaying countries and products purchased? Oh, or your main traffic sources and the visitor acquisition metrics?

So much to do, so simply, and so little time!

Strategy 4: Interactivity With Insightful End-Points.

There is a common belief that your company’s decision makers would use data more if they could explore it – more efficiently, deeper, etc. This is almost never true, primarily due to the problem outlined in the orange and blue triangles that outline skill/competency and insights/action.

Hence, in a business context I rarely advocate for initiatives whose only purpose is to allow the broad collection of company employees to go on random fishing expeditions.

Exploratory environments can be useful, especially when they are 1. sharply focused 2. have an ability to eliminate dead end-points and 3. allow for smart elements like modeling. Let’s look at the first two below and the third one in the following example.

Here’s a valuable dataset from the Equal Justice Initiative on Lynchings in America.


lynchings_america_eji
(Click on the above image for a higher resolution version)

Even at a glance the data is useful, along multiple dimensions.

In this case exploration of the data makes it even more valuable. You hover your mouse over your area of interest, and click…


lynchings_alabama_eji

You get your data drill-down, but what’s of most impactful is that you also get an end-point with a valuable insight providing meaning to the data.

In this case the number 29 for Jefferson County would be an insufficiently valuable end-point. The inclusion of Elizabeth Lawrence’s story on the other hand provides meaning. That is what gives the exploration a purposeful end-point.

You can now zoom out, move on to exploring other areas, continuing to get enriched value from the data.

In a business context when you are working with interactive data visualizations, ask this very valuable question: In a sea of data, whose job is it to include a logical end-point with an insight of value?

Surely, your terabytes of Google Analytics data dumped into a Tableau exploratory thingamagigy won’t magically throw them out there.

Surely, lay business decision makers, even senior ones, won’t have all the context they need to have to convert thingamagigy fishing expeditions, sorry, explorations, into the brilliance you feel the data contains.

Interactive visualization are great, only when packaged with insights for actions at logical end-points in exploration. Tweet that.

[SIDEBAR]
This is a difficult example to share because of the deeply emotional content it contains. But, those who do not learn from history are doomed to repeat it. Beyond the value of the lessons from the visualizations, I encourage you to explore rest of the EJI website. At the very minimum please consider spending five minutes listening to the story of John Hartfield told by Tarabu Kirkland, and six minutes on the story of Thomas Miles Sr told by Shirah Dedman. Thank you.
[/SIDEBAR]

Bonus: Another insightful visualization on this topic is at pudding.cool, The Shape of Slavery…

mapping_slavery

A bit more complex of a visualization, a function of the depth of data populated.

Follow the story of Louisiana as you reflect on the data.

Lots of data visualization, storytelling and life lessons in this data set as well.

Strategy 5: What-if Analysis Models.

Building on the thought above, if you create exploratory environments it can be exceedingly accretive to decision-making if we build in what-if type models. Rather than stopping at an end-point, provide an option of doing some type of sensitivity analysis with the goal of prodding the audience to take action.

For example… Let’s say they end up looking at Visitors, Conversion Rates, and Revenue. You can easily imagine how you want someone to explore that data by traffic sources or campaigns or geo or myriad valuable dimensions. You can create an environment where they press buttons to get that data.

Necessary, but not sufficient.

Why not build in a model where the decision maker can change Conversion Rates, to see the impact on Revenue? Move it from 1% to 1.5% to 8%. See what happens by traffic sources. Then, make a smarter decision.

Or, empower them to play with discounting strategies. What happens if they offer a 5%, 10% or 18% discount? Show impact on Revenue and Profit.

Even without bundling insights into your prepackaged environment, the what-if models allow your decision makers to play with scenarios, understand impact and make smarter decisions about what to do.

That’s the key. Don’t make visualizations with dead ends.

Here’s a great example of that from Mosaic. The visualization is about outpacing pandemics.

Quoting them: Vaccines are an essential weapon in fighting disease outbreaks. But how does the time taken to develop vaccines compare to the speed and frequency of outbreaks? And how can we do it better?

This is the simple view that greets you, outbreaks from 1890 to 2016 with vaccine development during that same time…

outpacing_pandemics

Each element is clickable.

As an illustration, the longest bar is Typhoid fever and the smallest, mercifully, is Measles. For each bar, click on Measles, you’ll see the first big outbreak (1917, 3,000 deaths) and the last (1989, 123 deaths). It is really easy to explore the data.

What I love is the sensitivity analysis.

Click on the yellow dot, and you’ll see that in action. First, you see what actually happened…

outpacing_pandemics_ebola

Simple exploration. Good reporting. Easy to understand.

The buttons with the number of weeks represent what I wanted to highlight here. Click on them, and it demonstrates what the outcome would have been if action was taken earlier.

I choose 22 weeks…

outpacing_pandemics_ebola_22_weeks

Even if the vaccine had been introduced after 22 weeks, a long time, we could have saved 1,628 lives!

The team also built in some hypothetical scenarios to help inform decision-making.

You can play with the implications of a fast-moving flu-like pandemic. It would have grievous overall impact, 30 mil deaths in 12 months.

But, what if we restrict 50% of the travel since we don’t have a vaccine yet. That would have an impact…

outpacing_pandemics_flu_travel_restrictions

Not quite as material as one might imagine, but it slows things down.

What if a vaccine was introduced 22 weeks in?

outpacing_pandemics_flu_22_weeks

Insanely helpful. 17 mil lives saved.

This type of modeling is rarer than seeing a rhino in the Ngorongoro crater. (We were there last week, you should go, it is pretty awesome.)

As an analyst, as a Big Data person, as a Data Scientist, pouring the right data on humanity is only marginally effective. In this example, in others above, I hope you’ll see the type of additional creativity we can bring to our work to power smarter decision-making. Starting with no dead end-points.

Strategy 6: Turbocharging Data Visuals with Storytelling.

You know this. Even if data is shared in a simple environment, most people are unable to internalize it. As has been hinted in most examples today, the problem is that the Analyst’s brain has not been packaged with the data.

The Global Gender Gap Report is a fabulous example how to solve this problem. The team nor only shares in a simple and beautiful environment, they also include the story they want to tell in that same environment. The output is not the reporting, the output are the conclusions from the Analyst’s brain.

It is very difficult for me to show the beauty of what they have done in static screenshots. You just have to go there and scroll.

Explore how the initial trend in the gender gap morphs into multiple visualizations, note the subtle but important emphasis on trends, and, most importantly, feel joy from how the story is presented with the data (text on the right).

The website and visualization will work on your mobile device (yea!), but it is best admired on the largest screen you can find.

To tempt you, let me just contrast the gender gap performance of the United States (precipitous decline in the last two years!) with… with… inspired by FLOTUS, the 10 year performance of Slovenia…

gendar_gap_browser

Play with the histogram and scatterplot options.

Go back and forth a few times (yes, gender parity is an issue I care deeply about), make sure you absorb the many nuances both in the story (why the above stinky performance by the US?) and the way the text (story) and the visualization (data) play together.

When you send data out, is it bundled with a piece of your brain?

Remember, you’ll be the last person with the intelligence and skills to understand the deep layers and nuances in what the data is actually saying (assuming you are an Analysis Ninja!). It is imperative that your brain go with the data.

Bonus 1: Another fantastic example of this type of sequential storytelling is Film Money….

film_money

Lars Verspohl takes you along on a wonderful journey through cost and profit structures of movies. Like me, you’ll love the simple and delightful visualizations, how gracefully flow it all flows, and that all the charts and data are primarily there to support the story that emerges from his analysis.

Please also note the thought put into the order in which the story is told, if and when the visualizations switch (from the one above) and the techniques deployed to keep you interested. All excellent, loads to learn.

Bonus 2: This is one subject, storytelling, that I just love, love, love. Indulge me as I pile on and share one more, dramatically different, example of storytelling where data and text go hand in hand.

The team at Reuters Graphics does a fab job of explaining China’s debt problem.

china_debt_reuters

Almost all the visuals are extremely simple. As you scroll through, observe though how they peel back layers of the onion one by one, segment the data, and zero in on the core point they want to make.

Really lovely. Worth emulating.

Strategy 7: The Magic of 2 x 2 Matrices

If you’ve read anything on this blog, you’ve read the importance of seeking why answers to provide critical context to the what answers that you get out of Adobe or Google Analytics. Hence, the amazing value of Surveys, Usability Studies (on or offline), Heuristic Evaluations, shadowing Customer Service calls, and more.

Customers are an amazing source of problems they are having, sometimes they are also a good source of ideas. The challenge is that if you ask people for their opinions you get tons of ideas.

How do you value them? How do you present them? How fast can you get from data to action?

One solution I love is a visualization strategy used by the team at the New York Times. The example illustrates, simply, the ideas related to an emotionally charged topic: Gun Control.

Everyone knows this is a polarizing topic. Friend against friend. Blue vs. Red. Police and minorities and every other combination thrown in. It is a mess.

But. Is it really as fraught with angst as we believe?

No. It turns out if you ask Americans about individual ideas that will reduce gun deaths… A vast majority of us agree!!

nyt_gun_control_ideas

The lowest supported idea is “Demonstrate need for a gun.” Support for it is just shy of 50%. A number that simply sounds unbelievable. 

Did you think vast majorities in our countries agree with these common-sense ideas? I have to admit I did not. It is hopeful data.

But, this is not the reason for the inclusion of this visual on our list.

Rather than just share the ideas, the NYT team added incremental value (remember packing the Analyst’s brain?) by asking Experts to opine on the effectiveness of each idea. That’s what you are seeing in the distribution above.

From the 2×2 matrix, here is the slice of ideas American’s support and the ones Experts say are effective…

nyt_gun_control_ideas_effective

There are only two ideas rated as ineffective by Expert, but are supported by over 70% of the Americans (national stand your ground law and honor out-of-state conceal and carry permits).

We all basically agree on ideas, and a lot of them will have an impact.

I love the presentation of the ideas and the fact that Experts were brought in to give valuable context. This is what I meant in my above example by not simply taking all the customer ideas and running with them. A wonderful way for you to visualize multiple ideas, and you can combine it with an Expert dimension or a Customer Satisfaction dimension or even a Revenue dimension to give context to the ideas.

One last element of value from NYT.

I’ve said that all data in aggregate is crap. I’m so happy that the NYT team also segmented the data.

What does Mr. Trump support…

nyt_gun_control_trump

What do American law enforcement support…

nyt_gun_control_police

And, lots more slices that make the data even more meaningful.

Segment. Always, always, always segment!

It is beyond the scope of this humble analytics blog to explore why in the face of such unanimity that nothing actually happens when it comes to reducing gun violence in the US. But, for lovers of data, for believers in the power of data to drive smart decision-making, this is one more reminder on the limitation of data if you can’t tell the story properly.

Strategy 8: Close Contextual Clusters.

Let’s close with examples of work that you’ll normally include in your enterprise analytics efforts.

Usually data we have is lonely. Just the Visits or Assisted Conversions or Order Size. Without other contextual elements, it turns out this data is less useful.

Consider this, conversion rate could go up by a statistically significant percentage… While revenue actually goes down. Or, the overall Visits to the site stay steady… But drop dramatically from your usually second highest source.

The European Monitoring Center for Drugs and Drug Addiction, also known by the gorgeous acronym EMCDDA (!), publishes a ton of data. Their Statistical Bulletin 2017 has a lovely collection of graphs and charts that we all use in some shape or form. The only difference is that we rarely report on Heroin Price and Purity. 🙂

emcdda_europa_heroin

Along with the use of (mostly) simple visuals to illustrate the data, I appreciated the context that they provide. Sometimes using the time dimensions, sometimes using geographic breakdowns, sometimes using two likely interplaying elements (like above), so on and so forth.

This simple strategy is quite effective at delivering insights – or at least causing the audience to ask relevant interesting questions.

I encourage you to take some time and explore the numerous examples on the site…

emcdda_europa_charts

I’m confident the visualization strategies will spark upgrades to the work you are doing at your company to communicate data more effectively.

Our friends at the EMCDDA mostly avoid two things that I find as poor practices in data visualization. They triggered this in my mind, let me take the opportunity of sharing them with you.

1. Never ever, never, never, never create the loooooooooonnnnnnnggggggg infographics that seem to be in vogue these days. Essentially they are taking 69 “slides”/graphs/tables and shoving them into a 9-meter-long thing that no browser can render decently. By the time you absorb the third screen full of stuff in tiny font/image, you’ve already forgotten what’s on the second.  You have many examples in this post as to how you can avoid making yourself look like sub-optimal Reporting Squirrel.

2. Pie-charts are a very poor data visualization choice. Humans find comparison by angles significantly harder than, for example, by length. I explain this a lot more in the May 14th edition of my newsletter The Marketing Analytics Intersect: Eat pies, don’t share them.

[You should subscriber to TMAI for a weekly dose of intelligence that’ll keep you at the bleeding edge of our industry.]

Bonus: In the spirit of government data, I’ll be remiss if I did not share with you three examples of interactive scatter plots from Our World in Data (produced by the University of Oxford).

The second one is timely, it shows how when we look at health spending and life expectancy the United States is a massive outlier (and not the good kind)…

owid_life_expectancy_health_expenditure

I love fusion charts, the first one on the site, Child Mortality vs. Mean Years of Schooling, is a good example of that as well. And, it shows great news.

Please review all three. Then, consider plotting one for your digital data. Conversion Rates by Discounts for Top Ten Traffic Sources. Time on Site by Visits to site for Content Types. And, more.

Strategy 9: Multi-dimensional Related Line Graphs.

One final example, to cause introspection about the final years of your life.

Wait. Things really got serious.

They did. But, I really do want you to lean into this one.

A small reason is that you are likely creating graphs like these every single day for your dashboards. I hope you’ll find lessons in how to make yours simpler. Notice the use of fonts and colors. Notice the labeling, or not, of the axis. And other little things.

A big reason is that I care for you deeply and I want this data to be a cautionary signal to all of us to possibly start making new choices.

The plots are from the American Time Use Survey, a multi-year study from 2003 to 2015 conducted by the US Bureau of Labor Statistics.

Age on the x-axis and hours we spend per day with on the y-axis…


american_time_use_survey
(Source: halhen on Reddit  |  Github)

In our 20s we’ll spend most time with our friends and our parents. Our partner and co-workers will take over our lives from then on through our 50s.

I’ll let you internalize the rest, and please share via comments what you see as the lessons in this data.

Three things stood out for me, as I consider the larger latter chunk of life. 1. We might be giving an extraordinary amount of importance to our co-workers, perhaps worth a rethink. 2. I love my spouse, regardless of who goes first, I felt very sad after staring at the Partner and Alone graphs. 3. The data demonstrated the value of loving oneself – of being proud of who you are, of being comfortable in one’s own skin. After all each individual will spend huge chunks of a decade plus… alone. You have from now until you are 50 or so to get there. Hurry!

: )

The power of great data visualized simply.

Closing Thoughts.

The sixteen diverse sources and visualization strategies help you think differently about how you are bridging the critical last-mile when it comes to impact from data – from you to the person who’ll take and action of business value. We don’t give enough time and attention to this last-mile.

While some of these clearly take special skills (especially the ones that tell integrated stories), I hope you’ll note that most of them are simple and ones that you can create with just a little more effort.

What’s most important today is that I’ve sparked your commitment to upgrading your personal data visualization skills.

Good luck!

As always, it is your turn now.

Which one or two examples did you like the most? Why? Is there a visualization technique you deploy in your analytics practice that’s not covered in this post? What barriers prevent you from improving your data viz skills? What are your pet peeves when it comes to data visualizations? Do you have go-to sources when it comes to inspiring you?

Please share your tips, best practices, critique, and praise for the people who created the above examples, via comments.

Thank you.

PS: I was not kidding in the opening of this post… I’ve written a lot about data visualization and shared guidance for this type of storytelling in numerous different contexts. To continue your immersion, here’s another collection of knowledge…

I hope you love it, and paint more beautiful pictures with your data.

Create High-Impact Data Visualizations: Nine Effective Strategies is a post from: Occam’s Razor by Avinash Kaushik



[Read More …]