The blue pill or the green pill – following the rabbit-hole to Tableau Wonderland
The company I work for is evaluating tools for BI/Visualization, and Tableau is one of the front runners that we are considering.
I’ve been fortunate to be able to attend this week’s Advanced Tableau training in Vancouver. Our trainer is one of the Tableau Jedis, Interworks’ Director of Business Intelligence (BI) Dan Murray, and boy does he know how to impress.
At first, because of budget constraints, I was contemplating on whether I should just watch the videos – since Tableau has graciously published a number of them allowing anyone to learn the product. But I’ve always thought that there’s always something to learn from in class training – especially if you get an awesome instructor. And I am fortunate – and very thankful – I did get an awesome instructor. I cannot believe how much he packed in two (2) days of training. Don’t get me wrong, my brain was full after two days – but even after the class ended I just wanted to keep on going and try out more Tableau stuff.
I won’t go through each topic that we covered, but for anyone interested, the curriculum for the Advanced Tableau training is posted in the Tableau site. Upcoming training sessions from Interworks are posted in the Interworks site.
Anyway, just wanted to share some of the tidbits I have learned:
- Extracts allow you to work with your data really fast.
- True story – in v6 an 80M record Tableau extract took > 30 mins to load. In v7, it took ~ 1 second.
- The color of the pill matters. BLUE means discrete. GREEN means continuous.
- Tableau can do inner, left, right join, or union, although Tableau usually doesn’t recommend unions, they don’t guarantee performance
- Tableau works great with relational data, even flat files. Within the same connection, you can do JOINs on your tables, but within different connections you can BLEND. JOINs happen at the server (or source), BLENDs happen locally.Tableau JOIN
- Ctrl + Dragging a pill allows you to copy the pill and repurpose
- Actions are almost always faster than parameters
- Quick filters are great, but use sparingly. When dashboarding, real estate is expensive (sounds like the Vancouver real estate market)
- When adding reference lines, invoke reference line from axis you want to reference from
- Annotations can be risky – might fly around when your data changes
- Tableau tooltips look awesome by default.
Data Exploration and Visualization
- Easy is hard. There’s a lot of work in making something easy.
- Don’t be afraid to explore your data. Sometimes, you know which questions to ask, but sometimes you don’t. In either case, don’t be afraid to experiment, explore. You might be surprised at what you can discover.
- Dan differentiates data into 3 Types:
- Type 1 – Data you know (normal BI) – ex sales, profit etc
- Type 2 – Data that comes from Type 1, is the blip, is the explanation to the question.
- Type 3 – Data you needed to know that you didn’t know you needed to know; Real data discovery; Can usually be explored with scatter plots; This is where jaw dropping starts
- Scatter plots are a great way to explore data.
- If you’re serious about visualization, you’ll read Stephen Few’s books and watch Hans Rosling TED talks. Also check out Deiter Rams and Iain McGilchrist‘s The Divided Brain on Youtube
- When dashboarding – always ask yourself the question – what story am I trying to tell?
I am pretty excited to start working with Tableau. I plan to do more Tableau-related posts and tutorials in the future (of course incorporating all the tricks I’ve learned from this training session).
I work with SSRS a lot, and I can see how SSRS and Tableau can be great complements.